CV
Contact Information
| Name | Musawar Ali |
| Professional Title | Computer Vision Engineer |
| engrmusawar@gmail.com | |
| Location | Via del Pozzo, 6, Reggio Emilia, 42121, Italy |
Professional Summary
I did my undergraduate and master’s studies in Software/Computer Engineering in Pakistan. In March 2026, I completed my PhD in Computer Science and Engineering from University of Bologna. My areas of focus are computer vision, and generative models such as diffusion models and neural radiance fields. I am now seeking opportunities in Computer Vision and related fields so as to continueadvancing my work in these areas, with a preference for positions in Italy/Europe.
Experience
-
2022 - 2025 Bologna, Italy
PhD Researcher
EyeCan.AI
-
2020 - 2022 Karachi, Pakistan
Lecturer
FAST National university of Computer and Emerging Sciences
-
2018 - 2020 Swabi, Pakistan
Graduate Assistant
Ghulam Ishaq Khan Institute of Engineering Sciences and Technology
-
2016 - 2017 Lahore, Pakistan
Software Engineer
The Indus Hospital
Education
-
2022 - 2026 Bologna, Italy
-
2018 - 2020 Swabi, Pakistan
MS
Ghulam Ishaq Khan Institute of Engineering Sciences and Technology
Computer Engineering
-
2012 - 2016 Jamshoro, Pakistan
BE
Mehran University of Engineering and Technology
Software Engineering
Publications
-
2026 NVS-HO: A Benchmark for Novel View Synthesis of Handheld Objects
International Conference on Pattern Recognition
We propose NVS-HO, the first benchmark designed for novel view synthesis of handheld objects in real-world environments using only RGB inputs. Each object is recorded in two complementary RGB sequences: (1) a handheld sequence, where the object is manipulated in front of a static camera, and (2) a board sequence, where the object is fixed on a ChArUco board to provide accurate camera poses via marker detection. The goal of NVS-HO is to learn a NVS model that captures the full appearance of an object from (1), whereas (2) provides the groundtruth images used for evaluation. To establish baselines, we consider both a classical SfM pipeline and a state-of-the-art pre-trained feed-forward neural network (VGGT) as pose estimators, and train NVS models based on NeRF and Gaussian Splatting. Our experiments reveal significant performance gaps in current methods under unconstrained handheld conditions, highlighting the need for more robust approaches. NVS-HO thus offers a challenging real-world benchmark to drive progress in RGB-based novel view synthesis of handheld objects.
-
2025 Few-Shot Anomaly Classification by Learning to Inpaint Nominal Images
International Conference on Image Analysis and Processing
Automated inspection is a crucial aspect in modern industrial manufacturing. Despite its importance, the methods used to perform it suffer from the data scarcity intrinsic to the problem, where only a few anomalous samples are usually available. Recently, fine-tuning of foundation inpainting models has been proposed as a solution. The fine-tuned model is then used to inpaint nominal images where areas corresponding to defective masks have been removed. While effective, random pairing sometimes applies the mask on background or logically unfeasible areas. To counteract this phenomenon, we experiment with generating high-resolution defective images inpainting the few available real defects. Since the resulting images would show limited variance in the non-defective parts, we propose to fine-tune another inpainting model to change nominal parts of generated images. Experimental results on the MVTec-AD dataset demonstrate that our method generates images with complementary properties with respect to those produced by the baseline and training on an ensemble of generated data produces a new state of the art result.
-
2024 AnomalyControl: Few-Shot Anomaly Generation by ControlNet Inpainting
IEEE Access
Quality inspection tasks, i.e., anomaly detection, localization and classification, face the scarcity of non-nominal images in real industrial scenarios. Hence, generative models have been explored as a tool to obtain defective images from few real labelled samples. Despite the fast-increasing quality of such models, generating realistic defective images remains a challenging task due to the same data scarcity problem, which makes it difficult to steer large general-purpose models to produce realistic defects for specific industrial products. In this paper, we show how casting defect generation as inpainting of nominal images and using ControlNet to specialize a state-of-the-art inpainting model based on stable diffusion can be an effective solution for the few-shot anomaly generation task. Extensive experimental results on the MVTec-AD dataset demonstrate that the high quality of the images generated by our method significantly improves the state of the art on downstream anomaly classification.
-
2021 A novel framework for generating handwritten datasets
Multimedia Tools and Applications
The performance of deep learning algorithms is highly dependent on the size and diversity of data. However, for handwritten character recognition, dataset creation, segmentation, and labeling are time consuming and laborious tasks and not much researched. This work proposes a novel and generic framework which automates the segmentation and labeling processes for handwritten datasets. First, a user collects handwritten glyphs on the proposed form. Next, based on a priori knowledge, local peaks from horizontal and vertical projection functions are computed. This helps in locating and segmenting individual samples automatically. To show the effectiveness of the proposed framework, a dataset of 160,000 samples is collected for an oriental language. We profile the segmentation of samples from one sheet with three approaches: manual, semi-automatic, and the proposed fully automatic approach. Compared to the manual and semi-automatic processes, the proposed approach is 120 × and 65 × faster, respectively. Further, we also present the classification of this dataset by traditional and state-of-the-art machine learning algorithms.