Fine-grained Vehicle Classification

Through this project, we built a model which learned to detect and classify moving objects in videos. A classifier learned to distinguish between pedestrians and vehicles, as well as predicting fine-grained results for vehicles like 2-wheelers, 4-wheelers, etc.
First, the classifier learned to distinguish between different object classes. We employed Histogram of Gradients (HOG) [N. Dalal and B. Triggs] and Scale-Invariant Feature Transform (SIFT) [David G. Lowe] for building image features. Subsequently, we used the learned classifier with a sliding window approach over video frames to localize and identify objects.
An issue which we encountered while running our model over video frames was the prediction of many false positives (a window containing no object identified as having one). To address this, we turned to the approach of hard negative mining. Here, we saved a wrongly classified empty window as background and forced the classifier to learn it as a separate class. This led to a significant reduction in the false positive rate.

Avatar
Keshaw Singh
Member of Technical Staff II

Adobe Inc.