The TLD object tracker was developed by Z. Kalal, J. Matas and K. Mikolajczyk at the University of Surrey and Czech Technical In
stitute.
It is called TLD because of it utilizes three components in para
llel to accomplish the task of long-term tracking, tracking, lea
rning and
detection [4].
Tracking
The tracker used is a median-shift tracker based on Lucas-Kanade
optical flow algorithm.
The tracker provides an estimation of the trajectory of the object based solely on the frame-to-frame movement of key points, and independent of the system's object model.
Detection
The detector is a random forest classifier based on a collection
of 2bit binary patterns.These binary patterns are discretizations of the gradients across randomly sized and located pixel patches (called groups) withi
n the region of interest.
Each group yields its own decision tree within the random forest
, the leafs of which represent different positive representations the detector has foundwithin that pixel patch.
Learning
The learning that takes place is a semi-supervised process that fuses results from the object detector and tracker to iteratively improve the object model.
False negatives close to the tracked trajectory extend the decision trees (grow the forest) by positively labeling the tracked patch and retraining the model.
False positives far from the tracked trajectory prune the forest by removing leaves that led to the false identification.
The tracker is initiated by the user bounding the object of interest with a box in a single frame. The initial random forest model is trained with 100
different affine transformations of this single labeled example [1][2][3][4].
The machine learning methods used to create the TLD are describe d in detail above, and can be summarized as semi-supervised lear ning of structured datausing a random forest classifier.The initializing object-class detector needs to recognize catego ries of objects as opposed to specific instances of a single object. From preliminary literature searches it appears as if a boosting algorithm such as Adaboost would work well[5][6]. Other viable options include part-based models[8] or bag-of-features[9].
The algorithm will be tested on one of the data sets used by the creators of TLD in order to remove potential causes of discrepa ncy between my implementation and theirs. Specifically the dirtbike video sequence[7] will be used since a bundant similar training data of dirt bikers is available on you tube[10]. The most useful data sets to use for my research, however, would be aerial views of people or vehicles, so if time permits and a source is identified, the algorithm will be trained and tested on such data sets as well.
The proposed work plan is as follows:
4/17 Lucas-Kanade Optical Tracker 4/24 Random forest classifier 5/1 P-N Learning 5/8* Working TLD Tracker 5/15 Development and training of object classifier 5/22 Working automatic initialization 5/29 Poster presentation *Project milestone