twist₊₁

Compositional representation

We incrementally generated a part-based description by grouping simple local features (edge segments) to part combinations. This part-based representation is built from a large set of features (edge segments) by hierarchically grouping them depending on their spatial and temporal density. The grouping process leads to a set of stable part combinations with increasing complexity, which can efficiently guide a spatio-temporal association step (find correspondences) of coherently moving image regions, which are part of the same target object (see Figures 2 and 3). Figure 1 visualizes the concept of the approach. Videos 1 and 2 show tracking results with this generic, but also highly target specific representation. Related publications [28,39] (see publications).