PD2T: Person-Specific Detection, Deformable Tracking

IEEE Trans Pattern Anal Mach Intell. 2018 Nov;40(11):2555-2568. doi: 10.1109/TPAMI.2017.2769654. Epub 2017 Nov 3.

Abstract

Face detection/alignment methods have reached a satisfactory state in static images captured under arbitrary conditions. Such methods typically perform (joint) fitting for each frame and are used in commercial applications; however in the majority of the real-world scenarios the dynamic scenes are of interest. We argue that generic fitting per frame is suboptimal (it discards the informative correlation of sequential frames) and propose to learn person-specific statistics from the video to improve the generic results. To that end, we introduce a meticulously studied pipeline, which we name PD2T, that performs person-specific detection and landmark localisation. We carry out extensive experimentation with a diverse set of i) generic fitting results, ii) different objects (human faces, animal faces) that illustrate the powerful properties of our proposed pipeline and experimentally verify that PD2T outperforms all the compared methods.

Publication types

  • Research Support, Non-U.S. Gov't