Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments
- PMID: 26353306
- DOI: 10.1109/TPAMI.2013.248
Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments
Abstract
We introduce a new dataset, Human3.6M, of 3.6 Million accurate 3D Human poses, acquired by recording the performance of 5 female and 6 male subjects, under 4 different viewpoints, for training realistic human sensing systems and for evaluating the next generation of human pose estimation models and algorithms. Besides increasing the size of the datasets in the current state-of-the-art by several orders of magnitude, we also aim to complement such datasets with a diverse set of motions and poses encountered as part of typical human activities (taking photos, talking on the phone, posing, greeting, eating, etc.), with additional synchronized image, human motion capture, and time of flight (depth) data, and with accurate 3D body scans of all the subject actors involved. We also provide controlled mixed reality evaluation scenarios where 3D human models are animated using motion capture and inserted using correct 3D geometry, in complex real environments, viewed with moving cameras, and under occlusion. Finally, we provide a set of large-scale statistical models and detailed evaluation baselines for the dataset illustrating its diversity and the scope for improvement by future work in the research community. Our experiments show that our best large-scale model can leverage our full training set to obtain a 20% improvement in performance compared to a training set of the scale of the largest existing public dataset for this problem. Yet the potential for improvement by leveraging higher capacity, more complex models with our large dataset, is substantially vaster and should stimulate future research. The dataset together with code for the associated large-scale learning models, features, visualization tools, as well as the evaluation server, is available online at http://vision.imar.ro/human3.6m.
Similar articles
-
Tracking people on a torus.IEEE Trans Pattern Anal Mach Intell. 2009 Mar;31(3):520-38. doi: 10.1109/TPAMI.2008.101. IEEE Trans Pattern Anal Mach Intell. 2009. PMID: 19147879
-
Learning Actionlet Ensemble for 3D Human Action Recognition.IEEE Trans Pattern Anal Mach Intell. 2014 May;36(5):914-27. doi: 10.1109/TPAMI.2013.198. IEEE Trans Pattern Anal Mach Intell. 2014. PMID: 26353226
-
Make3D: learning 3D scene structure from a single still image.IEEE Trans Pattern Anal Mach Intell. 2009 May;31(5):824-40. doi: 10.1109/TPAMI.2008.132. IEEE Trans Pattern Anal Mach Intell. 2009. PMID: 19299858
-
Biometrics: Going 3D.Sensors (Basel). 2022 Aug 24;22(17):6364. doi: 10.3390/s22176364. Sensors (Basel). 2022. PMID: 36080821 Free PMC article. Review.
-
Deep Learning-Based Motion Style Transfer Tools, Techniques and Future Challenges.Sensors (Basel). 2023 Feb 26;23(5):2597. doi: 10.3390/s23052597. Sensors (Basel). 2023. PMID: 36904801 Free PMC article. Review.
Cited by
-
MILI: Multi-person inference from a low-resolution image.Fundam Res. 2023 Mar 1;3(3):434-441. doi: 10.1016/j.fmre.2023.02.006. eCollection 2023 May. Fundam Res. 2023. PMID: 38933767 Free PMC article.
-
Review-Emerging Portable Technologies for Gait Analysis in Neurological Disorders.Front Hum Neurosci. 2022 Feb 3;16:768575. doi: 10.3389/fnhum.2022.768575. eCollection 2022. Front Hum Neurosci. 2022. PMID: 35185496 Free PMC article. Review.
-
Riemannian Spatio-Temporal Features of Locomotion for Individual Recognition.Sensors (Basel). 2018 Dec 23;19(1):56. doi: 10.3390/s19010056. Sensors (Basel). 2018. PMID: 30583609 Free PMC article.
-
The Poses for Equine Research Dataset (PFERD).Sci Data. 2024 May 15;11(1):497. doi: 10.1038/s41597-024-03312-1. Sci Data. 2024. PMID: 38750064 Free PMC article.
-
3D human pose data augmentation using Generative Adversarial Networks for robotic-assisted movement quality assessment.Front Neurorobot. 2024 Apr 5;18:1371385. doi: 10.3389/fnbot.2024.1371385. eCollection 2024. Front Neurorobot. 2024. PMID: 38644903 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Research Materials
