Point Cloud Hand-Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
- PMID: 34451117
- PMCID: PMC8402345
- DOI: 10.3390/s21165676
Point Cloud Hand-Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
Abstract
This paper presents an application of neural networks operating on multimodal 3D data (3D point cloud, RGB, thermal) to effectively and precisely segment human hands and objects held in hand to realize a safe human-robot object handover. We discuss the problems encountered in building a multimodal sensor system, while the focus is on the calibration and alignment of a set of cameras including RGB, thermal, and NIR cameras. We propose the use of a copper-plastic chessboard calibration target with an internal active light source (near-infrared and visible light). By brief heating, the calibration target could be simultaneously and legibly captured by all cameras. Based on the multimodal dataset captured by our sensor system, PointNet, PointNet++, and RandLA-Net are utilized to verify the effectiveness of applying multimodal point cloud data for hand-object segmentation. These networks were trained on various data modes (XYZ, XYZ-T, XYZ-RGB, and XYZ-RGB-T). The experimental results show a significant improvement in the segmentation performance of XYZ-RGB-T (mean Intersection over Union: 82.8% by RandLA-Net) compared with the other three modes (77.3% by XYZ-RGB, 35.7% by XYZ-T, 35.7% by XYZ), in which it is worth mentioning that the Intersection over Union for the single class of hand achieves 92.6%.
Keywords: deep neural network; hand segmentation; multimodal imaging; point cloud segmentation; thermal.
Conflict of interest statement
This article has no conflict of interest with any organization.
Figures
Similar articles
-
Interactive robot teaching based on finger trajectory using multimodal RGB-D-T-data.Front Robot AI. 2023 Mar 16;10:1120357. doi: 10.3389/frobt.2023.1120357. eCollection 2023. Front Robot AI. 2023. PMID: 37008984 Free PMC article.
-
Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data.Sensors (Basel). 2023 Jan 4;23(2):560. doi: 10.3390/s23020560. Sensors (Basel). 2023. PMID: 36679357 Free PMC article.
-
Semantic Segmentation of Natural Materials on a Point Cloud Using Spatial and Multispectral Features.Sensors (Basel). 2020 Apr 15;20(8):2244. doi: 10.3390/s20082244. Sensors (Basel). 2020. PMID: 32326663 Free PMC article.
-
Deep Learning on Point Clouds and Its Application: A Survey.Sensors (Basel). 2019 Sep 26;19(19):4188. doi: 10.3390/s19194188. Sensors (Basel). 2019. PMID: 31561639 Free PMC article. Review.
-
A Method for Measuring Contact Points in Human-Object Interaction Utilizing Infrared Cameras.Front Robot AI. 2022 Feb 14;8:800131. doi: 10.3389/frobt.2021.800131. eCollection 2021. Front Robot AI. 2022. PMID: 35237668 Free PMC article. Review.
Cited by
-
OHO: A Multi-Modal, Multi-Purpose Dataset for Human-Robot Object Hand-Over.Sensors (Basel). 2023 Sep 11;23(18):7807. doi: 10.3390/s23187807. Sensors (Basel). 2023. PMID: 37765862 Free PMC article.
-
Triangle-Mesh-Rasterization-Projection (TMRP): An Algorithm to Project a Point Cloud onto a Consistent, Dense and Accurate 2D Raster Image.Sensors (Basel). 2023 Aug 8;23(16):7030. doi: 10.3390/s23167030. Sensors (Basel). 2023. PMID: 37631565 Free PMC article.
-
Interactive robot teaching based on finger trajectory using multimodal RGB-D-T-data.Front Robot AI. 2023 Mar 16;10:1120357. doi: 10.3389/frobt.2023.1120357. eCollection 2023. Front Robot AI. 2023. PMID: 37008984 Free PMC article.
-
Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data.Sensors (Basel). 2023 Jan 4;23(2):560. doi: 10.3390/s23020560. Sensors (Basel). 2023. PMID: 36679357 Free PMC article.
References
-
- Redmon J., Divvala S., Girshick R., Farhadi A. You only look once: Unified, real-time object detection; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; Las Vegas, NV, USA. 27–30 June 2016; pp. 779–788.
-
- He K., Gkioxari G., Dollár P., Girshick R. Mask r-cnn; Proceedings of the IEEE International Conference on Computer Vision; Venice, Italy. 22–29 October 2017; pp. 2961–2969.
-
- Kirillov A., Wu Y., He K., Girshick R. Pointrend: Image segmentation as rendering; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; Seattle, WA, USA. 14–19 June 2020; pp. 9799–9808.
-
- Palmero C., Clapés A., Bahnsen C., Møgelmose A., Moeslund T.B., Escalera S. Multi-modal rgb–depth–thermal human body segmentation. Int. J. Comput. Vis. 2016;118:217–239.
-
- Zhao S., Yang W., Wang Y. A new hand segmentation method based on fully convolutional network; Proceedings of the 2018 Chinese Control And Decision Conference (CCDC); Shenyang, China. 9–11 June 2018; pp. 5966–5970.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous
