A Differentiable Physics Engine for Deep Learning in Robotics
- PMID: 30899218
- PMCID: PMC6416213
- DOI: 10.3389/fnbot.2019.00006
A Differentiable Physics Engine for Deep Learning in Robotics
Abstract
An important field in robotics is the optimization of controllers. Currently, robots are often treated as a black box in this optimization process, which is the reason why derivative-free optimization methods such as evolutionary algorithms or reinforcement learning are omnipresent. When gradient-based methods are used, models are kept small or rely on finite difference approximations for the Jacobian. This method quickly grows expensive with increasing numbers of parameters, such as found in deep learning. We propose the implementation of a modern physics engine, which can differentiate control parameters. This engine is implemented for both CPU and GPU. Firstly, this paper shows how such an engine speeds up the optimization process, even for small problems. Furthermore, it explains why this is an alternative approach to deep Q-learning, for using deep learning in robotics. Finally, we argue that this is a big step for deep learning in robotics, as it opens up new possibilities to optimize robots, both in hardware and software.
Keywords: deep learning; differentiable physics engine; gradient descent; neural network controller; robotics.
Figures
Similar articles
-
Safe deep reinforcement learning in diesel engine emission control.Proc Inst Mech Eng Part I J Syst Control Eng. 2023 Sep;237(8):1440-1453. doi: 10.1177/09596518231153445. Epub 2023 Feb 17. Proc Inst Mech Eng Part I J Syst Control Eng. 2023. PMID: 37692899 Free PMC article.
-
Online Gait Learning for Modular Robots with Arbitrary Shapes and Sizes.Artif Life. 2017 Winter;23(1):80-104. doi: 10.1162/ARTL_a_00223. Epub 2017 Jan 31. Artif Life. 2017. PMID: 28140628
-
An adaptive deep reinforcement learning approach for MIMO PID control of mobile robots.ISA Trans. 2020 Jul;102:280-294. doi: 10.1016/j.isatra.2020.02.017. Epub 2020 Feb 19. ISA Trans. 2020. PMID: 32085878
-
Open Issues in Evolutionary Robotics.Evol Comput. 2016 Summer;24(2):205-36. doi: 10.1162/EVCO_a_00172. Epub 2015 Nov 18. Evol Comput. 2016. PMID: 26581015 Review.
-
Model learning for robot control: a survey.Cogn Process. 2011 Nov;12(4):319-40. doi: 10.1007/s10339-011-0404-1. Epub 2011 Apr 13. Cogn Process. 2011. PMID: 21487784 Review.
Cited by
-
Plant science in the age of simulation intelligence.Front Plant Sci. 2024 Jan 16;14:1299208. doi: 10.3389/fpls.2023.1299208. eCollection 2023. Front Plant Sci. 2024. PMID: 38293629 Free PMC article.
-
A Differentiable Dynamic Model for Musculoskeletal Simulation and Exoskeleton Control.Biosensors (Basel). 2022 May 9;12(5):312. doi: 10.3390/bios12050312. Biosensors (Basel). 2022. PMID: 35624613 Free PMC article.
-
Robot Learning From Randomized Simulations: A Review.Front Robot AI. 2022 Apr 11;9:799893. doi: 10.3389/frobt.2022.799893. eCollection 2022. Front Robot AI. 2022. PMID: 35494543 Free PMC article. Review.
-
Deep physical neural networks trained with backpropagation.Nature. 2022 Jan;601(7894):549-555. doi: 10.1038/s41586-021-04223-6. Epub 2022 Jan 26. Nature. 2022. PMID: 35082422 Free PMC article.
-
Learning neural network potentials from experimental data via Differentiable Trajectory Reweighting.Nat Commun. 2021 Nov 25;12(1):6884. doi: 10.1038/s41467-021-27241-4. Nat Commun. 2021. PMID: 34824254 Free PMC article.
References
-
- Abadi M., Agarwal A., Barham P., Brevdo E., Chen Z., Citro C., et al. (2016). TensorFlow: large-scale machine learning on heterogeneous systems. arXiv [Preprint]. arXiv:1603.04467. Available online at: https://arxiv.org/abs/1603.04467
-
- Aguilar-Ibañez C. (2017). Stabilization of the pvtol aircraft based on a sliding mode and a saturation function. Int. J. Robust Nonlinear Control 27, 843–859. 10.1002/rnc.3601 - DOI
-
- Al-Rfou R., Alain G., Almahairi A., Angermueller C., Bahdanau D., Ballas N., et al. (2016). Theano: a Python framework for fast computation of mathematical expressions. arXiv [Preprint]. arXiv:1605.02688. Available online at: https://arxiv.org/abs/1605.02688
-
- Bertsekas D. P., Bertsekas D. P., Bertsekas D. P., Bertsekas D. P. (2005). Dynamic Programming and Optimal Control, Vol. 1 Belmont, MA: Athena scientific.
LinkOut - more resources
Full Text Sources
