Decentralized policy learning with partial observation and mechanical constraints for multiperson modeling

Neural Netw. 2024 Mar:171:40-52. doi: 10.1016/j.neunet.2023.11.068. Epub 2023 Dec 1.

Abstract

Extracting the rules of real-world multi-agent behaviors is a current challenge in various scientific and engineering fields. Biological agents independently have limited observation and mechanical constraints; however, most of the conventional data-driven models ignore such assumptions, resulting in lack of biological plausibility and model interpretability for behavioral analyses. Here we propose sequential generative models with partial observation and mechanical constraints in a decentralized manner, which can model agents' cognition and body dynamics, and predict biologically plausible behaviors. We formulate this as a decentralized multi-agent imitation-learning problem, leveraging binary partial observation and decentralized policy models based on hierarchical variational recurrent neural networks with physical and biomechanical penalties. Using real-world basketball and soccer datasets, we show the effectiveness of our method in terms of the constraint violations, long-term trajectory prediction, and partial observation. Our approach can be used as a multi-agent simulator to generate realistic trajectories using real-world data.

Keywords: Behavioral analysis; Multi-agent imitation learning; Neural networks; Sports.

MeSH terms

  • Cognition
  • Learning*
  • Neural Networks, Computer*