Adaptive Memory and In Materia Reinforcement Learning Enabled by Flexoelectric-like Response from Ultrathin HfO2

ACS Appl Mater Interfaces. 2022 Dec 14;14(49):54876-54884. doi: 10.1021/acsami.2c19148. Epub 2022 Nov 30.

Abstract

Reinforcement learning (RL) is a mathematical framework of neural learning by trial and error that revolutionized the field of artificial intelligence. However, until now, RL has been implemented in algorithms with the compatibly of traditional complementary metal-oxide-semiconductor-based von Neumann digital platforms, which thus limits performance in terms of latency, fault tolerance, and robustness. Here, we demonstrate that nanocolumnar (∼12 nm) HfO2 structures can be used as building blocks to conduct the RL within the material by combining its stress-adjustable charge transport and memory functions. Specifically, HfO2 nanostructures grown by the sputtering method exhibit self-assembled vertical nanocolumnar structures that generate voltage depending on the impact of stress under self-biased conditions. The observed results are attributed to the flexoelectric-like response of HfO2. Further, multilevel current (>10-3 A) modulation with touch and controlled suspension (∼10-12 A) with a short electric pulse (100 ms) were demonstrated, yielding a proof-of-concept memory with an on/off ratio greater than 109. Utilizing multipattern dynamic memory and tactile sensing, RL was implemented to successfully solve a maze game using an array of 6 × 4. This work could pave the way to do RL within materials for a variety of applications such as memory storage, neuromorphic sensors, smart robots, and human-machine interaction systems.

Keywords: adaptive memory; flexoelectric effect; in materia; reinforcement learning; ultrathin HfO2.