Tracking food intake is key to using nutrition to prevent or manage common diseases including type 2 diabetes (T2D) and obesity. Several datasets are publicly available to promote research in diet monitoring, but generally contain data from a limited set of sensors (e.g., accelerometry, food images), which limits their application to specific use cases such as activity recognition or image recognition. Also lacking are publicly available datasets with food macronutrients and their associated continuous glucose measurements; datasets containing such rich information are proprietary. To address this gap, we present CGMacros, a dataset containing multimodal information from an activity tracker, two continuous glucose monitors (CGM), food macronutrients, and food photographs, as well as anonymized participant demographics, anthropometric measurements and health parameters from blood analyses and gut microbiome profiles. CGMacros contains data for 45 participants (15 healthy, 16 pre-diabetes, 14 T2D) who consumed meals with varying and known macronutrient compositions in a free-living setting for ten consecutive days. To our knowledge, this is the first database of its kind to be made publicly available. CGMacros, and larger publicly available datasets that we hope may follow, are essential to democratize academic research in personalized nutrition and algorithmic approaches to automated diet monitoring.
© 2025. The Author(s).