Recurrent Neural Networks for Multivariate Time Series with Missing Values
- PMID: 29666385
- PMCID: PMC5904216
- DOI: 10.1038/s41598-018-24271-9
Recurrent Neural Networks for Multivariate Time Series with Missing Values
Abstract
Multivariate time series data in practical applications, such as health care, geoscience, and biology, are characterized by a variety of missing values. In time series prediction and other related tasks, it has been noted that missing values and their missing patterns are often correlated with the target labels, a.k.a., informative missingness. There is very limited work on exploiting the missing patterns for effective imputation and improving prediction performance. In this paper, we develop novel deep learning models, namely GRU-D, as one of the early attempts. GRU-D is based on Gated Recurrent Unit (GRU), a state-of-the-art recurrent neural network. It takes two representations of missing patterns, i.e., masking and time interval, and effectively incorporates them into a deep model architecture so that it not only captures the long-term temporal dependencies in time series, but also utilizes the missing patterns to achieve better prediction results. Experiments of time series classification tasks on real-world clinical datasets (MIMIC-III, PhysioNet) and synthetic datasets demonstrate that our models achieve state-of-the-art performance and provide useful insights for better understanding and utilization of missing values in time series analysis.
Conflict of interest statement
The authors declare no competing interests.
Figures
Similar articles
-
Adversarial Joint-Learning Recurrent Neural Network for Incomplete Time Series Classification.IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):1765-1776. doi: 10.1109/TPAMI.2020.3027975. Epub 2022 Mar 4. IEEE Trans Pattern Anal Mach Intell. 2022. PMID: 32997624
-
Attention-Based Sequence-to-Sequence Model for Time Series Imputation.Entropy (Basel). 2022 Dec 9;24(12):1798. doi: 10.3390/e24121798. Entropy (Basel). 2022. PMID: 36554203 Free PMC article.
-
CGCNImp: a causal graph convolutional network for multivariate time series imputation.PeerJ Comput Sci. 2022 Apr 29;8:e966. doi: 10.7717/peerj-cs.966. eCollection 2022. PeerJ Comput Sci. 2022. PMID: 35634128 Free PMC article.
-
Deep imputation of missing values in time series health data: A review with benchmarking.J Biomed Inform. 2023 Aug;144:104440. doi: 10.1016/j.jbi.2023.104440. Epub 2023 Jul 8. J Biomed Inform. 2023. PMID: 37429511 Free PMC article. Review.
-
Handling missing values in healthcare data: A systematic review of deep learning-based imputation techniques.Artif Intell Med. 2023 Aug;142:102587. doi: 10.1016/j.artmed.2023.102587. Epub 2023 May 22. Artif Intell Med. 2023. PMID: 37316097 Review.
Cited by
-
A Recurrent Neural Network Model for Predicting Activated Partial Thromboplastin Time After Treatment With Heparin: Retrospective Study.JMIR Med Inform. 2022 Oct 13;10(10):e39187. doi: 10.2196/39187. JMIR Med Inform. 2022. PMID: 36227653 Free PMC article.
-
Dynamical flexible inference of nonlinear latent factors and structures in neural population activity.Nat Biomed Eng. 2024 Jan;8(1):85-108. doi: 10.1038/s41551-023-01106-1. Epub 2023 Dec 11. Nat Biomed Eng. 2024. PMID: 38082181
-
Machine learning modeling practices to support the principles of AI and ethics in nutrition research.Nutr Diabetes. 2022 Dec 2;12(1):48. doi: 10.1038/s41387-022-00226-y. Nutr Diabetes. 2022. PMID: 36456550 Free PMC article.
-
High-Precision Microscale Particulate Matter Prediction in Diverse Environments Using a Long Short-Term Memory Neural Network and Street View Imagery.Environ Sci Technol. 2024 Feb 27;58(8):3869-3882. doi: 10.1021/acs.est.3c06511. Epub 2024 Feb 14. Environ Sci Technol. 2024. PMID: 38355131 Free PMC article.
-
Informative missingness: What can we learn from patterns in missing laboratory data in the electronic health record?J Biomed Inform. 2023 Mar;139:104306. doi: 10.1016/j.jbi.2023.104306. Epub 2023 Feb 3. J Biomed Inform. 2023. PMID: 36738870 Free PMC article.
References
-
- Rubin DB. Inference and missing data. Biom. 1976;63:581–592.
-
- Schafer, J. L. & Graham, J. W. Missing data: our view of the state of the art. Psychol. methods (2002). - PubMed
-
- Kreindler, D. M. & Lumsden, C. J. The effects of the irregular sample and missing data in time series analysis. Nonlinear Dyn. Syst. Analysis for Behav. Sci. Using Real Data (2012). - PubMed
-
- De Boor C, De Boor C, Mathématicien E-U, De Boor C, De Boor C. A practical guide to splines. New York: Springer-Verlag; 1978.
LinkOut - more resources
Full Text Sources
Other Literature Sources
