Recurrent Neural Networks for Multivariate Time Series with Missing Values
- PMID: 29666385
- PMCID: PMC5904216
- DOI: 10.1038/s41598-018-24271-9
Recurrent Neural Networks for Multivariate Time Series with Missing Values
Abstract
Multivariate time series data in practical applications, such as health care, geoscience, and biology, are characterized by a variety of missing values. In time series prediction and other related tasks, it has been noted that missing values and their missing patterns are often correlated with the target labels, a.k.a., informative missingness. There is very limited work on exploiting the missing patterns for effective imputation and improving prediction performance. In this paper, we develop novel deep learning models, namely GRU-D, as one of the early attempts. GRU-D is based on Gated Recurrent Unit (GRU), a state-of-the-art recurrent neural network. It takes two representations of missing patterns, i.e., masking and time interval, and effectively incorporates them into a deep model architecture so that it not only captures the long-term temporal dependencies in time series, but also utilizes the missing patterns to achieve better prediction results. Experiments of time series classification tasks on real-world clinical datasets (MIMIC-III, PhysioNet) and synthetic datasets demonstrate that our models achieve state-of-the-art performance and provide useful insights for better understanding and utilization of missing values in time series analysis.
Conflict of interest statement
The authors declare no competing interests.
Figures
Similar articles
-
Adversarial Joint-Learning Recurrent Neural Network for Incomplete Time Series Classification.IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):1765-1776. doi: 10.1109/TPAMI.2020.3027975. Epub 2022 Mar 4. IEEE Trans Pattern Anal Mach Intell. 2022. PMID: 32997624
-
Attention-Based Sequence-to-Sequence Model for Time Series Imputation.Entropy (Basel). 2022 Dec 9;24(12):1798. doi: 10.3390/e24121798. Entropy (Basel). 2022. PMID: 36554203 Free PMC article.
-
CGCNImp: a causal graph convolutional network for multivariate time series imputation.PeerJ Comput Sci. 2022 Apr 29;8:e966. doi: 10.7717/peerj-cs.966. eCollection 2022. PeerJ Comput Sci. 2022. PMID: 35634128 Free PMC article.
-
Deep imputation of missing values in time series health data: A review with benchmarking.J Biomed Inform. 2023 Aug;144:104440. doi: 10.1016/j.jbi.2023.104440. Epub 2023 Jul 8. J Biomed Inform. 2023. PMID: 37429511 Review.
-
Handling missing values in healthcare data: A systematic review of deep learning-based imputation techniques.Artif Intell Med. 2023 Aug;142:102587. doi: 10.1016/j.artmed.2023.102587. Epub 2023 May 22. Artif Intell Med. 2023. PMID: 37316097 Review.
Cited by
-
Missing Data Statistics Provide Causal Insights into Data Loss in Diabetes Health Monitoring by Wearable Sensors.Sensors (Basel). 2024 Feb 27;24(5):1526. doi: 10.3390/s24051526. Sensors (Basel). 2024. PMID: 38475061 Free PMC article.
-
Identification of clinical disease trajectories in neurodegenerative disorders with natural language processing.Nat Med. 2024 Mar 12. doi: 10.1038/s41591-024-02843-9. Online ahead of print. Nat Med. 2024. PMID: 38472295
-
Feature-based 3D+t descriptors of hyperactivated human sperm beat patterns.Heliyon. 2024 Feb 23;10(5):e26645. doi: 10.1016/j.heliyon.2024.e26645. eCollection 2024 Mar 15. Heliyon. 2024. PMID: 38444471 Free PMC article.
-
Mdpg: a novel multi-disease diagnosis prediction method based on patient knowledge graphs.Health Inf Sci Syst. 2024 Mar 2;12(1):15. doi: 10.1007/s13755-024-00278-7. eCollection 2024 Dec. Health Inf Sci Syst. 2024. PMID: 38440103
-
High-Precision Microscale Particulate Matter Prediction in Diverse Environments Using a Long Short-Term Memory Neural Network and Street View Imagery.Environ Sci Technol. 2024 Feb 27;58(8):3869-3882. doi: 10.1021/acs.est.3c06511. Epub 2024 Feb 14. Environ Sci Technol. 2024. PMID: 38355131 Free PMC article.
References
-
- Rubin DB. Inference and missing data. Biom. 1976;63:581–592.
-
- Schafer, J. L. & Graham, J. W. Missing data: our view of the state of the art. Psychol. methods (2002). - PubMed
-
- Kreindler, D. M. & Lumsden, C. J. The effects of the irregular sample and missing data in time series analysis. Nonlinear Dyn. Syst. Analysis for Behav. Sci. Using Real Data (2012). - PubMed
-
- De Boor C, De Boor C, Mathématicien E-U, De Boor C, De Boor C. A practical guide to splines. New York: Springer-Verlag; 1978.
LinkOut - more resources
Full Text Sources
Other Literature Sources
