Neuroscience Cloud Analysis As a Service: An open-source platform for scalable, reproducible data analysis
- PMID: 35870448
- PMCID: PMC9464703
- DOI: 10.1016/j.neuron.2022.06.018
Neuroscience Cloud Analysis As a Service: An open-source platform for scalable, reproducible data analysis
Abstract
A key aspect of neuroscience research is the development of powerful, general-purpose data analyses that process large datasets. Unfortunately, modern data analyses have a hidden dependence upon complex computing infrastructure (e.g., software and hardware), which acts as an unaddressed deterrent to analysis users. Although existing analyses are increasingly shared as open-source software, the infrastructure and knowledge needed to deploy these analyses efficiently still pose significant barriers to use. In this work, we develop Neuroscience Cloud Analysis As a Service (NeuroCAAS): a fully automated open-source analysis platform offering automatic infrastructure reproducibility for any data analysis. We show how NeuroCAAS supports the design of simpler, more powerful data analyses and that many popular data analysis tools offered through NeuroCAAS outperform counterparts on typical infrastructure. Pairing rigorous infrastructure management with cloud resources, NeuroCAAS dramatically accelerates the dissemination and use of new data analyses for neuroscientific discovery.
Keywords: cloud compute; data analysis; ensembling; infrastructure-as-code; markerless tracking; open source; widefield calcium imaging.
Copyright © 2022 Elsevier Inc. All rights reserved.
Conflict of interest statement
Declaration of interests The authors declare no competing interests.
Figures
Comment in
-
Neuroscience data analysis in the cloud.Nat Methods. 2022 Sep;19(9):1034. doi: 10.1038/s41592-022-01613-0. Nat Methods. 2022. PMID: 36068316 No abstract available.
Similar articles
-
APRICOT: Advanced Platform for Reproducible Infrastructures in the Cloud via Open Tools.Methods Inf Med. 2020 Dec;59(S 02):e33-e45. doi: 10.1055/s-0040-1712460. Epub 2020 Aug 10. Methods Inf Med. 2020. PMID: 32777825 Free PMC article.
-
MCX Cloud-a modern, scalable, high-performance and in-browser Monte Carlo simulation platform with cloud computing.J Biomed Opt. 2022 Jan;27(8):083008. doi: 10.1117/1.JBO.27.8.083008. J Biomed Opt. 2022. PMID: 34989198 Free PMC article.
-
Cloud-enabled Biodepot workflow builder integrates image processing using Fiji with reproducible data analysis using Jupyter notebooks.Sci Rep. 2022 Sep 2;12(1):14920. doi: 10.1038/s41598-022-19173-w. Sci Rep. 2022. PMID: 36056115 Free PMC article.
-
Edge, Fog, and Cloud Against Disease: The Potential of High-Performance Cloud Computing for Pharma Drug Discovery.Methods Mol Biol. 2024;2716:181-202. doi: 10.1007/978-1-0716-3449-3_8. Methods Mol Biol. 2024. PMID: 37702940 Review.
-
Use of cloud computing in biomedicine.J Biomol Struct Dyn. 2016 Dec;34(12):2688-2697. doi: 10.1080/07391102.2015.1127182. Epub 2016 Feb 17. J Biomol Struct Dyn. 2016. PMID: 26646249 Review.
Cited by
-
Face mask recognition system using CNN model.Neurosci Inform. 2022 Sep;2(3):100035. doi: 10.1016/j.neuri.2021.100035. Epub 2021 Dec 9. Neurosci Inform. 2022. PMID: 36819833 Free PMC article. Review.
-
Multiphoton fluorescence microscopy for in vivo imaging.Cell. 2024 Aug 22;187(17):4458-4487. doi: 10.1016/j.cell.2024.07.036. Cell. 2024. PMID: 39178829 Review.
-
Sharing neurophysiology data from the Allen Brain Observatory.Elife. 2023 Jul 11;12:e85550. doi: 10.7554/eLife.85550. Elife. 2023. PMID: 37432073 Free PMC article.
-
Exploration biases forelimb reaching strategies.Cell Rep. 2024 Apr 23;43(4):113958. doi: 10.1016/j.celrep.2024.113958. Epub 2024 Mar 22. Cell Rep. 2024. PMID: 38520691 Free PMC article.
-
Lightning Pose: improved animal pose estimation via semi-supervised learning, Bayesian ensembling and cloud-native open-source tools.Nat Methods. 2024 Jul;21(7):1316-1328. doi: 10.1038/s41592-024-02319-1. Epub 2024 Jun 25. Nat Methods. 2024. PMID: 38918605
References
-
- Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M et al. (2016), Tensorflow: A system for large-scale machine learning, in ‘12th Symposium on Operating Systems Design and Implementation (OSDI 16)’, pp. 265–283.
-
- Aguiar A, Díaz J, Almaraz R, Pérez J and Garbajosa J (2018), DevOps in Practice – An Exploratory Case Study, in ‘Proceedings of the 19th International Conference on Agile Software Development: Companion, XP ‘18’, pp. 1–3.
-
- Amstutz P, Crusoe MR, Tijanic N. s., Chapman B, Chilton J, Heuer M, Kartashov A, Kern J, Leehr D, Menager H, Nedeljkovich M, Scales M, Soiland-Reyes S and Stojanovic L (2016), ‘Common Workflow Language, v1.0’. 10.6084/m9.figshare.3115156.v2 - DOI
-
- Avesani P, McPherson B, Hayashi S, Caiafa CF, Henschel R, Garyfallidis E, Kitchell L, Bullock D, Patterson A, Olivetti E et al. (2019), ‘The open diffusion data derivatives, brain data upcycling via integrated publishing of derivatives and reproducible open cloud services’, Scientific data 6(1), 1–13. - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials
