Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data

Micah J Sheller; Brandon Edwards; G Anthony Reina; Jason Martin; Sarthak Pati; Aikaterini Kotrotsou; Mikhail Milchenko; Weilin Xu; Daniel Marcus; Rivka R Colen; Spyridon Bakas

doi:10.1038/s41598-020-69250-1

Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data

Sci Rep. 2020 Jul 28;10(1):12598. doi: 10.1038/s41598-020-69250-1.

Authors

Micah J Sheller¹, Brandon Edwards¹, G Anthony Reina¹, Jason Martin¹, Sarthak Pati^{2

3}, Aikaterini Kotrotsou^{4

5}, Mikhail Milchenko⁶, Weilin Xu¹, Daniel Marcus⁶, Rivka R Colen^{4

5

7

8}, Spyridon Bakas^{9

10

11}

Affiliations

¹ Intel Corporation, 2200 Mission College Blvd., Santa Clara, CA, 95052, USA.
² Center for Biomedical Image Computing and Analytics (CBICA), University of Pennsylvania, Richards Medical Research Laboratories, Floor 7, 3700 Hamilton Walk, Philadelphia, PA, 19104, USA.
³ Department of Radiology, Perelman School of Medicine, University of Pennsylvania, Richards Medical Research Laboratories, Floor 7, 3700 Hamilton Walk, Philadelphia, PA, 19104, USA.
⁴ Department of Diagnostic Radiology, The University of Texas MD Anderson Cancer Center, 1400 Pressler St., Houston, TX, 77030, USA.
⁵ Department of Cancer Systems Imaging, The University of Texas MD Anderson Cancer Center, 1881 East Rd, 3SCRB4, Houston, TX, 77054, USA.
⁶ Department of Radiology, Washington University School of Medicine, St. Louis, MO, 63110, USA.
⁷ Hillman Cancer Center, University of Pittsburgh Medical Center, Pittsburgh, PA, 15232, USA.
⁸ Department of Radiology, University of Pittsburgh, Pittsburgh, PA, 15213, USA.
⁹ Center for Biomedical Image Computing and Analytics (CBICA), University of Pennsylvania, Richards Medical Research Laboratories, Floor 7, 3700 Hamilton Walk, Philadelphia, PA, 19104, USA. sbakas@upenn.edu.
¹⁰ Department of Radiology, Perelman School of Medicine, University of Pennsylvania, Richards Medical Research Laboratories, Floor 7, 3700 Hamilton Walk, Philadelphia, PA, 19104, USA. sbakas@upenn.edu.
¹¹ Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Richards Medical Research Laboratories, Floor 7, 3700 Hamilton Walk, Philadelphia, PA, 19104, USA. sbakas@upenn.edu.

Abstract

Several studies underscore the potential of deep learning in identifying complex patterns, leading to diagnostic and prognostic biomarkers. Identifying sufficiently large and diverse datasets, required for training, is a significant challenge in medicine and can rarely be found in individual institutions. Multi-institutional collaborations based on centrally-shared patient data face privacy and ownership challenges. Federated learning is a novel paradigm for data-private multi-institutional collaborations, where model-learning leverages all available data without sharing data between institutions, by distributing the model-training to the data-owners and aggregating their results. We show that federated learning among 10 institutions results in models reaching 99% of the model quality achieved with centralized data, and evaluate generalizability on data from institutions outside the federation. We further investigate the effects of data distribution across collaborating institutions on model quality and learning patterns, indicating that increased access to data through data private multi-institutional collaborations can benefit model quality more than the errors introduced by the collaborative method. Finally, we compare with other collaborative-learning approaches demonstrating the superiority of federated learning, and discuss practical implementation considerations. Clinical adoption of federated learning is expected to lead to models trained on datasets of unprecedented size, hence have a catalytic impact towards precision/personalized medicine.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Humans
Information Dissemination*
Interinstitutional Relations*
Learning*
Medicine*
Patients*
Privacy*

Abstract

Publication types

MeSH terms

Grants and funding