Microscopy reveals complex patterns of cellular heterogeneity that can be biologically informative. However, a limitation of microscopy is that only a small number of biomarkers can typically be monitored simultaneously. Thus, a natural question is whether additional biomarkers provide a deeper characterization of the distribution of cellular states in a population. How much information about a cell's phenotypic state in one biomarker is gained by knowing its state in another biomarker? Here, we describe a framework for comparing phenotypic states across biomarkers. Our approach overcomes the current limitation of microscopy by not requiring costaining biomarkers on the same cells; instead, we require staining of biomarkers (possibly separately) on a common collection of phenotypically diverse cell lines. We evaluate our approach on two image datasets: 33 oncogenically diverse lung cancer cell lines stained with 7 biomarkers, and 49 less diverse subclones of one lung cancer cell line stained with 12 biomarkers. We first validate our method by comparing it to the "gold standard" of costaining. We then apply our approach to all pairs of biomarkers and use it to identify biomarkers that yield similar patterns of heterogeneity. The results presented in this work suggest that many biomarkers provide redundant information about heterogeneity. Thus, our approach provides a practical guide for selecting independently informative biomarkers and, more generally, will yield insights into both the connectivity of biological networks and the complexity of the state space of biological systems.
Keywords: bioimage informatics; biological networks; biomarker selection; systems biology; heterogeneity; information theory; microscopy; single-cell variability.
© 2014 International Society for Advancement of Cytometry.