The meaning of significant mean group differences for biomarker discovery

Eva Loth; Jumana Ahmad; Chris Chatham; Beatriz López; Ben Carter; Daisy Crawley; Bethany Oakley; Hannah Hayward; Jennifer Cooke; Antonia San José Cáceres; Danilo Bzdok; Emily Jones; Tony Charman; Christian Beckmann; Thomas Bourgeron; Roberto Toro; Jan Buitelaar; Declan Murphy; Guillaume Dumas

doi:10.1371/journal.pcbi.1009477

The meaning of significant mean group differences for biomarker discovery

PLoS Comput Biol. 2021 Nov 18;17(11):e1009477. doi: 10.1371/journal.pcbi.1009477. eCollection 2021 Nov.

Authors

Eva Loth^{1

2}, Jumana Ahmad³, Chris Chatham⁴, Beatriz López⁵, Ben Carter⁶, Daisy Crawley¹, Bethany Oakley¹, Hannah Hayward¹, Jennifer Cooke¹, Antonia San José Cáceres^{1

7}, Danilo Bzdok^{8

9

10}, Emily Jones¹¹, Tony Charman¹², Christian Beckmann¹³, Thomas Bourgeron¹⁴, Roberto Toro¹⁴, Jan Buitelaar¹³, Declan Murphy^{1

2}, Guillaume Dumas^{10

14

15}

Affiliations

¹ Department of Forensic and Neurodevelopmental Sciences, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, United Kingdom.
² Sackler Institute for Translational Neuroscience, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, United Kingdom.
³ Department of Psychology, Social Work and Counselling, Faculty of Education and Health, University of Greenwich, London, United Kingdom.
⁴ Neuroscience & Rare Diseases, Pharma Research & Early Development, Roche Innovation Center New York, New York, United States of America.
⁵ Department of Psychology, Portsmouth University, Portsmouth, United Kingdom.
⁶ Department of Biostatistics, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, United Kingdom.
⁷ Instituto de Investigación Sanitaria Gregorio Marañón, Departamento de Psiquiatría del Niño y del Adolescente, Hospital General Universitario Gregorio Marañón and Centro Investigación Biomédica en Red Salud Mental (CIBERSAM), Madrid, Spain.
⁸ Department of Biomedical Engineering, McConnell Brain-Imaging Centre (BIC), Montreal Neurological Institute (MNI), Faculty of Medicine, McGill University, Montreal, Canada.
⁹ Canadian Institute for Advanced Research (CIFAR), Canada.
¹⁰ Mila-Quebec Artificial Intelligence Institute, Montreal, Canada.
¹¹ Centre for Brain and Cognitive Development, Birkbeck, University of London, London, United Kingdom.
¹² Department of Psychology, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, United Kingdom.
¹³ Department of Cognitive Neuroscience, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Centre, Nijmegen, the Netherlands.
¹⁴ Human Genetics and Cognitive Functions, Institut Pasteur, UMR3571 CNRS, Université de Paris, Paris, France.
¹⁵ Precision Psychiatry and Social Physiology laboratory, CHU Sainte-Justine Research Center, Department of Psychiatry, University of Montreal, Quebec, Canada.

Abstract

Over the past decade, biomarker discovery has become a key goal in psychiatry to aid in the more reliable diagnosis and prognosis of heterogeneous psychiatric conditions and the development of tailored therapies. Nevertheless, the prevailing statistical approach is still the mean group comparison between "cases" and "controls," which tends to ignore within-group variability. In this educational article, we used empirical data simulations to investigate how effect size, sample size, and the shape of distributions impact the interpretation of mean group differences for biomarker discovery. We then applied these statistical criteria to evaluate biomarker discovery in one area of psychiatric research-autism research. Across the most influential areas of autism research, effect size estimates ranged from small (d = 0.21, anatomical structure) to medium (d = 0.36 electrophysiology, d = 0.5, eye-tracking) to large (d = 1.1 theory of mind). We show that in normal distributions, this translates to approximately 45% to 63% of cases performing within 1 standard deviation (SD) of the typical range, i.e., they do not have a deficit/atypicality in a statistical sense. For a measure to have diagnostic utility as defined by 80% sensitivity and 80% specificity, Cohen's d of 1.66 is required, with still 40% of cases falling within 1 SD. However, in both normal and nonnormal distributions, 1 (skewness) or 2 (platykurtic, bimodal) biologically plausible subgroups may exist despite small or even nonsignificant mean group differences. This conclusion drastically contrasts the way mean group differences are frequently reported. Over 95% of studies omitted the "on average" when summarising their findings in their abstracts ("autistic people have deficits in X"), which can be misleading as it implies that the group-level difference applies to all individuals in that group. We outline practical approaches and steps for researchers to explore mean group comparisons for the discovery of stratification biomarkers.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Autistic Disorder / diagnosis
Biomarkers / analysis*
Case-Control Studies
Computational Biology / education*
Computational Biology / statistics & numerical data
Computer Simulation
Humans
Individuality
Mental Disorders / diagnosis
Neurodevelopmental Disorders / diagnosis
Neuropsychiatry / statistics & numerical data
Neuropsychology / statistics & numerical data
Normal Distribution
Sample Size

Substances

Biomarkers

Grants and funding

EL, JA, BL, BC, DC, BO, HH, JC, ASJC, EJ, TC, CB, TB, RT, JB, DM, and GD have received funding from the Innovative Medicines Initiative 2 Joint Undertaking under grant agreement No 777394 for the project AIMS-2-TRIALS. This Joint Undertaking is a joint support from the European Union's Horizon 2020 research and innovation programme, EFPIA, AUTISM SPEAKS, Autistica, and SFARI. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.