Background: Although germline genetics influences breast cancer incidence, published research only explains approximately half of the expected association. Moreover, the accuracy of prediction models remains low. For women who develop breast cancer early, the genetic architecture is less established.Methods: To identify loci associated with early-onset breast cancer, gene-based tests were carried out using exome array data from 3,479 women with breast cancer diagnosed before age 50 and 973 age-matched controls. Replication was undertaken in a population that developed breast cancer at all ages of onset.Results: Three gene regions were associated with breast cancer incidence: FGFR2 (P = 1.23 × 10-5; replication P < 1.00 × 10-6), NEK10 (P = 3.57 × 10-4; replication P < 1.00 × 10-6), and SIVA1 (P = 5.49 × 10-4; replication P < 1.00 × 10-6). Of the 151 gene regions reported in previous literature, 19 (12.5%) showed evidence of association (P < 0.05) with the risk of early-onset breast cancer in the early-onset population. To predict incidence, whole-genome prediction was implemented on a subset of 3,076 participants who were additionally genotyped on a genome wide array. The whole-genome prediction outperformed a polygenic risk score [AUC, 0.636; 95% confidence interval (CI), 0.614-0.659 compared with 0.601; 95% CI, 0.578-0.623], and when combined with known epidemiologic risk factors, the AUC rose to 0.662 (95% CI, 0.640-0.684).Conclusions: This research supports a role for variation within FGFR2 and NEK10 in breast cancer incidence, and suggests SIVA1 as a novel risk locus.Impact: This analysis supports a shared genetic etiology between women with early- and late-onset breast cancer, and suggests whole-genome data can improve risk assessment. Cancer Epidemiol Biomarkers Prev; 27(9); 1057-64. ©2018 AACR.
©2018 American Association for Cancer Research.