Statistical estimates from black non-Hispanic female breast cancer data

Asian Pac J Cancer Prev. 2014;15(19):8371-6. doi: 10.7314/apjcp.2014.15.19.8371.


Background: The use of statistical methods has become an imperative tool in breast cancer survival data analysis. The purpose of this study was to develop the best statistical probability model using the Bayesian method to predict future survival times for the black non-Hispanic female breast cancer patients diagnosed during 1973- 2009 in the U.S.

Materials and methods: We used a stratified random sample of black non-Hispanic female breast cancer patient data from the Surveillance Epidemiology and End RESULTS (SEER) database. Survival analysis was performed using Kaplan-Meier and Cox proportional regression methods. Four advanced types of statistical models, Exponentiated Exponential (EE), Beta Generalized Exponential (BGE), Exponentiated Weibull (EW), and Beta Inverse Weibull (BIW) were utilized for data analysis. The statistical model building criteria, Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), and Deviance Information Criteria (DIC) were used to measure the goodness of fit tests. Furthermore, we used the Bayesian approach to obtain the predictive survival inferences from the best-fit data based on the exponentiated Weibull model.

Results: We identified the highest number of black non-Hispanic female breast cancer patients in Michigan and the lowest in Hawaii. The mean (SD), of age at diagnosis (years) was 58.3 (14.43). The mean (SD), of survival time (months) for black non- Hispanic females was 66.8 (30.20). Non-Hispanic blacks had a significantly increased risk of death compared to Black Hispanics (Hazard ratio: 1.96, 95%CI: 1.51-2.54). Compared to other statistical probability models, we found that the exponentiated Weibull model better fits for the survival times. By making use of the Bayesian method predictive inferences for future survival times were obtained.

Conclusions: These findings will be of great significance in determining appropriate treatment plans and health-care cost allocation. Furthermore, the same approach should contribute to build future predictive models for any health related diseases.

MeSH terms

  • Bayes Theorem
  • Blacks / statistics & numerical data*
  • Breast Neoplasms / mortality*
  • Ethnicity / statistics & numerical data*
  • Female
  • Follow-Up Studies
  • Humans
  • Middle Aged
  • Models, Statistical*
  • Prognosis
  • SEER Program
  • Survival Rate