Identification of patients with low-risk for aneuploidy: comparative discriminatory models using linear and machine-learning classifiers in prostate cancer

Prostate. 2007 Oct 1;67(14):1524-36. doi: 10.1002/pros.20629.


Background: Prostate needle biopsy (PNB) ploidy status has proven utility to predict adverse outcomes after prostatectomy. We sought to develop models to predict ploidy status using clinicopathologic variables.

Methods: We identified a cohort of 169 patients with a diagnosis of prostatic adenocarcinoma on PNB, and estimated ploidy status (determined using Feulgen stained biopsy tissue) using four predictors, including age, prebiopsy PSA, highest Gleason score (GS), and the percentage of involvement by carcinoma at the biopsy site with the highest GS (PCARBX). Logistic regression (LR), Neural Network (NN), and CART classifiers were constructed.

Results: Univariate analyses revealed all four predictors to be significantly associated with ploidy status. On multivariable analyses, LR identified a 2-parameter model, including GS and PCARBX that had a significant ability to predict ploidy status with a 74% and 75% correct classification rate (CCR), respectively. Using the same variables, CART and NN yielded similar CCRs of 70.4%. Within GS = 6 cohort, the CART model classified over 90% of biopsies as diploid when patients had a PCARBX < 55% and a log(PSA) < 1.7.

Conclusions: Our study demonstrates that models using GS and PCARBX are able to predict PNB ploidy status with acceptable accuracy. While machine learning classifier-derived models yield similar accuracy as LR-derived models, the latter methodology has the distinct advantage of being applicable in future datasets to estimate case-specific predictions. This information may be useful in identifying potentially aneuploid patients, who can then be targeted for more aggressive therapy.

Publication types

  • Validation Study

MeSH terms

  • Adenocarcinoma / blood
  • Adenocarcinoma / classification
  • Adenocarcinoma / diagnosis
  • Adenocarcinoma / genetics*
  • Adult
  • Aged
  • Aged, 80 and over
  • Aneuploidy*
  • Biopsy, Needle
  • Cohort Studies
  • DNA, Neoplasm / genetics
  • Discriminant Analysis
  • Humans
  • Image Cytometry
  • Logistic Models
  • Male
  • Middle Aged
  • Models, Statistical
  • Neural Networks, Computer*
  • Predictive Value of Tests
  • Prostate-Specific Antigen / blood
  • Prostatic Neoplasms / blood
  • Prostatic Neoplasms / classification
  • Prostatic Neoplasms / diagnosis
  • Prostatic Neoplasms / genetics*
  • Risk Assessment


  • DNA, Neoplasm
  • Prostate-Specific Antigen