An information-theoretical model for breast cancer detection

Methods Inf Med. 2008;47(4):322-7.


Objectives: Formal diagnostic modeling is an important line of modern biological and medical research. The construction of a formal diagnostic model consists of two stages: first, the estimation of correlation between model parameters and the disease under consideration; and second, the construction of a diagnostic decision rule using these correlation estimates. A serious drawback of current diagnostic models is the absence of a unified mathematical methodological approach to implementing these two stages. The absence of a unified approach makes the theoretical/biomedical substantiation of diagnostic rules difficult and reduces the efficacy of actual diagnostic model application.

Methods: The present study constructs a formal model for breast cancer detection. The diagnostic model is based on information theory. Normalized mutual information is chosen as the measure of relevance between parameters and the patterns studied. The "nearest neighbor" rule is utilized for diagnosis, while the distance between elements is the weighted Hamming distance. The model concomitantly employs cellular fluorescence polarization as the quantitative input parameter and cell receptor expression as qualitative parameters.

Results: Twenty-four healthy individuals and 34 patients (not including the subjects analyzed for the model construction) were tested by the model. Twenty-three healthy subjects and 34 patients were correctly diagnosed.

Conclusions: The proposed diagnostic model is an open one, i.e. it can accommodate new additional parameters, which may increase its effectiveness.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Breast Neoplasms / diagnosis*
  • Female
  • Fluorescence Polarization
  • Humans
  • Information Theory*
  • Models, Statistical*