Microsatellite instability, a genetic indication of DNA mismatch impairment, provides promising treatment options. Our study aimed to detect the mutation with whole-slide image (WSI) and discover the most effective pre-trained deep-learning model to sort diagnostic slides between high microsatellite instability (MSI-H) and microsatellite stable (MSS). WSI data retrieved from public dataset were processed for training and evaluating MSI categorization model. We detected MSI in slide levels for colorectal cancer (CRC), stomach adenocarcinoma (STAD), uterine corpus, and endometrial adenocarcinoma (UCEC). Models trained with a single tissue type were evaluated with the test dataset of corresponding tissue and subsequently with the test dataset of other types of tissue (cross-tissue evaluation). Finally, another model trained with multi-tissue types was built to predict the test dataset of individual tissue. Our models achieved AUC values of 0.93, 0.84, and 0.79 in TCGA-CRC, TCGA-STAD and TCGA-UCEC, respectively. We observed that a model trained on a corresponding tumor tissue demonstrates higher accuracy, particularly compared to those trained on other tumor tissues. In the combined model trained on multi-tissue, we observed diverse outcomes regarding which model was prioritized depending on the cancer type. These results demonstrate that models trained on multiple tissues have the potential to discern features that are generalizable across different types of cancer.
Copyright: © 2025 Lee et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.