TCLP: an online cancer cell line catalogue integrating HLA type, predicted neo-epitopes, virus and gene expression

Genome Med. 2015 Nov 20;7:118. doi: 10.1186/s13073-015-0240-5.


Human cancer cell lines are an important resource for research and drug development. However, the available annotations of cell lines are sparse, incomplete, and distributed in multiple repositories. Re-analyzing publicly available raw RNA-Seq data, we determined the human leukocyte antigen (HLA) type and abundance, identified expressed viruses and calculated gene expression of 1,082 cancer cell lines. Using the determined HLA types, public databases of cell line mutations, and existing HLA binding prediction algorithms, we predicted antigenic mutations in each cell line. We integrated the results into a comprehensive knowledgebase. Using the Django web framework, we provide an interactive user interface with advanced search capabilities to find and explore cell lines and an application programming interface to extract cell line information. The portal is available at

MeSH terms

  • Algorithms
  • Cell Line, Tumor*
  • Computational Biology / methods
  • Databases, Genetic*
  • Databases, Nucleic Acid
  • Epitopes / genetics
  • Epitopes / immunology
  • Gene Expression
  • HCT116 Cells
  • HLA Antigens / genetics*
  • HLA Antigens / immunology*
  • Humans
  • Information Systems
  • Internet
  • Mutation
  • Neoplasms / genetics*
  • Neoplasms / immunology*
  • Neoplasms / pathology
  • Neoplasms / virology
  • Online Systems*
  • User-Computer Interface


  • Epitopes
  • HLA Antigens