Predicting Cancer Tissue-of-Origin by a Machine Learning Method Using DNA Somatic Mutation Data
Patients with carcinoma of unknown primary (CUP) account for 3–5% of all cancer cases. A large number of metastatic cancers require further diagnosis to determine their tissue of origin. However, diagnosis of CUP and identification of its primary site are challenging. Previous studies have suggested that molecular profiling of tissue-specific genes could be useful in inferring the primary tissue of a tumor. The purpose of this study was to evaluate the performance somatic mutations detected in a tumor to identify the cancer tissue of origin. We downloaded the somatic mutation datasets from the International Cancer Genome Consortium project. The random forest algorithm was used to extract features, and a classifier was established based on the logistic regression. Specifically, the somatic mutations of 300 genes were extracted, which are significantly enriched in functions, such as cell-to-cell adhesion. In addition, the prediction accuracy on tissue-of-origin inference for 3,374 cancer samples across 13 cancer types reached 81% in a 10-fold cross-validation. Our method could be useful in the identification of cancer tissue of origin, as well as the diagnosis and treatment of cancers.
CITE THIS COLLECTION
REFERENCES
- https://doi.org//10.1007/s11547-006-0112-6
- https://doi.org//10.1593/tlo.11325
- https://doi.org//10.1016/s0002-9440(10)63090-8
- https://doi.org//10.2353/jmoldx.2010.090197
- https://doi.org//10.18632/oncotarget.9067
- https://doi.org//10.1016/j.biocel.2016.02.007
- https://doi.org//10.1093/hmg/ddt539
- https://doi.org//10.1016/j.jmoldx.2011.04.004
- https://doi.org//10.1200/jco.2004.22.14_suppl.9545
- https://doi.org//10.1097/00008469-199301001-00026
- https://doi.org//10.1007/s00259-007-0456-8
- https://doi.org//10.1159/000336300
- https://doi.org//10.1200/jco.2012.43.3755
- https://doi.org//10.1016/j.cell.2014.06.049
SHARE
Usage metrics
Read the peer-reviewed publication
AUTHORS (11)
CATEGORIES
- Gene and Molecular Therapy
- Biomarkers
- Genetics
- Genetically Modified Animals
- Developmental Genetics (incl. Sex Determination)
- Epigenetics (incl. Genome Methylation and Epigenomics)
- Gene Expression (incl. Microarray and other genome-wide approaches)
- Livestock Cloning
- Genome Structure and Regulation
- Genetic Engineering
- Genomics