Data_Sheet_1_DECtp: Calling Differential Gene Expression Between Cancer and Normal Samples by Integrating Tumor Purity Information.docx (1.09 MB)
Download file

Data_Sheet_1_DECtp: Calling Differential Gene Expression Between Cancer and Normal Samples by Integrating Tumor Purity Information.docx

Download (1.09 MB)
dataset
posted on 28.08.2018, 04:24 by Weiwei Zhang, Haixia Long, Binsheng He, Jialiang Yang

Identifying differentially expressed genes (DEGs) between tumor and normal samples is critical for studying tumorigenesis, and has been routinely applied to identify diagnostic, prognostic, and therapeutic biomarkers for many cancers. It is well-known that solid tumor tissue samples obtained from clinical settings are always mixtures of cancer and normal cells. However, the tumor purity information is more or less ignored in traditional differential expression analyses, which might decrease the power of differential gene identification or even bias the results. In this paper, we have developed a novel differential gene calling method called DECtp by integrating tumor purity information into a generalized least square procedure, followed by the Wald test. We compared DECtp with popular methods like t-test and limma on nine simulation datasets with different sample sizes and noise levels. DECtp achieved the highest area under curves (AUCs) for all the comparisons, suggesting that cancer purity information is critical for DEG calling between tumor and normal samples. In addition, we applied DECtp into cancer and normal samples of 14 tumor types collected from The Cancer Genome Atlas (TCGA) and compared the DEGs with those called by limma. As a result, DECtp achieved more sensitive, consistent, and biologically meaningful results and identified a few novel DEGs for further experimental validation.

History