Table_1_Phenotype Prediction and Genome-Wide Association Study Using Deep Convolutional Neural Network of Soybean.xlsx (1.17 MB)

Table_1_Phenotype Prediction and Genome-Wide Association Study Using Deep Convolutional Neural Network of Soybean.xlsx

Download (1.17 MB)
dataset
posted on 22.11.2019, 04:38 by Yang Liu, Duolin Wang, Fei He, Juexin Wang, Trupti Joshi, Dong Xu

Genomic selection uses single-nucleotide polymorphisms (SNPs) to predict quantitative phenotypes for enhancing traits in breeding populations and has been widely used to increase breeding efficiency for plants and animals. Existing statistical methods rely on a prior distribution assumption of imputed genotype effects, which may not fit experimental datasets. Emerging deep learning technology could serve as a powerful machine learning tool to predict quantitative phenotypes without imputation and also to discover potential associated genotype markers efficiently. We propose a deep-learning framework using convolutional neural networks (CNNs) to predict the quantitative traits from SNPs and also to investigate genotype contributions to the trait using saliency maps. The missing values of SNPs are treated as a new genotype for the input of the deep learning model. We tested our framework on both simulation data and experimental datasets of soybean. The results show that the deep learning model can bypass the imputation of missing values and achieve more accurate results for predicting quantitative phenotypes than currently available other well-known statistical methods. It can also effectively and efficiently identify significant markers of SNPs and SNP combinations associated in genome-wide association study.

History

References

Licence

Exports