DataSheet1_Protein Function Prediction Based on PPI Networks: Network Reconstruction vs Edge Enrichment.pdf
Over the past decades, massive amounts of protein-protein interaction (PPI) data have been accumulated due to the advancement of high-throughput technologies, and but data quality issues (noise or incompleteness) of PPI have been still affecting protein function prediction accuracy based on PPI networks. Although two main strategies of network reconstruction and edge enrichment have been reported on the effectiveness of boosting the prediction performance in numerous literature studies, there still lack comparative studies of the performance differences between network reconstruction and edge enrichment. Inspired by the question, this study first uses three protein similarity metrics (local, global and sequence) for network reconstruction and edge enrichment in PPI networks, and then evaluates the performance differences of network reconstruction, edge enrichment and the original networks on two real PPI datasets. The experimental results demonstrate that edge enrichment work better than both network reconstruction and original networks. Moreover, for the edge enrichment of PPI networks, the sequence similarity outperformes both local and global similarity. In summary, our study can help biologists select suitable pre-processing schemes and achieve better protein function prediction for PPI networks.
History
Usage metrics
Categories
- Gene and Molecular Therapy
- Gene Expression (incl. Microarray and other genome-wide approaches)
- Genetics
- Genetically Modified Animals
- Livestock Cloning
- Developmental Genetics (incl. Sex Determination)
- Epigenetics (incl. Genome Methylation and Epigenomics)
- Biomarkers
- Genomics
- Genome Structure and Regulation
- Genetic Engineering