Table7_A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins.XLSX
Considering that traditional biological experiments are expensive and time consuming, it is important to develop effective computational models to infer potential essential proteins. In this manuscript, a novel collaborative filtering model-based method called CFMM was proposed, in which, an updated protein–domain interaction (PDI) network was constructed first by applying collaborative filtering algorithm on the original PDI network, and then, through integrating topological features of PDI networks with biological features of proteins, a calculative method was designed to infer potential essential proteins based on an improved PageRank algorithm. The novelties of CFMM lie in construction of an updated PDI network, application of the commodity-customer-based collaborative filtering algorithm, and introduction of the calculation method based on an improved PageRank algorithm, which ensured that CFMM can be applied to predict essential proteins without relying entirely on known protein–domain associations. Simulation results showed that CFMM can achieve reliable prediction accuracies of 92.16, 83.14, 71.37, 63.87, 55.84, and 52.43% in the top 1, 5, 10, 15, 20, and 25% predicted candidate key proteins based on the DIP database, which are remarkably higher than 14 competitive state-of-the-art predictive models as a whole, and in addition, CFMM can achieve satisfactory predictive performances based on different databases with various evaluation measurements, which further indicated that CFMM may be a useful tool for the identification of essential proteins in the future.
History
References
- https://doi.org//10.1093/nar/gkh121
- https://doi.org//10.1093/database/bau012
- https://doi.org//10.1086/228631
- https://doi.org//10.1371/journal.pone.0184129
- https://doi.org//10.1109/ACCESS.2020.2964571
- https://doi.org//10.1093/nar/26.1.73
- https://doi.org//10.1111/j.1440-1711.2005.01332.x
- https://doi.org//10.1007/978-3-319-60137-3
- https://doi.org//10.1101/gr.1073603
- https://doi.org//10.1103/PhysRevE.71.056103
- https://doi.org//10.1109/BIBM.2016.7822501
- https://doi.org//10.3390/molecules23071569
- https://doi.org//10.1038/nature04532
- https://doi.org//10.1038/nature00935
- https://doi.org//10.1093/molbev/msi072
- https://doi.org//10.1186/1471-2180-9-243
- https://doi.org//10.1038/35075138
- https://doi.org//10.1016/j.ymeth.2015.04.013
- https://doi.org//10.1155/JBB.2005.96
- https://doi.org//10.1038/nature04670
- https://doi.org//10.1016/j.knosys.2018.03.027
- https://doi.org//10.1016/j.compbiolchem.2011.04.002
- https://doi.org//10.1186/1752-0509-6-15
- https://doi.org//10.1109/ACCESS.2020.2993860
- https://doi.org//10.1016/j.jtbi.2020.110414
- https://doi.org//10.1007/s00500-017-2964-1
- https://doi.org//10.3389/fgene.2021.645932
- https://doi.org//10.1093/nar/gkh092
- https://doi.org//10.1093/nar/gkp931
- https://doi.org//10.1109/JBHI.2015.2513200
- https://doi.org//10.3389/fmicb.2020.592430
- https://doi.org//10.1109/TCBB.2014.2338317
- https://doi.org//10.1186/1752-0509-6-87
- https://doi.org//10.1109/BIBM.2015.7359693
- https://doi.org//10.1186/1471-2105-8-111
- https://doi.org//10.1371/journal.pone.0182031
- https://doi.org//10.1371/journal.pone.0161042
- https://doi.org//10.1016/0378-8733%2889%2990016-6
- https://doi.org//10.1126/science.1120499
- https://doi.org//10.1093/bioinformatics/btr500
- https://doi.org//10.1007/978-3-642-21260-4_12
- https://doi.org//10.1109/TCBB.2011.147
- https://doi.org//10.1002/prca.201200068
- https://doi.org//10.1016/S0022-5193%2803%2900071-7
- https://doi.org//10.1093/nar/30.1.303
- https://doi.org//10.1109/TCBB.2017.2701824
- https://doi.org//10.1093/nar/gkn858
- https://doi.org//10.1109/tcbb.2016.2615931
- https://doi.org//10.1371/journal.pone.0195410
- https://doi.org//10.1371/journal.pone.0058763
- https://doi.org//10.1109/tnb.2014.2337912
- https://doi.org//10.1186/s12859-019-2930-2
- https://doi.org//10.3390/molecules24091714
Usage metrics
Read the peer-reviewed publication
Categories
- Gene and Molecular Therapy
- Gene Expression (incl. Microarray and other genome-wide approaches)
- Genetics
- Genetically Modified Animals
- Livestock Cloning
- Developmental Genetics (incl. Sex Determination)
- Epigenetics (incl. Genome Methylation and Epigenomics)
- Biomarkers
- Genomics
- Genome Structure and Regulation
- Genetic Engineering