Table_2_Prediction of Pathogenic Factors in Dysbiotic Gut Microbiomes of Colorectal Cancer Patients Using Reverse Microbiomics.xlsx (14.46 kB)

Table_2_Prediction of Pathogenic Factors in Dysbiotic Gut Microbiomes of Colorectal Cancer Patients Using Reverse Microbiomics.xlsx

Download (14.46 kB)
posted on 2022-04-27, 04:13 authored by Haihe Wang, Kaibo Zhang, Lin Wu, Qian Qin, Yongqun He

Gut microbiome plays a crucial role in the formation and progression of colorectal cancer (CRC). To better identify the underlying gene-level pathogenic mechanisms of microbiome-associated CRC, we applied our newly developed Reverse Microbiomics (RM) to predict potential pathogenic factors using the data of microbiomes in CRC patients.


Our literature search first identified 40 bacterial species enriched and 23 species depleted in the guts of CRC patients. These bacteria were systematically modeled and analyzed using the NCBI Taxonomy ontology. Ten species, including 6 enriched species (e.g., Bacteroides fragilis, Fusobacterium nucleatum and Streptococcus equinus) and 4 depleted species (e.g., Bacteroides uniformis and Streptococcus thermophilus) were chosen for follow-up comparative genomics analysis. Vaxign was used to comparatively analyze 47 genome sequences of these ten species. In total 18 autoantigens were predicted to contribute to CRC formation, six of which were reported with experimental evidence to be correlated with drug resistance and/or cell invasiveness of CRC. Interestingly, four human homology proteins (EDK89078.1, EDK87700.1, EDK89777.1, and EDK89145.1) are conserved among all enriched strains. Furthermore, we predicted 76 potential virulence factors without homology to human proteins, including two riboflavin synthase proteins, three ATP-binding cassettes (ABC) transporter protein family proteins, and 12 outer membrane proteins (OMPs). Riboflavin synthase is present in all the enriched strains but not in depleted species. The critical role of riboflavin synthase in CRC development was further identified from its hub role in our STRING-based protein−protein interaction (PPI) network analysis and from the finding of the riboflavin metabolism as the most significantly enriched pathway in our KEGG pathway analysis. A novel model of the CRC pathogenesis involving riboflavin synthase and other related proteins including TpiA and GrxC was further proposed.


The RM strategy was used to predict 18 autoantigens and 76 potential virulence factors from CRC-associated microbiome data. In addition to many of these autoantigens and virulence factors experimentally verified as reported in the literature, our study predicted many new pathogenetic factors and developed a new model of CRC pathogenesis involving the riboflavin synthase from the enriched colorectal bacteria and other associated proteins.