Table_1_Evaluating Structural Variation Detection Tools for Long-Read Sequencing Datasets in Saccharomyces cerevisiae.xlsx
Structural variation (SV) represents a major form of genetic variations that contribute to polymorphic variations, human diseases, and phenotypes in many organisms. Long-read sequencing has been successfully used to identify novel and complex SVs. However, comparison of SV detection tools for long-read sequencing datasets has not been reported. Therefore, we developed an analysis workflow that combined two alignment tools (NGMLR and minimap2) and five callers (Sniffles, Picky, smartie-sv, PBHoney, and NanoSV) to evaluate the SV detection in six datasets of Saccharomyces cerevisiae. The accuracy of SV regions was validated by re-aligning raw reads in diverse alignment tools, SV callers, experimental conditions, and sequencing platforms. The results showed that SV detection between NGMLR and minimap2 was not significant when using the same caller. The PBHoney was with the highest average accuracy (89.04%) and Picky has the lowest average accuracy (35.85%). The accuracy of NanoSV, Sniffles, and smartie-sv was 68.67%, 60.47%, and 57.67%, respectively. In addition, smartie-sv and NanoSV detected the most and least number of SVs, and SV detection from the PacBio sequencing platform was significantly more than that from ONT (p = 0.000173).
History
References
- https://doi.org//10.1016/j.ajhg.2014.09.005
- https://doi.org//10.1038/nature13907
- https://doi.org//10.1038/s41467-018-08148-z
- https://doi.org//10.1038/s41467-017-01343-4
- https://doi.org//10.1016/j.tibtech.2019.02.003
- https://doi.org//10.2307/2531595
- https://doi.org//10.1101/pdb.top077602
- https://doi.org//10.1186/1471-2105-15-180
- https://doi.org//10.1101/gr.107995.110
- https://doi.org//10.1038/s41592-018-0002-6
- https://doi.org//10.1148/radiology.143.1.7063747
- https://doi.org//10.1038/nprot.2008.211
- https://doi.org//10.1101/gr.214007.116
- https://doi.org//10.1534/genetics.116.192823
- https://doi.org//10.1038/ncomms14061
- https://doi.org//10.1016/j.jmoldx.2016.02.003
- https://doi.org//10.1126/science.aar6343
- https://doi.org//10.1093/bioinformatics/bty191
- https://doi.org//10.1038/s41467-019-08289-9
- https://doi.org//10.1038/nature09708
- https://doi.org//10.1128/genomeA.01126-17
- https://doi.org//10.1101/gr.161497.113
- https://doi.org//10.1016/j.gpb.2015.08.002
- https://doi.org//10.1038/s41592-018-0001-7
- https://doi.org//10.1038/s41586-018-0382-x
- https://doi.org//10.1038/s41576-018-0007-0
- https://doi.org//10.1146/annurev-med-100708-204735
- https://doi.org//10.1093/bioinformatics/bts535
- https://doi.org//10.1038/nrg3373
- https://doi.org//10.1186/gb-2010-11-12-r128
- https://doi.org//10.1038/s41588-019-0427-6
- https://doi.org//10.1101/gr.142646.112
Usage metrics
Read the peer-reviewed publication
Categories
- Gene and Molecular Therapy
- Biomarkers
- Genetics
- Genetically Modified Animals
- Developmental Genetics (incl. Sex Determination)
- Epigenetics (incl. Genome Methylation and Epigenomics)
- Gene Expression (incl. Microarray and other genome-wide approaches)
- Livestock Cloning
- Genome Structure and Regulation
- Genetic Engineering
- Genomics