Data_Sheet_2_Identifying Differentially Expressed Genes of Zero Inflated Single Cell RNA Sequencing Data Using Mixed Model Score Tests.docx
Single cell RNA sequencing (scRNA-seq) allows quantitative measurement and comparison of gene expression at the resolution of single cells. Ignoring the batch effects and zero inflation of scRNA-seq data, many proposed differentially expressed (DE) methods might generate bias. We propose a method, single cell mixed model score tests (scMMSTs), to efficiently identify DE genes of scRNA-seq data with batch effects using the generalized linear mixed model (GLMM). scMMSTs treat the batch effect as a random effect. For zero inflation, scMMSTs use a weighting strategy to calculate observational weights for counts independently under zero-inflated and zero-truncated distributions. Counts data with calculated weights were subsequently analyzed using weighted GLMMs. The theoretical null distributions of the score statistics were constructed by mixed Chi-square distributions. Intensive simulations and two real datasets were used to compare edgeR-zinbwave, DESeq2-zinbwave, and scMMSTs. Our study demonstrates that scMMSTs, as supplement to standard methods, are advantageous to define DE genes of zero-inflated scRNA-seq data with batch effects.
History
References
- https://doi.org//10.1111/j.2517-6161.1995.tb02031.x
- https://doi.org//10.1111/1467-985X.00130
- https://doi.org//10.2307/2290687
- https://doi.org//10.1038/nbt.4096
- https://doi.org//10.1038/s41592-018-0254-1
- https://doi.org//10.1016/j.ajhg.2018.12.012
- https://doi.org//10.1186/s13059-016-1033-x
- https://doi.org//10.1007/978-1-4614-6868-4
- https://doi.org//10.7287/peerj.preprints.3188v1
- https://doi.org//10.18637/jss.v040.i08
- https://doi.org//10.1186/s13059-015-0844-5
- https://doi.org//10.1038/nbt.4091
- https://doi.org//10.1080/01621459.1977.10480998
- https://doi.org//10.1186/s13059-016-0938-8
- https://doi.org//10.1101/025528
- https://doi.org//10.1093/biostatistics/kxj037
- https://doi.org//10.1016/j.molcel.2015.04.005
- https://doi.org//10.1038/nm.4466
- https://doi.org//10.1016/j.neuron.2018.12.006
- https://doi.org//10.1186/s13059-014-0550-8
- https://doi.org//10.15252/msb.20188746
- https://doi.org//10.1093/nar/gks042
- https://doi.org//10.1016/j.ccr.2011.07.005
- https://doi.org//10.1186/s13073-020-00799-2
- https://doi.org//10.1038/nri.2017.76
- https://doi.org//10.1080/01621459.1955.10501973
- https://doi.org//10.1038/s41467-017-02554-5
- https://doi.org//10.1093/nar/gkv007
- https://doi.org//10.1186/1471-2105-12-77
- https://doi.org//10.1093/bioinformatics/btp616
- https://doi.org//10.1038/s41586-018-0024-3
- https://doi.org//10.1002/bimj.200610341
- https://doi.org//10.1186/s12863-019-0739-7
- https://doi.org//10.1186/s12859-019-2855-9
- https://doi.org//10.1038/nmeth.3805
- https://doi.org//10.1002/gepi.21717
- https://doi.org//10.1093/bioinformatics/bty644
- https://doi.org//10.3390/cells8101161
- https://doi.org//10.1038/nmeth.1315
- https://doi.org//10.1038/srep39921
- https://doi.org//10.1038/nn.3881
- https://doi.org//10.1186/s13059-018-1406-4
- https://doi.org//10.1101/157982
- https://doi.org//10.1038/nrg2484
- https://doi.org//10.1016/j.ajhg.2011.05.029
- https://doi.org//10.7717/peerj.3797
- https://doi.org//10.1186/s13059-017-1305-0
- https://doi.org//10.1126/science.aaa1934
Usage metrics
Read the peer-reviewed publication
Categories
- Gene and Molecular Therapy
- Biomarkers
- Genetics
- Genetically Modified Animals
- Developmental Genetics (incl. Sex Determination)
- Epigenetics (incl. Genome Methylation and Epigenomics)
- Gene Expression (incl. Microarray and other genome-wide approaches)
- Livestock Cloning
- Genome Structure and Regulation
- Genetic Engineering
- Genomics