Frontiers
Browse
Data_Sheet_1_Wavelet-Based Genomic Signal Processing for Centromere Identification and Hypothesis Generation.PDF (8.86 MB)

Data_Sheet_1_Wavelet-Based Genomic Signal Processing for Centromere Identification and Hypothesis Generation.PDF

Download (8.86 MB)
dataset
posted on 2019-05-31, 05:55 authored by Deborah Weighill, David Macaya-Sanz, Stephen Paul DiFazio, Wayne Joubert, Manesh Shah, Jeremy Schmutz, Avinash Sreedasyam, Gerald Tuskan, Daniel Jacobson

Various ‘omics data types have been generated for Populus trichocarpa, each providing a layer of information which can be represented as a density signal across a chromosome. We make use of genome sequence data, variants data across a population as well as methylation data across 10 different tissues, combined with wavelet-based signal processing to perform a comprehensive analysis of the signature of the centromere in these different data signals, and successfully identify putative centromeric regions in P. trichocarpa from these signals. Furthermore, using SNP (single nucleotide polymorphism) correlations across a natural population of P. trichocarpa, we find evidence for the co-evolution of the centromeric histone CENH3 with the sequence of the newly identified centromeric regions, and identify a new CENH3 candidate in P. trichocarpa.

History