Sequencing technologies are evolving at a rapid pace, enabling the generation of massive amounts of data in multiple dimensions (eg, genomics, epigenomics, transcriptomic …
Data-centric approaches have been used to develop predictive methods for elucidating uncharacterized properties of proteins; however, studies indicate that these methods should …
Understanding the genetic regulatory code governing gene expression is an important challenge in molecular biology. However, how individual coding and non-coding regions of …
The transcription regulatory network inside a eukaryotic cell is defined by the combinatorial actions of transcription factors (TFs). However, TF binding studies in plants are too few in …
Cis-regulatory mutations underlie important crop domestication and improvement traits,. However, limited allelic diversity has hindered functional dissection of the large number of …
Although remarkable advances have been reported in high-throughput sequencing, the ability to aptly analyze a substantial amount of rapidly generated biological …
The exponential increase in sequencing data calls for conceptual and computational advances to extract useful biological insights. One such advance, minimizers, allows for …
X Tu, S Ren, W Shen, J Li, Y Li, C Li, Y Li… - Nature …, 2022 - nature.com
Non-coding cis-regulatory variants in animal genomes are an important driving force in the evolution of transcription regulation and phenotype diversity. However, cistrome dynamics in …
We are now witnessing significant progress of deep learning methods in a variety of tasks (or datasets) of proteins. However, there is a lack of a standard benchmark to evaluate the …