Deep learning for plant genomics and crop improvement

H Wang, E Cimen, N Singh, E Buckler - Current opinion in plant biology, 2020 - Elsevier
Our era has witnessed tremendous advances in plant genomics, characterized by an
explosion of high-throughput techniques to identify multi-dimensional genome-wide …

Machine learning: its challenges and opportunities in plant system biology

M Hesami, M Alizadeh, AMP Jones… - Applied Microbiology and …, 2022 - Springer
Sequencing technologies are evolving at a rapid pace, enabling the generation of massive
amounts of data in multiple dimensions (eg, genomics, epigenomics, transcriptomic …

Learning functional properties of proteins with language models

S Unsal, H Atas, M Albayrak, K Turhan… - Nature Machine …, 2022 - nature.com
Data-centric approaches have been used to develop predictive methods for elucidating
uncharacterized properties of proteins; however, studies indicate that these methods should …

Deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure

J Zrimec, CS Börlin, F Buric, AS Muhammad… - Nature …, 2020 - nature.com
Understanding the genetic regulatory code governing gene expression is an important
challenge in molecular biology. However, how individual coding and non-coding regions of …

Reconstructing the maize leaf regulatory network using ChIP-seq data of 104 transcription factors

X Tu, MK Mejía-Guerra, JA Valdes Franco… - Nature …, 2020 - nature.com
The transcription regulatory network inside a eukaryotic cell is defined by the combinatorial
actions of transcription factors (TFs). However, TF binding studies in plants are too few in …

Dissecting cis-regulatory control of quantitative trait variation in a plant stem cell circuit

X Wang, L Aguirre, D Rodríguez-Leal, A Hendelman… - Nature Plants, 2021 - nature.com
Cis-regulatory mutations underlie important crop domestication and improvement traits,.
However, limited allelic diversity has hindered functional dissection of the large number of …

Representation learning applications in biological sequence analysis

H Iuchi, T Matsutani, K Yamada, N Iwano… - Computational and …, 2021 - Elsevier
Although remarkable advances have been reported in high-throughput sequencing, the
ability to aptly analyze a substantial amount of rapidly generated biological …

When less is more: sketching with minimizers in genomics

M Ndiaye, S Prieto-Baños, LM Fitzgerald… - Genome Biology, 2024 - Springer
The exponential increase in sequencing data calls for conceptual and computational
advances to extract useful biological insights. One such advance, minimizers, allows for …

Limited conservation in cross-species comparison of GLK transcription factor binding suggested wide-spread cistrome divergence

X Tu, S Ren, W Shen, J Li, Y Li, C Li, Y Li… - Nature …, 2022 - nature.com
Non-coding cis-regulatory variants in animal genomes are an important driving force in the
evolution of transcription regulation and phenotype diversity. However, cistrome dynamics in …

Peer: a comprehensive and multi-task benchmark for protein sequence understanding

M Xu, Z Zhang, J Lu, Z Zhu, Y Zhang… - Advances in …, 2022 - proceedings.neurips.cc
We are now witnessing significant progress of deep learning methods in a variety of tasks
(or datasets) of proteins. However, there is a lack of a standard benchmark to evaluate the …