Biological sequence classification: A review on data and general methods

C Ao, S Jiao, Y Wang, L Yu, Q Zou - Research, 2022 - spj.science.org
With the rapid development of biotechnology, the number of biological sequences has
grown exponentially. The continuous expansion of biological sequence data promotes the …

MathFeature: feature extraction package for DNA, RNA and protein sequences based on mathematical descriptors

RP Bonidia, DS Domingues, DS Sanches… - Briefings in …, 2022 - academic.oup.com
One of the main challenges in applying machine learning algorithms to biological sequence
data is how to numerically represent a sequence in a numeric input vector. Feature …

iFeatureOmega: an integrative platform for engineering, visualization and analysis of features from molecular sequences, structural and ligand data sets

Z Chen, X Liu, P Zhao, C Li, Y Wang, F Li… - Nucleic acids …, 2022 - academic.oup.com
The rapid accumulation of molecular data motivates development of innovative approaches
to computationally characterize sequences, structures and functions of biological and …

Predicting Parkinson disease related genes based on PyFeat and gradient boosted decision tree

M Helmy, E Eldaydamony, N Mekky, M Elmogy… - Scientific Reports, 2022 - nature.com
Identifying genes related to Parkinson's disease (PD) is an active research topic in
biomedical analysis, which plays a critical role in diagnosis and treatment. Recently, many …

TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization

YJ Jeon, MM Hasan, HW Park, KW Lee… - Briefings in …, 2022 - academic.oup.com
Long noncoding RNAs (lncRNAs) are primarily regulated by their cellular localization, which
is responsible for their molecular functions, including cell cycle regulation and genome …

BioAutoML: automated feature engineering and metalearning to predict noncoding RNAs in bacteria

RP Bonidia, APA Santos, BLS de Almeida… - Briefings in …, 2022 - academic.oup.com
Recent technological advances have led to an exponential expansion of biological
sequence data and extraction of meaningful information through Machine Learning (ML) …

Prediction of linear cationic antimicrobial peptides active against gram-negative and gram-positive bacteria based on machine learning models

ÜG Söylemez, M Yousef, Z Kesmen, ME Büyükkiraz… - Applied Sciences, 2022 - mdpi.com
Antimicrobial peptides (AMPs) are considered as promising alternatives to conventional
antibiotics in order to overcome the growing problems of antibiotic resistance …

ProPythia: a Python package for protein classification based on machine and deep learning

AM Sequeira, D Lousa, M Rocha - Neurocomputing, 2022 - Elsevier
The field of protein data mining has been growing rapidly in the last years. To characterize
proteins and determine their function from their amino acid sequences are challenging and …

Identification of DNA modification sites based on elastic net and bidirectional gated recurrent unit with convolutional neural network

B Yu, Y Zhang, X Wang, H Gao, J Sun, X Gao - … Signal Processing and …, 2022 - Elsevier
Abstract DNA N4-methylcytosine (4mC) and DNA N6-methyladenine (6mA) are significant
epigenetic modifications. 4mC is closely related to the restriction modification system, and …

Geographic encoding of transcripts enabled high-accuracy and isoform-aware deep learning of RNA methylation

D Huang, K Chen, B Song, Z Wei, J Su… - Nucleic acids …, 2022 - academic.oup.com
As the most pervasive epigenetic mark present on mRNA and lncRNA, N6-methyladenosine
(m6A) RNA methylation regulates all stages of RNA life in various biological processes and …