Deep learning, graph-based text representation and classification: a survey, perspectives and challenges

P Pham, LTT Nguyen, W Pedrycz, B Vo - Artificial Intelligence Review, 2023 - Springer
Recently, with the rapid developments of the Internet and social networks, there have been
tremendous increase in the amount of complex-structured text resources. These information …

PETSC: pattern-based embedding for time series classification

L Feremans, B Cule, B Goethals - Data Mining and Knowledge Discovery, 2022 - Springer
Efficient and interpretable classification of time series is an essential data mining task with
many real-world applications. Recently several dictionary-and shapelet-based time series …

Random subsequence forests

Z He, J Wang, M Jiang, L Hu, Q Zou - Information Sciences, 2024 - Elsevier
The random forest classifier is widely used in different fields due to its accuracy and
robustness. Since its invention, the random forest algorithm is naturally developed for multi …

Universal time-series representation learning: A survey

P Trirat, Y Shin, J Kang, Y Nam, J Na, M Bae… - arXiv preprint arXiv …, 2024 - arxiv.org
Time-series data exists in every corner of real-world systems and services, ranging from
satellites in the sky to wearable devices on human bodies. Learning representations by …

Geospatial knowledge in housing advertisements: Capturing and extracting spatial information from text

L Cadorel, A Blanchi, AGB Tettamanzi - Proceedings of the 11th …, 2021 - dl.acm.org
Information of the geographical and spatial type is found in numerous text documents and
constitutes a very challenging target for extraction. Geoparsing applications have been …

Mining sequential patterns with flexible constraints from MOOC data

W Song, W Ye, P Fournier-Viger - Applied Intelligence, 2022 - Springer
Online learning is playing an increasingly important role in education. Massive open online
course (MOOC) platforms are among the most important tools in online learning, and record …

Decision tree for sequences

Z He, Z Wu, G Xu, Y Liu, Q Zou - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Current decision trees such as C4. 5 and CART are widely used in different fields due to
their simplicity, accuracy and intuitive interpretation. Similar to other popular classifiers …

Rethinking travel behavior modeling representations through embeddings

FC Pereira - arXiv preprint arXiv:1909.00154, 2019 - arxiv.org
This paper introduces the concept of travel behavior embeddings, a method for re-
representing discrete variables that are typically used in travel demand modeling, such as …

Representing EHRs with temporal tree and sequential pattern mining for similarity computing

S Pokharel, G Zuccon, Y Li - Advanced Data Mining and Applications: 16th …, 2020 - Springer
The ability to rapidly identify at scale patients that are similar based on their electronic health
records (EHRs) is fundamental for a number of clinical informatics applications, such as …

Missing value replacement in strings and applications

G Bernardini, C Liu, G Loukides… - Data Mining and …, 2025 - Springer
Missing values arise routinely in real-world sequential (string) datasets due to:(1) imprecise
data measurements;(2) flexible sequence modeling, such as binding profiles of molecular …