Cost-sensitive and hybrid-attribute measure multi-decision tree over imbalanced data sets

F Li, X Zhang, X Zhang, C Du, Y Xu, YC Tian - Information Sciences, 2018 - Elsevier
One of the most popular algorithms for classification is the decision tree. However, existing
binary decision tree models do not handle well the minority class over imbalanced data sets …

[PDF][PDF] Literature review of feature selection for mining tasks

MS Pervez, DM Farid - International Journal of Computer Applications, 2015 - Citeseer
During past few decades, researchers worked on data preprocessing techniques for the
datasets. Data preprocessing techniques are needed, where the data are prepared for …

Parametric and non-parametric analyses for pedestrian crash severity prediction in Great Britain

M Rella Riccardi, F Mauriello, S Sarkar, F Galante… - Sustainability, 2022 - mdpi.com
The study aims to investigate the factors that are associated with fatal and severe vehicle–
pedestrian crashes in Great Britain by developing four parametric models and five non …

Aggregating predictions vs. aggregating features for relational classification

O Schulte, K Routley - 2014 IEEE Symposium on …, 2014 - ieeexplore.ieee.org
Relational data classification is the problem of predicting a class label of a target entity given
information about features of the entity, of the related entities, or neighbors, and of the links …

Mining multi-relational high utility itemsets from star schemas

W Song, B Jiang, Y Qiao - Intelligent Data Analysis, 2018 - content.iospress.com
Mining high utility itemsets is an interesting research problem in data mining and knowledge
discovery. Most high utility itemset discovery algorithms seek patterns in a single table, but …

Selecting Walk Schemes for Database Embedding

YL Lubarsky, J Tönshof, M Grohe… - Proceedings of the 32nd …, 2023 - dl.acm.org
Machinery for data analysis often requires a numeric representation of the input. Towards
that, a common practice is to embed components of structured data into a high-dimensional …

A comparative study and performance analysis of multirelational classification algorithms

K Shah, KS Patel - International Journal of Business …, 2022 - inderscienceonline.com
Classification is one of the important tasks in data mining in which a model is generated-
based on training dataset and that model is used to predict class label of unknown dataset …

Analyse relationnelle de concepts: une méthode polyvalente pour l'extraction de connaissance

M Wajnberg - 2020 - hal.science
À une époque où les données, souvent interprétées comme une «réalité terrain», sont
produites dans des quantités gargantuesques, un besoin de compréhension et …

[PDF][PDF] Evaluating various machine learning methods for predicting students' math performance in the 2019 TIMSS

A Elouafi, I Tammouch, S Eddarouich… - Indonesian Journal of …, 2024 - researchgate.net
The growth of a country strongly depends on the quality of its educational system. All over
the world, the education sectors are experiencing a fundamental evolution of their mode of …

A hierarchy of independence assumptions for multi-relational Bayes net classifiers

O Schulte, B Bina, B Crawford… - … IEEE Symposium on …, 2013 - ieeexplore.ieee.org
Many databases store data in relational format, with different types of entities and
information about their attributes and links between the entities. Link-based classification …