A survey on unsupervised outlier detection in high‐dimensional numerical data

A Zimek, E Schubert, HP Kriegel - Statistical Analysis and Data …, 2012 - Wiley Online Library
High‐dimensional data in Euclidean space pose special challenges to data mining
algorithms. These challenges are often indiscriminately subsumed under the term 'curse of …

Density‐based clustering

RJGB Campello, P Kröger, J Sander… - … Reviews: Data Mining …, 2020 - Wiley Online Library
Clustering refers to the task of identifying groups or clusters in a data set. In density‐based
clustering, a cluster is a set of data objects spread in the data space over a contiguous …

How molecular size impacts RMSD applications in molecular dynamics simulations

K Sargsyan, C Grauffel, C Lim - Journal of chemical theory and …, 2017 - ACS Publications
The root-mean-square deviation (RMSD) is a similarity measure widely used in analysis of
macromolecular structures and dynamics. As increasingly larger macromolecular systems …

The dark machines anomaly score challenge: benchmark data and model independent event classification for the large hadron collider

T Aarrestad, M van Beekveld, M Bona, A Boveia… - SciPost Physics, 2022 - scipost.org
We describe the outcome of a data challenge conducted as part of the Dark Machines
Initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged …

Big Data with Cloud Computing: an insight on the computing environment, MapReduce, and programming frameworks

A Fernández, S del Río, V López… - … : Data Mining and …, 2014 - Wiley Online Library
The term 'Big Data'has spread rapidly in the framework of Data Mining and Business
Intelligence. This new scenario can be defined by means of those problems that cannot be …

Data-driven monitoring of multimode continuous processes: A review

M Quiñones-Grueiro, A Prieto-Moreno, C Verde… - Chemometrics and …, 2019 - Elsevier
Abstract The Internet of Things benefits connectivity and functionality in industrial
environments, while Cloud Computing boosts computational capability. Hence, historical …

Context-aware misinformation detection: A benchmark of deep learning architectures using word embeddings

VI Ilie, CO Truică, ES Apostol, A Paschke - IEEE Access, 2021 - ieeexplore.ieee.org
New mass media paradigms for information distribution have emerged with the digital age.
With new digital-enabled mass media, the communication process is centered around the …

Causal unsupervised semantic segmentation

J Kim, BK Lee, YM Ro - arXiv preprint arXiv:2310.07379, 2023 - arxiv.org
Unsupervised semantic segmentation aims to achieve high-quality semantic grouping
without human-labeled annotations. With the advent of self-supervised pre-training, various …

[HTML][HTML] Data-driven modeling of multimode chemical process: Validation with a real-world distillation column

Y Choi, B Bhadriaju, H Cho, J Lim, IS Han… - Chemical Engineering …, 2023 - Elsevier
Real-world industrial processes frequently operate in different modes such as start-up,
transient, and steady-state operation. Since different operating modes are governed by …

Latent discriminant deterministic uncertainty

G Franchi, X Yu, A Bursuc, E Aldea… - … on Computer Vision, 2022 - Springer
Predictive uncertainty estimation is essential for deploying Deep Neural Networks in real-
world autonomous systems. However, most successful approaches are computationally …