Approximate OLAP of document-oriented databases: A variety-aware approach

E Gallinucci, M Golfarelli, S Rizzi - Information Systems, 2019 - Elsevier
Schemaless databases, and document-oriented databases in particular, are preferred to
relational ones for storing heterogeneous data with variable schemas and structural forms …

[PDF][PDF] Design and analysis of a water quality monitoring data service platform

J Zhang, Y Sheng, W Chen, H Lin… - Comput. Mater …, 2021 - cdn.techscience.cn
Water is one of the basic resources for human survival. Water pollution monitoring and
protection have been becoming a major problem for many countries all over the world. Most …

[PDF][PDF] Identifying and analyzing the transient and permanent barriers for big data

SN Brohi, MA Bamiah, MN Brohi - Journal of Engineering …, 2016 - jestec.taylors.edu.my
Auspiciously, big data analytics had made it possible to generate value from immense
amounts of raw data. Organizations are able to seek incredible insights which assist them in …

Bayeswipe: A scalable probabilistic framework for improving data quality

S De, Y Hu, VV Meduri, Y Chen… - Journal of Data and …, 2016 - dl.acm.org
Recent efforts in data cleaning of structured data have focused exclusively on problems like
data deduplication, record matching, and data standardization; none of the approaches …

BUNNI: Learning Repair Actions in Rule-driven Data Cleaning

G Mecca, P Papotti, D Santoro, E Veltri - ACM Journal of Data and …, 2024 - dl.acm.org
In this work, we address the challenging and open problem of involving non-expert users in
the data-repairing problem as first-class citizens. Despite a large number of proposals that …

Correlation monitoring method and model of science-technology-industry in the ai field: a case of the neural network

X Wang, Y Liu, L Chen, Y Zhang - SAGE Open, 2022 - journals.sagepub.com
This article aims to analyze the correlation status and development trend among science,
technology and industry in the Artificial Intelligence (AI) subfield. First, it constructs the …

Investigating Data Repair steps for EHR Big Data

S Juddoo - 2022 3rd International Conference on Next …, 2022 - ieeexplore.ieee.org
This paper builds on previous research with the aim of optimizing data quality
methodologies for Big Data systems, with a focus on Electronic Health Records. This …

A SYSTEMATIC MAPPING REVIEW ON DATA CLEANING METHODS IN BIG DATA ENVIRONMENTS.

C Keiji Iwata, N Verardi Galegale, M Ito… - IADIS International …, 2024 - search.ebscohost.com
The evolution of information technology combined with artificial intelligence, IoT (Internet of
Things) and robotics has made processes integrated and intelligent. The increased use of …

BayesWipe: A scalable probabilistic framework for cleaning bigdata

S De, Y Hu, MV Vamsikrishna, Y Chen… - arXiv preprint arXiv …, 2015 - arxiv.org
Recent efforts in data cleaning of structured data have focused exclusively on problems like
data deduplication, record matching, and data standardization; none of the approaches …

Leveraging decision making in cyber security analysis through data cleaning

C Zhong, H Liu, A Alnusair - Southwestern Business …, 2017 - digitalscholarship.tsu.edu
Abstract Security Operations Centers (SOCs) have been built in many institutions for
intrusion detection and incident response. A SOC employs various cyber defense …