Big Data in the construction industry: A review of present status, opportunities, and future trends

M Bilal, LO Oyedele, J Qadir, K Munir, SO Ajayi… - Advanced engineering …, 2016 - Elsevier
The ability to process large amounts of data and to extract useful insights from data has
revolutionised society. This phenomenon—dubbed as Big Data—has applications for a wide …

Big data preprocessing: methods and prospects

S García, S Ramírez-Gallego, J Luengo, JM Benítez… - Big data analytics, 2016 - Springer
The massive growth in the scale of data has been observed in recent years being a key
factor of the Big Data scenario. Big Data can be defined as high volume, velocity and variety …

[图书][B] Learning from imbalanced data sets

Learning with imbalanced data refers to the scenario in which the amounts of instances that
represent the concepts in a given problem follow a different distribution. The main issue …

Big data for cyber physical systems in industry 4.0: a survey

LD Xu, L Duan - Enterprise Information Systems, 2019 - Taylor & Francis
With the technology development in cyber physical systems and big data, there are huge
potential to apply them to achieve personalization and improve resource efficiency in …

A survey of data partitioning and sampling methods to support big data analysis

MS Mahmud, JZ Huang, S Salloum… - Big Data Mining and …, 2020 - ieeexplore.ieee.org
Computer clusters with the shared-nothing architecture are the major computing platforms
for big data processing and analysis. In cluster computing, data partitioning and sampling …

A survey on network methodologies for real-time analytics of massive IoT data and open research issues

S Verma, Y Kawamoto, ZM Fadlullah… - … Surveys & Tutorials, 2017 - ieeexplore.ieee.org
With the widespread adoption of the Internet of Things (IoT), the number of connected
devices is growing at an exponential rate, which is contributing to ever-increasing, massive …

Structural health monitoring framework based on Internet of Things: A survey

CA Tokognon, B Gao, GY Tian… - IEEE Internet of Things …, 2017 - ieeexplore.ieee.org
Internet of Things (IoT) has recently received a great attention due to its potential and
capacity to be integrated into any complex system. As a result of rapid development of …

[PDF][PDF] Comparison of naive bayes, random forest, decision tree, support vector machines, and logistic regression classifiers for text reviews classification

T Pranckevičius, V Marcinkevičius - Baltic Journal of Modern Computing, 2017 - bjmc.lu.lv
Today, a largely scalable computing environment provides a possibility of carrying out
various data-intensive natural language processing and machine-learning tasks. One of …

Big data analytics on Apache Spark

S Salloum, R Dautov, X Chen, PX Peng… - International Journal of …, 2016 - Springer
Apache Spark has emerged as the de facto framework for big data analytics with its
advanced in-memory programming model and upper-level libraries for scalable machine …

kNN-IS: An Iterative Spark-based design of the k-Nearest Neighbors classifier for big data

J Maillo, S Ramírez, I Triguero, F Herrera - Knowledge-Based Systems, 2017 - Elsevier
Abstract The k-Nearest Neighbors classifier is a simple yet effective widely renowned
method in data mining. The actual application of this model in the big data domain is not …