Detecting anomaly in big data system logs using convolutional neural network

S Lu, X Wei, Y Li, L Wang - 2018 IEEE 16th Intl Conf on …, 2018 - ieeexplore.ieee.org
Nowadays, big data systems are being widely adopted by many domains for offering
effective data solutions, such as manufacturing, healthcare, education, and media. Big data …

Log-based abnormal task detection and root cause analysis for spark

S Lu, BB Rao, X Wei, B Tak, L Wang… - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Application delays caused by abnormal tasks arecommon problems in big data computing
frameworks. Anabnormal task in Spark, which may run slowly withouterror or warning logs …

LADRA: Log-based abnormal task detection and root-cause analysis in big data processing with Spark

S Lu, X Wei, B Rao, B Tak, L Wang, L Wang - Future Generation Computer …, 2019 - Elsevier
As big data processing is being widely adopted by many domains, massive amount of
generated data become more reliant on the parallel computing platforms for analysis …

Mrapid: An efficient short job optimizer on hadoop

H Zhang, H Huang, L Wang - 2017 IEEE International Parallel …, 2017 - ieeexplore.ieee.org
Data have been generated and collected at an accelerating pace. Hadoop has made
analyzing large scale data much simpler to developers/analysts using commodity hardware …

Cloud computing for integrated stochastic groundwater uncertainty analysis

Y Liu, AY Sun, K Nelson, WE Hipke - International Journal of Digital …, 2013 - Taylor & Francis
One of the major scientific challenges and societal concerns is to make informed decisions
to ensure sustainable groundwater availability when facing deep uncertainties. A major …

TraceBoK: Toward a software requirements traceability body of knowledge

AMD Duarte, D Duarte, M Thiry - 2016 IEEE 24th International …, 2016 - ieeexplore.ieee.org
This paper introduces the idea of building a body of knowledge (BoK) to gather the better
practices in software requirements traceability that could bring major benefits for analyzing …

Hierarchical spark: A multi-cluster big data computing framework

Z Liu, H Zhang, L Wang - 2017 IEEE 10th International …, 2017 - ieeexplore.ieee.org
Nowadays, with the increasing burst of newly generated data everyday, as well as the vast
expanding needs for corresponding data analyses, grand challenges have been brought to …

Migrating GIS big data computing from Hadoop to Spark: an exemplary study Using Twitter

Z Sun, H Zhang, Z Liu, C Xu… - 2016 IEEE 9th …, 2016 - ieeexplore.ieee.org
Recent research has demonstrated that social media could provide valuable spatio-
temporal data about users activities. However, information extraction and computation from …

Auto-tuning performance of MPI parallel programs using resource management in container-based virtual cloud

H Ma, L Wang, BC Tak, L Wang… - 2016 IEEE 9th …, 2016 - ieeexplore.ieee.org
Load imbalance problem is one of the major obstacles to achieving optimal performance of
High Performance Computing applications. The approach of trying to distribute the problem …

Using clouds for technical computing

G Fox, D Gannon - Cloud Computing and Big Data, 2013 - ebooks.iospress.nl
We discuss the use of cloud computing in technical (scientific) applications and identify
characteristics such as loosely-coupled and data-intensive that lead to good performance …