Managing computational complexity using surrogate models: a critical review

R Alizadeh, JK Allen, F Mistree - Research in Engineering Design, 2020 - Springer
In simulation-based realization of complex systems, we are forced to address the issue of
computational complexity. One critical issue that must be addressed is the approximation of …

Feature selection for text classification: A review

X Deng, Y Li, J Weng, J Zhang - Multimedia Tools and Applications, 2019 - Springer
Big multimedia data is heterogeneous in essence, that is, the data may be a mixture of
video, audio, text, and images. This is due to the prevalence of novel applications in recent …

Foundations of data imbalance and solutions for a data democracy

A Kulkarni, D Chong, FA Batarseh - Data democracy, 2020 - Elsevier
Dealing with imbalanced data is a prevalent problem while performing classification on the
datasets. Many times, this problem contributes to bias while making decisions or …

Deep learning for detecting financial statement fraud

P Craja, A Kim, S Lessmann - Decision Support Systems, 2020 - Elsevier
Financial statement fraud is an area of significant consternation for potential investors,
auditing companies, and state regulators. The paper proposes an approach for detecting …

[HTML][HTML] Comparing automated text classification methods

J Hartmann, J Huppertz, C Schamp… - International Journal of …, 2019 - Elsevier
Online social media drive the growth of unstructured text data. Many marketing applications
require structuring this data at scales non-accessible to human coding, eg, to detect …

Optimization methods for large-scale machine learning

L Bottou, FE Curtis, J Nocedal - SIAM review, 2018 - SIAM
This paper provides a review and commentary on the past, present, and future of numerical
optimization algorithms in the context of machine learning applications. Through case …

[图书][B] Machine learning for text: An introduction

CC Aggarwal, CC Aggarwal - 2018 - Springer
The extraction of useful insights from text with various types of statistical algorithms is
referred to as text mining, text analytics, or machine learning from text. The choice of …

Why does China allow freer social media? Protests versus surveillance and propaganda

B Qin, D Strömberg, Y Wu - Journal of Economic Perspectives, 2017 - aeaweb.org
In this paper, we document basic facts regarding public debates about controversial political
issues on Chinese social media. Our documentation is based on a dataset of 13.2 billion …

Transfer learning with adaptive fine-tuning

G Vrbančič, V Podgorelec - IEEE Access, 2020 - ieeexplore.ieee.org
With the utilization of deep learning approaches, the key factors for a successful application
are sufficient datasets with reliable ground truth, which are generally not easy to obtain …

A survey of text classification algorithms

CC Aggarwal, CX Zhai - Mining text data, 2012 - Springer
The problem of classification has been widely studied in the data mining, machine learning,
database, and information retrieval communities with applications in a number of diverse …