Data, measurement, and causal inferences in machine learning: opportunities and challenges for marketing

JF Hair Jr, M Sarstedt - Journal of Marketing Theory and Practice, 2021 - Taylor & Francis
The emergence of digital data and the methods used to analyze them are revolutionizing
marketing research. The vast quantity of data offers marketing researchers countless …

Efficient automated processing of the unstructured documents using artificial intelligence: A systematic literature review and future directions

D Baviskar, S Ahirrao, V Potdar, K Kotecha - IEEE Access, 2021 - ieeexplore.ieee.org
The unstructured data impacts 95% of the organizations and costs them millions of dollars
annually. If managed well, it can significantly improve business productivity. The traditional …

Comparison of text preprocessing methods

CP Chai - Natural Language Engineering, 2023 - cambridge.org
Text preprocessing is not only an essential step to prepare the corpus for modeling but also
a key area that directly affects the natural language processing (NLP) application results. For …

Opinion mining in online social media: a survey

C Messaoudi, Z Guessoum… - Social network analysis …, 2022 - Springer
With the emergence of social networks, opinion detection has become an active research
area with different applications and several opinionated resources such as product reviews …

[HTML][HTML] Leveraging Arabic sentiment classification using an enhanced CNN-LSTM approach and effective Arabic text preparation

AM Alayba, V Palade - Journal of King Saud University-Computer and …, 2022 - Elsevier
The high variety in the forms of the Arabic words creates significant complexity related
challenges in Natural Language Processing (NLP) tasks for Arabic text. These challenges …

[HTML][HTML] A systematic review of big data innovations in smart grids

H Taherdoost - Results in Engineering, 2024 - Elsevier
Multiple industries have been revolutionized by the incorporation of data science
advancements into intelligent environment technologies, specifically in the context of smart …

Systematic literature review of information extraction from textual data: recent methods, applications, trends, and challenges

MHA Abdullah, N Aziz, SJ Abdulkadir… - IEEE …, 2023 - ieeexplore.ieee.org
Information extraction (IE) is a challenging task, particularly when dealing with highly
heterogeneous data. State-of-the-art data mining technologies struggle to process …

[HTML][HTML] EmmDocClassifier: Efficient multimodal document image classifier for scarce data

S Kanchi, A Pagani, H Mokayed, M Liwicki, D Stricker… - Applied Sciences, 2022 - mdpi.com
Document classification is one of the most critical steps in the document analysis pipeline.
There are two types of approaches for document classification, known as image-based and …

Data integration for digital twins in the built environment based on federated data models

J Merino, X Xie, N Moretti, JY Chang… - Proceedings of the …, 2023 - icevirtuallibrary.com
Improving the efficiency of operations is a major challenge in facility management given the
limitations of outsourcing individual building functions to third-party companies. The status of …

Multi-layout unstructured invoice documents dataset: A dataset for template-free invoice processing and its evaluation using AI approaches

D Baviskar, S Ahirrao, K Kotecha - IEEE Access, 2021 - ieeexplore.ieee.org
The daily transaction of an organization generates a vast amount of unstructured data such
as invoices and purchase orders. Managing and analyzing unstructured data is a costly …