Turning from TF-IDF to TF-IGM for term weighting in text classification

K Chen, Z Zhang, J Long, H Zhang - Expert Systems with Applications, 2016 - Elsevier
Massive textual data management and mining usually rely on automatic text classification
technology. Term weighting is a basic problem in text classification and directly affects the …

Construction accident narrative classification: An evaluation of text mining techniques

YM Goh, CU Ubeynarayana - Accident Analysis & Prevention, 2017 - Elsevier
Learning from past accidents is fundamental to accident prevention. Thus, accident and near
miss reporting are encouraged by organizations and regulators. However, for organizations …

Deep learning and network analysis: Classifying and visualizing accident narratives in construction

B Zhong, X Pan, PED Love, L Ding, W Fang - Automation in Construction, 2020 - Elsevier
If headway is to be made to improve safety performance in construction, then there is a need
to learn from past accidents. Accident reports provide a useful source of information to make …

[HTML][HTML] A personalised operation and maintenance approach for complex products based on equipment portrait of product-service system

S Ren, L Shi, Y Liu, W Cai, Y Zhang - Robotics and Computer-Integrated …, 2023 - Elsevier
Based on the holistic data of product-service system (PSS) delivery processes, equipment
portrait can be used to describe personalised user requirements and conduct targeted …

A Comparative Study of Machine Learning and NLP Techniques for Uses of Stop Words by Patients in Diagnosis of Alzheimer's Disease

S Adhikari, S Thapa, P Singh, H Huo… - … Joint Conference on …, 2021 - ieeexplore.ieee.org
Alzheimer's Disease (AD) is one of the most common forms of neuropsychological disorder
in elderly people. It is a slow progressive disease affecting the brain cells. This affects the …

Discriminative feature spamming technique for roman urdu sentiment analysis

K Mehmood, D Essam, K Shafi, MK Malik - IEEE Access, 2019 - ieeexplore.ieee.org
Term weighting is one of the most commonly used approaches, which works by assigning
weights to terms, that aims to improve the performance of information retrieval or text …

Language identification at the word level in code-mixed texts using character sequence and word embedding

OE Ojo, A Gelbukh, H Calvo, A Feldman… - Proceedings of the …, 2022 - aclanthology.org
People often switch languages in conversations or written communication in order to
communicate thoughts on social media platforms. The languages in texts of this type, also …

Keeping Deep Learning Models in Check: A History-Based Approach to Mitigate Overfitting

H Li, GK Rajbahadur, D Lin, CP Bezemer… - IEEE Access, 2024 - ieeexplore.ieee.org
In software engineering, deep learning models are increasingly deployed for critical tasks
such as bug detection and code review. However, overfitting remains a challenge that …

[HTML][HTML] Social media users' perceptions of a wearable mixed reality headset during the COVID-19 pandemic: aspect-based sentiment analysis

H Jeong, A Bayro, SP Umesh, K Mamgain… - JMIR Serious …, 2022 - games.jmir.org
Background: Mixed reality (MR) devices provide real-time environments for physical-digital
interactions across many domains. Owing to the unprecedented COVID-19 pandemic, MR …

[HTML][HTML] Classification and causes identification of Chinese civil aviation incident reports

Y Jiao, J Dong, J Han, H Sun - Applied Sciences, 2022 - mdpi.com
Safety is a primary concern for the civil aviation industry. Airlines record high-frequency but
potentially low-severity unsafe events, ie, incidents, in their reports. Over the past few …