Text preprocessing for text mining in organizational research: Review and recommendations

L Hickman, S Thapa, L Tay, M Cao… - Organizational …, 2022 - journals.sagepub.com
Recent advances in text mining have provided new methods for capitalizing on the
voluminous natural language text data created by organizations, their employees, and their …

Deep learning in clinical natural language processing: a methodical review

S Wu, K Roberts, S Datta, J Du, Z Ji, Y Si… - Journal of the …, 2020 - academic.oup.com
Objective This article methodically reviews the literature on deep learning (DL) for natural
language processing (NLP) in the clinical domain, providing quantitative analysis to answer …

AI and the everything in the whole wide world benchmark

ID Raji, EM Bender, A Paullada, E Denton… - arXiv preprint arXiv …, 2021 - arxiv.org
There is a tendency across different subfields in AI to valorize a small collection of influential
benchmarks. These benchmarks operate as stand-ins for a range of anointed common …

Reporting score distributions makes a difference: Performance study of lstm-networks for sequence tagging

N Reimers, I Gurevych - arXiv preprint arXiv:1707.09861, 2017 - arxiv.org
In this paper we show that reporting a single performance score is insufficient to compare
non-deterministic approaches. We demonstrate for common sequence tagging tasks that the …

TIRA integrated research architecture

M Potthast, T Gollub, M Wiegmann, B Stein - … Retrieval Evaluation in a …, 2019 - Springer
Data and software are immaterial. Scientists in computer science hence have the unique
chance to let other scientists easily reproduce their findings. Similarly, and with the same …

State of the art: Reproducibility in artificial intelligence

OE Gundersen, S Kjensmo - Proceedings of the AAAI conference on …, 2018 - ojs.aaai.org
Background: Research results in artificial intelligence (AI) are criticized for not being
reproducible. Objective: To quantify the state of reproducibility of empirical AI research using …

Computer-assisted text analysis for comparative politics

C Lucas, RA Nielsen, ME Roberts, BM Stewart… - Political …, 2015 - cambridge.org
Recent advances in research tools for the systematic analysis of textual data are enabling
exciting new research throughout the social sciences. For comparative politics, scholars who …

[图书][B] Natural language processing for social media

A Farzindar, D Inkpen, G Hirst - 2015 - Springer
In recent years, online social networking has revolutionized interpersonal communication.
The newer research on language analysis in social media has been increasingly focusing …

Comparing apples to apple: The effects of stemmers on topic models

A Schofield, D Mimno - Transactions of the Association for …, 2016 - direct.mit.edu
Rule-based stemmers such as the Porter stemmer are frequently used to preprocess English
corpora for topic modeling. In this work, we train and evaluate topic models on a variety of …

How we do things with words: Analyzing text as social and cultural data

D Nguyen, M Liakata, S DeDeo, J Eisenstein… - Frontiers in Artificial …, 2020 - frontiersin.org
In this article we describe our experiences with computational text analysis involving rich
social and cultural concepts. We hope to achieve three primary goals. First, we aim to shed …