[PDF][PDF] A study on automatically extracted keywords in text categorization

A Hulth, B Megyesi - … of the 21st International Conference on …, 2006 - aclanthology.org
This paper presents a study on if and how automatically extracted keywords can be used to
improve text categorization. In summary we show that a higher performance—as measured …

Random walk term weighting for improved text classification

S Hassan, R Mihalcea, C Banea - International Journal of Semantic …, 2007 - World Scientific
This paper describes a new approach for estimating term weights in a document, and shows
how the new weighting scheme can be used to improve the accuracy of a text classifier. The …

[图书][B] Methods for mining and summarizing text conversations

G Carenini, R Ng, G Murray - 2011 - books.google.com
Due to the Internet Revolution, human conversational data--in written forms--are
accumulating at a phenomenal rate. At the same time, improvements in speech technology …

Discovering the core semantics of event from social media

W Liu, X Luo, Z Gong, J Xuan, NM Kou, Z Xu - Future Generation Computer …, 2016 - Elsevier
As social media is opening up such as Twitter and Sina Weibo, 1 large volumes of short
texts are flooding on the Web. The ocean of short texts dilutes the limited core semantics of …

I, me, mine: The role of personal phrases in author profiling

RM Ortega-Mendoza, A Franco-Arcega… - Experimental IR Meets …, 2016 - Springer
Abstract The Author Profiling (AP) task aims to distinguish between groups of authors
labeled by a common demographic characteristic such as gender or age by studying the …

Semantic text classification of emergent disease reports

Y Zhang, B Liu - Knowledge Discovery in Databases: PKDD 2007: 11th …, 2007 - Springer
Traditional text classification studied in the information retrieval and machine learning
literature is mainly based on topics. That is, each class represents a particular topic, eg …

Document alignment for generation of english-punjabi comparable corpora from wikipedia

V Goyal, A Kumar, MS Lehal - International Journal of E-Adoption …, 2020 - igi-global.com
Comparable corpora come as an alternative to parallel corpora for the languages where the
parallel corpora is scarce. The efficiency of the models trained on comparable corpora is …

Automatic document classification using summarization strategies

R Ferreira, RD Lins, L Cabral, F Freitas… - Proceedings of the …, 2015 - dl.acm.org
An efficient way to automatically classify documents may be provided by automatic text
summarization, the task of creating a shorter text from one or several documents. This paper …

[PDF][PDF] General-purpose text categorization applied to the medical domain

AA Argaw, A Hulth, BB Megyesi - Department of Computer an System …, 2007 - academia.edu
This paper presents work where a general-purpose text categorization method was applied
to categorize medical free-texts. The purpose of the experiments was to examine how such a …

Using content and text classification methods to characterize team performance

K Swigger, R Brazile, G Dafoulas… - 2010 5th IEEE …, 2010 - ieeexplore.ieee.org
Because of the critical role that communication plays in a team's ability to coordinate action,
the measurement and analysis of online transcripts in order to predict team performance is …