Clustering narrow domain short texts is considered to be a complex task because of the intrinsic features of the corpus to be clustered:(i) the low frequencies of vocabulary terms in …
W Tao, D Chang - Tehnički vjesnik, 2019 - hrcak.srce.hr
Sažetak With the explosive growth in Internet news media and the disorganized status of news texts, this paper puts forward an automatic classification model for news based on a …
P Makagonov, M Alexandrov, A Gelbukh - International Conference on …, 2004 - Springer
Accessibility of digital libraries and other web-based repositories has caused the illusion of accessibility of the full texts of scientific papers. However, in the majority of cases such an …
Text Categorization (TC) has become one of the major techniques for organizing and managing online information. Several studies proposed the so-called associative …
This paper focuses on the use of sense clusters for classification and clustering of very short texts such as conference abstracts. Common keyword-based techniques are effective for …
En este trabajo de tesis doctoral se investiga el problema del agrupamiento de conjuntos especiales de documentos llamados textos cortos de dominios restringidos. Para llevar a …