W Aroonmanakun - Proceedings of the Seventh Symposium on …, 2007 - academia.edu
This paper discusses problems of word and sentence segmentation in Thai. Disagreements on word segmentation are caused mostly from compound words. To set a standard resource …
L Tan, S Pal - Proceedings of the Ninth Workshop on Statistical …, 2014 - aclanthology.org
We describe the Manawi1 () system submitted to the 2014 WMT translation shared task. We participated in the English-Hindi (EN-HI) and Hindi-English (HI-EN) language pair and …
A Doucet, H Ahonen-Myka - Language resources and evaluation, 2010 - Springer
In this paper, we address the problem of the exploitation of text phrases in a multilingual context. We propose a technique to benefit from multi-word units in adhoc document …
D Bouamor, N Semmar… - Proceedings of the …, 2011 - ixa2.si.ehu.es
Identifying and translating a MultiWord Expression (MWE) in a text represents an issue for numerous applications in Natural Language Processing (NLP) as MWEs appear in all text …
S Agrawal, R Sanyal, S Sanyal - Int. J. Eng. Technol, 2018 - researchgate.net
A three phase hybrid method for automatic extraction of English multiword expressions (MWEs) has been proposed. The proposed method is based on linguistic patterns …
P Watrin, T François - Proceedings of the Workshop on Multiword …, 2011 - aclanthology.org
The identification and extraction of Multiword Expressions (MWEs) currently deliver satisfactory results. However, the integration of these results into a wider application remains …
B Al-Shboul, SH Myaeng - … Conference on Big Data and Smart …, 2014 - ieeexplore.ieee.org
Topic drift has been recognized as a major reason for ineffective retrieval of documents using query expansion. Topic drift is very important in patent domain as document …
Advanced Document Description, a Sequential Approach Page 1 DOCTORAL ABSTRACT Advanced Document Description, a Sequential Approach Antoine Doucet University of …
We investigate how, and to what extent, morphological complexity of the language influences text classification using support vector machines (SVM). The Croatian–English …