[PDF][PDF] Identifying bilingual multi-word expressions for statistical machine translation.

D Bouamor, N Semmar, P Zweigenbaum - LREC, 2012 - perso.limsi.fr
Abstract MultiWord Expressions (MWEs) repesent a key issue for numerous applications in
Natural Language Processing (NLP) especially for Machine Translation (MT). In this paper …

[PDF][PDF] Thoughts on word and sentence segmentation in Thai

W Aroonmanakun - Proceedings of the Seventh Symposium on …, 2007 - academia.edu
This paper discusses problems of word and sentence segmentation in Thai. Disagreements
on word segmentation are caused mostly from compound words. To set a standard resource …

[PDF][PDF] Manawi: Using multi-word expressions and named entities to improve machine translation

L Tan, S Pal - Proceedings of the Ninth Workshop on Statistical …, 2014 - aclanthology.org
We describe the Manawi1 () system submitted to the 2014 WMT translation shared task. We
participated in the English-Hindi (EN-HI) and Hindi-English (HI-EN) language pair and …

An efficient any language approach for the integration of phrases in document retrieval

A Doucet, H Ahonen-Myka - Language resources and evaluation, 2010 - Springer
In this paper, we address the problem of the exploitation of text phrases in a multilingual
context. We propose a technique to benefit from multi-word units in adhoc document …

[PDF][PDF] Improved statistical machine translation using multiword expressions

D Bouamor, N Semmar… - Proceedings of the …, 2011 - ixa2.si.ehu.es
Identifying and translating a MultiWord Expression (MWE) in a text represents an issue for
numerous applications in Natural Language Processing (NLP) as MWEs appear in all text …

[PDF][PDF] Hybrid method for automatic extraction of multiword expressions

S Agrawal, R Sanyal, S Sanyal - Int. J. Eng. Technol, 2018 - researchgate.net
A three phase hybrid method for automatic extraction of English multiword expressions
(MWEs) has been proposed. The proposed method is based on linguistic patterns …

[PDF][PDF] An N-gram frequency database reference to handle MWE extraction in NLP applications

P Watrin, T François - Proceedings of the Workshop on Multiword …, 2011 - aclanthology.org
The identification and extraction of Multiword Expressions (MWEs) currently deliver
satisfactory results. However, the integration of these results into a wider application remains …

Analyzing topic drift in query expansion for information retrieval from a large-scale patent database

B Al-Shboul, SH Myaeng - … Conference on Big Data and Smart …, 2014 - ieeexplore.ieee.org
Topic drift has been recognized as a major reason for ineffective retrieval of documents
using query expansion. Topic drift is very important in patent domain as document …

Advanced document description, a sequential approach

A Doucet - ACM SIGIR Forum, 2006 - dl.acm.org
Advanced Document Description, a Sequential Approach Page 1 DOCTORAL ABSTRACT
Advanced Document Description, a Sequential Approach Antoine Doucet University of …

Language morphology offset: Text classification on a Croatian–English parallel corpus

M Malenica, T Šmuc, J Šnajder, BD Bašić - Information processing & …, 2008 - Elsevier
We investigate how, and to what extent, morphological complexity of the language
influences text classification using support vector machines (SVM). The Croatian–English …