[PDF][PDF] Cross-language information retrieval based on category matching between language versions of a web directory

F Kimura, A Maeda, M Yoshikawa… - Proceedings of the sixth …, 2003 - aclanthology.org
F Kimura, A Maeda, M Yoshikawa, S Uemura
Proceedings of the sixth international workshop on Information …, 2003aclanthology.org
Since the Web consists of documents in various domains or genres, the method for Cross-
Language Information Retrieval (CLIR) of Web documents should be independent of a
particular domain. In this paper, we propose a CLIR method which employs a Web directory
provided in multiple language versions (such as Yahoo!). In the proposed method, feature
terms are first extracted from Web documents for each category in the source and the target
languages. Then, one or more corresponding categories in another language are …
Abstract
Since the Web consists of documents in various domains or genres, the method for Cross-Language Information Retrieval (CLIR) of Web documents should be independent of a particular domain. In this paper, we propose a CLIR method which employs a Web directory provided in multiple language versions (such as Yahoo!). In the proposed method, feature terms are first extracted from Web documents for each category in the source and the target languages. Then, one or more corresponding categories in another language are determined beforehand by comparing similarities between categories across languages. Using these category pairs, we intend to resolve ambiguities of simple dictionary translation by narrowing the categories to be retrieved in the target language.
aclanthology.org
以上显示的是最相近的搜索结果。 查看全部搜索结果