Multilingual document clustering using wikipedia as external knowledge

S GSK, V Varma - Information Retrieval Facility Conference, 2011 - Springer
Abstract This paper presents Multilingual Document Clustering (MDC) on comparable
corpora. Wikipedia has evolved to be a major structured multilingual knowledge base. It has …

Multilingual news clustering: Feature translation vs. identification of cognate named entities

S Montalvo, R Martínez, A Casillas, V Fresno - Pattern Recognition Letters, 2007 - Elsevier
In this paper we evaluate the influence of different document representations in the results of
multilingual news clustering. We aim at proving whether or not the use of only named …

多语言文本聚类研究综述

章成志, 王惠临 - 数据分析与知识发现, 2009 - manu44.magtech.com.cn
多语言文本聚类研究综述* Please wait a minute... Advanced Search 首页 期刊简介 期刊团队
作者指南 出版道德声明 征订&广告 English 基本情况 收录获奖 联系方式 编委会 审稿专家 编辑部 …

Exploiting named entities for bilingual news clustering

S Montalvo, R Martínez, V Fresno… - Journal of the …, 2015 - Wiley Online Library
In this article, we present a new algorithm for clustering a bilingual collection of comparable
news items in groups of specific topics. Our hypothesis is that named entities (NE s) are …

[PDF][PDF] Type level clustering evaluation: New measures and a pos induction case study

R Reichart, O Abend, A Rappoport - Proceedings of the …, 2010 - aclanthology.org
Clustering is a central technique in NLP. Consequently, clustering evaluation is of great
importance. Many clustering algorithms are evaluated by their success in tagging corpus …

[引用][C] 移动通信基站故障的处理与维护

弓美桃, 曹剑英 - 信息安全与通信保密, 2014

[引用][C] 基于混合策略的英汉双语新闻聚类研究

韩普, 万接喜, 王东波 - 情报科学, 2013

[HTML][HTML] 基于英汉双语短语级平行语料的类别知识挖掘研究

王东波, 韩普, 沈思, 魏向清 - 数据分析与知识发现, 2013 - manu44.magtech.com.cn
摘要在已有聚类算法的基础上, 基于英汉双语短语级人文社会科学平行语料,
进行类别知识挖掘的实验. 根据实验数据并结合具体的研究需求, 确定相应的聚类算法和英语 …

A comparison of unsupervised methods for ad hoc cross-lingual document retrieval

E Zosa, M Granroth-Wilding… - Proceedings of the …, 2020 - aclanthology.org
We address the problem of linking related documents across languages in a multilingual
collection. We evaluate three diverse unsupervised methods to represent and compare …

Multilingual news document clustering: two algorithms based on cognate named entities

S Montalvo, R Martínez, A Casillas, V Fresno - Text, Speech and Dialogue …, 2006 - Springer
This paper presents an approach for Multilingual News Document Clustering in comparable
corpora. We have implemented two algorithms of heuristic nature that follow the approach …