An evolutionary-based term reduction approach to bilingual clustering of Malay-English corpora

R Alfred, LC Leong, JH Obit - … of the International Conference, ICTA 2016, 2017 - Springer
The document clustering process groups the unstructured text documents into a predefined
set of clusters in order to provide more information to the users. There are many studies …

基于并行信息瓶颈的多语种文本聚类算法

闫小强, 卢耀恩, 娄铮铮, 叶阳东 - 模式识别与人工智能, 2017 - cqvip.com
聚类算法在抽取文本数据中的模式结构时, 忽略多个语种信息之间潜在的互补作用,
得到的模式结构不能充分反映数据的内在信息. 针对此问题, 文中提出基于并行信息瓶颈的多 …