Text classification from unlabeled documents with bootstrapping and feature projection techniques

Y Ko, J Seo - Information Processing & Management, 2009 - Elsevier
Many machine learning algorithms have been applied to text classification tasks. In the
machine learning paradigm, a general inductive process automatically builds a text classifier …

[PDF][PDF] 概念空间中上下位关系的意义识别研究

刘磊, 曹存根, 张春霞, 田国刚 - 计算机学报, 2009 - cjc.ict.ac.cn
摘要针对上下位关系在分类层级结构建立阶段遇到的多义性问题, 给出一种概念空间中上下位
关系意义识别的方法. 单个概念的意义识别问题被转换为概念空间中上下位关系的意义识别 …

[PDF][PDF] 机器翻译的词处理研究

杨宪泽 - 计算机工程与科学, 2009 - joces.nudt.edu.cn
摘要本文首先在讨论汉语自动分词这一难题的基础上提出最大匹配分词的改进算法然后论述
词性兼类处理的一些方法最后探讨了汉英机器翻译时名词的单复数处理算法" DA:;'1:% 2: A> E> …

隐喻自动处理研究进展

贾玉祥, 俞士汶, 朱学锋 - 中文信息学报, 2009 - cqvip.com
隐喻在人类语言中普遍存在, 是自然语言理解必须面对的问题. 该文首先探讨了对隐喻的认识及
语言中隐喻表达的分类. 把隐喻自动处理分为隐喻识别, 隐喻理解和隐喻生成三个子任务 …

[PDF][PDF] ambiguous arabic Words disambiguation: the results

L Merhbene, A Zouaghi, M Zrigui - Proceedings of the Student …, 2009 - aclanthology.org
In this paper we propose an hybrid system of Arabic words disambiguation. To achieve this
goal we use the methods employed in the domain of information retrieval: Latent semantic …

Word sense disambiguation for Turkish

E Mert, G Dalkilic - 2009 24th International Symposium on …, 2009 - ieeexplore.ieee.org
Word sense disambiguation (WSD) is the core and one of the hardest problems of many
natural language processing tasks. WSD is considered as an AI-complete problem …

[PDF][PDF] Machine translation using automatically inferred construction-based correspondence and language models

S Edelman, Z Solan - Proceedings of the 23rd Pacific Asia …, 2009 - aclanthology.org
We discuss the problem of translation in the wider context of the problem of meaning in
cognition and describe a structural statistical machine translation (MT) method motivated by …

汉英机器翻译的单词处理研究

杨宪泽 - 西南民族大学学报: 自然科学版, 2009 - cqvip.com
机器翻译涉及的技术很多. 本文的主要工作有两部分: 第一部分给出词性兼类处理的一些方法;
第二部分探讨汉英机器翻译时译文生成的处理, 包括建立汉英机器翻译的时态转换 …

[PDF][PDF] 基于专用双语词典的查询扩展

罗小聪 - 现代计算机(专业版), 2009 - core.ac.uk
* 基金项目: 国家自然科学基金(No. 2006AA010107) 收稿日期: 2009-09-03 修稿日期: 2009-10-
08 作者简介: 罗小聪(1985-), 男, 贵州凯里人, 硕士研究生, 研究方向为自然语言处理 …

[PDF][PDF] BoostWeight: An Approach to Boost the Term Weights in a Document Vector by Exploiting Open Web Directory.

G Ruhela, PK Reddy - IKE, 2009 - researchgate.net
For clustering, cosine similarity method is a popular approach to compute the similarity
between two document vectors where each document vector consists of weighted terms. It …