作者
Phuong H Nguyen, Thuan D Ngo, Dung A Phan, Thu PT Dinh, Thang Q Huynh
发表日期
2008/7/13
研讨会论文
2008 IEEE International Conference on Research, Innovation and Vision for the Future in Computing and Communication Technologies
页码范围
96-102
出版商
IEEE
简介
The spelling checking problem is considered to contain two main phases: the detecting phase and the correcting phase. In this paper, we present a new approach for Vietnamese spelling checking based on Vietnamese characteristics for each phase. Our research approach includes the use of a syllable Bi-gram in combination with parts of speech (POS) to find out suspected syllables. In the correcting phase, we based on Minimum Edit Distance, SoundEx algorithms and some heuristics to build a weight function for assessing suggestion candidates. The training corpus and the test set were collected from e-newspapers.
引用总数
201520162017201820192020202120222023202423112332
学术搜索中的文章