[PDF][PDF] Categorization according to language: A step toward combining linguistic knowledge and statistic learning

E Giguet - International Workshop of Parsing Technologies (IWPT' …, 1995 - hal.science
International Workshop of Parsing Technologies (IWPT'95), 1995hal.science
In this article, we address the problem of categorization according to language by presenting
a method based on natural properties of language which allow us to categorize any kind of
sentence with a very high success rate. The major di culties in categorization are
convergence and textual errors. Convergence since dealing with short entries involve
discarding languages from few clues. Textual errors since documents coming from di erent
electronic ways may contain spelling and grammatical errors as well as character …
Abstract
In this article, we address the problem of categorization according to language by presenting a method based on natural properties of language which allow us to categorize any kind of sentence with a very high success rate.
The major di culties in categorization are convergence and textual errors. Convergence since dealing with short entries involve discarding languages from few clues. Textual errors since documents coming from di erent electronic ways may contain spelling and grammatical errors as well as character recognition errors generated by OCR.
hal.science
以上显示的是最相近的搜索结果。 查看全部搜索结果