Per-node optimization of finite-state mechanisms for natural language processing

A Troussov, B O'Donovan, S Koskenniemi… - … on Intelligent Text …, 2003 - Springer
A Troussov, B O'Donovan, S Koskenniemi, N Glushnev
International Conference on Intelligent Text Processing and Computational …, 2003Springer
Finite-state processing is typically based on structures that allow for efficient indexing and
sequential search. However, this “rigid” framework has several disadvantages when used in
natural language processing, especially for non-alphabetical languages. The solution is to
systematically introduce polymorphic programming techniques that are adapted to particular
cases. In this paper we describe the structure of a morphological dictionary implemented
with finite-state automata using variable or polymorphic node formats. Each node is …
Abstract
Finite-state processing is typically based on structures that allow for efficient indexing and sequential search. However, this “rigid” framework has several disadvantages when used in natural language processing, especially for non-alphabetical languages. The solution is to systematically introduce polymorphic programming techniques that are adapted to particular cases. In this paper we describe the structure of a morphological dictionary implemented with finite-state automata using variable or polymorphic node formats. Each node is assigned a format from a predefined set reflecting its utility in corpora processing as measured by a number of graph theoretic metrics and statistics. Experimental results demonstrate that this approach permits a 52% increase in the performance of dictionary look-up.
Springer
以上显示的是最相近的搜索结果。 查看全部搜索结果