Retrieval methods for english-text with missrecognized ocr characters

M Ohta, A Takasu, J Adachi - Proceedings of the fourth …, 1997 - ieeexplore.ieee.org
This paper presents three probabilistic text retrieval methods designed to carry out a full-text
search of English documents containing OCR errors. By searching for any query term on the …

Bibliographic component extraction using support vector machines and hidden Markov models

T Okada, A Takasu, J Adachi - … , ECDL 2004, Bath, UK, September 12-17 …, 2004 - Springer
Article citations are composed of subfields such as author, title, journal, and year. It is useful
to automatically identify attributes of these subfields, since they are used for linking a citation …

Probabilistic automaton-based fuzzy english-text retrieval

M Ohta, A Takasu, J Adachi - IEICE TRANSACTIONS on …, 2003 - search.ieice.org
Optical Character Reader (OCR) incorrect recognition is a serious problem when searching
for OCR-scanned documents in databases such as digital libraries. In order to reduce costs …

Reduction of expanded search terms for fuzzy English-text retrieval

M Ohta, A Takasu, J Adachi - International Journal on Digital Libraries, 2000 - Springer
Optical character reader (OCR) misrecognition is a serious problem when OCR-recognized
text is used for retrieval purposes in digital libraries. We have proposed fuzzy retrieval …

[DOC][DOC] Fuzzy Logic Based Arabic Optical Character Recognition

GAA Al_Talib - 2006 - researchgate.net
Recognition of handwritten Arabic characters is by all means a difficult task due to the variety
of styles of writings of different people. The use of fuzzy logic in recognition of handwritten …

Information extraction by two dimensional parser

A Takasu - 2008 20th IEEE International Conference on Tools …, 2008 - ieeexplore.ieee.org
This paper proposes a learning algorithm for a two dimensional parser. The parser is
designed to analyze page layout of documents and extract information using both textual …

Statistical analysis of bibliographic strings for constructing an integrated document space

A Takasu - International Conference on Theory and Practice of …, 2002 - Springer
It is important to utilize retrospective documents when constructing a large digital library.
This paper proposes a method for analyzing recognized bibliographic strings using an …

A sequence labeling method using syntactical and textual patterns for record linkage

A Takasu - International Conference on Pattern Recognition and …, 2005 - Springer
Record linkage is an important application area of text pattern analysis. In this paper we
propose a new sequence labeling method that can be used to extract entities from a string …

Handwritten text retrieval using two-stage pattern matching with handwritten query

K Yamada - … Conference on Pattern Recognition (Cat. No …, 1998 - ieeexplore.ieee.org
Describes a method of retrieving handwritten text with handwritten queries. The properties of
handwritten Japanese texts have prevented conventional handwritten European text …

メタデータを中心に構成した文書画像の電子図書館システム

安達淳 - 電子情報通信学会論文誌 D, 2001 - search.ieice.org
電子図書館システム NACSIS-ELS は学術雑誌のページをスキャンした文書画像のデータベースを
書誌情報データベースと統合した情報検索システムである. 統合利用するためにメタデータを核にし …