查看文章

springer.com 中的 [HTML]

MDIW-13: a new multi-lingual and multi-script database and benchmark for script identification

作者

Miguel A Ferrer, Abhijit Das, Moises Diaz, Aythami Morales, Cristina Carmona-Duarte, Umapada Pal

发表日期

2024/1

期刊

Cognitive Computation

卷号

期号

页码范围

131-157

出版商

Springer US

简介

Script identification plays a vital role in applications that involve handwriting and document analysis within a multi-script and multi-lingual environment. Moreover, it exhibits a profound connection with human cognition. This paper provides a new database for benchmarking script identification algorithms, which contains both printed and handwritten documents collected from a wide variety of scripts, such as Arabic, Bengali (Bangla), Gujarati, Gurmukhi, Devanagari, Japanese, Kannada, Malayalam, Oriya, Roman, Tamil, Telugu, and Thai. The dataset consists of 1,135 documents scanned from local newspaper and handwritten letters as well as notes from different native writers. Further, these documents are segmented into lines and words, comprising a total of 13,979 and 86,655 lines and words, respectively, in the dataset. Easy-to-go benchmarks are proposed with handcrafted and deep learning methods. The …

引用总数

被引用次数：1

20241

学术搜索中的文章

MDIW-13: a new multi-lingual and multi-script database and benchmark for script identification

MA Ferrer, A Das, M Diaz, A Morales… - Cognitive Computation, 2024

被引用次数：1 相关文章所有 6 个版本