查看文章

academia.edu 中的 [PDF]

Indian scenario in language corpus generation

作者

Niladri Sekhar Dash

发表日期

2007

期刊

Rainbow of linguistics

卷号

页码范围

129-162

简介

In this paper an attempt is made to present a brief, general review about the history and the present status of corpus generation in electronic form in Indian languages to address various unprecedented needs of mainstream linguistics, applied linguistics, language technology, and several other interlinked disciplines. Since the history of corpus generation in Indian languages is not a long story punctuated with diversities and meanders, our present study will therefore try to focus not on the roadmaps of the enterprise, but on the milestones achieved so far with a clear emphasis on the techniques and methodologies applied for achieving these milestones. Besides, the innovative traits and the philosophical bases underlying these works are brought under our focus to justify the present schemes of work undertaken to revitalise the existing corpus generation machineries in a pan-Indian frame for future referential relevance and functional adequacies. Identification of domains of possible corpus use along with the target users in Indian context is another important goal of this paper by which the entire community of corpus users will be empowered to expand the applicational relevance of empirical language databases beyond the realms of their immediate spheres of linguistic vision and projection.

引用总数

被引用次数：29

2009201020112012201320142015201620172018201920202021202220231 1 1 2 5 2 1 8 1 2 1 4

学术搜索中的文章

Indian scenario in language corpus generation

NS Dash - Rainbow Linguist, 2007

被引用次数：29 相关文章