作者
Niladri Sekhar Dash
发表日期
2007
期刊
Rainbow of linguistics
卷号
1
页码范围
129-162
简介
In this paper an attempt is made to present a brief, general review about the history and the present status of corpus generation in electronic form in Indian languages to address various unprecedented needs of mainstream linguistics, applied linguistics, language technology, and several other interlinked disciplines. Since the history of corpus generation in Indian languages is not a long story punctuated with diversities and meanders, our present study will therefore try to focus not on the roadmaps of the enterprise, but on the milestones achieved so far with a clear emphasis on the techniques and methodologies applied for achieving these milestones. Besides, the innovative traits and the philosophical bases underlying these works are brought under our focus to justify the present schemes of work undertaken to revitalise the existing corpus generation machineries in a pan-Indian frame for future referential relevance and functional adequacies. Identification of domains of possible corpus use along with the target users in Indian context is another important goal of this paper by which the entire community of corpus users will be empowered to expand the applicational relevance of empirical language databases beyond the realms of their immediate spheres of linguistic vision and projection.
引用总数
200920102011201220132014201520162017201820192020202120222023111252181214
学术搜索中的文章