[图书][B] Contemporary corpus linguistics

P Baker - 2012 - books.google.com
Corpus linguistics uses large electronic databases of language to examine hypotheses
about language use. These can be tested scientifically with computerised analytical tools …

Corpus linguistics and language technology

NS Dash - Routledge Encyclopedia of Technology and the …, 2024 - taylorfrancis.com
The chapter on linguistics and technology is written by Professor Niladri Sekhar Dash of the
Indian Statistical Institute in India. In the chapter 'Corpus Linguistics and Language …

In search of a suitable method for disambiguation of word senses in Bengali

AR Pal, D Saha, SK Naskar, NS Dash - International Journal of Speech …, 2021 - Springer
The paper presents a study on word sense disambiguation (WSD) in Bengali, one of the less
resourced Indian languages. The overall work is carried out in two sequential phases. In the …

[PDF][PDF] Statistical analysis of Telugu text corpora

GB Kumar, KN Murthy, BB Chaudhuri - 2007 - library.isical.ac.in
Corpora and corpus based studies are relatively recent and limited in Indian languages as
compared to other major languages of the world. This paper is about statistical analyses of …

L'ús de corpus en la traducció especialitzada: compilació de corpus ad hoc i extracció de recursos terminològics

P Sánchez-Gijón - 2004 - torrossa.com
D'entrada podem dir que s' han començat a superar molts dels prejudicis que hi havia. Per
exemple s' ha posat en qüestió que la terminologia era una mera pràctica de confecció de …

Corpus design for Setswana lexicography

TJ Otlogetswe - 2008 - repository.up.ac.za
This PhD thesis is about the design of a Setswana corpus for lexicography. While various
corpora have been compiled and a variety of corpora-based researches attempted in African …

[PDF][PDF] Language corpora: present Indian need

NS Dash - Proceedings of the SCALLA 2004 Working Conference, 2004 - Citeseer
Corpora have proved their value both in linguistics and language technology. Information
obtained from corpora has challenged the intuitive language study, since intuitive …

A novel approach to build Kannada web Corpus

S Parameswarappa, VN Narayana… - 2012 International …, 2012 - ieeexplore.ieee.org
This paper introduces the Kannada Corpus tool, a suite of Perl (Program Extraction and
Reporting Language) programs implementing an iterative procedure to build Kannada …

[图书][B] Text Variability Measures in Corpus Design for Setswana Lexicography

TJ Otlogetswe - 2011 - books.google.com
This book is about the design of a Setswana corpus for lexicography. While various corpora
have been compiled and a variety of corpora-based research has been attempted in African …

Frequency and function of characters used in the Bangla text corpus

NS Dash - Literary and linguistic computing, 2004 - academic.oup.com
Empirical analysis of any natural language needs to be substantiated with the statistical
findings because without adequate knowledge from statistics any linguistic study can fall into …