Automatic language identification in texts: A survey

T Jauhiainen, M Lui, M Zampieri, T Baldwin… - Journal of Artificial …, 2019 - jair.org
Language identification (" LI") is the problem of determining the natural language that a
document or part thereof is written in. Automatic LI has been extensively researched for over …

[PDF][PDF] Automatic detection of English inclusions in mixed-lingual data with an application to parsing

B Alex - 2008 - researchgate.net
The influence of English continues to grow to the extent that its expressions have begun to
permeate the original forms of other languages. It has become more acceptable, and in …

Linguistic resources for Bhojpuri, Magahi, and Maithili: statistics about them, their similarity estimates, and baselines for three applications

RK Mundotiya, MK Singh, R Kapur, S Mishra… - Transactions on Asian …, 2021 - dl.acm.org
Corpus preparation for low-resource languages and for development of human language
technology to analyze or computationally process them is a laborious task, primarily due to …

Text to speech synthesis for texts with foreign language inclusions

J Wouters, C Traber, D Hagstrand, A Wilpert… - US Patent …, 2015 - Google Patents
(57) ABSTRACT A speech output is generated from a text input writtenina first language and
containing inclusions in a second language. Words in the native language are pronounced …

[PDF][PDF] Using foreign inclusion detection to improve parsing performance

B Alex, A Dubey, F Keller - … of the 2007 joint conference on …, 2007 - aclanthology.org
Inclusions from other languages can be a significant source of errors for monolingual
parsers. We show this for English inclusions, which are sufficiently frequent to present a …

System and method for handling multiple languages in text

C Brun - US Patent 8,285,541, 2012 - Google Patents
A system and method for processing text are disclosed. The method includes receiving text
to be processed. A main language of the text is identified. At least one unknown sequence in …

[PDF][PDF] Multilingual text entry using automatic language detection

Y Ehara, K Tanaka-Ishii - Proceedings of the Third International …, 2008 - aclanthology.org
Computer users increasingly need to produce text written in multiple languages. However,
typical computer interfaces require the user to change the text entry software each time a …

Prosody modification on mixed-language speech synthesis

Y Zhang, J Tao - 2008 6th International Symposium on …, 2008 - ieeexplore.ieee.org
This paper proposes a method to generate natural prosody parameters in Chinese and
English mixed-language speech synthesis system which is based on separate Chinese …

[PDF][PDF] Basic linguistic resources and baselines for Bhojpuri, Magahi and Maithili for natural language processing

RK Mundotiya, MK Singh, R Kapur… - arXiv preprint arXiv …, 2020 - researchgate.net
To the best of our knowledge, our effort, which began in 2014, was one of the first to create
annotated resources for these languages. At that time, there was no publicly available …

Polyglot speech synthesis: a review

B Sharma, SRM Prasanna - IETE Technical Review, 2017 - Taylor & Francis
The term polyglot speech synthesis refers to the process of producing speech in multiple
languages and single speaker's voice from a single text-to-speech synthesis (TTS) system …