Low-resource languages: A review of past work and future challenges

A Magueresse, V Carles, E Heetderks - arXiv preprint arXiv:2006.07264, 2020 - arxiv.org
A current problem in NLP is massaging and processing low-resource languages which lack
useful training attributes such as supervised data, number of native speakers or experts, etc …

Charner: Character-level named entity recognition

O Kuru, OA Can, D Yuret - Proceedings of COLING 2016, the 26th …, 2016 - aclanthology.org
We describe and evaluate a character-level tagger for language-independent Named Entity
Recognition (NER). Instead of words, a sentence is represented as a sequence of …

Named-entity recognition in Turkish legal texts

C Çetindağ, B Yazıcıoğlu, A Koç - Natural Language Engineering, 2023 - cambridge.org
Natural language processing (NLP) technologies and applications in legal text processing
are gaining momentum. Being one of the most prominent tasks in NLP, named-entity …

Improving classifier training efficiency for automatic cyberbullying detection with feature density

J Eronen, M Ptaszynski, F Masui… - Information Processing …, 2021 - Elsevier
We study the effectiveness of Feature Density (FD) using different linguistically-backed
feature preprocessing methods in order to estimate dataset complexity, which in turn is used …

Advancing natural language processing (NLP) applications of morphologically rich languages with bidirectional encoder representations from transformers (BERT): an …

A Özçift, K Akarsu, F Yumuk, C Söylemez - Automatika: časopis za …, 2021 - hrcak.srce.hr
Sažetak Language model pre-training architectures have demonstrated to be useful to learn
language representations. bidirectional encoder representations from transformers (BERT) …

Named entity recognition in Turkish: A comparative study with detailed error analysis

O Ozcelik, C Toraman - Information Processing & Management, 2022 - Elsevier
Named entity recognition aims to detect pre-determined entity types in unstructured text.
There is a limited number of studies on this task for low-resource languages such as Turkish …

Named entity recognition with word embeddings and wikipedia categories for a low-resource language

A Das, D Ganguly, U Garain - ACM Transactions on Asian and Low …, 2017 - dl.acm.org
In this article, we propose a word embedding--based named entity recognition (NER)
approach. NER is commonly approached as a sequence labeling task with the application of …

Corpus linguistics and language technology

NS Dash - Routledge Encyclopedia of Technology and the …, 2024 - taylorfrancis.com
The chapter on linguistics and technology is written by Professor Niladri Sekhar Dash of the
Indian Statistical Institute in India. In the chapter 'Corpus Linguistics and Language …

An evaluation of recent neural sequence tagging models in Turkish named entity recognition

G Aras, D Makaroğlu, S Demir, A Cakir - Expert Systems with Applications, 2021 - Elsevier
Named entity recognition (NER) is an extensively studied task that extracts and classifies
named entities in a text. NER is crucial not only in downstream language processing …

Turkish named entity recognition with deep learning

A Güneş, AC TantuĞ - 2018 26th signal processing and …, 2018 - ieeexplore.ieee.org
Named Entity Recognition (NER) is an important task in Natural Language Processing, Data
Mining and Information Extraction areas since 1990's. While NER is a succesfully solved …