Natural language processing for dialects of a language: A survey

A Joshi, R Dabre, D Kanojia, Z Li, H Zhan… - ACM Computing …, 2024 - dl.acm.org
State-of-the-art natural language processing (NLP) models are trained on massive training
corpora, and report a superlative performance on evaluation datasets. This survey delves …

End-to-end neural relation extraction with global optimization

M Zhang, Y Zhang, G Fu - Proceedings of the 2017 conference on …, 2017 - aclanthology.org
Neural networks have shown promising results for relation extraction. State-of-the-art
models cast the task as an end-to-end problem, solved incrementally using a local classifier …

Parsing tweets into universal dependencies

Y Liu, Y Zhu, W Che, B Qin, N Schneider… - arXiv preprint arXiv …, 2018 - arxiv.org
We study the problem of analyzing tweets with Universal Dependencies. We extend the UD
guidelines to cover special constructions in tweets that affect tokenization, part-of-speech …

Twitter universal dependency parsing for African-American and mainstream American English

SL Blodgett, J Wei, B O'Connor - … of the 56th Annual Meeting of …, 2018 - aclanthology.org
Due to the presence of both Twitter-specific conventions and non-standard and dialectal
language, Twitter presents a significant parsing challenge to current dependency parsing …

Universal Dependency parsing for Hindi-English code-switching

IA Bhat, RA Bhat, M Shrivastava… - arXiv preprint arXiv …, 2018 - arxiv.org
Code-switching is a phenomenon of mixing grammatical structures of two or more
languages under varied social constraints. The code-switching data differ so radically from …

[PDF][PDF] PoSTWITA-UD: an Italian Twitter Treebank in universal dependencies

M Sanguinetti, C Bosco, A Lavelli… - Proceedings of the …, 2018 - aclanthology.org
Due to the spread of social media-based applications and the challenges posed by the
treatment of social media texts in NLP tools, tailored approaches and ad hoc resources are …

Sociolinguistically driven approaches for just natural language processing

SL Blodgett - 2021 - scholarworks.umass.edu
Natural language processing (NLP) systems are now ubiquitous. Yet the benefits of these
language technologies do not accrue evenly to all users, and indeed they can be harmful; …

Jampatoisnli: A jamaican patois natural language inference dataset

RA Armstrong, J Hewitt, C Manning - arXiv preprint arXiv:2212.03419, 2022 - arxiv.org
JamPatoisNLI provides the first dataset for natural language inference in a creole language,
Jamaican Patois. Many of the most-spoken low-resource languages are creoles. These …

DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

F Faisal, O Ahia, A Srivastava, K Ahuja… - arXiv preprint arXiv …, 2024 - arxiv.org
Language technologies should be judged on their usefulness in real-world use cases. An
often overlooked aspect in natural language processing (NLP) research and evaluation is …

Cross-lingual dependency parsing using code-mixed treebank

Z Meishan, Z Yue, F Guohong - arXiv preprint arXiv:1909.02235, 2019 - arxiv.org
Treebank translation is a promising method for cross-lingual transfer of syntactic
dependency knowledge. The basic idea is to map dependency arcs from a source treebank …