Indic-transformers: An analysis of transformer language models for Indian languages

[HTML][HTML] How can we detect homophobia and transphobia? experiments in a multilingual code-mixed setting for social media governance

BR Chakravarthi, A Hande, R Ponnusamy… - International Journal of …, 2022 - Elsevier

Homophobia or Transphobia can be defined as the hatred, discomfort, or dislike of lesbian,
gay, transgender or bisexual people. Studies have shown that these individuals were more …

被引用次数：59 相关文章

[PDF] arxiv.org

L3cube-hindbert and devbert: Pre-trained bert transformer models for devanagari based hindi and marathi languages

R Joshi - arXiv preprint arXiv:2211.11418, 2022 - arxiv.org

The monolingual Hindi BERT models currently available on the model hub do not perform
better than the multi-lingual models on downstream tasks. We present L3Cube-HindBERT, a …

被引用次数：53 相关文章所有 3 个版本

[PDF] arxiv.org

L3cube-mahacorpus and mahabert: Marathi monolingual corpus, marathi bert language models, and resources

R Joshi - arXiv preprint arXiv:2202.01159, 2022 - arxiv.org

We present L3Cube-MahaCorpus a Marathi monolingual data set scraped from different
internet sources. We expand the existing Marathi monolingual corpus with 24.8 M sentences …

被引用次数：56 相关文章所有 6 个版本

[PDF] arxiv.org

A review of bangla natural language processing tasks and the utility of transformer models

F Alam, A Hasan, T Alam, A Khan, J Tajrin… - arXiv preprint arXiv …, 2021 - arxiv.org

Bangla--ranked as the 6th most widely spoken language across the world (https://www.
ethnologue. com/guides/ethnologue200), with 230 million native speakers--is still …

被引用次数：27 相关文章所有 4 个版本

[PDF] arxiv.org

Mono vs multilingual bert for hate speech detection and text classification: A case study in marathi

A Velankar, H Patil, R Joshi - IAPR Workshop on Artificial Neural Networks …, 2022 - Springer

Transformers are the most eminent architectures used for a vast range of Natural Language
Processing tasks. These models are pre-trained over a large text corpus and are meant to …

被引用次数：52 相关文章所有 8 个版本

[PDF] neurips.cc

Distributed deep learning in open collaborations

M Diskin, A Bukhtiyarov, M Ryabinin… - Advances in …, 2021 - proceedings.neurips.cc

Modern deep learning applications require increasingly more compute to train state-of-the-
art models. To address this demand, large corporations and institutions use dedicated High …

被引用次数：57 相关文章所有 9 个版本

[PDF] arxiv.org

Hope speech detection in under-resourced kannada language

A Hande, R Priyadharshini, A Sampath… - arXiv preprint arXiv …, 2021 - arxiv.org

Numerous methods have been developed to monitor the spread of negativity in modern
years by eliminating vulgar, offensive, and fierce comments from social media platforms …

被引用次数：37 相关文章所有 3 个版本

[PDF] aclanthology.org

[PDF][PDF] Hate speech detection: a comparison of mono and multilingual transformer model with cross-language evaluation

K Ghosh, A Senapati - Proceedings of the 36th Pacific Asia …, 2022 - aclanthology.org

Warning: This paper contains examples of the language that some people may find
offensive. Transformer-based Language models have achieved state-of-the-art performance …

被引用次数：22 相关文章

[PDF] ieee.org

Authorship classification in a resource constraint language using convolutional neural networks

MR Hossain, MM Hoque, MAA Dewan… - IEEE …, 2021 - ieeexplore.ieee.org

Authorship classification is a method of automatically determining the appropriate author of
an unknown linguistic text. Although research on authorship classification has significantly …

被引用次数：24 相关文章所有 8 个版本

[PDF] arxiv.org

Role of language relatedness in multilingual fine-tuning of language models: A case study in indo-aryan languages

TI Dhamecha, R Murthy V, S Bharadwaj… - arXiv preprint arXiv …, 2021 - arxiv.org

We explore the impact of leveraging the relatedness of languages that belong to the same
family in NLP models using multilingual fine-tuning. We hypothesize and validate that …

被引用次数：23 相关文章所有 6 个版本

高级搜索

QQ 群