Parsbert: Transformer-based model for persian language understanding

M Farahani, M Gharachorloo, M Farahani… - Neural Processing …, 2021 - Springer
The surge of pre-trained language models has begun a new era in the field of Natural
Language Processing (NLP) by allowing us to build powerful language models. Among …

Semantic Relation Extraction: A Review of Approaches, Datasets, and Evaluation Methods With Looking at the Methods and Datasets in the Persian Language

H Gharagozlou, J Mohammadzadeh… - ACM Transactions on …, 2023 - dl.acm.org
A large volume of unstructured data, especially text data, is generated and exchanged daily.
Consequently, the importance of extracting patterns and discovering knowledge from textual …

ParsiNLU: A Suite of Language Understanding Challenges for Persian

D Khashabi, A Cohan, S Shakeri, P Hosseini… - Transactions of the …, 2021 - direct.mit.edu
Despite the progress made in recent years in addressing natural language understanding
(NLU) challenges, the majority of this progress remains to be concentrated on resource-rich …

[PDF][PDF] Challenges and opportunities in processing low resource languages: A study on persian

M Shamsfard - International conference language technologies for all …, 2019 - lt4all.org
This paper discusses the importance of language processing and its challenges. It first
defines low resource languages and their influencing factors. Then talking about the Persian …

Beheshti-NER: Persian named entity recognition using BERT

E Taher, SA Hoseini, M Shamsfard - arXiv preprint arXiv:2003.08875, 2020 - arxiv.org
Named entity recognition is a natural language processing task to recognize and extract
spans of text associated with named entities and classify them in semantic Categories …

Application of k-means clustering algorithm to improve effectiveness of the results recommended by journal recommender system

N Vara, M Mirzabeigi, H Sotudeh, SM Fakhrahmad - Scientometrics, 2022 - Springer
This study investigates to evaluate feasibility of k-means clustering algorithm in order to
improve effectiveness of the results recommended by RICEST Journal Finder System. More …

FaBERT: Pre-training BERT on Persian Blogs

M Masumi, SS Majd, M Shamsfard, H Beigy - arXiv preprint arXiv …, 2024 - arxiv.org
We introduce FaBERT, a Persian BERT-base model pre-trained on the HmBlogs corpus,
encompassing both informal and formal Persian texts. FaBERT is designed to excel in …

Parsner-social: A corpus for named entity recognition in persian social media texts

M Asgari-Bidhendi, B Janfada… - Journal of AI and …, 2021 - jad.shahroodut.ac.ir
Named Entity Recognition (NER) is one of the essential prerequisites for many natural
language processing tasks. All public corpora for Persian named entity recognition, such as …

Comparative study of text representation and learning for Persian named entity recognition

MM Abdollah Pour, S Momtazi - ETRI Journal, 2022 - Wiley Online Library
Transformer models have had a great impact on natural language processing (NLP) in
recent years by realizing outstanding and efficient contextualized language models. Recent …

[HTML][HTML] Investigating the Challenges and Opportunities in Persian Language Information Retrieval through Standardized Data Collections and Deep Learning

S Moniri, T Schlosser, D Kowerko - Computers, 2024 - mdpi.com
The Persian language, also known as Farsi, is distinguished by its intricate morphological
richness, yet it contends with a paucity of linguistic resources. With an estimated 110 million …