[HTML][HTML] Investigating the Challenges and Opportunities in Persian Language Information Retrieval through Standardized Data Collections and Deep Learning

S Moniri, T Schlosser, D Kowerko - Computers, 2024 - mdpi.com
The Persian language, also known as Farsi, is distinguished by its intricate morphological
richness, yet it contends with a paucity of linguistic resources. With an estimated 110 million …

DeepLink: A novel link prediction framework based on deep learning

MM Keikha, M Rahgozar… - Journal of Information …, 2021 - journals.sagepub.com
Recently, link prediction has attracted more attention from various disciplines such as
computer science, bioinformatics and economics. In link prediction, numerous information …

The impact of corpus domain on word representation: a study on Persian word embeddings

A Hadifar, S Momtazi - language resources and evaluation, 2018 - Springer
Word embedding, has been a great success story for natural language processing in recent
years. The main purpose of this approach is providing a vector representation of words …

Time sensitive blog retrieval using temporal properties of queries

MS Zahedi, A Aleahmad, M Rahgozar… - Journal of …, 2017 - journals.sagepub.com
Blogs are one of the main user-generated contents on the web and are growing in number
rapidly. The characteristics of blogs require the development of specialized search methods …

Arman: Pre-training with semantically selecting and reordering of sentences for persian abstractive summarization

A Salemi, E Kebriaei, GN Minaei, A Shakery - arXiv preprint arXiv …, 2021 - arxiv.org
Abstractive text summarization is one of the areas influenced by the emergence of pre-
trained language models. Current pre-training works in abstractive summarization give more …

How questions are posed to a search engine? An empiricial analysis of question queries in a large scale Persian search engine log

MS Zahedi, B Mansouri, S Moradkhani… - … Conference on Web …, 2017 - ieeexplore.ieee.org
In this paper we investigate a Persian search engine log and present a comprehensive
analysis of question queries in three levels: structure, click and topic. By analyzing question …

Correction of spaces in Persian sentences for tokenization

M Panahandeh, S Ghanbari - 2019 5th Conference on …, 2019 - ieeexplore.ieee.org
The exponential growth of the Internet and its users and the emergence of Web 2.0 have
caused a large volume of textual data to be created. Automatic analysis of such data can be …

HmBlogs: A big general Persian corpus

HM Khansari, M Shamsfard - arXiv preprint arXiv:2111.02362, 2021 - arxiv.org
This paper introduces the hmBlogs corpus for Persian, as a low resource language. This
corpus has been prepared based on a collection of nearly 20 million blog posts over a …

Understanding User's Search Behavior towards Spiky Events

B Mansouri, MS Zahedi, R Campos… - … Proceedings of the The …, 2018 - dl.acm.org
Web searches are done by users every day on a million-daily basis. Many of these web
searches are related to events, social occasions that attracts society's attention. Events may …

FarsAcademic: A Standard Persian Test Collection for Information Retrieval in Scientific Texts

D Haseli, H Atapour, F Fahimniya… - International Journal of …, 2023 - ijism.isc.ac
A significant amount of scientific texts is produced in Persian and available in scientific
information databases through the Web. In this paper, FarsAcademic, a test collection of …