作者
Muzammil Khan, Yasser Alharbi, Ali Alferaidi, Talal Saad Alharbi, Kusum Yadav
发表日期
2023/10
期刊
SAGE Open
卷号
13
期号
4
页码范围
21582440231201368
出版商
SAGE Publications
简介
The digital news preservation and management of low-resource languages are challenging tasks, especially in vast collections. Unique identification of individual digital objects is possible with well-defined attributes to assure efficient management, such as access, retrieval, preservation, usability, and transformability. The metadata element set is required to maximize the available attributes related to the digital objects. To create a comprehensive metadata set that contains all the necessary attributes and data about the digital news objects. It is more challenging and complicated when the archive contains articles from low-resourced and morphologically complex languages like Urdu and Arabic, which is difficult for machines to understand. The study presents challenges in low-resource languages (LRL) and research challenges. This metadata will help to link news articles based on similarity with other news articles …
引用总数
学术搜索中的文章