[PDF][PDF] A state-of-the-art survey on semantic similarity for document clustering using GloVe and density-based algorithms

SM Mohammed, K Jacksi… - Indonesian Journal of …, 2021 - pdfs.semanticscholar.org
Semantic similarity is the process of identifying relevant data semantically. The traditional
way of identifying document similarity is by using synonymous keywords and syntactician. In …

Re-evaluating word mover's distance

R Sato, M Yamada, H Kashima - … Conference on Machine …, 2022 - proceedings.mlr.press
The word mover's distance (WMD) is a fundamental technique for measuring the similarity of
two documents. As the crux of WMD, it can take advantage of the underlying geometry of the …

Document Clustering in the Age of Big Data: Incorporating Semantic Information for Improved Results

SH Haji, A Al-zebari, A Sengur, S Fattah… - Journal of Applied …, 2023 - jastt.org
There has been a meteoric rise in the total amount of digital texts as a direct result of the
proliferation of internet access. As a direct result of this, document clustering has evolved …

Fuzzy conceptualization model for document representation

P Sijin, HN Champa - 2020 IEEE International Conference on …, 2020 - ieeexplore.ieee.org
The Fuzzy Conceptualization Model (FCM) performs a fuzzy mapping of query set to the
given word corpus to obtain fuzzy membership degree for a document based on semantic …

Beyond Mere Words: Advanced Text Representations for Practical Similarity Analysis

M Wrzalik - 2024 - hlbrm.pur.hebis.de
This dissertation is concerned with the assessment of textual similarity based on novel
language models in the context of real-world application requirements. In the light of a rapid …