Automated phrase mining from massive text corpora

J Shang, J Liu, M Jiang, X Ren… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
As one of the fundamental tasks in text analysis, phrase mining aims at extracting quality
phrases from a text corpus and has various downstream applications including information …

MDERank: A masked document embedding rank approach for unsupervised keyphrase extraction

L Zhang, Q Chen, W Wang, C Deng, SL Zhang… - arXiv preprint arXiv …, 2021 - arxiv.org
Keyphrase extraction (KPE) automatically extracts phrases in a document that provide a
concise summary of the core content, which benefits downstream information retrieval and …

Ucphrase: Unsupervised context-aware quality phrase tagging

X Gu, Z Wang, Z Bi, Y Meng, L Liu, J Han… - Proceedings of the 27th …, 2021 - dl.acm.org
Identifying and understanding quality phrases from context is a fundamental task in text
mining. The most challenging part of this task arguably lies in uncommon, emerging, and …

Language model as an annotator: Unsupervised context-aware quality phrase generation

Z Zhang, Y Zuo, C Lin, J Wu - Knowledge-Based Systems, 2024 - Elsevier
Phrase mining is a fundamental text mining task that aims to identify quality phrases from
context. Nevertheless, the scarcity of extensive gold labels datasets, demanding substantial …

HAMNER: Headword amplified multi-span distantly supervised method for domain specific named entity recognition

S Liu, Y Sun, B Li, W Wang, X Zhao - … of the AAAI Conference on Artificial …, 2020 - ojs.aaai.org
Abstract To tackle Named Entity Recognition (NER) tasks, supervised methods need to
obtain sufficient cleanly annotated data, which is labor and time consuming. On the contrary …

An efficient method for high quality and cohesive topical phrase mining

B Li, X Yang, R Zhou, B Wang, C Liu… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
A phrase is a natural, meaningful, and essential semantic unit. In topic modeling, visualizing
phrases for individual topics is an effective way to explore and understand unstructured text …

ParsingPhrase: parsing-based automated quality phrase mining

Y Wu, S Zhao, S Dou, J Li - Information Sciences, 2023 - Elsevier
Phrases represent independent semantics in natural language but usually have
indeterminate lengths and different combinations. So, extracting meaningful phrases from …

Mining infrequent high-quality phrases from domain-specific corpora

L Wang, W Zhu, S Jiang, S Zhang, K Wang… - Proceedings of the 29th …, 2020 - dl.acm.org
Phrase mining is a fundamental task for text analysis and has various downstream
applications such as named entity recognition, topic modeling, and relation extraction. In this …

[图书][B] Automated taxonomy discovery and exploration

J Shen, J Han - 2022 - Springer
In today's information era, people are inundated with vast amounts of text data. Every day,
there are thousands of scientific papers, tens of thousands of news articles, corporate …

Constructing and mining heterogeneous information networks from massive text

J Shang, J Shen, L Liu, J Han - Proceedings of the 25th ACM SIGKDD …, 2019 - dl.acm.org
Real-world data exists largely in the form of unstructured texts. A grand challenge on data
mining research is to develop effective and scalable methods that may transform …