Mining quality phrases from massive text corpora

J Liu, J Shang, C Wang, X Ren, J Han - Proceedings of the 2015 ACM …, 2015 - dl.acm.org
Text data are ubiquitous and play an essential role in big data applications. However, text
data are mostly unstructured. Transforming unstructured text into structured units (eg …

Semantic matching in search

H Li, J Xu - Foundations and Trends® in Information …, 2014 - nowpublishers.com
Relevance is the most important factor to assure users' satisfaction in search and the
success of a search engine heavily depends on its performance on relevance. It has been …

Mining search and browse logs for web search: A survey

D Jiang, J Pei, H Li - ACM Transactions on Intelligent Systems and …, 2013 - dl.acm.org
Huge amounts of search log data have been accumulated at Web search engines.
Currently, a popular Web search engine may receive billions of queries and collect terabytes …

A piggyback system for joint entity mention detection and linking in web queries

M Cornolti, P Ferragina, M Ciaramita, S Rüd… - Proceedings of the 25th …, 2016 - dl.acm.org
In this paper we study the problem of linking open-domain web-search queries towards
entities drawn from the full entity inventory of Wikipedia articles. We introduce SMAPH-2, a …

[PDF][PDF] Knowledge graph and corpus driven segmentation and answer inference for telegraphic entity-seeking queries

M Joshi, U Sawant, S Chakrabarti - Proceedings of the 2014 …, 2014 - aclanthology.org
Much recent work focuses on formal interpretation of natural question utterances, with the
goal of executing the resulting structured queries on knowledge graphs (KGs) such as …

Towards concept-based translation models using search logs for query expansion

J Gao, JY Nie - Proceedings of the 21st ACM international conference …, 2012 - dl.acm.org
Query logs have been successfully used to improve Web search. One of the directions
exploits user clickthrough data to extract related terms to a query to perform query expansion …

Towards optimum query segmentation: in doubt without

M Hagen, M Potthast, A Beyer, B Stein - Proceedings of the 21st ACM …, 2012 - dl.acm.org
Query segmentation is the problem of identifying those keywords in a query, which together
form compound concepts or phrases like" new york times". Such segments can help a …

[图书][B] Phrase mining from massive text and its applications

J Liu, J Shang, J Han - 2022 - books.google.com
A lot of digital ink has been spilled on" big data" over the past few years. Most of this surge
owes its origin to the various types of unstructured data in the wild, among which the …

A generalized hidden markov model with discriminative training for query spelling correction

Y Li, H Duan, CX Zhai - Proceedings of the 35th international ACM SIGIR …, 2012 - dl.acm.org
Query spelling correction is a crucial component of modern search engines. Existing
methods in the literature for search query spelling correction have two major drawbacks …

Efficiently mining high quality phrases from texts

B Li, X Yang, B Wang, W Cui - Proceedings of the AAAI Conference on …, 2017 - ojs.aaai.org
Phrase mining is a key research problem for semantic analysis and text-based information
retrieval. The existing approaches based on NLP, frequency, and statistics cannot extract …