What's in a name? An unsupervised approach to link users across communities

J Liu, F Zhang, X Song, YI Song, CY Lin… - Proceedings of the sixth …, 2013 - dl.acm.org
In this paper, we consider the problem of linking users across multiple online communities.
Specifically, we focus on the alias-disambiguation step of this user linking task, which is …

Twiner: named entity recognition in targeted twitter stream

C Li, J Weng, Q He, Y Yao, A Datta, A Sun… - Proceedings of the 35th …, 2012 - dl.acm.org
Many private and/or public organizations have been reported to create and monitor targeted
Twitter streams to collect and understand users' opinions about the organizations. Targeted …

Modeling dwell time to predict click-level satisfaction

Y Kim, A Hassan, RW White, I Zitouni - … on Web search and data mining, 2014 - dl.acm.org
Clicks on search results are the most widely used behavioral signals for predicting search
satisfaction. Even though clicks are correlated with satisfaction, they can also be noisy …

N-gram counts and language models from the common crawl

C Buck, K Heafield, B Van Ooyen - Proceedings of the Language …, 2014 - research.ed.ac.uk
We contribute 5-gram counts and language models trained on the Common Crawl corpus, a
collection over 9 billion web pages. This release improves upon the Google n-gram counts …

[PDF][PDF] A broad-coverage normalization system for social media language

F Liu, F Weng, X Jiang - Proceedings of the 50th Annual Meeting …, 2012 - aclanthology.org
Social media language contains huge amount and wide variety of nonstandard tokens,
created both intentionally and unintentionally by the users. It is of crucial importance to …

Rapid detection of fake news based on machine learning methods

B Probierz, P Stefański, J Kozak - Procedia Computer Science, 2021 - Elsevier
Nowadays, it is very important to quickly recognize the false information referred to as fake
news. This is especially important in the case of news appearing on the Internet because of …

SpringerBriefs in Computer Science

S Zdonik, P Ning, S Shekhar, J Katz, X Wu, LC Jain… - 2012 - Springer
This is an introduction to multicast routing, which is the study of methods for routing from one
source to many destinations, or from many sources to many destinations. Multicast is …

Tweet segmentation and its application to named entity recognition

C Li, A Sun, J Weng, Q He - IEEE Transactions on knowledge …, 2014 - ieeexplore.ieee.org
Twitter has attracted millions of users to share and disseminate most up-to-date information,
resulting in large volumes of data produced everyday. However, many applications in …

Micropinion generation: an unsupervised approach to generating ultra-concise summaries of opinions

K Ganesan, CX Zhai, E Viegas - … of the 21st international conference on …, 2012 - dl.acm.org
This paper presents a new unsupervised approach to generating ultra-concise summaries of
opinions. We formulate the problem of generating such a micropinion summary as an …

Are large language models geospatially knowledgeable?

P Bhandari, A Anastasopoulos, D Pfoser - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Despite the impressive performance of Large Language Models (LLM) for various natural
language processing tasks, little is known about their comprehension of geographic data …