Streaming first story detection with application to twitter

S Petrovic, M Osborne, V Lavrenko - … Technologies: The 2010 …, 2010 - research.ed.ac.uk
Human Language Technologies: The 2010 Annual Conference of the North …, 2010research.ed.ac.uk
With the recent rise in popularity and size of social media, there is a growing need for
systems that can extract useful information from this amount of data. We address the
problem of detecting new events from a stream of Twitter posts. To make event detection
feasible on web-scale corpora, we present an algorithm based on locality-sensitive hashing
which is able overcome the limitations of traditional approaches, while maintaining
competitive results. In particular, a comparison with a stateof-the-art system on the first story …
Abstract
With the recent rise in popularity and size of social media, there is a growing need for systems that can extract useful information from this amount of data. We address the problem of detecting new events from a stream of Twitter posts. To make event detection feasible on web-scale corpora, we present an algorithm based on locality-sensitive hashing which is able overcome the limitations of traditional approaches, while maintaining competitive results. In particular, a comparison with a stateof-the-art system on the first story detection task shows that we achieve over an order of magnitude speedup in processing time, while retaining comparable performance. Event detection experiments on a collection of 160 million Twitter posts show that celebrity deaths are the fastest spreading news on Twitter.
research.ed.ac.uk
以上显示的是最相近的搜索结果。 查看全部搜索结果