[图书][B] An introduction to information retrieval

CD Manning - 2009 - edl.emi.gov.et
As recently as the 1990s, studies showed that most people preferred getting information
from other people rather than from information retrieval systems. Of course, in that time …

[图书][B] Modern information retrieval

R Baeza-Yates, B Ribeiro-Neto - 1999 - people.ischool.berkeley.edu
Information retrieval (IR) has changed considerably in recent years with the expansion of the
World Wide Web and the advent of modern and inexpensive graphical user interfaces and …

[图书][B] Web data mining: exploring hyperlinks, contents, and usage data

B Liu - 2011 - Springer
Liu has written a comprehensive text on Web mining, which consists of two parts. The first
part covers the data mining and machine learning foundations, where all the essential …

Data-Centric Systems and Applications

MJ Carey, S Ceri, P Bernstein, U Dayal, C Faloutsos… - Italy: Springer, 2006 - Springer
The rapid growth of the Web in the past two decades has made it the largest publicly
accessible data source in the world. Web mining aims to discover useful information or …

[PDF][PDF] Building large corpora from the web using a new efficient tool chain.

R Schäfer, F Bildhauer - Lrec, 2012 - researchgate.net
Over the last decade, methods of web corpus construction and the evaluation of web
corpora have been actively researched. Prominently, the WaCky initiative has provided both …

Web crawling

C Olston, M Najork - Foundations and Trends® in Information …, 2010 - nowpublishers.com
This is a survey of the science and practice of web crawling. While at first glance web
crawling may appear to be merely an application of breadth-first-search, the truth is that …

[PDF][PDF] Introduction to information retrieval

DM Christopher, R Prabhakar, S Hinrich - 2008 - 155.0.49.213
Introduction to Information Retrieval is the first textbook with a coherent treatment of classical
and web information retrieval, including web search and the related areas of text …

Measuring semantic similarity between words using web search engines.

D Bollegala, Y Matsuo, M Ishizuka - www, 2007 - dl.acm.org
Semantic similarity measures play important roles in information retrieval and Natural
Language Processing. Previous work in semantic web-related applications such as …

Estimating and sampling graphs with multidimensional random walks

B Ribeiro, D Towsley - Proceedings of the 10th ACM SIGCOMM …, 2010 - dl.acm.org
Estimating characteristics of large graphs via sampling is a vital part of the study of complex
networks. Current sampling methods such as (independent) random vertex and random …

Oblivious data structures

XS Wang, K Nayak, C Liu, THH Chan, E Shi… - Proceedings of the …, 2014 - dl.acm.org
We design novel, asymptotically more efficient data structures and algorithms for programs
whose data access patterns exhibit some degree of predictability. To this end, we propose …