Sign in
Author
|
Conference
|
Journal
|
Organization
|
Year
|
DOI
Look for results that meet for the following criteria:
since
equal to
before
between
and
Search in all fields of study
Limit my searches in the following fields of study
Agriculture Science
Arts & Humanities
Biology
Chemistry
Computer Science
Economics & Business
Engineering
Environmental Sciences
Geosciences
Material Science
Mathematics
Medicine
Physics
Social Science
Multidisciplinary
Keywords
(7)
Greedy Algorithm
Indexation
Search Engine
Set Cover
Set Covering Problem
Web Pages
Word Frequency
Subscribe
Academic
Publications
An algorithmic treatment of strong queries
An algorithmic treatment of strong queries,10.1145/1935826.1935928,Ravi Kumar,Silvio Lattanzi,Prabhakar Raghavan
Edit
An algorithmic treatment of strong queries
BibTex
|
RIS
|
RefWorks
Download
Ravi Kumar
,
Silvio Lattanzi
,
Prabhakar Raghavan
A strong query for a target document with respect to an index is the smallest query for which the target document is returned by the index as the top result for the query. The strong query problem was first studied more than a decade ago in the context of measuring
search engine
overlap. Despite its simple-to-state nature and its longevity in the field, this problem has not been sufficiently addressed in a formal manner. In this paper we provide the first rigorous treatment of the strong query problem. We show an interesting connection between this problem and the
set cover
problem, and use it to obtain basic hardness and algorithmic results. Experiments on more than 10K documents show that our proposed algorithm performs much better than the widely-used word frequency-based heuristic. En route, our study suggests that less than four words on average can be sufficient to uniquely identify web pages.
Conference:
Web Search and Data Mining - WSDM
, pp. 775-784, 2011
DOI:
10.1145/1935826.1935928
Cumulative
Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
(
dl.acm.org
)
(
portal.acm.org
)
(
portal.acm.org
)
(
www.informatik.uni-trier.de
)
(
doi.acm.org
)
More »
References
(21)
The webgraph framework I: compression techniques
(
Citations: 151
)
Paolo Boldi
,
Sebastiano Vigna
Conference:
World Wide Web Conference Series - WWW
, pp. 595-602, 2004
A scalable pattern mining approach to web graph compression with communities
(
Citations: 25
)
Gregory Buehrer
,
Kumar Chellapilla
Conference:
Web Search and Data Mining - WSDM
, pp. 95-106, 2008
Max-cover in map-reduce
(
Citations: 2
)
Flavio Chierichetti
,
Ravi Kumar
,
Andrew Tomkins
Conference:
World Wide Web Conference Series - WWW
, pp. 231-240, 2010
Automatic retrieval of similar content using search engine query interface
(
Citations: 3
)
Ali Dasdan
,
Paolo D'alberto
,
Santanu Kolay
,
Chris Drome
Conference:
International Conference on Information and Knowledge Management - CIKM
, pp. 701-710, 2009
A note on the probabilistic analysis of the minimum set cover problem
(
Citations: 1
)
Jaime Davila
,
Sanguthevar Rajasekaran