Do not crawl in the DUST: Different URLs with similar text Z Bar-Yossef, I Keidar, U Schonfeld ACM Transactions on the Web (TWEB) 3 (1), 1-31, 2009 | 151 | 2009 |
Topical semantics of twitter links MJ Welch, U Schonfeld, D He, J Cho Proceedings of the fourth ACM international conference on Web search and …, 2011 | 144 | 2011 |
RankMass Crawler: A Crawler with High PageRank Coverage Guarantee. J Cho, U Schonfeld VLDB 7, 23-28, 2007 | 76 | 2007 |
Sitemaps: above and beyond the crawl of duty U Schonfeld, N Shivakumar Proceedings of the 18th international conference on World wide web, 991-1000, 2009 | 66 | 2009 |
Search engine coverage A Azagury, C Leue, U Schonfeld US Patent App. 11/185,999, 2007 | 37 | 2007 |
Do not crawl in the DUST: different URLs with similar text U Schonfeld, Z Bar-Yossef, I Keidar Proceedings of the 15th international conference on World Wide Web, 1015-1016, 2006 | 25 | 2006 |
Automated search intent discovery R Kraft, U Schonfeld US Patent App. 14/501,222, 2016 | 9 | 2016 |
System and method for detecting duplicate content items U Schonfeld, A Bhattacharjee, R Ahuja US Patent App. 11/939,834, 2009 | 3 | 2009 |
Crawling for results U Schonfeld University of California at Los Angeles, 2011 | | 2011 |
Do not Crawl in the DUST: Different URLs with Similar Text Extended Abstract U Schonfeld, Z Bar-Yossef, I Keidar | | |