Distributed location aware web crawling
Samaras, George S.
SourceThirteenth International World Wide Web Conference Proceedings, WWW2004
Thirteenth International World Wide Web Conference Proceedings, WWW2004
Google Scholar check
MetadataShow full item record
Distributed crawling has shown that it can overcome important limitations of the today's crawling paradigm. However, the optimal benefits of this approach are usually limited to the sites hosting the crawler. In this work, we propose a location-aware method, called IPMicra, that utilizes an IP address hierarchy, and allows crawling of links in a near optimal location aware manner.