Show simple item record

dc.contributor.authorDikaiakos, Marios D.en
dc.contributor.authorStassopoulou, Athenaen
dc.contributor.authorPapageorgiou, Loizosen
dc.creatorDikaiakos, Marios D.en
dc.creatorStassopoulou, Athenaen
dc.creatorPapageorgiou, Loizosen
dc.date.accessioned2019-11-13T10:39:52Z
dc.date.available2019-11-13T10:39:52Z
dc.date.issued2005
dc.identifier.urihttp://gnosis.library.ucy.ac.cy/handle/7/53844
dc.description.abstractIn this paper, we present a characterization study of search-engine crawlers. For the purposes of our work, we use Web-server access logs from five academic sites in three different countries. Based on these logs, we analyze the activity of different crawlers that belong to five search engines: Google, AltaVista, Inktomi, FastSearch and CiteSeer. We compare crawler behavior to the characteristics of the general World-Wide Web traffic and to general characterization studies. We analyze crawler requests to derive insights into the behavior and strategy of crawlers. We propose a set of simple metrics that describe qualitative characteristics of crawler behavior, vis-à-vis a crawler's preference on resources of a particular format, its frequency of visits on a Web site, and the pervasiveness of its visits to a particular site. To the best of our knowledge, this is the first extensive and in depth characterization of search-engine crawlers. Our results and observations provide useful insights into crawler behavior and serve as basis of our ongoing work on the automatic detection of Web crawlers. © 2005 Elsevier B.V. All rights reserved.en
dc.sourceComputer Communicationsen
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-17644390582&doi=10.1016%2fj.comcom.2005.01.003&partnerID=40&md5=bbaae7e21912c52febdc917fe80e21e6
dc.subjectWorld Wide Weben
dc.subjectSearch enginesen
dc.subjectResource allocationen
dc.subjectServersen
dc.subjectJava programming languageen
dc.subjectWeb crawlersen
dc.subjectCrawlersen
dc.subjectHTTPen
dc.subjectLocal area networksen
dc.subjectWeb characterizationen
dc.subjectWeb serversen
dc.subjectWeb trafficen
dc.titleAn investigation of web crawler behavior: Characterization and metricsen
dc.typeinfo:eu-repo/semantics/article
dc.identifier.doi10.1016/j.comcom.2005.01.003
dc.description.volume28
dc.description.issue8
dc.description.startingpage880
dc.description.endingpage897
dc.author.faculty002 Σχολή Θετικών και Εφαρμοσμένων Επιστημών / Faculty of Pure and Applied Sciences
dc.author.departmentΤμήμα Πληροφορικής / Department of Computer Science
dc.type.uhtypeArticleen
dc.description.notes<p>Cited By :45</p>en
dc.source.abbreviationComput.Commun.en
dc.contributor.orcidDikaiakos, Marios D. [0000-0002-4350-6058]
dc.gnosis.orcid0000-0002-4350-6058


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record