Show simple item record

dc.contributor.authorZeinalipour-Yazdi, Constantinos D.en
dc.contributor.authorDikaiakos, Marios D.en
dc.contributor.editorGal A.en
dc.contributor.editorHalevy A.en
dc.creatorZeinalipour-Yazdi, Constantinos D.en
dc.creatorDikaiakos, Marios D.en
dc.date.accessioned2019-11-13T10:43:03Z
dc.date.available2019-11-13T10:43:03Z
dc.date.issued2002
dc.identifier.issn0302-9743
dc.identifier.urihttp://gnosis.library.ucy.ac.cy/handle/7/55178
dc.description.abstractWeb crawlers are the key component of services running on Internet and providing searching and indexing support for the entire Web, for corporate Intranets and large portal sites. More recently, crawlers have also been used as tools to conduct focused Web searches and to gather data about the characteristics of the WWW. In this paper, we study the employment of crawlers as a programmable, scalable, and distributed component in future Internet middleware infrastructures and proxy services. In particular, we present the architecture and implementation of, and experimentation withWebRACE, a high-performance, distributedWeb crawler, filtering server and object cache.We address the challenge of designing and implementing modular, open, distributed, and scalable crawlers, using Java. We describe our design and implementation decisions, and various optimizations. We discuss the advantages and disadvantages of using Java to implement the WebRACE-crawler, and present an evaluation of its performance. WebRACE is designed in the context of eRACE, an extensible Retrieval Annotation Caching Engine, which collects, annotates and disseminates information from heterogeneous Internet sources and protocols, according to XML-encoded user profiles that determine the urgency and relevance of collected information. © Springer-Verlag Berlin Heidelberg 2002.en
dc.source5th International Workshop on Next Generation Information Technologies and Systems, NGITS 2002en
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-84937389622&partnerID=40&md5=d4388abbc8563ad5c4a2c10c99e37230
dc.subjectWorld Wide Weben
dc.subjectInterneten
dc.subjectSearch enginesen
dc.subjectDesign and implementationsen
dc.subjectInternet protocolsen
dc.subjectMiddlewareen
dc.subjectSocial networking (online)en
dc.subjectUser profileen
dc.subjectIntegrated circuit designen
dc.subjectDistributed componentsen
dc.subjectCorporate intraneten
dc.subjectDistributed crawleren
dc.subjectFuture interneten
dc.subjectInternet sourcesen
dc.subjectProxy servicesen
dc.titleDesign and implementation of a distributed crawler and filtering processoren
dc.typeinfo:eu-repo/semantics/article
dc.description.volume2382
dc.description.startingpage58
dc.description.endingpage74
dc.author.faculty002 Σχολή Θετικών και Εφαρμοσμένων Επιστημών / Faculty of Pure and Applied Sciences
dc.author.departmentΤμήμα Πληροφορικής / Department of Computer Science
dc.type.uhtypeArticleen
dc.description.notes<p>Sponsors:en
dc.description.notesConference code: 121059en
dc.description.notesCited By :17</p>en
dc.source.abbreviationLect. Notes Comput. Sci.en
dc.contributor.orcidZeinalipour-Yazdi, Constantinos D. [0000-0002-8388-1549]
dc.contributor.orcidDikaiakos, Marios D. [0000-0002-4350-6058]
dc.gnosis.orcid0000-0002-8388-1549
dc.gnosis.orcid0000-0002-4350-6058


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record