Web robot detection: A probabilistic reasoning approach
dc.contributor.author | Stassopoulou, Athena | en |
dc.contributor.author | Dikaiakos, Marios D. | en |
dc.creator | Stassopoulou, Athena | en |
dc.creator | Dikaiakos, Marios D. | en |
dc.date.accessioned | 2019-11-13T10:42:21Z | |
dc.date.available | 2019-11-13T10:42:21Z | |
dc.date.issued | 2009 | |
dc.identifier.issn | 1389-1286 | |
dc.identifier.uri | http://gnosis.library.ucy.ac.cy/handle/7/55015 | |
dc.description.abstract | In this paper, we introduce a probabilistic modeling approach for addressing the problem of Web robot detection from Web-server access logs. More specifically, we construct a Bayesian network that classifies automatically access log sessions as being crawler- or human-induced, by combining various pieces of evidence proven to characterize crawler and human behavior. Our approach uses an adaptive-threshold technique to extract Web sessions from access logs. Then, we apply machine learning techniques to determine the parameters of the probabilistic model. The resulting classification is based on the maximum posterior probability of all classes given the available evidence. We apply our method to real Web-server logs and obtain results that demonstrate the robustness and effectiveness of probabilistic reasoning for crawler detection. © 2008 Elsevier B.V. All rights reserved. | en |
dc.source | Computer Networks | en |
dc.source.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-58549116778&doi=10.1016%2fj.comnet.2008.09.021&partnerID=40&md5=d3dbb16373690bc2d0cbc50a029613fe | |
dc.subject | Behavioral research | en |
dc.subject | Learning algorithms | en |
dc.subject | Robotics | en |
dc.subject | Human behaviors | en |
dc.subject | Bayesian networks | en |
dc.subject | Bayesian | en |
dc.subject | Learning systems | en |
dc.subject | Machine learning techniques | en |
dc.subject | Inference engines | en |
dc.subject | Classifiers | en |
dc.subject | Web servers | en |
dc.subject | Bayesian classifiers | en |
dc.subject | Posterior probabilities | en |
dc.subject | Probabilistic model (PM) | en |
dc.subject | Probabilistic reasoning | en |
dc.subject | Reasoning approach | en |
dc.subject | Web crawler detection | en |
dc.subject | Web robot detection | en |
dc.subject | Web sessions | en |
dc.title | Web robot detection: A probabilistic reasoning approach | en |
dc.type | info:eu-repo/semantics/article | |
dc.identifier.doi | 10.1016/j.comnet.2008.09.021 | |
dc.description.volume | 53 | |
dc.description.issue | 3 | |
dc.description.startingpage | 265 | |
dc.description.endingpage | 278 | |
dc.author.faculty | 002 Σχολή Θετικών και Εφαρμοσμένων Επιστημών / Faculty of Pure and Applied Sciences | |
dc.author.department | Τμήμα Πληροφορικής / Department of Computer Science | |
dc.type.uhtype | Article | en |
dc.source.abbreviation | Comput.Networks | en |
dc.contributor.orcid | Dikaiakos, Marios D. [0000-0002-4350-6058] | |
dc.gnosis.orcid | 0000-0002-4350-6058 |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |