Show simple item record

dc.contributor.authorEfstathiades, Haritonen
dc.contributor.authorAntoniades, Demetrisen
dc.contributor.authorPallis, George C.en
dc.contributor.authorDikaiakos, Marios D.en
dc.creatorEfstathiades, Haritonen
dc.creatorAntoniades, Demetrisen
dc.creatorPallis, George C.en
dc.creatorDikaiakos, Marios D.en
dc.date.accessioned2019-11-13T10:39:58Z
dc.date.available2019-11-13T10:39:58Z
dc.date.issued2017
dc.identifier.isbn978-1-5090-4607-2
dc.identifier.urihttp://gnosis.library.ucy.ac.cy/handle/7/53889
dc.description.abstractThe popularity and huge amount of information published in Online Social Networks (OSN) established them as one of the main data sources for a variety of research community fields. However, the design of a large-scale dataset collection campaign is a major problem for organizations and researchers who aim in addressing their research questions by analyzing this type of data. OSN platforms provide Application Programming Interfaces (API) to third party developers, which enable them to retrieve and use this data for applications deployment. However, due to OSN imposed limitations, the process of retrieving large scale data with the use of these APIs is challenging and time consuming, resulting in datasets which are either incomplete or outdated. It is relatively impossible for an individual scientist or research group to follow an efficient dataset collection procedure and build a large sample in a short amount of time. In this paper we present a framework for efficient crowd crawling of OSN. Our framework is based on the use of multiple OSN accounts, which are engaged in an efficient distributed collection process able to circumvent the imposed limitations without violating the terms of use. We present an evaluation of the proposed solution and demonstrate its performance in terms of dataset completeness and timeliness, for the case study of Twitter, one of the most popular platforms used in research. © 2016 IEEE.en
dc.publisherInstitute of Electrical and Electronics Engineers Inc.en
dc.sourceProceedings - 2016 IEEE 2nd International Conference on Collaboration and Internet Computing, IEEE CIC 2016en
dc.source2nd IEEE International Conference on Collaboration and Internet Computing, IEEE CIC 2016en
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85013188444&doi=10.1109%2fCIC.2016.056&partnerID=40&md5=a51af2e9b55f8dcbf071583750b625d1
dc.subjectAmount of informationen
dc.subjectData acquisitionen
dc.subjectApplication programming interfaces (API)en
dc.subjectSocial networking (online)en
dc.subjectResearch communitiesen
dc.subjectOn-line social networksen
dc.subjectOnline social networksen
dc.subjectDistributed collectionsen
dc.subjectDistributed dataen
dc.subjectDistributed data collection frameworken
dc.subjectLarge-scale dataseten
dc.subjectLarge-scale dataset collectionen
dc.subjectOnline social networks (OSN)en
dc.subjectResearch questionsen
dc.titleDistributed large-scale data collection in online social networksen
dc.typeinfo:eu-repo/semantics/conferenceObject
dc.identifier.doi10.1109/CIC.2016.056
dc.description.startingpage373
dc.description.endingpage380
dc.author.faculty002 Σχολή Θετικών και Εφαρμοσμένων Επιστημών / Faculty of Pure and Applied Sciences
dc.author.departmentΤμήμα Πληροφορικής / Department of Computer Science
dc.type.uhtypeConference Objecten
dc.description.notes<p>Sponsors: Drexel Universityen
dc.description.notesRMITen
dc.description.notesUniversity of Pittsburghen
dc.description.notesUniversity Zhejiang Universityen
dc.description.notesConference code: 125951</p>en
dc.contributor.orcidPallis, George C. [0000-0003-1815-5468]
dc.contributor.orcidDikaiakos, Marios D. [0000-0002-4350-6058]
dc.gnosis.orcid0000-0003-1815-5468
dc.gnosis.orcid0000-0002-4350-6058


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record