Distributed large-scale data collection in online social networks
Date
2017ISBN
978-1-5090-4607-2Publisher
Institute of Electrical and Electronics Engineers Inc.Source
Proceedings - 2016 IEEE 2nd International Conference on Collaboration and Internet Computing, IEEE CIC 20162nd IEEE International Conference on Collaboration and Internet Computing, IEEE CIC 2016
Pages
373-380Google Scholar check
Keyword(s):
Metadata
Show full item recordAbstract
The popularity and huge amount of information published in Online Social Networks (OSN) established them as one of the main data sources for a variety of research community fields. However, the design of a large-scale dataset collection campaign is a major problem for organizations and researchers who aim in addressing their research questions by analyzing this type of data. OSN platforms provide Application Programming Interfaces (API) to third party developers, which enable them to retrieve and use this data for applications deployment. However, due to OSN imposed limitations, the process of retrieving large scale data with the use of these APIs is challenging and time consuming, resulting in datasets which are either incomplete or outdated. It is relatively impossible for an individual scientist or research group to follow an efficient dataset collection procedure and build a large sample in a short amount of time. In this paper we present a framework for efficient crowd crawling of OSN. Our framework is based on the use of multiple OSN accounts, which are engaged in an efficient distributed collection process able to circumvent the imposed limitations without violating the terms of use. We present an evaluation of the proposed solution and demonstrate its performance in terms of dataset completeness and timeliness, for the case study of Twitter, one of the most popular platforms used in research. © 2016 IEEE.
Collections
Cite as
Related items
Showing items related by title, author, creator and subject.
-
Conference Object
SELECT: A Distributed Publish/Subscribe Notification System for Online Social Networks
Apolónia, Nuno; Antaris, Stefanos; Girdzijauskas, Sarunas; Pallis, George; Dikaiakos, Marios (2018)Publish/subscribe (pub/sub) mechanisms constitute an attractive communication paradigm in the design of large-scale notification systems for Online Social Networks (OSNs). To accommodate the large-scale workloads of ...
-
Conference Object
Lessons learned from online social networking of physical things
Kamilaris, Andreas; Papadiomidous, D.; Pitsillides, Andreas (2011)Social networking is a core part of the global online experience. The Web 2.0 has been transformed into a social Web, extending the social capabilities of users. A big challenge for the Web is to become ubiquitous, blended ...
-
Book Chapter
On the Impact of Online Social Networks in Content Delivery
Kilanioti, Irene; Georgiou, Chryssis; Pallis, George C. (Wiley Blackwell, 2014)This chapter presents the existing approaches that can be leveraged for the scaling of rich media content in content delivery networks (CDNs) using information from online social networks (OSNs). It also presents a taxonomy ...