dc.contributor.advisor | Kyrkou, Christos | en |
dc.contributor.advisor | Theocharides, Theocharis | en |
dc.contributor.author | Telegraph, Kristina | en |
dc.coverage.spatial | Cyprus | en |
dc.creator | Telegraph, Kristina | en |
dc.date.accessioned | 2023-07-05T06:08:26Z | |
dc.date.available | 2023-07-05T06:08:26Z | |
dc.date.issued | 2023-06-01 | |
dc.identifier.uri | http://gnosis.library.ucy.ac.cy/handle/7/65562 | en |
dc.description.abstract | Image object detection has shown tremendous success in recent years, leading to its adaptation to the domain of video. However, the major advancements based on single-shot deep learning models process single frames individually. Hence, relying on spatial information alone can be problematic in cases where there are occlusions, blurred/unclear background, lack of information in low-resolution, and changing lighting conditions, all of which are common occurrences in transportation monitoring applications. Overcoming these problems necessitates incorporating both spatial and temporal information into the detection process. To address this challenge, several spatiotemporal detection models were investigated, which used sequences of video frames and explicit motion cues to build better representations of the scene context. First, a representative custom dataset of video sequences of aerial road network footage from an unmanned aerial vehicle was collected and annotated with three vehicle classes, to be used for model training and validation. Then, different spatiotemporal models were developed and incorporated into the YOLO framework. Overall, the spatiotemporal models show significant improvement in results, with the best model showing a mean average precision (mAP50) of 83.1% for all classes, which is a 16.22% improvement over its corresponding single frame model. The addition of attention mechanisms to the spatiotemporal models’ architecture was also explored. Inference tests were carried out to perform qualitative and inference speed comparisons. Finally, it was concluded that the addition of temporal information to deep learning object detectors is in fact an effective approach to improve vehicle detection in aerial video data. | en |
dc.description.sponsorship | KIOS Research and Innovation Center of Excellence | en |
dc.language.iso | eng | en |
dc.publisher | Πανεπιστήμιο Κύπρου, Πολυτεχνική Σχολή / University of Cyprus, Faculty of Engineering | |
dc.rights | info:eu-repo/semantics/openAccess | en |
dc.rights | Open Access | en |
dc.title | Enhancing aerial vehicle detection in transportation monitoring using spatiotemporal object detection models | en |
dc.type | info:eu-repo/semantics/masterThesis | en |
dc.contributor.committeemember | Kyrkou, Christos | en |
dc.contributor.committeemember | Theocharides, Theocharis | en |
dc.contributor.committeemember | Michael, Maria | en |
dc.contributor.committeemember | Timotheou, Stelios | en |
dc.contributor.department | Τμήμα Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών / Department of Electrical and Computer Engineering | |
dc.subject.uncontrolledterm | OBJECT DETECTION | en |
dc.subject.uncontrolledterm | SPATIOTEMPORAL DETECTION | en |
dc.subject.uncontrolledterm | DEEP LEARNING | en |
dc.subject.uncontrolledterm | ATTENTION | en |
dc.subject.uncontrolledterm | COMPUTER VISION | en |
dc.subject.uncontrolledterm | MACHINE LEARNING | en |
dc.author.faculty | Πολυτεχνική Σχολή / Faculty of Engineering | |
dc.author.department | Τμήμα Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών / Department of Electrical and Computer Engineering | |
dc.type.uhtype | Master Thesis | en |
dc.contributor.orcid | Kyrkou, Christos [0000-0002-7926-7642] | |
dc.contributor.orcid | Theocharides, Theocharis [0000-0001-7222-9152] | |
dc.gnosis.orcid | 0000-0002-7926-7642 | |
dc.gnosis.orcid | 0000-0001-7222-9152 | |