Show simple item record

dc.contributor.advisorChristodoulou, Chrisen
dc.contributor.advisorVassiliades, Vassilisen
dc.contributor.authorPastellas, Ioannisen
dc.coverage.spatialCyprusen
dc.creatorPastellas, Ioannisen
dc.date.accessioned2024-07-24T10:09:38Z
dc.date.available2024-07-24T10:09:38Z
dc.date.issued2024-06
dc.identifier.urihttp://gnosis.library.ucy.ac.cy/handle/7/66325en
dc.description.abstractOffline reinforcement learning (RL) has emerged as a promising approach for training intelligent agents without requiring real-time interaction with an environment, addressing a key limitation of traditional RL. This capability is particularly valuable in domains where direct interactions are dangerous or impractical, such as healthcare, finance, and hazardous environments. By leveraging offline RL, it is possible to create autonomous agents capable of deriving optimal policies from static datasets, thereby facilitating automation in diverse decision-making realms. This study explores the application of offline RL techniques in the context of World of Tanks, an online multiplayer tank combat game. We evaluated several offline RL algorithms, including Conservative Q-Learning (CQL), Implicit Q-Learning (IQL), Decision Transformer (DT), and Deep Deterministic Policy Gradient (DDPG). Our results indicate that CQL and IQL achieved significant returns under various discount factors, demonstrating robustness and adaptability in offline settings. Notably, higher discount factors led to better cumulative returns, particularly for CQL and IQL. Effective handling of data distribution shifts was crucial for algorithm robustness, with regularization techniques in CQL and modified architectures in IQL proving effective. In addition, offline RL algorithms (IQL, CQL, Decision Transformer) seem to perform better than the baselines ( Behavioral Cloning, policies from dataset). The volume of training data significantly influenced the performance of offline RL algorithms, with larger datasets enhancing learning effectiveness. However, evaluating offline RL policies remains challenging due to the lack of real-time interaction with the environment. We employed methods such as model-based dynamics and policy value estimation, despite their limitations in accurately predicting real-world performance. This study contributes to the methodology of offline RL research and suggests directions for future advancements.en
dc.language.isoengen
dc.publisherΠανεπιστήμιο Κύπρου, Σχολή Θετικών και Εφαρμοσμένων Επιστημών / University of Cyprus, Faculty of Pure and Applied Sciences
dc.rightsinfo:eu-repo/semantics/openAccessen
dc.rightsOpen Accessen
dc.rightsCC0 1.0 Universal*
dc.rights.urihttp://creativecommons.org/publicdomain/zero/1.0/*
dc.titleOffline Reinforcement Learning in World Of Tanksen
dc.title.alternativeOffline (χωρίς διάδραση με περιβάλλον) Ενισχυτική Mάθηση στο World Of Tanksel
dc.typeinfo:eu-repo/semantics/masterThesisen
dc.contributor.committeememberAristidou, Andreasen
dc.contributor.departmentΠανεπιστήμιο Κύπρου, Σχολή Θετικών και Εφαρμοσμένων Επιστημών, Τμήμα Πληροφορικήςel
dc.contributor.departmentUniversity of Cyprus, Faculty of Pure and Applied Sciences, Department of Computer Scienceen
dc.subject.uncontrolledtermΤΕΧΝΗΤΗ ΝΟΗΜΟΣΥΝΗel
dc.subject.uncontrolledtermARTIFICIAL INTELLIGENCEen
dc.subject.uncontrolledtermREINFORCEMENT LEARNINGen
dc.subject.uncontrolledtermMACHINE LEARNINGen
dc.subject.uncontrolledtermΕΝΙΣΧΥΤΙΚΗ ΜΑΘΗΣΗel
dc.subject.uncontrolledtermΜΗΧΑΝΙΚΗ ΜΑΘΗΣΗel
dc.author.facultyΣχολή Θετικών και Εφαρμοσμένων Επιστημών / Faculty of Pure and Applied Sciences
dc.author.departmentΤμήμα Πληροφορικής / Department of Computer Science
dc.type.uhtypeMaster Thesisen
dc.contributor.orcidPastellas, Ioannis [0000-0002-1193-6280]
dc.contributor.orcidChristodoulou, Chris [0000-0001-9398-5256]
dc.contributor.orcidVassiliades, Vassilis [0000-0002-1336-5629]
dc.contributor.orcidAristidou, Andreas [0000-0001-7754-0791]
dc.gnosis.orcid0000-0002-1193-6280
dc.gnosis.orcid0000-0001-9398-5256
dc.gnosis.orcid0000-0002-1336-5629
dc.gnosis.orcid0000-0001-7754-0791


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

info:eu-repo/semantics/openAccess
Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess