An extension of a hierarchical reinforcement learning algorithm for multiagent settings

This paper compares and investigates single-agent reinforcement learning (RL) algorithms on the simple and an extended taxi problem domain, and multiagent RL algorithms on a multiagent extension of the simple taxi problem domain we created. In particular, we extend the Policy Hill Climbing (PHC) and the Win or Learn Fast-PHC (WoLF-PHC) algorithms by combining them with the MAXQ hierarchical decomposition and investigate their efficiency. The results are very promising for the multiagent domain as they indicate that these two newly-created algorithms are the most efficient ones from the algorithms we compared. © 2012 Springer-Verlag.

An extension of a hierarchical reinforcement learning algorithm for multiagent settings

Date

Author

ISSN

Source

Volume

Pages

Keyword(s):

Metadata

Abstract

Links

DOI

URI

Collections

Cite as APAVancouverHarvardBibTeX

Related items

Spiking neural networks with different reinforcement learning (RL) schemes in a multiagent setting ﻿

Charging Policies for PHEVs used for Service Delivery: A Reinforcement Learning Approach ﻿

Multiagent reinforcement learning: Spiking and nonspiking agents in the Iterated Prisoner's Dilemma ﻿

Cite as

Spiking neural networks with different reinforcement learning (RL) schemes in a multiagent setting

Charging Policies for PHEVs used for Service Delivery: A Reinforcement Learning Approach

Multiagent reinforcement learning: Spiking and nonspiking agents in the Iterated Prisoner's Dilemma