Improving interactive reinforcement learning: What makes a good teacher?

Interactive reinforcement learning has become an important apprenticeship approach to speed up convergence in classic reinforcement learning problems. In this regard, a variant of interactive reinforcement learning is policy shaping which uses a parent-like trainer to propose the next action to be p...

Descripción completa

Guardado en:

Detalles Bibliográficos
Autores principales:	Cruz, Francisco, Magg, Sven, Naga, Yukie, Wermter, Stefan
Formato:	Objeto de conferencia Resumen
Lenguaje:	Inglés
Publicado:	2018
Materias:	Ciencias Informáticas interactive reinforcement learning policy shape artificial trainer-agent cleaning scenario
Acceso en línea:	http://sedici.unlp.edu.ar/handle/10915/70699 http://47jaiio.sadio.org.ar/sites/default/files/ASAI-09.pdf
Aporte de:	SEDICI (UNLP) de Universidad Nacional de La Plata

id	I19-R120-10915-70699
record_format	dspace
institution	Universidad Nacional de La Plata
institution_str	I-19
repository_str	R-120
collection	SEDICI (UNLP)
language	Inglés
topic	Ciencias Informáticas interactive reinforcement learning policy shape artificial trainer-agent cleaning scenario
spellingShingle	Ciencias Informáticas interactive reinforcement learning policy shape artificial trainer-agent cleaning scenario Cruz, Francisco Magg, Sven Naga, Yukie Wermter, Stefan Improving interactive reinforcement learning: What makes a good teacher?
topic_facet	Ciencias Informáticas interactive reinforcement learning policy shape artificial trainer-agent cleaning scenario
description	Interactive reinforcement learning has become an important apprenticeship approach to speed up convergence in classic reinforcement learning problems. In this regard, a variant of interactive reinforcement learning is policy shaping which uses a parent-like trainer to propose the next action to be performed and by doing so reduces the search space by advice. On some occasions, the trainer may be another artificial agent which in turn was trained using reinforcement learning methods to afterward becoming an advisor for other learner-agents. In this work, we analyze internal representations and characteristics of artificial agents to determine which agent may outperform others to become a better trainer-agent. Using a polymath agent, as compared to a specialist agent, an advisor leads to a larger reward and faster convergence of the reward signal and also to a more stable behavior in terms of the state visit frequency of the learner-agents. Moreover, we analyze system interaction parameters in order to determine how influential they are in the apprenticeship process, where the consistency of feedback is much more relevant when dealing with different learner obedience parameters.
format	Objeto de conferencia Resumen
author	Cruz, Francisco Magg, Sven Naga, Yukie Wermter, Stefan
author_facet	Cruz, Francisco Magg, Sven Naga, Yukie Wermter, Stefan
author_sort	Cruz, Francisco
title	Improving interactive reinforcement learning: What makes a good teacher?
title_short	Improving interactive reinforcement learning: What makes a good teacher?
title_full	Improving interactive reinforcement learning: What makes a good teacher?
title_fullStr	Improving interactive reinforcement learning: What makes a good teacher?
title_full_unstemmed	Improving interactive reinforcement learning: What makes a good teacher?
title_sort	improving interactive reinforcement learning: what makes a good teacher?
publishDate	2018
url	http://sedici.unlp.edu.ar/handle/10915/70699 http://47jaiio.sadio.org.ar/sites/default/files/ASAI-09.pdf
work_keys_str_mv	AT cruzfrancisco improvinginteractivereinforcementlearningwhatmakesagoodteacher AT maggsven improvinginteractivereinforcementlearningwhatmakesagoodteacher AT nagayukie improvinginteractivereinforcementlearningwhatmakesagoodteacher AT wermterstefan improvinginteractivereinforcementlearningwhatmakesagoodteacher
bdutipo_str	Repositorios
_version_	1764820481696858112

Improving interactive reinforcement learning: What makes a good teacher?

Ejemplares similares