A parallel implementation of Q-learning based on communication with cache

Q-Learning is a Reinforcement Learning method for solving sequential decision problems, where the utility of actions depends on a sequence of decisions and there exists uncertainty about the dynamics of the environment the agent is situated on. This general framework has allowed that Q-Learning and...

Descripción completa

Guardado en:

Detalles Bibliográficos
Autores principales:	Printista, Alicia Marcela, Errecalde, Marcelo Luis, Montoya, Cecilia Inés
Formato:	Articulo
Lenguaje:	Inglés
Publicado:	2002
Materias:	Ciencias Informáticas Parallel programming Redes de Comunicación de Computadores Informática Aprendizaje communication based on cache reinforcement learning asynchronous dynamic programming
Acceso en línea:	http://sedici.unlp.edu.ar/handle/10915/9432 http://journal.info.unlp.edu.ar/wp-content/uploads/p41.pdf
Aporte de:	SEDICI (UNLP) de Universidad Nacional de La Plata

id	I19-R120-10915-9432
record_format	dspace
institution	Universidad Nacional de La Plata
institution_str	I-19
repository_str	R-120
collection	SEDICI (UNLP)
language	Inglés
topic	Ciencias Informáticas Parallel programming Redes de Comunicación de Computadores Informática Aprendizaje communication based on cache reinforcement learning asynchronous dynamic programming
spellingShingle	Ciencias Informáticas Parallel programming Redes de Comunicación de Computadores Informática Aprendizaje communication based on cache reinforcement learning asynchronous dynamic programming Printista, Alicia Marcela Errecalde, Marcelo Luis Montoya, Cecilia Inés A parallel implementation of Q-learning based on communication with cache
topic_facet	Ciencias Informáticas Parallel programming Redes de Comunicación de Computadores Informática Aprendizaje communication based on cache reinforcement learning asynchronous dynamic programming
description	Q-Learning is a Reinforcement Learning method for solving sequential decision problems, where the utility of actions depends on a sequence of decisions and there exists uncertainty about the dynamics of the environment the agent is situated on. This general framework has allowed that Q-Learning and other Reinforcement Learning methods to be applied to a broad spectrum of complex real world problems such as robotics, industrial manufacturing, games and others. Despite its interesting properties, Q-learning is a very slow method that requires a long period of training for learning an acceptable policy. In order to solve or at least reduce this problem, we propose a parallel implementation model of Q-learning using a tabular representation and via a communication scheme based on cache. This model is applied to a particular problem and the results obtained with different processor configurations are reported. A brief discussion about the properties and current limitations of our approach is finally presented.
format	Articulo Articulo
author	Printista, Alicia Marcela Errecalde, Marcelo Luis Montoya, Cecilia Inés
author_facet	Printista, Alicia Marcela Errecalde, Marcelo Luis Montoya, Cecilia Inés
author_sort	Printista, Alicia Marcela
title	A parallel implementation of Q-learning based on communication with cache
title_short	A parallel implementation of Q-learning based on communication with cache
title_full	A parallel implementation of Q-learning based on communication with cache
title_fullStr	A parallel implementation of Q-learning based on communication with cache
title_full_unstemmed	A parallel implementation of Q-learning based on communication with cache
title_sort	parallel implementation of q-learning based on communication with cache
publishDate	2002
url	http://sedici.unlp.edu.ar/handle/10915/9432 http://journal.info.unlp.edu.ar/wp-content/uploads/p41.pdf
work_keys_str_mv	AT printistaaliciamarcela aparallelimplementationofqlearningbasedoncommunicationwithcache AT errecaldemarceloluis aparallelimplementationofqlearningbasedoncommunicationwithcache AT montoyaceciliaines aparallelimplementationofqlearningbasedoncommunicationwithcache AT printistaaliciamarcela parallelimplementationofqlearningbasedoncommunicationwithcache AT errecaldemarceloluis parallelimplementationofqlearningbasedoncommunicationwithcache AT montoyaceciliaines parallelimplementationofqlearningbasedoncommunicationwithcache
bdutipo_str	Repositorios
_version_	1764820491166547969

A parallel implementation of Q-learning based on communication with cache

Ejemplares similares