Dynamic Programming Principle for tug-of-war games with noise

We consider a two-player zero-sum-game in a bounded open domain Ω described as follows: at a point x Ω, Players I and II play an ε-step tug-of-war game with probability α, and with probability β (α + β = 1), a random point in the ball of radius ε centered at x is chosen. Once the game position reach...

Descripción completa

Detalles Bibliográficos
Autor principal:	Rossi, Julio Daniel
Publicado:	2012
Materias:	Dirichlet boundary conditions Dynamic Programming Principle P-Laplacian Stochastic games Two-player zero-sum games Dirichlet boundary condition Stochastic game Zero-sum game Boundary conditions Dynamic programming
Acceso en línea:	https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_12928119_v18_n1_p81_Manfredi http://hdl.handle.net/20.500.12110/paper_12928119_v18_n1_p81_Manfredi
Aporte de:	Biblioteca Digital - Facultad de Ciencias Exactas y Naturales (UBA) de Universidad de Buenos Aires

id	paper:paper_12928119_v18_n1_p81_Manfredi
record_format	dspace
spelling	paper:paper_12928119_v18_n1_p81_Manfredi2023-06-08T16:10:14Z Dynamic Programming Principle for tug-of-war games with noise Rossi, Julio Daniel Dirichlet boundary conditions Dynamic Programming Principle P-Laplacian Stochastic games Two-player zero-sum games Dirichlet boundary condition Dynamic Programming Principle P-Laplacian Stochastic game Zero-sum game Boundary conditions Dynamic programming We consider a two-player zero-sum-game in a bounded open domain Ω described as follows: at a point x Ω, Players I and II play an ε-step tug-of-war game with probability α, and with probability β (α + β = 1), a random point in the ball of radius ε centered at x is chosen. Once the game position reaches the boundary, Player II pays Player I the amount given by a fixed payoff function F. We give a detailed proof of the fact that the value functions of this game satisfy the Dynamic Programming Principle\\begin{equation} u(x) = α{2} in ol Bε(x) u (y) + y in ol Bε(x) u (y) + β kint Bε(x) u(y) ud y, end equation for x Ω with u(y) = F(y) when y Ω. This principle implies the existence of quasioptimal Markovian strategies. © 2010 EDP Sciences, SMAI. Fil:Rossi, J.D. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales; Argentina. 2012 https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_12928119_v18_n1_p81_Manfredi http://hdl.handle.net/20.500.12110/paper_12928119_v18_n1_p81_Manfredi
institution	Universidad de Buenos Aires
institution_str	I-28
repository_str	R-134
collection	Biblioteca Digital - Facultad de Ciencias Exactas y Naturales (UBA)
topic	Dirichlet boundary conditions Dynamic Programming Principle P-Laplacian Stochastic games Two-player zero-sum games Dirichlet boundary condition Dynamic Programming Principle P-Laplacian Stochastic game Zero-sum game Boundary conditions Dynamic programming
spellingShingle	Dirichlet boundary conditions Dynamic Programming Principle P-Laplacian Stochastic games Two-player zero-sum games Dirichlet boundary condition Dynamic Programming Principle P-Laplacian Stochastic game Zero-sum game Boundary conditions Dynamic programming Rossi, Julio Daniel Dynamic Programming Principle for tug-of-war games with noise
topic_facet	Dirichlet boundary conditions Dynamic Programming Principle P-Laplacian Stochastic games Two-player zero-sum games Dirichlet boundary condition Dynamic Programming Principle P-Laplacian Stochastic game Zero-sum game Boundary conditions Dynamic programming
description	We consider a two-player zero-sum-game in a bounded open domain Ω described as follows: at a point x Ω, Players I and II play an ε-step tug-of-war game with probability α, and with probability β (α + β = 1), a random point in the ball of radius ε centered at x is chosen. Once the game position reaches the boundary, Player II pays Player I the amount given by a fixed payoff function F. We give a detailed proof of the fact that the value functions of this game satisfy the Dynamic Programming Principle\\begin{equation} u(x) = α{2} in ol Bε(x) u (y) + y in ol Bε(x) u (y) + β kint Bε(x) u(y) ud y, end equation for x Ω with u(y) = F(y) when y Ω. This principle implies the existence of quasioptimal Markovian strategies. © 2010 EDP Sciences, SMAI.
author	Rossi, Julio Daniel
author_facet	Rossi, Julio Daniel
author_sort	Rossi, Julio Daniel
title	Dynamic Programming Principle for tug-of-war games with noise
title_short	Dynamic Programming Principle for tug-of-war games with noise
title_full	Dynamic Programming Principle for tug-of-war games with noise
title_fullStr	Dynamic Programming Principle for tug-of-war games with noise
title_full_unstemmed	Dynamic Programming Principle for tug-of-war games with noise
title_sort	dynamic programming principle for tug-of-war games with noise
publishDate	2012
url	https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_12928119_v18_n1_p81_Manfredi http://hdl.handle.net/20.500.12110/paper_12928119_v18_n1_p81_Manfredi
work_keys_str_mv	AT rossijuliodaniel dynamicprogrammingprinciplefortugofwargameswithnoise
_version_	1768546034237046784

Dynamic Programming Principle for tug-of-war games with noise

Ejemplares similares