Agent behavior monitoring using optimal action selection and twin gaussian processes
The increasing trend towards delegating complex tasks to autonomous artificial agents in safety-critical socio-technical systems makes agent behavior monitoring of paramount importance. In this work, a probabilistic approach for on-line monitoring using optimal action selection and twin Gaussian pro...
Guardado en:
| Autores principales: | , |
|---|---|
| Formato: | Objeto de conferencia |
| Lenguaje: | Inglés |
| Publicado: |
2014
|
| Materias: | |
| Acceso en línea: | http://sedici.unlp.edu.ar/handle/10915/41659 http://43jaiio.sadio.org.ar/proceedings/ASAI/1.pdf |
| Aporte de: |
| id |
I19-R120-10915-41659 |
|---|---|
| record_format |
dspace |
| institution |
Universidad Nacional de La Plata |
| institution_str |
I-19 |
| repository_str |
R-120 |
| collection |
SEDICI (UNLP) |
| language |
Inglés |
| topic |
Ciencias Informáticas agent monitoring gaussian processes optimal selection |
| spellingShingle |
Ciencias Informáticas agent monitoring gaussian processes optimal selection Avila, Luis Martínez, Ernesto Agent behavior monitoring using optimal action selection and twin gaussian processes |
| topic_facet |
Ciencias Informáticas agent monitoring gaussian processes optimal selection |
| description |
The increasing trend towards delegating complex tasks to autonomous artificial agents in safety-critical socio-technical systems makes agent behavior monitoring of paramount importance. In this work, a probabilistic approach for on-line monitoring using optimal action selection and twin Gaussian processes (TGP) is proposed. A Kullback-Leibler (KL) based metric is proposed to characterize the deviation of an agent behavior (modeled as a controlled stochastic process) to its specification. The optimal behavior specification is obtained using Linearly Solvable Markov Decision Processes (LSMDP) whereby the Bellman equation is made linear through an exponential transformation such that the optimal control policy is obtained in an explicit form. |
| format |
Objeto de conferencia Objeto de conferencia |
| author |
Avila, Luis Martínez, Ernesto |
| author_facet |
Avila, Luis Martínez, Ernesto |
| author_sort |
Avila, Luis |
| title |
Agent behavior monitoring using optimal action selection and twin gaussian processes |
| title_short |
Agent behavior monitoring using optimal action selection and twin gaussian processes |
| title_full |
Agent behavior monitoring using optimal action selection and twin gaussian processes |
| title_fullStr |
Agent behavior monitoring using optimal action selection and twin gaussian processes |
| title_full_unstemmed |
Agent behavior monitoring using optimal action selection and twin gaussian processes |
| title_sort |
agent behavior monitoring using optimal action selection and twin gaussian processes |
| publishDate |
2014 |
| url |
http://sedici.unlp.edu.ar/handle/10915/41659 http://43jaiio.sadio.org.ar/proceedings/ASAI/1.pdf |
| work_keys_str_mv |
AT avilaluis agentbehaviormonitoringusingoptimalactionselectionandtwingaussianprocesses AT martinezernesto agentbehaviormonitoringusingoptimalactionselectionandtwingaussianprocesses |
| bdutipo_str |
Repositorios |
| _version_ |
1764820472775573507 |