Moscow International School of Physics

Name: Moscow International School of Physics
Start: 2019-02-20T11:00:00+03:00
End: 2019-02-27T11:30:00+03:00
Location: HSE Study Center “Voronovo”

20-27 February 2019

HSE Study Center “Voronovo”

Europe/Moscow timezone

Organizing Committee

The master equation for the reinforcement learning

26 Feb 2019, 20:36

12m

HSE Study Center “Voronovo”

Voronovskoe, Moscow Russian Federation

Talk [10+2 min] Young Scientist Forum

Edgar Vardanyan (Yerevan Physics Institute, Yerevan State University)

We look the reinforcement learning dynamics. As the dynamics is a stochastic process, the adequate mathematical tool is the master equation. We introduce the probability distributions for the actions and value functions, then get a master equation, describing the reinforcement learning process. We derived a Hamilton-Jacobi equation for the latter equation. We verify a unique feature of the model (compared to the Master equation of the chemical reaction with few molecules or evolution models with finite population): the variance of distribution disappeared at the steady state, which gives a good credit for the application of the moment closing approximation. Our method (recursive equations) gives accurate expressions both for the mean and variance of variables, while HJE provides only correct results for the mean values. Looking the recursive equations, we express the value function distribution via the solution of a system of ordinary differential equations.

Edgar Vardanyan (Yerevan Physics Institute, Yerevan State University) Dr David Saakian (Yerevan Physics Institute) Dr Ricard Sole (Universitat Pompeu Fabra)

The master equation for the reinforcement learning.pptx

Moscow International School of Physics

Organizing Committee

The master equation for the reinforcement learning

HSE Study Center “Voronovo”

Speaker

Description

Primary authors

Presentation Materials

Your browser is out of date!