ODT - témakiírás: Csáji Balázs Csanád: Reinforcement learning

Reinforcement learning

TÉMAKIÍRÁS

Intézmény: Budapesti Műszaki és Gazdaságtudományi Egyetem
matematika- és számítástudományok
Matematika- és Számítástudományok Doktori Iskola

témavezető: Csáji Balázs Csanád
helyszín (magyar oldal): SZTAKI(Institute for Computer Science and Control)
helyszín rövidítés: BME

A kutatási téma leírása:

Reinforcement learning (RL) is one the main branchesof machine learning and it deals with the problem of learning from sequential interactions with an uncertain, dynamic environment based on feedbacks(e.g., states and immediate costs). Markov decision processes (MDPs) constitute the main mathematical background ofRL.However, unlike in classical MDP studies, in RL the model of the system is typically unavailable, therefore, the dynamicsand the costs have tobe learned (estimated) while the decision makertries to workefficiently. These two goals(exploring the environment and exploiting the information gathered so far) are working against each other leading to the fundamental problem of exploration vs exploitation(estimation vs control). Theoretical support for classical RL methods, such as Q-learning and TD(lambda), are usuallyasymptotic and presuppose eitheratabular representation of the value function or a linear function approximation. Novelchallenges in RL include providing methodswith non-asymptotic (and distribution-free) guarantees, handling partial observability andchanging environments, as well asstudying the notorious exploration-exploitation trade-off (even in simplified problems, such as multi-armed-or contextual bandits). Distributed RL methods is another possibleresearch direction. The theory of stochastic approximation(especially in Markovian environments)andvariousdistribution-free statistical methods are of high importance to provide guarantees for RL.

előírt nyelvtudás: angol
további elvárások:
Solid background in probability and statistics, programming skills (e.g., Matlab, Python)

felvehető hallgatók száma: 1

Jelentkezési határidő: 2024-05-31