ODT - THESIS TOPIC: Bálint Gyires-Tóth: Deep Reinforcement Learning based ...

Deep Reinforcement Learning based Optimization Methods

THESIS TOPIC PROPOSAL

Institute: Budapest University of Technology and Economics
electrical engineering
Doctoral School of Electrical Engineering

Thesis supervisor: Bálint Gyires-Tóth
Location of studies (in Hungarian): Department of Telecommunications and Media Informatics
Abbreviation of location of studies: TMIT

Description of the research topic:

Deep neural networks have proven to effectively solve particular problems when large amount of high quality training data is available. Training supervised deep learning models utilizes mathematical operations to minimize or maximize an objective function, such as mean squared error or cross entropy. However, these models are unable to develop complex strategies. In contrast, by defining an environment, possible actions in this environment and by assigning rewards and penalties to these actions in different states reinforcement learning is capable of elaborating novel strategies indirectly.
Deep learning paradigm showed outstanding results in reinforcement learning. Recently a number of deep reinforcement learning strategies (like Deep Deterministic Policy Gradient Asynchronous Advantage Actor-Critic; Actor Critic using Kronecker-Factored Trust Region; etc. methods) evolved that achieve faster convergence than the baseline, Deep Q Network algorithm. Still, deep reinforcement learning faces all the difficulties of deep neural network design (data representation, hyperparameter optimization, etc.) and it requires a proper reward function design as well.
The goal of this PhD research is to elaborate novel algorithms towards establishing complex and flexible strategies and understanding the possible current and future contexts and outcomes as well.
The effectiveness of the elaborated method must be proven at least in one application scenario. Such an application scenario can be (1) intelligent chatbot, (2) model design, (3) robotics, etc.
The research can be conducted both in English and in Hungarian. For training the models public and private databases and high performance GPUs are available.

The possible research tasks of the PhD student are the following:
- Overview the related scientific papers, including the basic deep neural network elements, the basics and the recent results of reinforcement learning, focusing on deep reinforcement learning.
- Design and implement baseline methods of deep reinforcement learning in various application scenarios. Elaborate a simulation environment.
- Conduct research on the potentials of deep reinforcement learning algorithms.
- Propose novel method(s) that elaborates complex and flexible strategies. Take into consideration the possible contexts and outcomes.
- Demonstrate the effectiveness of the results at least in one application scenario.
- Objective and subjective evaluation.

Required language skills: english
Number of students who can be accepted: 1

Deadline for application: 2018-07-30