Sadegh Talebi
Tenure Track Assistant Professor
Machine Learning
Universitetsparken 1
2100 København Ø
1 - 1 out of 1Page size: 10
- 2021
- Published
Improved Exploration in Factored Average-Reward MDPs
Talebi, Mohammad Sadegh, Jonsson, A. & Maillard, O., 2021, Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS). PMLR, p. 3988-3996 (Proceedings of Machine Learning Research, Vol. 130).Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
ID: 235125478
Most downloads
-
48
downloads
Adversarial Bandits with Corruptions
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published -
39
downloads
Tightening Exploration in Upper Confidence Reinforcement Learning
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published -
18
downloads
Scaling Up Q-Learning via Exploiting State–Action Equivalence
Research output: Contribution to journal › Journal article › peer-review
Published