Sadegh Talebi
Tenure Track Assistant Professor
Machine Learning
Universitetsparken 1
2100 København Ø
11 - 11 out of 11Page size: 10
- 2020
- Published
Tightening Exploration in Upper Confidence Reinforcement Learning
Bourel, H., Maillard, O. & Talebi, Mohammad Sadegh, 2020, Proceedings of the 37th International Conference on Machine Learning. PMLR, p. 1056-1066 (Proceedings of Machine Learning Research, Vol. 119).Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
ID: 235125478
Most downloads
-
51
downloads
Adversarial Bandits with Corruptions
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published -
40
downloads
Tightening Exploration in Upper Confidence Reinforcement Learning
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published -
22
downloads
Scaling Up Q-Learning via Exploiting State–Action Equivalence
Research output: Contribution to journal › Journal article › Research › peer-review
Published