Sadegh Talebi

Tenure Track Assistant Professor

Publication year:

All

1 - 3 out of 3Page size: 10

Sort by: Publication date

2020
Published
Adversarial Bandits with Corruptions
Yang, L., Hajiesmaili, M. H., Talebi, Mohammad Sadegh, Lui, J. C. S. & Wong, W. S., 2020, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtua. NeurIPS Proceedings, 10 p. (Advances in Neural Information Processing Systems, Vol. 33).
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published
Bandit-based relay selection in cooperative networks over unknown stationary channels
Nomikos, N., Talebi, Mohammad Sadegh, Wichman, R. & Charalambous, T., 2020, Proceedings of the 2020 IEEE 30th International Workshop on Machine Learning for Signal Processing, MLSP 2020. IEEE, 9231604
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published
Tightening Exploration in Upper Confidence Reinforcement Learning
Bourel, H., Maillard, O. & Talebi, Mohammad Sadegh, 2020, Proceedings of the 37th International Conference on Machine Learning. PMLR, p. 1056-1066 (Proceedings of Machine Learning Research, Vol. 119).
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review

ID: 235125478

49 downloads
Adversarial Bandits with Corruptions
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published
40 downloads
Tightening Exploration in Upper Confidence Reinforcement Learning
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published
20 downloads
Scaling Up Q-Learning via Exploiting State–Action Equivalence
Research output: Contribution to journal › Journal article › Research › peer-review
Published