Sadegh Talebi

Sadegh Talebi

Tenure Track Assistant Professor


Publication year:
  1. 2024
  2. Published

    Logarithmic regret in communicating MDPs: Leveraging known dynamics with bandits

    Saber, H., Pesquerel, F., Maillard, O. & Talebi, Mohammad Sadegh, 2024, Proceedings of the 15th Asian Conference on Machine Learning. PMLR, p. 1167-1182 (Proceedings of Machine Learning Research, Vol. 222).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  3. E-pub ahead of print

    Scaling Power Management in Cloud Data Centers: A Multi-Level Continuous-Time MDP Approach

    Chitsaz, B., Khonsari, A., Moradian, M., Dadlani, A. & Talebi, Mohammad Sadegh, 2024, (E-pub ahead of print) In: IEEE Transactions on Services Computing. 12 p.

    Research output: Contribution to journalJournal articlepeer-review

  4. 2023
  5. Published

    Double Graph Attention Networks for Visual Semantic Navigation

    Lyu, Y. & Talebi, Mohammad Sadegh, 2023, In: Neural Processing Letters. 55, 7, p. 9019-9040

    Research output: Contribution to journalJournal articlepeer-review

  6. Published

    Exploration in Reward Machines with Low Regret

    Bourel, Hippolyte Raymond, Jonsson, A., Maillard, O. A. & Talebi, Mohammad Sadegh, 2023, Proceedings of The 26th International Conference on Artificial Intelligence and Statistics. PMLR, Vol. 206. p. 4114-4146 33 p. (Proceedings of Machine Learning Research, Vol. 206).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  7. Published

    Provably Efficient Offline Reinforcement Learning in Regular Decision Processes

    Cipollone, R., Jonsson, A., Ronca, A. & Talebi, Mohammad Sadegh, 2023, Advances in Neural Information Processing Systems 36 (NeurIPS 2023). NeurIPS Proceedings, 34 p. (Advances in Neural Information Processing Systems, Vol. 36).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  8. Published

    Scaling Up Q-Learning via Exploiting State–Action Equivalence

    Lyu, Y., Côme, A., Zhang, Yijie & Talebi, Mohammad Sadegh, 2023, In: Entropy. 25, 4, 584.

    Research output: Contribution to journalJournal articlepeer-review

  9. 2022
  10. Published

    Bandit-Based Power Control in Full-Duplex Cooperative Relay Networks with Strict-Sense Stationary and Non-Stationary Wireless Communication Channels

    Nomikos, N., Talebi, Mohammad Sadegh, Charalambous, T. & Wichman, R., 2022, In: IEEE Open Journal of the Communications Society. 3, p. 366-378 13 p.

    Research output: Contribution to journalJournal articlepeer-review

  11. 2021
  12. Published

    Improved Exploration in Factored Average-Reward MDPs

    Talebi, Mohammad Sadegh, Jonsson, A. & Maillard, O., 2021, Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS). PMLR, p. 3988-3996 (Proceedings of Machine Learning Research, Vol. 130).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  13. 2020
  14. Published

    Adversarial Bandits with Corruptions

    Yang, L., Hajiesmaili, M. H., Talebi, Mohammad Sadegh, Lui, J. C. S. & Wong, W. S., 2020, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtua. NeurIPS Proceedings, 10 p. (Advances in Neural Information Processing Systems, Vol. 33).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  15. Published

    Bandit-based relay selection in cooperative networks over unknown stationary channels

    Nomikos, N., Talebi, Mohammad Sadegh, Wichman, R. & Charalambous, T., 2020, Proceedings of the 2020 IEEE 30th International Workshop on Machine Learning for Signal Processing, MLSP 2020. IEEE, 9231604

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  16. Published

    Tightening Exploration in Upper Confidence Reinforcement Learning

    Bourel, H., Maillard, O. & Talebi, Mohammad Sadegh, 2020, Proceedings of the 37th International Conference on Machine Learning. PMLR, p. 1056-1066 (Proceedings of Machine Learning Research, Vol. 119).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

ID: 235125478