Sadegh Talebi
Tenure Track Assistant Professor
Machine Learning
Universitetsparken 1
2100 København Ø
Most downloads
-
50 downloadsPublished
Adversarial Bandits with Corruptions
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
-
40 downloadsPublished
Tightening Exploration in Upper Confidence Reinforcement Learning
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
-
22 downloadsPublished
Scaling Up Q-Learning via Exploiting State–Action Equivalence
Research output: Contribution to journal › Journal article › Research › peer-review
-
14 downloadsPublished
Bandit-Based Power Control in Full-Duplex Cooperative Relay Networks with Strict-Sense Stationary and Non-Stationary Wireless Communication Channels
Research output: Contribution to journal › Journal article › Research › peer-review
ID: 235125478
Most downloads
-
50
downloads
Adversarial Bandits with Corruptions
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published -
40
downloads
Tightening Exploration in Upper Confidence Reinforcement Learning
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published -
22
downloads
Scaling Up Q-Learning via Exploiting State–Action Equivalence
Research output: Contribution to journal › Journal article › Research › peer-review
Published