Reading Group @ SICE, IUB

Welcome to the homepage of our reading group for reinforcement learning.

Info

Time: bi-weekly on 1:00pm - 3:00pm

Place: Luddy Hall 3069

Keywords

Markov Decision Process (MDP); Value Functions; Bellman Equations; State Occupancy; Q Learning;

Keywords

Tabular Episodic MDP; Model-based; Optimistic Algorithm; Frequentist Regret; $\widetilde{\mathcal{O}}(H\sqrt{SAT})$;

Keywords

Tabular Episodic MDP; Model-free; Optimistic Algorithm; Frequentist Regret; $\widetilde{\mathcal{O}}(H^2\sqrt{SAT})$;

Keywords

Tabular Episodic MDP; Thompson Sampling; Bayesian Regret; $\widetilde{\mathcal{O}}(HS\sqrt{AT})$;

Keywords

Tabular Infinite Undiscounted MDP; Weakly Communicating; Thompson Sampling; Bayesian Regret; $\widetilde{\mathcal{O}}(D'S\sqrt{AT})$;