https://sreejeetm1729.github.io/posts/why-vanilla-q-learning-breaks-under-corrupted-rewards/ 2026-04-12T23:15:00+00:00 https://sreejeetm1729.github.io/posts/why-gradient-descent-works/ 2026-05-12T04:00:00+00:00 https://sreejeetm1729.github.io/posts/stochastic-gradient-descent/ 2026-05-13T00:00:00+00:00 https://sreejeetm1729.github.io/posts/td-without-projection/ 2026-05-26T00:00:00+00:00 https://sreejeetm1729.github.io/posts/function-approximation-rl/ 2026-05-27T05:00:00+00:00 https://sreejeetm1729.github.io/posts/neural_td/ 2026-06-02T00:00:00+00:00 https://sreejeetm1729.github.io/posts/one-pixel-attack/ 2026-06-04T04:00:00+00:00 https://sreejeetm1729.github.io/posts/concentration-inequalities-for-rl-research/ 2026-06-09T00:00:00+00:00 https://sreejeetm1729.github.io/posts/discounted-vs-average-reward-rl/ 2026-06-09T00:00:00+00:00 https://sreejeetm1729.github.io/posts/bellman-operators-and-optimality/ 2026-06-10T00:00:00+00:00 https://sreejeetm1729.github.io/posts/td-learning-almost-gradient-descent-jekyll-fixed/ 2026-06-11T00:00:00+00:00 https://sreejeetm1729.github.io/posts/turbo-snail-robust-pomdp/ 2026-06-11T00:00:00+00:00 https://sreejeetm1729.github.io/posts/linear-td-vs-neural-td-blog/ 2026-06-12T00:00:00+00:00 https://sreejeetm1729.github.io/posts/td-2norm-vs-infinity-norm/ 2026-06-13T00:00:00+00:00 https://sreejeetm1729.github.io/who-am-i/ 2026-08-03T03:19:06+00:00 https://sreejeetm1729.github.io/Research/ 2026-08-03T03:19:06+00:00 https://sreejeetm1729.github.io/Publications/ 2026-08-03T03:19:06+00:00 https://sreejeetm1729.github.io/Recent-News/ 2026-08-03T03:19:06+00:00 https://sreejeetm1729.github.io/RL-Blogs/ 2026-08-03T03:19:06+00:00 https://sreejeetm1729.github.io/Resume/ 2026-08-03T03:19:06+00:00 https://sreejeetm1729.github.io/categories/ https://sreejeetm1729.github.io/ https://sreejeetm1729.github.io/tags/ https://sreejeetm1729.github.io/tags/reinforcement-learning/ https://sreejeetm1729.github.io/tags/robust-rl/ https://sreejeetm1729.github.io/tags/q-learning/ https://sreejeetm1729.github.io/tags/corruption/ https://sreejeetm1729.github.io/tags/theory/ https://sreejeetm1729.github.io/tags/optimization/ https://sreejeetm1729.github.io/tags/gradient-descent/ https://sreejeetm1729.github.io/tags/machine-learning/ https://sreejeetm1729.github.io/tags/stochastic-gradient-descent/ https://sreejeetm1729.github.io/tags/td-learning/ https://sreejeetm1729.github.io/tags/function-approximation/ https://sreejeetm1729.github.io/tags/markovian-sampling/ https://sreejeetm1729.github.io/tags/stochastic-approximation/ https://sreejeetm1729.github.io/tags/rl/ https://sreejeetm1729.github.io/tags/function-approximation/ https://sreejeetm1729.github.io/tags/deep-rl/ https://sreejeetm1729.github.io/tags/neural-networks/ https://sreejeetm1729.github.io/tags/temporal-difference-learning/ https://sreejeetm1729.github.io/tags/neural-td/ https://sreejeetm1729.github.io/tags/overparameterization/ https://sreejeetm1729.github.io/tags/adversarial-ml/ https://sreejeetm1729.github.io/tags/markov-inequality/ https://sreejeetm1729.github.io/tags/chebyshev-inequality/ https://sreejeetm1729.github.io/tags/hoeffding-inequality/ https://sreejeetm1729.github.io/tags/chernoff-bound/ https://sreejeetm1729.github.io/tags/bernstein-inequality/ https://sreejeetm1729.github.io/tags/azuma-hoeffding-inequality/ https://sreejeetm1729.github.io/tags/freedman-inequality/ https://sreejeetm1729.github.io/tags/martingales/ https://sreejeetm1729.github.io/tags/reinforcement-learning/ https://sreejeetm1729.github.io/tags/mdp/ https://sreejeetm1729.github.io/tags/discounted-rl/ https://sreejeetm1729.github.io/tags/average-reward/ https://sreejeetm1729.github.io/tags/bellman-equation/ https://sreejeetm1729.github.io/tags/bellman-operator/ https://sreejeetm1729.github.io/tags/dynamic-programming/ https://sreejeetm1729.github.io/tags/value-iteration/ https://sreejeetm1729.github.io/tags/policy-iteration/ https://sreejeetm1729.github.io/tags/td-learning/ https://sreejeetm1729.github.io/tags/linear-function-approximation/ https://sreejeetm1729.github.io/tags/finite-time-analysis/ https://sreejeetm1729.github.io/tags/markovian-noise/ https://sreejeetm1729.github.io/tags/reinforcement-learning-theory/ https://sreejeetm1729.github.io/tags/pomdp/ https://sreejeetm1729.github.io/tags/exploration/ https://sreejeetm1729.github.io/tags/minimax/ https://sreejeetm1729.github.io/tags/gridworld/ https://sreejeetm1729.github.io/tags/linear-td/ https://sreejeetm1729.github.io/tags/bellman-operators/ https://sreejeetm1729.github.io/tags/mspbe/ https://sreejeetm1729.github.io/tags/stochastic-approximation/ https://sreejeetm1729.github.io/tags/norms/ https://sreejeetm1729.github.io/categories/rl-blogs/ https://sreejeetm1729.github.io/page2/ https://sreejeetm1729.github.io/assets/html/living_city_learning_dashboard_width_fixed.html 2026-08-03T03:18:56+00:00 https://sreejeetm1729.github.io/assets/html/rl3d_drone_playground_compact.html 2026-08-03T03:18:56+00:00 https://sreejeetm1729.github.io/assets/pdf/2026_Resume.pdf 2026-08-03T03:18:56+00:00 https://sreejeetm1729.github.io/google7ebe9e03cc635e54.html 2026-08-03T03:18:56+00:00