Ph.D. Candidate at North Carolina State University

HOME
WHO AM I 🍂🍁?
RESEARCH
PUBLICATIONS
RECENT NEWS
RL BLOGS
RESUME

Home Tags mdp

Tag

mdp 1

Discounted vs. Average Reward Reinforcement Learning Jun 9, 2026

Recently Updated

Why TD Learning Likes the 2-Norm: Mean Directions, Inner Products, and Bellman Geometry
Linear TD vs Neural TD: A Tale of Two Geometries
TD Learning Is Almost Gradient Descent: A Finite-Time View of Linear TD
From Snail Trails to Robust POMDPs: Safe Learning with Hidden Monsters
Bellman Operators and Bellman Optimality

Trending Tags

reinforcement-learning function-approximation q-learning temporal-difference-learning bellman-equation bellman-operators gradient-descent machine-learning neural-td optimization

© 2026 Sreejeet Maity. Some rights reserved.

Using the Chirpy theme for Jekyll.

Trending Tags

reinforcement-learning function-approximation q-learning temporal-difference-learning bellman-equation bellman-operators gradient-descent machine-learning neural-td optimization

New content available