corruption1 deep-rl1 Function Approximation1 function-approximation2 gradient-descent2 machine-learning2 Markovian Sampling1 neural-networks1 neural-td1 optimization2 overparameterization1 q-learning3 reinforcement-learning1 rl1 robust-rl1 Stochastic Approximation1 stochastic-gradient-descent1 TD Learning1 temporal-difference-learning1 theory2