Resume

Sreejeet Maity

Ph.D. Student in Electrical Engineering · North Carolina State University · Raleigh, NC, U.S.A

I develop provably robust finite-sample guarantees for reinforcement learning (RL) under uncertainty and adversarial corruption. My current research interests include corruption-tolerant reinforcement learning, robust policy evaluation, distributed and federated reinforcement learning, and minimax lower bounds that characterize the fundamental limits of robust learning.

Google Scholar GitHub LinkedIn Download CV

Robust RL Statistical Learning Theory Control Theory Federated RL Robust Statistics

Collaboration Note. I am always happy to engage with researchers working on broad areas of reinforcement learning, control theory, federated learning, and trustworthy machine learning. If you are interested in discussing possible collaborations or exchanging research ideas, please reach out to .

Prospective Opportunities. I will be entering the academic and industry research job market next year (2027), and I would be grateful to hear about opportunities aligned with my interests in broad areas of robust and safe RL, and reliable decision-making and control. I am also open to postdoctoral opportunities beginning in Fall 2027, as well as opportunities in subsequent academic cycles. Prospective recruiters, search committees, and researchers are warmly welcome to reach out.

Education

North Carolina State University

Ph.D. in Electrical Engineering · Aug 2023--Present · Raleigh, NC

Advisor: Dr. Aritra Mitra.

Indian Institute of Science, Bangalore

M.Tech. in Robotics and Autonomous Systems · Aug 2021--June 2023 · Bangalore, India

Jadavpur University

B.E. in Electrical Engineering · Aug 2017--July 2021 · Kolkata, India

Research Experience

Graduate Research / Teaching Assistant

North Carolina State University · Aug 2023--Present

Robust optimal policy learning from corrupted and correlated observations. Showed that vanilla Q-Learning is provably fragile under reward corruption, designed robust Bellman-update methods, and established finite-time convergence with matching minimax lower bounds. Results disseminated across ICML 2026, NeurIPS 2025, and IEEE CDC 2024, CDC 2026.
Robust policy evaluation under adversarial influences and Markovian data. Developed finite-time theory for robust temporal-difference learning with Markovian noise and function approximation, including upper bounds and near-tight lower bounds. This research is published in AISTATS 2025.
Robust federated and multi-agent reinforcement learning. Developed adversarially robust and communication-efficient reinforcement learning algorithms for federated multi-agent settings, including Byzantine-resilient methods with collaborative speedups. Two papers published at ACC 2026.

Research Collaboration · Neuromuscular Rehabilitation Engineering Laboratory

University of North Carolina at Chapel Hill · April 2026--Present

Developing personalized reinforcement learning methods for subject-specific tuning of commercial powered knee prostheses.
Formulating personalized MDP/CMDP models with user-specific dynamics, time-varying reward design, and safety-comfort constraints.
Exploring offline-to-online adaptation schemes that warm-start from population priors and enable safe, data-efficient personalization.