Derived and Implemented 6 RL algorithms from scratch in Python and simulated with GYM environments  Topics

RL Algorithms from scratch using Open AI Gym

Overview

In this project, I implemented and analyzed 9 reinforcement learning algorithms from scratch—ranging from tabular methods like Q-Learning and SARSA to deep and actor–critic approaches such as DQN, DDQN, Actor-Critic, A2C, PPO, DDPG, and TD3.

For each algorithm, I:

  • Derived the mathematical foundations and update equations

  • Implemented the model from scratch in Python (PyTorch + NumPy)

  • Tested and benchmarked across multiple OpenAI Gym environments

  • Analyzed learning curves, policy behavior, and algorithmic trade-offs

This project deepened my understanding of reinforcement learning principles, bridging theoretical derivations with practical implementations and highlighting the evolution from simple value-based agents to sophisticated actor–critic frameworks.

Tools

Python

Open AI Gym

Semester

7

Grade

A+

RL Algorithms from scratch using Open AI Gym

Overview

In this project, I implemented and analyzed 9 reinforcement learning algorithms from scratch—ranging from tabular methods like Q-Learning and SARSA to deep and actor–critic approaches such as DQN, DDQN, Actor-Critic, A2C, PPO, DDPG, and TD3.

For each algorithm, I:

  • Derived the mathematical foundations and update equations

  • Implemented the model from scratch in Python (PyTorch + NumPy)

  • Tested and benchmarked across multiple OpenAI Gym environments

  • Analyzed learning curves, policy behavior, and algorithmic trade-offs

This project deepened my understanding of reinforcement learning principles, bridging theoretical derivations with practical implementations and highlighting the evolution from simple value-based agents to sophisticated actor–critic frameworks.

Tools

Python

Open AI Gym

Semester

7

Grade

A+

Create a free website with Framer, the website builder loved by startups, designers and agencies.