GitHub - Shubham1965/rl-projects

Let's explore the world of Reinforcement Learning through implementation using python as simplified as possible.

I'm assuming you have basic foundational knowledge of Markov Decision Processes (MDPs) and Dynamic Programming (DP). Most RL algorithms can be viewed as attempts to achieve much the same effect as DP, only with less computation.

You'll see the implementation of the classical reinforcement learning algorithms from Reinforcement Learning: An Introduction on various environments

Dynamic Programming (Policy and Value Iteration)
Monte Carlo Methods (Prediction and Control)
Temporal Difference (SARSA and Q-Learning)
Value Function Approximation (DQN, DDQN)
Policy gradient methods (REINFORCE)
Actor Critic methods (DDPG, PPO, TRPO, A2C, TD3, SAC, RPO, AMP)
Model Based methods (Dyna-Q, PETS)

Project layout

core/algorithms/monte_carlo: Blackjack Monte Carlo prediction and control.
core/algorithms/tabular: Dynamic programming and temporal-difference methods for grid worlds.
core/env: The grid world environment and configs.
core/utils: Small numeric helpers.
examples: Runnable scripts demonstrating the algorithms.
tests: Placeholder suite ready for real unit tests.
results: Saved figures produced by the examples.

Setup

Install dependencies: pip install -r requirements.txt
(Optional) Editable install with dev tools: pip install -e .[dev]
Lint/format/test: ruff format --check . && ruff check . && pytest

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github/workflows		.github/workflows
core		core
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
ReadMe.md		ReadMe.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project layout

Setup

About

Uh oh!

Releases

Packages

Languages

Shubham1965/rl-projects

Folders and files

Latest commit

History

Repository files navigation

Project layout

Setup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages