| thu-ml/tianshou |
7,125 |
|
0 |
10 |
about 2 years ago |
33 |
August 22, 2023 |
97 |
mit |
Python |
| An elegant PyTorch deep reinforcement learning library. |
| udacity/deep-reinforcement-learning |
4,635 |
|
0 |
0 |
over 2 years ago |
0 |
|
5 |
mit |
Jupyter Notebook |
| Repo for the Deep Reinforcement Learning Nanodegree program |
| vwxyzjn/cleanrl |
3,947 |
|
0 |
0 |
about 2 years ago |
0 |
|
49 |
other |
Python |
| High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG) |
| simoninithomas/Deep_reinforcement_learning_Course |
3,581 |
|
0 |
0 |
almost 3 years ago |
0 |
|
46 |
|
Jupyter Notebook |
| Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch |
| ikostrikov/pytorch-a2c-ppo-acktr-gail |
3,396 |
|
0 |
0 |
almost 4 years ago |
0 |
|
82 |
mit |
Python |
| PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL). |
| AI4Finance-Foundation/ElegantRL |
3,229 |
|
0 |
2 |
over 2 years ago |
5 |
January 08, 2022 |
107 |
other |
Python |
| Massively Parallel Deep Reinforcement Learning. 🔥 |
| ShangtongZhang/DeepRL |
2,834 |
|
0 |
0 |
over 3 years ago |
0 |
|
5 |
mit |
Python |
| Modularized Implementation of Deep RL Algorithms in PyTorch |
| sweetice/Deep-reinforcement-learning-with-pytorch |
2,741 |
|
0 |
0 |
about 3 years ago |
0 |
|
26 |
mit |
Python |
| PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and .... |
| seungeunrho/minimalRL |
2,417 |
|
0 |
0 |
almost 3 years ago |
0 |
|
21 |
mit |
Python |
| Implementations of basic RL algorithms with minimal lines of codes! (pytorch based) |
| DLR-RM/rl-baselines3-zoo |
1,640 |
|
0 |
2 |
about 2 years ago |
18 |
November 17, 2023 |
54 |
mit |
Python |
| A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included. |