| MorvanZhou/Reinforcement-learning-with-tensorflow |
8,174 |
|
0 |
0 |
over 2 years ago |
0 |
|
63 |
mit |
Python |
| Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学 |
| vwxyzjn/cleanrl |
3,947 |
|
0 |
0 |
about 2 years ago |
0 |
|
49 |
other |
Python |
| High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG) |
| ikostrikov/pytorch-a2c-ppo-acktr-gail |
3,396 |
|
0 |
0 |
almost 4 years ago |
0 |
|
82 |
mit |
Python |
| PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL). |
| nikhilbarhate99/PPO-PyTorch |
1,270 |
|
0 |
0 |
over 2 years ago |
0 |
|
10 |
mit |
Python |
| Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch |
| uvipen/Super-mario-bros-PPO-pytorch |
692 |
|
0 |
0 |
over 4 years ago |
0 |
|
7 |
mit |
Python |
| Proximal Policy Optimization (PPO) algorithm for Super Mario Bros |
| Khrylx/PyTorch-RL |
638 |
|
0 |
0 |
about 5 years ago |
0 |
|
6 |
mit |
Python |
| PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO. |
| cpnota/autonomous-learning-library |
616 |
|
0 |
0 |
over 2 years ago |
16 |
November 16, 2023 |
26 |
mit |
Python |
| A PyTorch library for building deep reinforcement learning agents. |
| TianhongDai/reinforcement-learning-algorithms |
407 |
|
0 |
0 |
about 5 years ago |
0 |
|
4 |
|
Python |
| This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress) |
| ChenglongChen/pytorch-DRL |
387 |
|
0 |
0 |
over 8 years ago |
0 |
|
|
mit |
Python |
| PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent. |
| zuoxingdong/lagom |
365 |
|
0 |
0 |
about 6 years ago |
0 |
|
5 |
mit |
Jupyter Notebook |
| lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms. |