| Unity-Technologies/ml-agents |
15,707 |
|
12 |
17 |
over 2 years ago |
46 |
October 09, 2023 |
48 |
other |
C# |
| The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning. |
| kmario23/deep-learning-drizzle |
10,767 |
|
0 |
0 |
over 3 years ago |
0 |
|
6 |
|
HTML |
| Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!! |
| google/trax |
7,818 |
|
0 |
1 |
over 2 years ago |
24 |
October 26, 2021 |
120 |
apache-2.0 |
Python |
| Trax — Deep Learning with Clear Code and Speed |
| datawhalechina/easy-rl |
7,643 |
|
0 |
0 |
about 2 years ago |
0 |
|
47 |
other |
Jupyter Notebook |
| 强化学习中文教程(蘑菇书),在线阅读地址:https://datawhalechina.github.io/easy-rl/ |
| yandexdataschool/Practical_RL |
5,572 |
|
0 |
0 |
over 2 years ago |
0 |
|
40 |
unlicense |
Jupyter Notebook |
| A course in reinforcement learning in the wild |
| udacity/deep-reinforcement-learning |
4,635 |
|
0 |
0 |
over 2 years ago |
0 |
|
5 |
mit |
Jupyter Notebook |
| Repo for the Deep Reinforcement Learning Nanodegree program |
| vwxyzjn/cleanrl |
3,947 |
|
0 |
0 |
about 2 years ago |
0 |
|
49 |
other |
Python |
| High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG) |
| andri27-ts/Reinforcement-Learning |
3,928 |
|
0 |
0 |
almost 6 years ago |
0 |
|
2 |
mit |
Jupyter Notebook |
| Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning |
| huggingface/deep-rl-class |
3,459 |
|
0 |
0 |
about 2 years ago |
0 |
|
27 |
apache-2.0 |
MDX |
| This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course. |
| ikostrikov/pytorch-a2c-ppo-acktr-gail |
3,396 |
|
0 |
0 |
almost 4 years ago |
0 |
|
82 |
mit |
Python |
| PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL). |