| LantaoYu/SeqGAN |
1,801 |
|
0 |
0 |
about 7 years ago |
0 |
|
33 |
|
Python |
| Implementation of Sequence Generative Adversarial Nets with Policy Gradient |
| wangshusen/DRL |
1,542 |
|
0 |
0 |
over 3 years ago |
0 |
|
37 |
other |
|
| Deep Reinforcement Learning |
| kh-kim/stock_market_reinforcement_learning |
630 |
|
0 |
0 |
over 9 years ago |
0 |
|
16 |
|
Python |
| This project provides a stock market environment using OpenGym with Deep Q-learning and Policy Gradient. |
| yanpanlau/DDPG-Keras-Torcs |
595 |
|
0 |
0 |
over 8 years ago |
0 |
|
54 |
|
Python |
| Using Keras and Deep Deterministic Policy Gradient to play TORCS |
| salesforce/MultiHopKG |
252 |
|
0 |
0 |
over 3 years ago |
0 |
|
8 |
bsd-3-clause |
Jupyter Notebook |
| Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout |
| xuehy/pytorch-maddpg |
231 |
|
0 |
0 |
almost 8 years ago |
0 |
|
1 |
|
Python |
| A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient) |
| ghliu/pytorch-ddpg |
207 |
|
0 |
0 |
over 7 years ago |
0 |
|
2 |
apache-2.0 |
Python |
| Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch |
| openai/phasic-policy-gradient |
175 |
|
0 |
0 |
about 4 years ago |
0 |
|
6 |
mit |
Python |
| Code for the paper "Phasic Policy Gradient" |
| kvfrans/openai-cartpole |
129 |
|
0 |
0 |
over 7 years ago |
0 |
|
8 |
|
Python |
| random search, hill climbing, policy gradient |
| jcwleo/Reinforcement_Learning |
89 |
|
0 |
0 |
over 7 years ago |
0 |
|
0 |
|
Python |
| 강화학습에 대한 기본적인 알고리즘 구현 |