| philtabor/Youtube-Code-Repository |
687 |
|
0 |
0 |
about 3 years ago |
0 |
|
41 |
|
Python |
| Repository for most of the code from my YouTube channel |
| yg211/summary-reward-no-reference |
28 |
|
0 |
0 |
over 3 years ago |
0 |
|
9 |
apache-2.0 |
Python |
| A reference-free metric for measuring summary quality, learned from human ratings. |
| maxgillham/ReinforcementLearning-AlgoTrading |
23 |
|
0 |
0 |
over 6 years ago |
0 |
|
1 |
|
Python |
| Code for thesis project on applying reinforcement learning to algorithmic trading |
| ir-uam/kNNBandit |
14 |
|
0 |
0 |
about 5 years ago |
0 |
|
1 |
mpl-2.0 |
Java |
| Software for the experiments reported in the RecSys 2019 paper "A Simple Multi-Armed Nearest-Neighbor Bandit for Interactive Recommendation" |
| sadighian/recommendation-gym |
9 |
|
0 |
0 |
almost 6 years ago |
0 |
|
0 |
apache-2.0 |
Python |
| MovieLens recommendation system using reinforcement learning (GYM + PPO) |
| benjamin-dupuis/DQN-snake |
5 |
|
0 |
0 |
about 6 years ago |
0 |
|
1 |
|
Python |
| Deep Reinforcement Learning algorithm to learn to play Snake |
| marcelloaborges/Soccer-PPO |
5 |
|
0 |
0 |
almost 7 years ago |
0 |
|
1 |
|
ASP |
| Udacity Deep Reinforcement Learning Nanodegree Program |