| google-research/batch_rl |
443 |
|
0 |
0 |
almost 3 years ago |
0 |
|
9 |
apache-2.0 |
Python |
| Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games |
| apexrl/Batch-Offline--RL-Paper-Lists |
47 |
|
0 |
0 |
about 4 years ago |
0 |
|
0 |
|
|
| Paper Collection for Batch RL with brief introductions. |
| kzl/lifelong_rl |
42 |
|
0 |
0 |
almost 5 years ago |
0 |
|
1 |
mit |
Python |
| Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Reset-Free Lifelong Learning with Skill-Space Planning. |
| google-research/deep_ope |
42 |
|
0 |
0 |
almost 5 years ago |
0 |
|
1 |
apache-2.0 |
Jupyter Notebook |
| hari-sikchi/AWAC |
25 |
|
0 |
0 |
over 3 years ago |
0 |
|
3 |
mit |
Python |
| Advantage weighted Actor Critic for Offline RL |
| thu-ml/SRPO |
11 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
mit |
Python |
| Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" |