| JuliaPOMDP/POMDPs.jl |
616 |
|
0 |
0 |
over 2 years ago |
0 |
|
25 |
other |
Julia |
| MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces. |
| Nth-iteration-labs/contextual |
47 |
|
0 |
0 |
over 5 years ago |
6 |
July 25, 2020 |
2 |
|
R |
| Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies |
| apexrl/Batch-Offline--RL-Paper-Lists |
47 |
|
0 |
0 |
about 4 years ago |
0 |
|
0 |
|
|
| Paper Collection for Batch RL with brief introductions. |
| YiqinYang/ICQ |
41 |
|
0 |
0 |
over 3 years ago |
0 |
|
3 |
|
Python |
| Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS 2021 Spotlight https://arxiv.org/abs/2106.03400) |
| ChenDRAG/SfBC |
33 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
|
Python |
| Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548 |
| ChenDRAG/CEP-energy-guided-diffusion |
24 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
mit |
Python |
| Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023) |
| thu-ml/SRPO |
11 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
mit |
Python |
| Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" |
| massquantity/DBRL |
5 |
|
0 |
0 |
over 5 years ago |
0 |
|
0 |
|
Python |
| Dataset Batch(offline) Reinforcement Learning for recommender system |
| avisingh599/cog |
5 |
|
0 |
0 |
over 5 years ago |
0 |
|
0 |
mit |
Python |
| [CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning |