thefartchild/PARL: PARL 是一个高性能、灵活的强化学习框架 - PARL - OpenI

关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

History

Happy e62863de3c unittest, readme (#989 ) * a2c unittest, py39 test env, readme * compatwrapper comment * del MAenv from parl.env * update paddle in docker file * update paddle whl in docker * add paddle39 whl in docker * run example test in py38 * a2c unittest		1 year ago
..
README.md	Tipc (#862)	2 years ago

mujoco_agent.py	Added CQL paddle version (#744)	2 years ago

mujoco_model.py	Added CQL paddle version (#744)	2 years ago

requirements.txt	Tipc (#862)	2 years ago

train.py	unittest, readme (#989)	1 year ago

README.md

Reproduce CQL with PARL
- Env and dataset introduction
- Benchmark result
How to use

Reproduce CQL with PARL

Based on PARL, the CQL algorithm of deep reinforcement learning has been reproduced, reaching the same level of indicators as the paper on continuous control datasets from the D4RL benchmark.

Paper: CQL in Conservative Q-Learning for Offline Reinforcement Learning

Env and dataset introduction

D4RL datasets: The algorithm is tested in the D4RL dataset, one of the most commonly used dataset for offline RL. Please see here to know more about D4RL datasets. D4RL require Mujoco as a dependency. For more D4RL usage methods, please refer to its guide.
Mujoco simulator: Please see here to know more about Mujoco simulator and obtain a license.

Benchmark result

How to use

Dependencies:

python3.5+
parl>2.0.3
paddlepaddle>=2.0.4
gym==0.20.0
mujoco-py==2.0.2.8
d4rl (install from source)

Start Training:

Train

# To train for halfcheetah-medium-expert-v0(default), or [halfcheetah/hopper/walker/ant]-[random/medium/expert/medium-expert/medium-replay]-[v0/v2]
python train.py --env [ENV_NAME]

# To reproduce the performance
python train.py --env [ENV_NAME] --with_automatic_entropy_tuning

PARL 是一个高性能、灵活的强化学习框架

ai开发工具

Python C++ JavaScript Shell Markdown other

2466956298@qq.com zenghongsheng@baidu.com likejiao@baidu.com 39279048+Banmahhhh@users.noreply.github.com lsb19@tsinghua.org.cn 68997378+swag1ong@users.noreply.github.com zhoubo01@baidu.com 76139596+ShuaibinLi@users.noreply.github.com 52879090+YuechengLiu@users.noreply.github.com wangzelong0663@gmail.com royxroy@163.com zenghsh3@gmail.com tan_ze@outlook.com 52879090+liuyuecheng-github@users.noreply.github.com 915647399@qq.com haonanyu@baidu.com cclauss@me.com yu239@users.noreply.github.com tangzhiyi11@users.noreply.github.com 50344320+ZiyuanMa@users.noreply.github.com 115619013+Aidilele@users.noreply.github.com 49400846+Jiukaishi@users.noreply.github.com 58016616+ljy2222@users.noreply.github.com bestwanglei@gmail.com skylian@users.noreply.github.com

How to access data resources in code