OpenI/PARL: PARL 是一个高性能、灵活的强化学习框架 - PARL - OpenI

关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

Happy 6c85147db7 Update MADDPG README.md (#1020 ) * Update README.md * Update README.md		1 year ago
..
README.md	Update MADDPG README.md (#1020)	1 year ago

requirements.txt	dev 2.1.1 (#1009)	1 year ago

simple_agent.py	support zero-dim tensor for paddle future version (#986)	1 year ago

simple_model.py	add continuous actions option in maddpg (#828)	2 years ago

train.py	MADDPG-paddle sample/predict action (#942)	1 year ago

Reproduce MADDPG with PARL

Based on PARL, the MADDPG algorithm of deep reinforcement learning has been reproduced.

Paper: MADDPG in Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

Multi-agent particle environment introduction

A simple multi-agent particle world based on gym. Please see here to install and know more about the environment.

Benchmark result

Mean episode reward in training process (totally 25000 episodes).

result

Experiments result

simple	simple_adversary	simple_push	simple_crypto
simple_speaker_listener	simple_spread	simple_tag	simple_world_comm

How to use

Dependencies:

python3.7+
paddlepaddle>=2.0.0
parl>=2.1.1
PettingZoo==1.17.0
gym==0.23.1

Start Training:

# To train an agent for simple_speaker_listener scenario
python train.py

# To train for other scenario, model is automatically saved every 1000 episodes
python train.py --env [ENV_NAME]

# To show animation effects after training
python train.py --env [ENV_NAME] --show --restore

# To train and evaluate scenarios with continuous action spaces
python train.py --env [ENV_NAME] --continuous_actions
python train.py --env [ENV_NAME] --continuous_actions --show --restore

PARL 是一个高性能、灵活的强化学习框架

https://parl.readthedocs.io

ai开发工具

Python C++ JavaScript Shell Markdown other

2466956298@qq.com zenghongsheng@baidu.com likejiao@baidu.com 39279048+Banmahhhh@users.noreply.github.com lsb19@tsinghua.org.cn 68997378+swag1ong@users.noreply.github.com zhoubo01@baidu.com 76139596+ShuaibinLi@users.noreply.github.com 52879090+YuechengLiu@users.noreply.github.com wangzelong0663@gmail.com royxroy@163.com zenghsh3@gmail.com tan_ze@outlook.com 52879090+liuyuecheng-github@users.noreply.github.com 915647399@qq.com haonanyu@baidu.com cclauss@me.com yu239@users.noreply.github.com tangzhiyi11@users.noreply.github.com 50344320+ZiyuanMa@users.noreply.github.com 115619013+Aidilele@users.noreply.github.com 49400846+Jiukaishi@users.noreply.github.com 58016616+ljy2222@users.noreply.github.com bestwanglei@gmail.com skylian@users.noreply.github.com

How to access data resources in code