openrl

关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

History

How to Use

Users can train CartPole via:

python train_ppo.py --config ppo.yaml

To train with Dual-clip PPO:

python train_ppo.py --config dual_clip_ppo.yaml

To train with A2C algorithm:

python train_a2c.py

If you want to evaluate the agent during training and save the best model and save checkpoints, try to train with callbacks:

python train_ppo.py --config callbacks.yaml

More details about callbacks can be found in Callbacks.

OpenRL是一个开源的通用强化学习框架，支持单、多智能体，自博弈，离线强化学习，大语言模型训练。

深度学习人工智能机器人具身智能通用人工智能自动驾驶深度强化学习对抗智能体多智能体博弈智能体强化学习游戏智能体

Python Markdown other