Discovering Reinforcement Learning Algorithms. paper link
Junhyuk Oh, Matteo Hessel, Wojciech M. Czarnecki, Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver
Revisiting Fundamentals of Experience Replay. paper link
William Fedus, Prajit Ramachandran, Rishabh Agarwal, Yoshua Bengio, Hugo Larochelle, Mark Rowland, Will Dabney
Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion. paper link
Roland Hafner, Tim Hertweck, Philipp Kloppner, Michael Bloesch, Michael Neunert, Markus Wulfmeier, Saran Tunyasuvunakool, Nicolas Heess, Martin Riedmiller
Agent57: Outperforming the Atari Human Benchmark. paper link
Adrià Puigdomènech Badia, Bilal Piot, Steven Kapturowski, Pablo Sprechmann, Alex Vitvitsky, Daniel Gu, Charles Blundel
Hyperparameter Selection for Offline Reinforcement Learning. paper link
Tom Le Paine, Cosmin Paduraru, Andrea Michi , Caglar Gulcehre , Konrad Żołna, Alexander Novikov , Ziyu Wang and Nando de Freitas
Importance Weighted Policy Learning and Adaption. paper link
Alexandre Galashov, Jakub Sygnowski, Guillaume Desjardins, Jan Humplik, Leonard Hasenclever, Rae Jeong, Yee Whye Teh, Nicolas Heess
Dear OpenI User
Thank you for your continuous support to the Openl Qizhi Community AI Collaboration Platform. In order to protect your usage rights and ensure network security, we updated the Openl Qizhi Community AI Collaboration Platform Usage Agreement in January 2024. The updated agreement specifies that users are prohibited from using intranet penetration tools. After you click "Agree and continue", you can continue to use our services. Thank you for your cooperation and understanding.
For more agreement content, please refer to the《Openl Qizhi Community AI Collaboration Platform Usage Agreement》