CAUSAL DISCOVERY WITH REINFORCEMENT LEARNING ICLR 2020. paper
Shengyu Zhu, Ignavier Ng, Zhitang Chen
Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect informatio ICLR 2020. paper
Yichi Zhou , Jialian Li, Jun Zhu
Harnessing Structures for Value-Based Planning and Reinforcement Learning ICLR2020. paper
Yuzhe Yang , Guo Zhang, Zhi Xu, Dina Katabi
A Closer Look at Deep Policy Gradients ICLR 2020. paper
Andrew Ilyas, Logan Engstrom, Shibani Santurkar, Dimitris Tsipras, Firdaus Janoos, Larry Rudolph, Aleksander Madry
Implementation Matters in Deep RL: A Case Study on PPO and TRPO ICLR 2020. paper
Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Firdaus Janoos, Larry Rudolph, Aleksander Madry
A Generalized Training Approach for Multiagent Learning ICLR 2020. paper
Paul Muller, Shayegan Omidshafiei, Mark Rowland, Karl Tuyls, Julien Perolat, Siqi Liu, Daniel Hennes, Luke Marris, Marc Lanctot, Edward Hughes, Zhe Wang, Guy Lever, Nicolas Heess, Thore Graepel, Remi Munos
Meta-Q-Learning ICLR 2020. paper
Rasool Fakoor, Pratik Chaudhari, Stefano Soatto, Alexander J. Smola
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference ICLR 2020. paper
Lasse Espeholt, Raphaël Marinier, Piotr Stanczyk, Ke Wang, Marcin Michalski
Dear OpenI User
Thank you for your continuous support to the Openl Qizhi Community AI Collaboration Platform. In order to protect your usage rights and ensure network security, we updated the Openl Qizhi Community AI Collaboration Platform Usage Agreement in January 2024. The updated agreement specifies that users are prohibited from using intranet penetration tools. After you click "Agree and continue", you can continue to use our services. Thank you for your cooperation and understanding.
For more agreement content, please refer to the《Openl Qizhi Community AI Collaboration Platform Usage Agreement》