Cleanrl: High-quality single-file implementations of deep reinforcement learning algorithms S Huang, RFJ Dossa, C Ye, J Braga, D Chakraborty, K Mehta, ... Journal of Machine Learning Research 23 (274), 1-18, 2022 | 129 | 2022 |
The 37 implementation details of proximal policy optimization S Huang, RFJ Dossa, A Raffin, A Kanervisto, W Wang The ICLR Blog Track 2023, 2022 | 75 | 2022 |
A2C is a special case of PPO S Huang, A Kanervisto, A Raffin, W Wang, S Ontañón, RFJ Dossa arXiv preprint arXiv:2205.09123, 2022 | 15 | 2022 |
An empirical investigation of early stopping optimizations in proximal policy optimization RFJ Dossa, S Huang, S Ontañón, T Matsubara IEEE Access 9, 117981-117992, 2021 | 11 | 2021 |
A human-like agent based on a hybrid of reinforcement and imitation learning RFJ Dossa, X Lian, H Nomoto, T Matsubara, K Uehara 2019 international joint conference on neural networks (IJCNN), 1-8, 2019 | 10 | 2019 |
Hybrid of reinforcement and imitation learning for human-like agents RFJ Dossa, X Lian, H Nomoto, T Matsubara, K Uehara IEICE TRANSACTIONS on Information and Systems 103 (9), 1960-1970, 2020 | 4 | 2020 |
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning S Huang, Q Gallouédec, F Felten, A Raffin, RFJ Dossa, Y Zhao, ... arXiv preprint arXiv:2402.03046, 2024 | 1 | 2024 |
Toward Human Cognition-inspired High-Level Decision Making For Hierarchical Reinforcement Learning Agents RFJ Dossa, T Matsubara Decision Awareness in Reinforcement Learning Workshop at ICML 2022, 2022 | | 2022 |
Hybrid Reinforcement and Imitation Learning for Human-Like Agents RFJ Dossa, X Lian, H Nomoto, T Matsubara, K Uehara IEICE Technical Report; IEICE Tech. Rep. 119 (88), 69-74, 2019 | | 2019 |
A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning X Lian, RFJ Dossa, H Nomoto, T Matsubara, K Uehara IEICE Technical Report; IEICE Tech. Rep. 118 (316), 45-50, 2018 | | 2018 |
A Comparison of Visual and Auditory EEG Interfaces for Robot Multi-stage Task Control K Arulkumaran, M Di Vincenzo, RFJ Dossa, S Akiyama, D Ogawa Lillrank, ... Frontiers in Robotics and AI 11, 1329270, 0 | | |
Open-loop VLM Robot Planning: An Investigation of Fine-tuning and Prompt Engineering Strategies S Akiyama, RFJ Dossa, K Arulkumaran, S Sujit, E Johns First Workshop on Vision-Language Models for Navigation and Manipulation at …, 0 | | |