Hybrid reward architecture for reinforcement learning H Van Seijen, M Fatemi, J Romoff, R Laroche, T Barnes, J Tsang Advances in Neural Information Processing Systems 30, 2017 | 256 | 2017 |
Safe policy improvement with baseline bootstrapping R Laroche, P Trichelair, RT Des Combes International conference on machine learning, 3652-3661, 2019 | 214 | 2019 |
Learning dynamic belief graphs to generalize on text-based games A Adhikari, X Yuan, MA Côté, M Zelinka, MA Rondeau, R Laroche, ... Advances in Neural Information Processing Systems 33, 3045-3057, 2020 | 102 | 2020 |
Contextual bandit for active learning: Active thompson sampling D Bouneffouf, R Laroche, T Urvoy, R Féraud, R Allesiardo Neural Information Processing: 21st International Conference, ICONIP 2014 …, 2014 | 93 | 2014 |
Counting to explore and generalize in text-based games X Yuan, MA Côté, A Sordoni, R Laroche, RT Combes, M Hausknecht, ... arXiv preprint arXiv:1806.11525, 2018 | 60 | 2018 |
Transfer reinforcement learning with shared dynamics R Laroche, M Barlier Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017 | 59 | 2017 |
When does return-conditioned supervised learning work for offline reinforcement learning? D Brandfonbrener, A Bietti, J Buckman, R Laroche, J Bruna Advances in Neural Information Processing Systems 35, 1542-1553, 2022 | 53 | 2022 |
Score-based inverse reinforcement learning L El Asri, B Piot, M Geist, R Laroche, O Pietquin International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2016 | 45 | 2016 |
Reinforcement learning algorithm selection R Laroche, R Feraud ICLR, 2018 | 39 | 2018 |
Hybrid reward architecture for reinforcement learning HH Van Seijen, SMF Booshehri, RMH Laroche, JS Romoff US Patent 10,977,551, 2021 | 38 | 2021 |
Safe policy improvement with soft baseline bootstrapping K Nadjahi, R Laroche, R Tachet des Combes Machine Learning and Knowledge Discovery in Databases: European Conference …, 2020 | 35 | 2020 |
Transfer Learning for User Adaptation in Spoken Dialogue Systems. A Genevay, R Laroche AAMAS, 975-983, 2016 | 33 | 2016 |
Human-machine dialogue as a stochastic game M Barlier, J Perolat, R Laroche, O Pietquin 16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), 2015 | 31 | 2015 |
NASTIA: Negotiating Appointment Setting Interface. L El Asri, R Lemonnier, R Laroche, O Pietquin, H Khouzaimi LREC, 266-271, 2014 | 30 | 2014 |
Reward function learning for dialogue management L El Asri, R Laroche, O Pietquin STAIRS 2012, 95-106, 2012 | 29 | 2012 |
Reward shaping for statistical optimisation of dialogue management L El Asri, R Laroche, O Pietquin Statistical Language and Speech Processing: First International Conference …, 2013 | 28 | 2013 |
Decentralized exploration in multi-armed bandits R Féraud, R Alami, R Laroche International Conference on Machine Learning, 1901-1909, 2019 | 27 | 2019 |
Safe policy improvement with an estimated baseline policy TD Simão, R Laroche, RT Combes International Foundation for Autonomous Agents and Multi-Agent Systems, 2019 | 26 | 2019 |
On value function representation of long horizon problems L Lehnert, R Laroche, H van Seijen Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018 | 25 | 2018 |
Multi-advisor reinforcement learning R Laroche, M Fatemi, J Romoff, H van Seijen arXiv preprint arXiv:1704.00756, 2017 | 25 | 2017 |