Diana Borsa

인용

	전체	2019년 이후
서지정보	1039	934
h-index	14	14
i10-index	18	18

240

120

180

201420152016201720182019202020212022202320244 5 17 28 48 78 134 202 218 237 65

공개 액세스

모두 보기

자료 2개

자료 0개

공개

비공개

재정 지원 요구사항 기준

공동 저자

Andre BarretoResearch Scientist, Google DeepMindgoogle.com의 이메일 확인됨
Tom SchaulSenior Staff Scientist, DeepMindnyu.edu의 이메일 확인됨
Rémi MunosDeepMindinria.fr의 이메일 확인됨
David SilverDeepMind, UCLgoogle.com의 이메일 확인됨
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLgoogle.com의 이메일 확인됨
Doina PrecupDeepMind and McGill Universitycs.mcgill.ca의 이메일 확인됨
Will DabneyDeepMindgoogle.com의 이메일 확인됨
Matteo HesselResearch Engineer, Google DeepMindgoogle.com의 이메일 확인됨
Daniel J. MankowitzGoogle Deepmindgoogle.com의 이메일 확인됨
Ingemar J. CoxDepartment of Computer Science, University College London / University of Copenhagenucl.ac.uk의 이메일 확인됨
Elad Yom-TovBar Ilan Universityyom-tov.info의 이메일 확인됨
Augustin ZidekResearch Engineer, DeepMindgoogle.com의 이메일 확인됨
Nicolas HeessDeepMindgoogle.com의 이메일 확인됨
Anna HarutyunyanDeepMindgoogle.com의 이메일 확인됨
Thore GraepelGlobal Lead Computational Science, AI & ML at Altos Labs and Chair of Machine Learning, UCLucl.ac.uk의 이메일 확인됨
GHEORGHE COMANICIResearch Scientist, DeepMinddeepmind.com의 이메일 확인됨
Bilal PiotGoogle Deepmindgoogle.com의 이메일 확인됨
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)univ-lille.fr의 이메일 확인됨
John Shawe-TaylorUCLcs.ucl.ac.uk의 이메일 확인됨
Mark RowlandResearch Scientist, Google DeepMindgoogle.com의 이메일 확인됨

팔로우

Diana Borsa

DeepMind

google.com의 이메일 확인됨

Reinforcement Learning Machine Learning Artificial Intelligence Exploration.


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Transfer in deep reinforcement learning using successor features and generalised policy improvement A Barreto, D Borsa, J Quan, T Schaul, D Silver, M Hessel, D Mankowitz, ... International Conference on Machine Learning, 501-510, 2018	180	2018
Fast reinforcement learning with generalized policy updates A Barreto, S Hou, D Borsa, D Silver, D Precup Proceedings of the National Academy of Sciences 117 (48), 30079-30087, 2020	121	2020
Universal successor features approximators D Borsa, A Barreto, J Quan, D Mankowitz, R Munos, H Van Hasselt, ... arXiv preprint arXiv:1812.07626, 2018	118	2018
The option keyboard: Combining skills in reinforcement learning A Barreto, D Borsa, S Hou, G Comanici, E Aygün, P Hamel, D Toyama, ... Advances in Neural Information Processing Systems 32, 2019	91	2019
Detecting disease outbreaks in mass gatherings using Internet data E Yom-Tov, D Borsa, IJ Cox, RA McKendry Journal of medical Internet research 16 (6), e154, 2014	73	2014
Observational learning by reinforcement learning D Borsa, B Piot, R Munos, O Pietquin arXiv preprint arXiv:1706.06617, 2017	68	2017
Ray interference: a source of plateaus in deep reinforcement learning T Schaul, D Borsa, J Modayil, R Pascanu arXiv preprint arXiv:1904.11455, 2019	66	2019
The termination critic A Harutyunyan, W Dabney, D Borsa, N Heess, R Munos, D Precup arXiv preprint arXiv:1902.09996, 2019	53	2019
Learning shared representations in multi-task reinforcement learning D Borsa, T Graepel, J Shawe-Taylor arXiv preprint arXiv:1603.02041, 2016	44	2016
Expected eligibility traces H van Hasselt, S Madjiheurem, M Hessel, D Silver, A Barreto, D Borsa Proceedings of the AAAI Conference on Artificial Intelligence 35 (11), 9997 …, 2021	42	2021
Automatic identification of web-based risk markers for health events E Yom-Tov, D Borsa, AC Hayward, RA McKendry, IJ Cox Journal of medical Internet research 17 (1), e29, 2015	33	2015
Training deep neural nets to aggregate crowdsourced responses A Gaunt, D Borsa, Y Bachrach Proceedings of the Thirty-Second Conference on Uncertainty in Artificial …, 2016	32	2016
When should agents explore? M Pislar, D Szepesvari, G Ostrovski, D Borsa, T Schaul arXiv preprint arXiv:2108.11811, 2021	26	2021
Adapting behaviour for learning progress T Schaul, D Borsa, D Ding, D Szepesvari, G Ostrovski, W Dabney, ... arXiv preprint arXiv:1912.06910, 2019	15	2019
Temporal difference uncertainties as a signal for exploration S Flennerhag, JX Wang, P Sprechmann, F Visin, A Galashov, ... arXiv preprint arXiv:2010.02255, 2020	14	2020
Return-based scaling: Yet another normalisation trick for deep rl T Schaul, G Ostrovski, I Kemaev, D Borsa arXiv preprint arXiv:2105.05347, 2021	13	2021
Conditional importance sampling for off-policy learning M Rowland, A Harutyunyan, H Hasselt, D Borsa, T Schaul, R Munos, ... International Conference on Artificial Intelligence and Statistics, 45-55, 2020	12	2020
General non-linear bellman equations H van Hasselt, J Quan, M Hessel, Z Xu, D Borsa, A Barreto arXiv preprint arXiv:1907.03687, 2019	10	2019
Model-value inconsistency as a signal for epistemic uncertainty A Filos, E Vértes, Z Marinho, G Farquhar, D Borsa, A Friesen, ... arXiv preprint arXiv:2112.04153, 2021	9	2021
Generalised policy improvement with geometric policy composition S Thakoor, M Rowland, D Borsa, W Dabney, R Munos, A Barreto International Conference on Machine Learning, 21272-21307, 2022	6	2022

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용

공동 저자