Adam White

인용

	전체	2019년 이후
서지정보	1993	1289
h-index	22	19
i10-index	35	32

320

160

240

2007200820092010201120122013201420152016201720182019202020212022202320247 5 8 16 39 58 69 77 70 77 92 157 164 199 229 286 312 96

공개 액세스

모두 보기

자료 10개

자료 0개

공개

비공개

재정 지원 요구사항 기준

공동 저자

Martha WhiteUniversity of Albertaualberta.ca의 이메일 확인됨
Joseph ModayilOpenmind Research Institute & Keen AGIopenmindresearch.org의 이메일 확인됨
Patrick M. PilarskiUniversity of Alberta, Amii (Alberta Machine Intelligence Institute)ualberta.ca의 이메일 확인됨
Thomas DegrisDeepMindgoogle.com의 이메일 확인됨
Marlos C. MachadoUniversity of Albertaualberta.ca의 이메일 확인됨
Doina PrecupDeepMind and McGill Universitycs.mcgill.ca의 이메일 확인됨
Nathan SturtevantUniversity of Alberta, Alberta Machine Intelligence Institute (Amii)ualberta.ca의 이메일 확인됨
Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, Waymocs.ox.ac.uk의 이메일 확인됨
Craig SherstanResearch Scientist, Sony AIsony.com의 이메일 확인됨

팔로우

Adam White

University of Alberta, Amii (Alberta Machine Intelligence Institute)

ualberta.ca의 이메일 확인됨 - 홈페이지

Artificial Intelligence Reinforcement Learning


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction RS Sutton, J Modayil, M Delp, T Degris, PM Pilarski, A White, D Precup The 10th International Conference on Autonomous Agents and Multiagent …, 2011	577	2011
RL-Glue: Language-independent software for reinforcement-learning experiments B Tanner, A White The Journal of Machine Learning Research 10, 2133-2136, 2009	169	2009
Multi-timescale nexting in a reinforcement learning robot J Modayil, A White, RS Sutton Adaptive Behavior 22 (2), 146-160, 2014	141	2014
Feature construction for reinforcement learning in hearts NR Sturtevant, AM White Computers and Games: 5th International Conference, CG 2006, Turin, Italy …, 2007	80	2007
Developing a predictive approach to knowledge A White University of Alberta, 2015	78	2015
Report on the 2008 reinforcement learning competition S Whiteson, B Tanner, A White AI Magazine 31 (2), 81-81, 2010	57	2010
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains Y Pan, M Zaheer, A White, A Patterson, M White arXiv preprint arXiv:1806.04624, 2018	53	2018
Adapting behavior via intrinsic reward: A survey and empirical study C Linke, NM Ady, M White, T Degris, A White Journal of artificial intelligence research 69, 1287-1332, 2020	45	2020
Gradient temporal-difference learning with regularized corrections S Ghiassian, A Patterson, S Garg, D Gupta, A White, M White International Conference on Machine Learning, 3524-3534, 2020	43	2020
A greedy approach to adapting the trace parameter for temporal difference learning M White, A White arXiv preprint arXiv:1607.00446, 2016	43	2016
General value function networks M Schlegel, A Jacobsen, Z Abbas, A Patterson, A White, M White Journal of Artificial Intelligence Research 70, 497-543, 2021	40	2021
Investigating practical linear temporal difference learning A White, M White arXiv preprint arXiv:1602.08771, 2016	40	2016
Interval Estimation for Reinforcement-Learning Algorithms in Continuous-State Domains M White, A White Advances in Neural Information Processing Systems, 2010	38	2010
Surprise and curiosity for big data robotics A White, J Modayil, RS Sutton Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014	36	2014
Improving performance in reinforcement learning by breaking generalization in neural networks S Ghiassian, B Rafiee, YL Lo, A White arXiv preprint arXiv:2003.07417, 2020	33	2020
Accelerated gradient temporal difference learning Y Pan, A White, M White Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017	31	2017
Scaling life-long off-policy learning RSS Adam White, Joseph Modayil 2012 IEEE International Conference on Development and Learning and …, 2013	31*	2013
Online off-policy prediction S Ghiassian, A Patterson, M White, RS Sutton, A White arXiv preprint arXiv:1811.02597, 2018	30	2018
Reinforcement learning benchmarks and bake-offs II A Dutech, T Edmunds, J Kok, M Lagoudakis, M Littman, M Riedmiller, ... Advances in Neural Information Processing Systems (NIPS) 17, 6, 2005	30	2005
Loss of plasticity in continual deep reinforcement learning Z Abbas, R Zhao, J Modayil, A White, MC Machado Conference on Lifelong Learning Agents, 620-636, 2023	28	2023

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용

공동 저자