W Bradley Knox

Cited by

	All	Since 2019
Citations	3733	2379
h-index	23	18
i10-index	33	25

580

290

145

435

200920102011201220132014201520162017201820192020202120222023202423 27 54 69 119 147 139 214 215 294 298 404 466 462 562 182

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Peter StoneProfessor of Computer Science, The University of Texas at AustinVerified email at cs.utexas.edu
Cynthia BreazealProfessor Media Arts and Sciences, MIT Media LabVerified email at media.mit.edu
Maya CakmakUniversity of WashingtonVerified email at cs.washington.edu
Bradley C. LoveProfessor of Cognitive and Decision Sciences, University College LondonVerified email at ucl.ac.uk
Todd KuleszaUser Experience Researcher, GoogleVerified email at google.com
Saleema AmershiMicrosoft ResearchVerified email at microsoft.com
Alessandro AllieviImperial College LondonVerified email at imperial.ac.uk
Scott NiekumAssociate Professor, University of Massachusetts AmherstVerified email at cs.umass.edu
Hayley HungAssociate Professor, Delft University of TechnologyVerified email at tudelft.nl
Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoVerified email at cs.ox.ac.uk
Guangliang LiAssociate Professor, College of Electrical Engineering, Ocean University of China, Qingdao, ChinaVerified email at ouc.edu.cn
Ross OttoDepartment of Psychology, McGill UniversityVerified email at mcgill.ca
Jin Joo Lee, PhDAmazon Lab126Verified email at amazon.com
W. Todd MaddoxWayne Holtzman Chair and Professor of Psychology, University of TexasVerified email at utexas.edu
Serena BoothMITVerified email at mit.edu
Felix SchmittBosch Center for Artificial IntelligenceVerified email at de.bosch.com
Jolie Baumann WormwoodUniversity of New HampshireVerified email at unh.edu
David DeStenoNortheastern UniversityVerified email at northeastern.edu
Brian GlassPostdoctoral Researcher of Psychology and Computer Science, University College London, University ofVerified email at qmul.ac.uk
Samuel SpauldingMedia Lab, Massachusetts Institute of TechnologyVerified email at media.mit.edu

W Bradley Knox

Research Scientist at UT Austin

Verified email at cs.utexas.edu - Homepage

Reward functions Alignment RLHF Reinforcement Learning Human-Robot Interaction


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Power to the people: The role of humans in interactive machine learning S Amershi, M Cakmak, WB Knox, T Kulesza AI Magazine 35 (4), 105-120, 2014	1126	2014
Interactively shaping agents via human reinforcement: The TAMER framework WB Knox, P Stone Proceedings of the 5th International Conference on Knowledge Capture (K-CAP …, 2009	580	2009
Combining manual feedback with subsequent MDP reward signals for reinforcement learning WB Knox, P Stone Proceedings of the 9th International Conference on Autonomous Agents and …, 2010	266	2010
Reinforcement learning from simultaneous human and MDP reward WB Knox, P Stone Proceedings of the 11th International Conference on Autonomous Agents and …, 2012	253*	2012
Tamer: Training an agent manually via evaluative reinforcement WB Knox, P Stone 2008 7th IEEE international conference on development and learning, 292-297, 2008	200	2008
Training a robot via human feedback: A case study WB Knox, P Stone, C Breazeal International Conference on Social Robotics (ICSR), 460-470, 2013	170	2013
Computationally modeling interpersonal trust JJ Lee, B Knox, J Baumann, C Breazeal, D DeSteno Frontiers in psychology 4, 56004, 2013	123	2013
The nature of belief-directed exploratory choice in human decision-making WB Knox, AR Otto, P Stone, B Love Frontiers in Psychology 2, 2012	96	2012
How humans teach agents: A new experimental perspective WB Knox, BD Glass, BC Love, WT Maddox, P Stone International Journal of Social Robotics 4 (4), 409-421, 2012	95	2012
Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance WB Knox, P Stone Artificial Intelligence 225, 24-50, 2015	76	2015
Reinforcement Learning from Human Reward: Discounting in Episodic Tasks WB Knox, P Stone 21st IEEE International Symposium on Robot and Human Interactive …, 2012	70	2012
Reward (Mis)design for Autonomous Driving WB Knox, A Allievi, H Banzhaf, F Schmitt, P Stone arXiv preprint arXiv:2104.13906, 2021	66	2021
The EMPATHIC Framework for Task Learning from Implicit Human Feedback Y Cui, Q Zhang, A Allievi, P Stone, S Niekum, WB Knox Conference on Robot Learning (CoRL), 2020	56	2020
Learning from Human-Generated Reward WB Knox University of Texas at Austin, 2012	56	2012
Know thine enemy: A champion RoboCup coach agent G Kuhlmann, WB Knox, P Stone Proceedings of the National Conference on Artificial Intelligence 21 (2), 1463, 2006	48	2006
Using informative behavior to increase engagement in the tamer framework G Li, H Hung, S Whiteson, WB Knox Proceedings of the 2013 international conference on autonomous agents and …, 2013	42	2013
Learning non-myopically from human-generated reward WB Knox, P Stone Proceedings of the 2013 international conference on Intelligent user …, 2013	42	2013
Design Principles for Creating Human-Shapable Agents. WB Knox, IR Fasel, P Stone AAAI Spring Symposium: Agents that Learn from Human Teachers, 79-86, 2009	34	2009
Physiological and behavioral signatures of reflective exploratory choice AR Otto, WB Knox, AB Markman, BC Love Cognitive, Affective, & Behavioral Neuroscience 14, 1167-1183, 2014	27	2014
The perils of trial-and-error reward design: misdesign through overfitting and invalid task specifications S Booth, WB Knox, J Shah, S Niekum, P Stone, A Allievi Proceedings of the AAAI Conference on Artificial Intelligence 37 (5), 5920-5929, 2023	25	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors