Philip Thomas

인용

	전체	2019년 이후
서지정보	4541	3504
h-index	32	28
i10-index	55	48

820

410

205

615

2011201220132014201520162017201820192020202120222023202416 27 28 41 68 137 177 258 413 568 681 724 810 305

공개 액세스

모두 보기

자료 27개

자료 0개

공개

비공개

재정 지원 요구사항 기준

공동 저자

Emma BrunskillAssociate Professor of Computer Science, Stanford Universitycs.stanford.edu의 이메일 확인됨
Georgios TheocharousAdobe Researchadobe.com의 이메일 확인됨
Bruno Castro da SilvaUniversity of Massachusettscs.umass.edu의 이메일 확인됨
Scott M. JordanPostdoctoral Fellow, University of Albertaualberta.ca의 이메일 확인됨
George KonidarisBrowncs.brown.edu의 이메일 확인됨
Scott NiekumAssociate Professor, University of Massachusetts Amherstcs.umass.edu의 이메일 확인됨
Stephen GiguereUniversity of Massachusettscs.umass.edu의 이메일 확인됨
Antonie J. (Ton) van den BogertProfessor of Mechanical Engineering, Cleveland State Universitycsuohio.edu의 이메일 확인됨
Yuriy BrunManning College of Information and Computer Sciences, University of Massachusetts Amherstcs.umass.edu의 이메일 확인됨
Chris NotaUniversity of Massachusetts, Amherstcs.umass.edu의 이메일 확인됨
Michael BranickyProfessor of Electrical Engineering & Computer Science, University of Kansasku.edu의 이메일 확인됨
Sarah OsentoskiVinci4dvinci4d.ai의 이메일 확인됨
Erik Learned-MillerProfessor of Computer Science, University of Massachusetts Amherstcs.umass.edu의 이메일 확인됨
Sridhar MahadevanDirector, Data Science Lab, Adobe Research & Professor, University of Massachusetts, Amherstcs.umass.edu의 이메일 확인됨
Blossom MetevierUniversity of Massachusetts Amherstumass.edu의 이메일 확인됨
Will DabneyDeepMindgoogle.com의 이메일 확인됨
Francisco M. GarciaUniversity of Massachusetts - Amherstcs.umass.edu의 이메일 확인됨
Robert KirschProfessor and Chair of Biomedical Engineering, Case Western Reserve Universitycase.edu의 이메일 확인됨
Arthur GuezGoogle DeepMindgoogle.com의 이메일 확인됨
Rémi MunosDeepMindinria.fr의 이메일 확인됨

팔로우

Philip Thomas

University of Massachusetts Amherst

cs.umass.edu의 이메일 확인됨 - 홈페이지

Artificial Intelligence Reinforcement Learning AI Safety


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Data-efficient off-policy policy evaluation for reinforcement learning P Thomas, E Brunskill International Conference on Machine Learning, 2139-2148, 2016	738	2016
Value function approximation in reinforcement learning using the Fourier basis G Konidaris, S Osentoski, P Thomas Proceedings of the AAAI conference on artificial intelligence 25 (1), 380-385, 2011	566	2011
High-confidence off-policy evaluation P Thomas, G Theocharous, M Ghavamzadeh Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015	305	2015
High confidence policy improvement P Thomas, G Theocharous, M Ghavamzadeh International Conference on Machine Learning, 2380-2388, 2015	216	2015
Ad recommendation systems for life-time value optimization G Theocharous, PS Thomas, M Ghavamzadeh Proceedings of the 24th international conference on world wide web, 1305-1310, 2015	192	2015
Preventing undesirable behavior of intelligent machines P Thomas, B Castro da Silva, A Barto, S Giguere, Y Brun, E Brunskill Science 366 (6468), 999-1004, 2019	189	2019
Learning action representations for reinforcement learning Y Chandak, G Theocharous, J Kostas, S Jordan, P Thomas International conference on machine learning, 941-950, 2019	181	2019
Increasing the action gap: New operators for reinforcement learning MG Bellemare, G Ostrovski, A Guez, P Thomas, R Munos Proceedings of the AAAI Conference on Artificial Intelligence 30 (1), 2016	168	2016
Bias in natural actor-critic algorithms P Thomas International conference on machine learning, 441-448, 2014	158	2014
Safe reinforcement learning PS Thomas	115	2015
Is the policy gradient a gradient? C Nota, PS Thomas arXiv preprint arXiv:1906.07073, 2019	68	2019
Training an actor-critic reinforcement learning controller for arm movement using human-generated rewards KM Jagodnik, PS Thomas, AJ van den Bogert, MS Branicky, RF Kirsch IEEE Transactions on Neural Systems and Rehabilitation Engineering 25 (10 …, 2017	67	2017
Proximal reinforcement learning: A new theory of sequential decision making in primal-dual spaces S Mahadevan, B Liu, P Thomas, W Dabney, S Giguere, N Jacek, I Gemp, ... arXiv preprint arXiv:1405.6757, 2014	66	2014
Optimizing for the future in non-stationary mdps Y Chandak, G Theocharous, S Shankar, M White, S Mahadevan, ... International Conference on Machine Learning, 1414-1425, 2020	64	2020
Predictive off-policy policy evaluation for nonstationary decision problems, with applications to digital marketing P Thomas, G Theocharous, M Ghavamzadeh, I Durugkar, E Brunskill Proceedings of the AAAI Conference on Artificial Intelligence 31 (2), 4740-4745, 2017	63	2017
Policy gradient methods for reinforcement learning with function approximation and action-dependent baselines PS Thomas, E Brunskill arXiv preprint arXiv:1706.06643, 2017	62	2017
Evaluating the performance of reinforcement learning algorithms S Jordan, Y Chandak, D Cohen, M Zhang, P Thomas International Conference on Machine Learning, 4962-4973, 2020	61	2020
Risk Quantification for Policy Deployment PS Thomas, G Theocharous, M Ghavamzadeh US Patent App. 14/552,047, 2016	54	2016
Importance Sampling for Fair Policy Selection. S Doroudi, PS Thomas, E Brunskill Grantee Submission, 2017	53	2017
Some recent applications of reinforcement learning AG Barto, PS Thomas, RS Sutton Proceedings of the eighteenth Yale workshop on adaptive and learning systems, 2017	51	2017

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용

공동 저자