팔로우
Shimon Whiteson
Shimon Whiteson
Professor of Computer Science, University of Oxford / Head of Research, Waymo UK
cs.ox.ac.uk의 이메일 확인됨 - 홈페이지
제목
인용
인용
연도
Counterfactual multi-agent policy gradients
J Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
12302018
Learning to communicate with deep multi-agent reinforcement learning
J Foerster, IA Assael, N De Freitas, S Whiteson
Advances in neural information processing systems 29, 2016
11902016
Qmix: Monotonic value function factorisation for deep multi-agent reinforcement learning
T Rashid, M Samvelyan, C Schroeder, G Farquhar, J Foerster, S Whiteson
International conference on machine learning, 4295-4304, 2018
9872018
Stabilising experience replay for deep multi-agent reinforcement learning
J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ...
International conference on machine learning, 1146-1155, 2017
5282017
A survey of multi-objective sequential decision-making
DM Roijers, P Vamplew, S Whiteson, R Dazeley
Journal of Artificial Intelligence Research 48, 67-113, 2014
5182014
Learning with opponent-learning awareness
JN Foerster, RY Chen, M Al-Shedivat, S Whiteson, P Abbeel, I Mordatch
arXiv preprint arXiv:1709.04326, 2017
3932017
The starcraft multi-agent challenge
M Samvelyan, T Rashid, CS De Witt, G Farquhar, N Nardelli, TGJ Rudner, ...
arXiv preprint arXiv:1902.04043, 2019
3762019
Evolutionary Function Approximation for Reinforcement Learning
S Whiteson, P Stone
Journal of Machine Learning Research 7, 877-917, 2006
3452006
Lipnet: End-to-end sentence-level lipreading
YM Assael, B Shillingford, S Whiteson, N De Freitas
arXiv preprint arXiv:1611.01599, 2016
2832016
Multiagent reinforcement learning for urban traffic control using coordination graphs
L Kuyer, S Whiteson, B Bakker, N Vlassis
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2008
2442008
Fast context adaptation via meta-learning
L Zintgraf, K Shiarli, V Kurin, K Hofmann, S Whiteson
International Conference on Machine Learning, 7693-7702, 2019
2422019
A theoretical and empirical analysis of Expected Sarsa
H Van Seijen, H Van Hasselt, S Whiteson, M Wiering
2009 ieee symposium on adaptive dynamic programming and reinforcement …, 2009
2042009
Deep variational reinforcement learning for POMDPs
M Igl, L Zintgraf, TA Le, F Wood, S Whiteson
International Conference on Machine Learning, 2117-2126, 2018
1972018
Maven: Multi-agent variational exploration
A Mahajan, T Rashid, M Samvelyan, S Whiteson
Advances in Neural Information Processing Systems 32, 2019
1822019
Lipnet: Sentence-level lipreading
YM Assael, B Shillingford, S Whiteson, N De Freitas
arXiv preprint arXiv:1611.01599 2 (8), 2016
1632016
A survey of reinforcement learning informed by natural language
J Luketina, N Nardelli, G Farquhar, J Foerster, J Andreas, E Grefenstette, ...
arXiv preprint arXiv:1906.03926, 2019
1502019
Exploiting locality of interaction in factored Dec-POMDPs
FA Oliehoek, MTJ Spaan, N Vlassis, S Whiteson
Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems, 517-524, 2008
1452008
Balancing exploration and exploitation in listwise and pairwise online learning to rank for information retrieval
K Hofmann, S Whiteson, M de Rijke
Information Retrieval 16 (1), 63-90, 2013
1432013
Learning to communicate to solve riddles with deep distributed recurrent q-networks
JN Foerster, YM Assael, N de Freitas, S Whiteson
arXiv preprint arXiv:1602.02672, 2016
1402016
Transfer via inter-task mappings in policy search reinforcement learning
ME Taylor, S Whiteson, P Stone
Proceedings of the 6th international joint conference on Autonomous agents …, 2007
1352007
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20