Kamil Ciosek

인용

	전체	2019년 이후
서지정보	967	887
h-index	15	15
i10-index	19	18

260

130

195

201420152016201720182019202020212022202320246 3 8 10 47 66 84 205 197 252 81

공개 액세스

모두 보기

자료 9개

자료 0개

공개

비공개

재정 지원 요구사항 기준

팔로우

Kamil Ciosek

Spotify

spotify.com의 이메일 확인됨 - 홈페이지

Reinforcement Learning Machine Learning


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Generalization in reinforcement learning with selective noise injection and information bottleneck M Igl, K Ciosek, Y Li, S Tschiatschek, C Zhang, S Devlin, K Hofmann Advances in neural information processing systems 32, 2019	172	2019
Better exploration with optimistic actor critic K Ciosek, Q Vuong, R Loftin, K Hofmann Advances in Neural Information Processing Systems 32, 2019	151	2019
Expected policy gradients K Ciosek, S Whiteson Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	84	2018
Compositional planning using optimal option models D Silver, K Ciosek arXiv preprint arXiv:1206.6473, 2012	66	2012
Offer: Off-environment reinforcement learning K Ciosek, S Whiteson Proceedings of the aaai conference on artificial intelligence 31 (1), 2017	60	2017
Discount factor as a regularizer in reinforcement learning R Amit, R Meir, K Ciosek International conference on machine learning, 269-278, 2020	59	2020
Conservative uncertainty estimation by fitting prior networks K Ciosek, V Fortuin, R Tomioka, K Hofmann, R Turner International Conference on Learning Representations, 2019	58	2019
Multi-task batch reinforcement learning with metric learning J Li, Q Vuong, S Liu, M Liu, K Ciosek, H Christensen, H Su Advances in Neural Information Processing Systems 33, 6197-6210, 2020	49	2020
Expected policy gradients for reinforcement learning K Ciosek, S Whiteson Journal of Machine Learning Research 21 (52), 1-51, 2020	44	2020
Deep interactive bayesian reinforcement learning via meta-learning L Zintgraf, S Devlin, K Ciosek, S Whiteson, K Hofmann arXiv preprint arXiv:2101.03864, 2021	37	2021
Evaluating the robustness of collaborative agents P Knott, M Carroll, S Devlin, K Ciosek, K Hofmann, AD Dragan, R Shah arXiv preprint arXiv:2101.05507, 2021	25	2021
Alternating optimisation and quadrature for robust control S Paul, K Chatzilygeroudis, K Ciosek, JB Mouret, M Osborne, S Whiteson Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	23	2018
Imitation learning by reinforcement learning K Ciosek arXiv preprint arXiv:2108.04763, 2021	21	2021
Amrl: Aggregated memory for reinforcement learning J Beck, K Ciosek, S Devlin, S Tschiatschek, C Zhang, K Hofmann International Conference on Learning Representations, 2019	20	2019
Regularized policies are reward robust H Husain, K Ciosek, R Tomioka International Conference on Artificial Intelligence and Statistics, 64-72, 2021	17	2021
Information directed reward learning for reinforcement learning D Lindner, M Turchetta, S Tschiatschek, K Ciosek, A Krause Advances in Neural Information Processing Systems 34, 3850-3862, 2021	14	2021
Fourier policy gradients M Fellows, K Ciosek, S Whiteson International Conference on Machine Learning, 1486-1495, 2018	14	2018
Drift: Deep reinforcement learning for functional software testing L Harries, RS Clarke, T Chapman, SV Nallamalli, L Ozgur, S Jain, ... arXiv preprint arXiv:2007.08220, 2020	13	2020
Value iteration with options and state aggregation K Ciosek, D Silver arXiv preprint arXiv:1501.03959, 2015	12	2015
Alternating optimisation and quadrature for robust reinforcement learning S Paul, K Ciosek, MA Osborne, S Whiteson arXiv preprint arXiv:1605.07496, 2016	8	2016

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용