팔로우
Pablo Samuel Castro
제목
인용
인용
연도
Rigging the lottery: Making all tickets winners
U Evci, T Gale, J Menick, PS Castro, E Elsen
International Conference on Machine Learning, 2943-2952, 2020
3952020
Deep reinforcement learning at the edge of the statistical precipice
R Agarwal, M Schwarzer, PS Castro, AC Courville, M Bellemare
Advances in neural information processing systems 34, 29304-29320, 2021
3552021
From taxi GPS traces to social and community dynamics: A survey
PS Castro, D Zhang, C Chen, S Li, G Pan
ACM Computing Surveys (CSUR) 46 (2), 1-34, 2013
3332013
Urban traffic modelling and prediction using large scale taxi GPS traces
PS Castro, D Zhang, S Li
International Conference on Pervasive Computing, 57-72, 2012
3332012
Autonomous navigation of stratospheric balloons using reinforcement learning
MG Bellemare, S Candido, PS Castro, J Gong, MC Machado, S Moitra, ...
Nature 588 (7836), 77-82, 2020
2822020
Dopamine: A research framework for deep reinforcement learning
PS Castro, S Moitra, C Gelada, S Kumar, MG Bellemare
arXiv preprint arXiv:1812.06110, 2018
2562018
iBOAT: Isolation-based online anomalous trajectory detection
C Chen, D Zhang, PS Castro, N Li, L Sun, S Li, Z Wang
IEEE Transactions on Intelligent Transportation Systems 14 (2), 806-818, 2013
2012013
Contrastive behavioral similarity embeddings for generalization in reinforcement learning
R Agarwal, MC Machado, PS Castro, MG Bellemare
arXiv preprint arXiv:2101.05265, 2021
1532021
TF-Agents: A library for reinforcement learning in tensorflow
S Guadarrama, A Korattikara, O Ramirez, P Castro, E Holly, S Fishman, ...
see https://github. com/tensorflow/agents, 2018
1532018
Real-time detection of anomalous taxi trajectories from GPS traces
C Chen, D Zhang, P Samuel Castro, N Li, L Sun, S Li
International Conference on Mobile and Ubiquitous Systems: Computing …, 2011
1342011
Scalable methods for computing state similarity in deterministic markov decision processes
PS Castro
Proceedings of the AAAI Conference on Artificial Intelligence 34 (06), 10069 …, 2020
1092020
Revisiting rainbow: Promoting more insightful and inclusive deep reinforcement learning research
JSO Ceron, PS Castro
International Conference on Machine Learning, 1373-1383, 2021
94*2021
A geometric perspective on optimal representations for reinforcement learning
M Bellemare, W Dabney, R Dadashi, A Ali Taiga, PS Castro, N Le Roux, ...
Advances in neural information processing systems 32, 2019
922019
Methods for computing state similarity in Markov decision processes
N Ferns, PS Castro, D Precup, P Panangaden
arXiv preprint arXiv:1206.6836, 2012
892012
A comparative analysis of expected and distributional reinforcement learning
C Lyle, MG Bellemare, PS Castro
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 4504-4511, 2019
822019
Using bisimulation for policy transfer in MDPs
P Castro, D Precup
Proceedings of the AAAI conference on artificial intelligence 24 (1), 1065-1070, 2010
602010
An atari model zoo for analyzing, visualizing, and comparing deep reinforcement learning agents
FP Such, V Madhavan, R Liu, R Wang, PS Castro, Y Li, J Zhi, L Schubert, ...
arXiv preprint arXiv:1812.07069, 2018
552018
Real time anomalous trajectory detection and analysis
L Sun, D Zhang, C Chen, PS Castro, S Li, Z Wang
Mobile Networks and Applications 18, 341-356, 2013
472013
Equivalence Relations in Fully and Partially Observable Markov Decision Processes.
PS Castro, P Panangaden, D Precup
IJCAI 9, 1653-1658, 2009
39*2009
Automatic construction of temporally extended actions for mdps using bisimulation metrics
PS Castro, D Precup
European workshop on reinforcement learning, 140-152, 2011
342011
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20