Bernardo Ávila Pires
Bernardo Ávila Pires
Research Scientist, DeepMind
google.com의 이메일 확인됨
제목
인용
인용
연도
Bootstrap your own latent: A new approach to self-supervised learning
JB Grill, F Strub, F Altché, C Tallec, PH Richemond, E Buchatskaya, ...
arXiv preprint arXiv:2006.07733, 2020
2182020
Cost-sensitive multiclass classification risk bounds
BA Pires, C Szepesvari, M Ghavamzadeh
International Conference on Machine Learning, 1391-1399, 2013
442013
Neural predictive belief representations
ZD Guo, MG Azar, B Piot, BA Pires, R Munos
arXiv preprint arXiv:1811.06407, 2018
282018
Statistical linear estimation with penalized estimators: an application to reinforcement learning
BA Pires, C Szepesvári
arXiv preprint arXiv:1206.6444, 2012
252012
Policy error bounds for model-based reinforcement learning with factored linear models
BÁ Pires, C Szepesvári
Conference on Learning Theory, 121-151, 2016
182016
koray kavukcuoglu, Remi Munos, and Michal Valko. Bootstrap your own latent-a new approach to self-supervised learning
JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ...
Advances in Neural Information Processing Systems 33, 21271-21284, 2020
172020
Bootstrap latent-predictive representations for multitask reinforcement learning
ZD Guo, BA Pires, B Piot, JB Grill, F Altché, R Munos, MG Azar
International Conference on Machine Learning, 3875-3886, 2020
162020
World discovery models
MG Azar, B Piot, BA Pires, JB Grill, F Altché, R Munos
arXiv preprint arXiv:1902.07685, 2019
132019
Pseudo-MDPs and factored linear action models
H Yao, C Szepesvári, BA Pires, X Zhang
2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement …, 2014
132014
Multiclass classification calibration functions
BÁ Pires, C Szepesvári
arXiv preprint arXiv:1609.06385, 2016
122016
Neural belief states for partially observed domains
P Moreno, J Humplik, G Papamakarios, BA Pires, L Buesing, N Heess, ...
NeurIPS 2018 workshop on Reinforcement Learning under Partial Observability, 2018
112018
Statistical analysis of l1-penalized linear estimation with applications
B Ávila Pires
62012
Clause Identification Using Entropy Guided Transformation Learning
ER Fernandes, B Pires, CN dos Santos, RL Milidiú
Information and Human Language Technology (STIL), 2009 Seventh Brazilian …, 2009
52009
Geometric Entropic Exploration
ZD Guo, MG Azar, A Saade, S Thakoor, B Piot, BA Pires, M Valko, ...
arXiv preprint arXiv:2101.02055, 2021
22021
Pathological effects of variance on classification-based policy iteration
BÁ Pires, C Szepesvári
AAAI Workshop: Learning for General Competency in Video Games, 2015
12015
Neural Recursive Belief States in Multi-Agent Reinforcement Learning
P Moreno, E Hughes, KR McKee, BA Pires, T Weber
arXiv preprint arXiv:2102.02274, 2021
2021
Toward Practical Reinforcement Learning Algorithms: Classification Based Policy Iteration and Model-Based Learning
B Ávila Pires
2017
Using random projections to estimate condition numbers and solve linear systems CMPUT 501 Project
BA Pires
2012
CLASSIFICATION CALIBRATION
BÁ Pires
2012
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–19