팔로우
Aurélien Garivier
제목
인용
인용
연도
The KL-UCB algorithm for bounded stochastic bandits and beyond
A Garivier, O Cappé
Conference On Learning Theory COLT n°24 (arXiv preprint arXiv:1102.2490 …, 2011
7032011
On the complexity of best arm identification in multi-armed bandit models
E Kaufmann, O Cappé, A Garivier
Journal of Machine Learning Research 17, 1-42, 2016
5542016
On upper-confidence bound policies for switching bandit problems
A Garivier, E Moulines
International Conference on Algorithmic Learning Theory, 174-188, 2011
5172011
Parametric bandits: The generalized linear case
S Filippi, O Cappe, A Garivier, C Szepesvári
Advances in neural information processing systems 23, 2010
5012010
On Bayesian upper confidence bounds for bandit problems
E Kaufmann, O Cappé, A Garivier
Artificial intelligence and statistics, 592-600, 2012
4312012
Kullback-Leibler upper confidence bounds for optimal sequential allocation
O Cappé, A Garivier, OA Maillard, R Munos, G Stoltz
Annals of Statistics (and ArXiv:1210.1136) 41 (3), 1516-1541, 2012
4102012
Optimal best arm identification with fixed confidence
A Garivier, E Kaufmann
Proceedings of the 29th Conference On Learning Theory 49, 1-30, 2016
3322016
On upper-confidence bound policies for non-stationary bandit problems
A Garivier, E Moulines
arXiv preprint arXiv:0805.3415, 2008
2442008
Sequential Monte Carlo smoothing for general state space hidden Markov models
R Douc, A Garivier, E Moulines, J Olsson
1902011
Explore first, exploit next: The true shape of regret in bandit problems
A Garivier, P Ménard, G Stoltz
Mathematics of Operations Research 44 (2), 377-399, 2019
1882019
Optimism in reinforcement learning and Kullback-Leibler divergence
S Filippi, O Cappé, A Garivier
2010 48th Annual Allerton Conference on Communication, Control, and …, 2010
1282010
On Explore-Then-Commit Strategies
A Garivier, E Kaufmann, T Lattimore
NIPS 2016, arXiv preprint arXiv:1605.08988, 2016
1082016
On the Complexity of A/B Testing
E Kaufmann, O Cappé, A Garivier
JMLR Workshop and Conference Proceedings 35, 339-355, 2014
702014
An event spacing experiment
AJ Winstanley, A Garivier, MR Greenstreet
Proceedings Eighth International Symposium on Asynchronous Circuits and …, 2002
612002
Coding on countably infinite alphabets
S Boucheron, A Garivier, E Gassiat
IEEE Transactions on Information Theory 55 (1), 358-373, 2008
572008
A minimax and asymptotically optimal algorithm for stochastic bandits
P Ménard, A Garivier
International Conference on Algorithmic Learning Theory, 223-237, 2017
562017
Context tree selection: A unifying view
A Garivier, F Leonardi
Stochastic Processes and their Applications 121 (11), 2488-2506, 2011
462011
Consistency of the unlimited BIC context tree estimator
A Garivier
IEEE Transactions on Information theory 52 (10), 4630-4635, 2006
442006
KL-UCB-switch: optimal regret bounds for stochastic bandits from both a distribution-dependent and a distribution-free viewpoints
A Garivier, H Hadiji, P Menard, G Stoltz
The Journal of Machine Learning Research 23 (1), 8049-8114, 2022
432022
A MDL approach to hmm with Poisson and Gaussian emissions. Application to order identification.
A Chambaz, A Garivier, E Gassiat
Journal of Statistical Planning and Inference 139 (3), 962-977, 2009
42*2009
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20