팔로우
Sebastien Bubeck
Sebastien Bubeck
Sr Principal Research Manager, ML Foundations group, Microsoft Research
microsoft.com의 이메일 확인됨 - 홈페이지
제목
인용
인용
연도
Regret analysis of stochastic and nonstochastic multi-armed bandit problems
S Bubeck, N Cesa-Bianchi
Foundations and Trends in Machine Learning 5, 1-122, 2012
29512012
Convex optimization: Algorithms and complexity
S Bubeck
Foundations and Trends in Machine Learning 8, 231-357, 2014
21692014
Sparks of artificial general intelligence: Early experiments with gpt-4
S Bubeck, V Chandrasekaran, R Eldan, J Gehrke, E Horvitz, E Kamar, ...
arXiv preprint arXiv:2303.12712, 2023
11082023
Best arm identification in multi-armed bandits
JY Audibert, S Bubeck, R Munos
COLT 2010, 2010
8532010
Is Q-learning provably efficient?
C Jin, Z Allen-Zhu, S Bubeck, MI Jordan
Advances in neural information processing systems 31, 2018
8242018
Pure exploration in multi-armed bandits problems
S Bubeck, R Munos, G Stoltz
Algorithmic Learning Theory, 23-37, 2009
5672009
X-armed bandits
S Bubeck, R Munos, G Stoltz, C Szepesvári
Journal of Machine Learning Research 12, 1587-1627, 2011
4852011
Minimax policies for adversarial and stochastic bandits
JY Audibert, S Bubeck
COLT 2009, 2009
4712009
Provably robust deep learning via adversarially trained smoothed classifiers
H Salman, J Li, I Razenshteyn, P Zhang, H Zhang, S Bubeck, G Yang
Advances in Neural Information Processing Systems 32, 2019
4672019
lil'UCB: An Optimal Exploration Algorithm for Multi-Armed Bandits
K Jamieson, M Malloy, R Nowak, S Bubeck
COLT 2014, 2013
4352013
Benefits, limits, and risks of GPT-4 as an AI chatbot for medicine
P Lee, S Bubeck, J Petro
New England Journal of Medicine 388 (13), 1233-1239, 2023
3982023
Optimal algorithms for smooth and strongly convex distributed optimization in networks
K Scaman, F Bach, S Bubeck, YT Lee, L Massoulié
international conference on machine learning, 3027-3036, 2017
3172017
Pure exploration in finitely-armed and continuous-armed bandits
S Bubeck, R Munos, G Stoltz
Theoretical Computer Science 412, 1832-1852, 2010
2992010
Bandits with heavy tail
S Bubeck, N Cesa-Bianchi, G Lugosi
IEEE Transactions on Information Theory 59 (11), 7711-7717, 2013
2802013
Regret bounds and minimax policies under partial monitoring
JY Audibert, S Bubeck
The Journal of Machine Learning Research 11, 2635-2686, 2010
2622010
Online Optimization in X-Armed Bandits
S Bubeck, G Stoltz, C Szepesvári, R Munos
Advances in Neural Information Processing Systems 21, 201-208, 2008
2582008
Regret in online combinatorial optimization
JY Audibert, S Bubeck, G Lugosi
Mathematics of Operations Research 39 (1), 31-45, 2014
2422014
The best of both worlds: Stochastic and adversarial bandits
S Bubeck, A Slivkins
COLT 2012, 2012
2322012
Multiple identifications in multi-armed bandits
S Bubeck, T Wang, N Viswanathan
ICML 2012, 2012
2262012
Adversarial examples from computational constraints
S Bubeck, YT Lee, E Price, I Razenshteyn
International Conference on Machine Learning, 831-840, 2019
2212019
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20