Botao Hao

인용

	전체	2019년 이후
서지정보	656	647
h-index	15	15
i10-index	21	19

240

120

180

20182019202020212022202320244 16 42 111 161 225 87

공개 액세스

모두 보기

자료 11개

자료 0개

공개

비공개

재정 지원 요구사항 기준

공동 저자

Csaba SzepesvariDeepMind & University of Albertacs.ualberta.ca의 이메일 확인됨
Tor LattimoreDeepMindgoogle.com의 이메일 확인됨
Zheng WenGoogle DeepMindgoogle.com의 이메일 확인됨
Yasin Abbasi YadkoriDeepMindgoogle.com의 이메일 확인됨
Mengdi WangCenter for Statistics & Machine Learning, ECE, Princeton Universityprinceton.edu의 이메일 확인됨
Will Wei SunAssociate Professor, Daniels School of Business, Purdue Universitypurdue.edu의 이메일 확인됨
Nevena LazicDeepMindgoogle.com의 이메일 확인됨
Benjamin Van RoyStanford Universitystanford.edu의 이메일 확인됨
Jingfei ZhangEmory Univeristyemory.edu의 이메일 확인됨
Anru ZhangDuke Universityduke.edu의 이메일 확인됨
尚作峰 (Zuofeng Shang)New Jersey Institute of Technologynjit.edu의 이메일 확인됨
Yufeng LiuUniversity of North Carolina at Chapel Hillemail.unc.edu의 이메일 확인됨

팔로우

Botao Hao

Deepmind

google.com의 이메일 확인됨 - 홈페이지

reinforcement learning multi-armed bandits RLHF


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Simultaneous clustering and estimation of heterogeneous graphical models B Hao, WW Sun, Y Liu, G Cheng Journal of Machine Learning Research 18 (217), 1-58, 2018	70	2018
Adaptive exploration in linear contextual bandit B Hao, T Lattimore, C Szepesvari International Conference on Artificial Intelligence and Statistics, 3536-3545, 2020	60	2020
Sparse and low-rank tensor estimation via cubic sketchings B Hao, AR Zhang, G Cheng International Conference on Artificial Intelligence and Statistics, 1319-1330, 2020	57	2020
High-dimensional sparse linear bandits B Hao, T Lattimore, M Wang 34th Conference on Neural Information Processing Systems, 2020	56	2020
Bootstrapping upper confidence bound B Hao, Y Abbasi-Yadkori, Z Wen, G Cheng 33rd Conference on Neural Information Processing Systems, 2019	56	2019
Sparse feature selection makes batch reinforcement learning more sample efficient B Hao, Y Duan, T Lattimore, C Szepesvári, M Wang International Conference on Machine Learning, 4063-4073, 2021	35	2021
Bootstrapping fitted q-evaluation for off-policy inference B Hao, X Ji, Y Duan, H Lu, C Szepesvari, M Wang International Conference on Machine Learning, 4074-4084, 2021	31	2021
Online sparse reinforcement learning B Hao, T Lattimore, C Szepesvári, M Wang International Conference on Artificial Intelligence and Statistics, 316-324, 2021	29	2021
Adaptive approximate policy iteration B Hao, N Lazic, Y Abbasi-Yadkori, P Joulani, C Szepesvari Proceedings of the 24th International Conference on Artificial Intelligence …, 2020	27*	2020
Sparse tensor additive regression B Hao, B Wang, P Wang, J Zhang, J Yang, WW Sun Journal of machine learning research 22 (64), 1-43, 2021	26	2021
Efficient local planning with linear function approximation D Yin, B Hao, Y Abbasi-Yadkori, N Lazić, C Szepesvári International Conference on Algorithmic Learning Theory, 1165-1192, 2022	24	2022
Residual bootstrap exploration for bandit algorithms CH Wang, Y Yu, B Hao, G Cheng arXiv preprint arXiv:2002.08436, 2020	19	2020
Information directed sampling for sparse linear bandits B Hao, T Lattimore, W Deng Advances in Neural Information Processing Systems 34, 16738-16750, 2021	16	2021
Bootstrapping Statistical Inference for Off-Policy Evaluation B Hao, X Ji, Y Duan, H Lu, C Szepesvári, M Wang arXiv preprint arXiv:2102.03607, 2021	16	2021
The neural testbed: Evaluating joint predictions I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, M Ibrahimi, ... Advances in Neural Information Processing Systems 35, 12554-12565, 2022	15	2022
Contextual information-directed sampling B Hao, T Lattimore, C Qin International Conference on Machine Learning, 8446-8464, 2022	12	2022
Regret Bounds for Information-Directed Reinforcement Learning B Hao, T Lattimore Advances in Neural Information Processing Systems, 2022	12	2022
Bandit phase retrieval T Lattimore, B Hao Advances in Neural Information Processing Systems 34, 18801-18811, 2021	11	2021
Optimization issues in kl-constrained approximate policy iteration N Lazić, B Hao, Y Abbasi-Yadkori, D Schuurmans, C Szepesvári arXiv preprint arXiv:2102.06234, 2021	10	2021
Tensors in modern statistical learning WW Sun, B Hao, L Li Wiley StatsRef: Statistics Reference Online. Hoboken, NJ. John Wiley & Sons …, 2021	10	2021

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용

공동 저자