Zhaoran Wang

인용

	전체	2019년 이후
서지정보	9000	8344
h-index	48	47
i10-index	109	107

2300

1150

575

1725

2014201520162017201820192020202120222023202433 66 134 198 203 305 658 1267 1701 2126 2281

공개 액세스

모두 보기

자료 52개

자료 0개

공개

비공개

재정 지원 요구사항 기준

팔로우

Zhaoran Wang

Associate Professor at Northwestern University

northwestern.edu의 이메일 확인됨 - 홈페이지

Deep Reinforcement Learning Data-Driven Decision-Making Optimization Under Uncertainty Nonconvex


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Provably Efficient Reinforcement Learning with Linear Function Approximation C Jin, Z Yang, Z Wang, MI Jordan Mathematics of Operations Research/Annual Conference on Learning Theory, 2022	829	2022
A Theoretical Analysis of Deep Q-Learning J Fan, Z Wang, Y Xie, Z Yang Learning for Dynamics and Control, 2020	790	2020
Is Pessimism Provably Efficient for Offline RL? Y Jin, Z Yang, Z Wang International Conference on Machine Learning, 2021	408	2021
Provably Efficient Exploration in Policy Optimization Q Cai, Z Yang, C Jin, Z Wang International Conference on Machine Learning, 2020	306	2020
A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic M Hong, HT Wai, Z Wang, Z Yang SIAM Journal on Optimization, 2022	286*	2022
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence L Wang, Q Cai, Z Yang, Z Wang International Conference on Learning Representations, 2020	258	2020
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy B Liu, Q Cai, Z Yang, Z Wang Advances in Neural Information Processing Systems, 2019	221*	2019
Optimal Computational and Statistical Rates of Convergence for Sparse Nonconvex Learning Problems Z Wang, H Liu, T Zhang Annals of Statistics, 2014	206	2014
Multi-Agent Reinforcement Learning via Double-Averaging Primal-Dual Optimization HT Wai, Z Yang, Z Wang, M Hong Advances in Neural Information Processing Systems, 2018	204	2018
A Strictly Contractive Peaceman--Rachford Splitting Method for Convex Programming B He, H Liu, Z Wang, X Yuan SIAM Journal on Optimization, 2014	195	2014
A Nonconvex Optimization Framework for Low Rank Matrix Estimation T Zhao, Z Wang, H Liu Advances in Neural Information Processing Systems, 2015	194*	2015
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization D Ding, X Wei, Z Yang, Z Wang, MR Jovanović International Conference on Artificial Intelligence and Statistics, 2021	179	2021
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium Q Xie, Y Chen, Z Wang, Z Yang Mathematics of Operations Research/Annual Conference on Learning Theory, 2022	155	2022
Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima Q Cai, Z Yang, JD Lee, Z Wang Mathematics of Operations Research/Advances in Neural Information Processing …, 2019	153*	2019
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost Z Yang, Y Chen, M Hong, Z Wang Advances in Neural Information Processing Systems, 2019	149	2019
A Near-Optimal Algorithm for Stochastic Bilevel Optimization via Double-Momentum P Khanduri, S Zeng, M Hong, HT Wai, Z Wang, Z Yang Advances in Neural Information Processing Systems, 2021	141	2021
Bridging Exploration and General Function Approximation in Reinforcement Learning: Provably Efficient Kernel and Neural Value Iterations Z Yang, C Jin, Z Wang, M Wang, MI Jordan Advances in Neural Information Processing Systems, 2020	141*	2020
High-Dimensional Expectation-Maximization Algorithm: Statistical Optimization and Asymptotic Normality Z Wang, Q Gu, Y Ning, H Liu Advances in Neural Information Processing Systems, 2015	141	2015
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning C Bai, L Wang, Z Yang, Z Deng, A Garg, P Liu, Z Wang International Conference on Learning Representations, 2022	135	2022
Convergent Policy Optimization for Safe Reinforcement Learning M Yu, Z Yang, M Kolar, Z Wang Advances in Neural Information Processing Systems, 2019	129	2019

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용