팔로우
Alec Koppel
Alec Koppel
AI Research Lead, JP Morgan AI Research
jpmchase.com의 이메일 확인됨 - 홈페이지
제목
인용
인용
연도
Global convergence of policy gradient methods to (almost) locally optimal policies
K Zhang, A Koppel, H Zhu, T Basar
SIAM Journal on Control and Optimization 58 (6), 3586-3612, 2020
1812020
A saddle point algorithm for networked online convex optimization
A Koppel, FY Jakubiec, A Ribeiro
IEEE Transactions on Signal Processing 63 (19), 5149-5164, 2015
1812015
A Class of Prediction-Correction Methods for Time-Varying Convex Optimization
A Simonetto, A Mokhtari, A Koppel, G Leus, A Ribeiro
IEEE Transactions on Signal Processing (submitted), 0
140*
Variational policy gradient method for reinforcement learning with general utilities
J Zhang, A Koppel, AS Bedi, C Szepesvari, M Wang
Advances in Neural Information Processing Systems 33, 4572-4583, 2020
1242020
On the sample complexity of actor-critic method for reinforcement learning with function approximation
H Kumar, A Koppel, A Ribeiro
Machine Learning, 1-35, 2023
902023
Proximity without consensus in online multi-agent optimization
A Koppel, BM Sadler, A Ribeiro
Proc. Int. Conf. Accoustics Speech Signal Proces (submitted),, 2016
842016
A Decentralized Prediction-Correction Method for Networked Time-Varying Convex Optimization
A Simonetto, A Mokhtari, A Koppel, G Leus, A Ribeiro
Computational Advances in Multi-Sensor Adaptive Processing, IEEE …, 2015
812015
Decentralized online learning with kernels
A Koppel, S Paternain, C Richard, A Ribeiro
IEEE Transactions on Signal Processing 66 (12), 3240-3255, 2018
602018
Parsimonious online learning with kernels via sparse projections in function space
A Koppel, G Warnell, E Stump, A Ribeiro
The Journal of Machine Learning Research 20 (1), 83-126, 2019
51*2019
Achieving zero constraint violation for constrained reinforcement learning via primal-dual approach
Q Bai, AS Bedi, M Agarwal, A Koppel, V Aggarwal
Proceedings of the AAAI Conference on Artificial Intelligence 36 (4), 3682-3689, 2022
472022
Parsimonious online learning with kernels via sparse projections in function space
A Koppel, G Warnell, E Stump, A Ribeiro
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
432017
D4L: Decentralized Dynamic Discrminative Dictionary Learning
A Koppel, G Warnell, E Stump, A Ribeiro
IEEE Transactions on Signal and Info. Processing over Networks, 2015
402015
Consistent online gaussian process regression without the sample complexity bottleneck
A Koppel, H Pradhan, K Rajawat
Statistics and Computing 31, 1-18, 2021
372021
Asynchronous and parallel distributed pose graph optimization
Y Tian, A Koppel, AS Bedi, JP How
IEEE Robotics and Automation Letters 5 (4), 5819-5826, 2020
322020
Policy Evaluation in Continuous MDPs with Efficient Kernelized Gradient Temporal Difference
A Koppel, G Warnell, E Stump, P Stone, A Ribeiro.
IEEE Transactions on Automatic Control 66 (4), 2020
31*2020
Cautious reinforcement learning via distributional risk in the dual domain
J Zhang, AS Bedi, M Wang, A Koppel
arXiv preprint arXiv:2002.12475, 2020
292020
Online learning for characterizing unknown environments in ground robotic vehicle models
A Koppel, J Fink, G Warnell, E Stump, A Ribeiro
2016 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2016
262016
Asynchronous Decentralized Stochastic Optimization in Heterogeneous Networks
AS Bedi, A Koppel, K Rajawat
IEEE Trans. Signal Process (submitted)., 2017
25*2017
A variational approach to dual methods for constrained convex optimization
M Fazlyab, A Koppel, VM Preciado, A Ribeiro
2017 American Control Conference (ACC), 5269-5275, 2017
252017
Asynchronous online learning in multi-agent systems with proximity constraints
AS Bedi, A Koppel, K Rajawat
IEEE Transactions on Signal and Information Processing over Networks 5 (3 …, 2019
242019
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20