Yao Liu
Title
Cited by
Cited by
Year
Off-policy policy gradient with state distribution correction
Y Liu, A Swaminathan, A Agarwal, E Brunskill
Uncertainty in Artificial Intelligence, 2019
472019
Representation balancing mdps for off-policy policy evaluation
Y Liu, O Gottesman, A Raghu, M Komorowski, AA Faisal, F Doshi-Velez, ...
Advances in Neural Information Processing Systems, 2644-2653, 2018
382018
Behaviour policy estimation in off-policy policy evaluation: Calibration matters
A Raghu, O Gottesman, Y Liu, M Komorowski, A Faisal, F Doshi-Velez, ...
arXiv preprint arXiv:1807.01066, 2018
142018
Combining parametric and nonparametric models for off-policy evaluation
O Gottesman, Y Liu, S Sussex, E Brunskill, F Doshi-Velez
arXiv preprint arXiv:1905.05787, 2019
82019
Pac continuous state online multitask reinforcement learning with identification
Y Liu, Z Guo, E Brunskill
Proceedings of the 2016 International Conference on Autonomous Agents …, 2016
72016
Understanding the curse of horizon in off-policy evaluation via conditional importance sampling
Y Liu, PL Bacon, E Brunskill
International Conference on Machine Learning, 6184-6193, 2020
52020
When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms
Y Liu, E Brunskill
arXiv preprint arXiv:1805.09045, 2018
42018
Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions
O Gottesman, J Futoma, Y Liu, S Parbhoo, E Brunskill, F Doshi-Velez
arXiv preprint arXiv:2002.03478, 2020
22020
Provably good batch reinforcement learning without great exploration
Y Liu, A Swaminathan, A Agarwal, E Brunskill
arXiv preprint arXiv:2007.08202, 2020
12020
Nonlinear Dimensionality Reduction by Local Orthogonality Preserving Alignment
T Lin, Y Liu, B Wang, LW Wang, HB Zha
Journal of Computer Science and Technology 31 (3), 512-524, 2016
12016
All-Action Policy Gradient Methods: A Numerical Integration Approach
B Petit, L Amdahl-Culleton, Y Liu, J Smith, PL Bacon
arXiv preprint arXiv:1910.09093, 2019
2019
Stitched Trajectories for Off-Policy Learning
S Sussex, O Gottesman, Y Liu, S Murphy, E Brunskill, F Doshi-Velez
Model Selection for Off-Policy Policy Evaluation
Y Liu, PS Thomas, E Brunskill
The system can't perform the operation now. Try again later.
Articles 1–13