Zeyu Zheng

Cited by

	All	Since 2019
Citations	1345	1289
h-index	7	7
i10-index	7	7

500

250

125

375

2017201820192020202120222023202412 43 93 137 193 170 206 487

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Satinder SinghGoogle DeepMind / U. of MichiganVerified email at umich.edu
Junhyuk OhResearch Scientist, DeepMindVerified email at google.com
Eric XingPresident at Mohamed bin Zayed University of AI, Professor of Computer Science, Carnegie Mellon UVerified email at cs.cmu.edu
Hao ZhangUC San DiegoVerified email at ucsd.edu
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLVerified email at google.com
Will DabneyDeepMindVerified email at google.com
Razvan PascanuGoogle DeepMindVerified email at google.com
Wenfei FanProfessor of Web Data Management, University of EdinburghVerified email at inf.ed.ac.uk
Richard L. LewisProfessor of Psychology, Linguistics and Cognitive Science, University of MichiganVerified email at umich.edu
Zhongwen XuTencentVerified email at tencent.com
David SilverDeepMind, UCLVerified email at google.com
Matteo HesselResearch Engineer, Google DeepMindVerified email at google.com
Clare LyleGoogle DeepMindVerified email at deepmind.com
Risto VuorioUniversity of OxfordVerified email at cs.ox.ac.uk
Haozhu WangAmazonVerified email at amazon.com
Chengang JiPhD, University of Michigan-Ann ArborVerified email at umich.edu
L. Jay GuoProfessor of Electrical Engineering and Computer Science, The University of MichiganVerified email at umich.edu
Evgenii NikishinPhD student, Mila, University of MontrealVerified email at umontreal.ca
Vivek VeeriahGoogle DeepMindVerified email at google.com
Mengda XuColumbia University; J.P.Morgan AI ResearchVerified email at columbia.edu

Zeyu Zheng

DeepMind

Verified email at deepmind.com - Homepage

artificial intelligence machine learning reinforcement learning deep learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	443	2023
Poseidon: An efficient communication architecture for distributed deep learning on {GPU} clusters H Zhang, Z Zheng, S Xu, W Dai, Q Ho, X Liang, Z Hu, J Wei, P Xie, ... 2017 USENIX Annual Technical Conference (USENIX ATC 17), 181-193, 2017	397	2017
On learning intrinsic rewards for policy gradient methods Z Zheng, J Oh, S Singh Advances in Neural Information Processing Systems, 4644-4654, 2018	193	2018
Parallelizing sequential graph computations W Fan, J Xu, Y Wu, W Yu, J Jiang, Z Zheng, B Zhang, Y Cao, C Tian Proceedings of the 2017 ACM International Conference on Management of Data …, 2017	116	2017
What Can Learned Intrinsic Rewards Capture? Z Zheng, J Oh, M Hessel, Z Xu, M Kroiss, H Van Hasselt, D Silver, S Singh International Conference on Machine Learning, 11436-11446, 2020	86	2020
Automated multi-layer optical design via deep reinforcement learning H Wang, Z Zheng, C Ji, LJ Guo Machine Learning: Science and Technology 2 (2), 025013, 2021	55	2021
Understanding plasticity in neural networks C Lyle, Z Zheng, E Nikishin, BA Pires, R Pascanu, W Dabney International Conference on Machine Learning, 23190-23211, 2023	33	2023
Adaptive Pairwise Weights for Temporal Credit Assignment Z Zheng, R Vuorio, R Lewis, S Singh Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 9225-9232, 2022	7*	2022
Learning State Representations from Random Deep Action-conditional Predictions Z Zheng, V Veeriah, R Vuorio, RL Lewis, S Singh Advances in Neural Information Processing Systems 34, 23679-23691, 2021	7	2021
Towards multi‐agent reinforcement learning‐driven over‐the‐counter market simulations N Vadori, L Ardon, S Ganesh, T Spooner, S Amrouni, J Vann, M Xu, ... Mathematical Finance 34 (2), 262-347, 2024	4	2024
GrASP: Gradient-Based Affordance Selection for Planning V Veeriah, Z Zheng, R Lewis, S Singh arXiv preprint arXiv:2202.04772, 2022	3	2022
Disentangling the Causes of Plasticity Loss in Neural Networks C Lyle, Z Zheng, K Khetarpal, H van Hasselt, R Pascanu, J Martens, ... arXiv preprint arXiv:2402.18762, 2024	1	2024
Human Alignment of Large Language Models through Online Preference Optimisation D Calandriello, D Guo, R Munos, M Rowland, Y Tang, BA Pires, ... arXiv preprint arXiv:2403.08635, 2024		2024
Generalized Preference Optimization: A Unified Approach to Offline Alignment Y Tang, ZD Guo, Z Zheng, D Calandriello, R Munos, M Rowland, ... arXiv preprint arXiv:2402.05749, 2024		2024
Towards Perpetually Trainable Neural Networks C Lyle, Z Zheng, K Khetarpal, R Pascanu, J Martens, H van Hasselt, ...		2023
Advances in Deep Reinforcement Learning: Intrinsic Rewards, Temporal Credit Assignment, State Representations, and Value-equivalent Models Z Zheng		2022
Reinforcement learning using meta-learned intrinsic rewards Z Zheng, J Oh, SS Baveja US Patent App. 17/033,410, 2021		2021
Supplementary Material: On Learning Intrinsic Rewards for Policy Gradient Methods Z Zheng, J Oh, S Singh

The system can't perform the operation now. Try again later.

Articles 1–18

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors