Abbas Abdolmaleki

Cited by

	All	Since 2019
Citations	3746	3502
h-index	27	25
i10-index	43	36

1200

600

300

900

20132014201520162017201820192020202120222023202410 14 28 27 57 83 175 384 538 893 1114 389

Public access

View all

12 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Martin RiedmillerDeepMindVerified email at google.com
Nicolas HeessDeepMindVerified email at google.com
Michael NeunertGoogle DeepMindVerified email at google.com
Luis Paulo ReisAssociate Professor, University of PortoVerified email at fe.up.pt
Nuno LauUniversidade de AveiroVerified email at ua.pt
Thomas LampeDeepMindVerified email at google.com
Yuval TassaSenior Research Scientist, Google DeepMindVerified email at google.com
Roland HafnerDeepMindVerified email at google.com
Gerhard NeumannProfessor, Karlsruhe Institute of Technology (KIT)Verified email at robot-learning.de
Noah Y. SiegelDeepMindVerified email at google.com
Josh MerelVerified email at google.com
Steven BohezGoogle DeepMindVerified email at google.com
Nima ShafiiNVIDIAVerified email at nvidia.com
Jan PetersProfessor for Intelligent Autonomous Systems/TU Darmstadt, Dept. Head/German AI Research Center DFKIVerified email at ias.tu-darmstadt.de
Rudolf LioutikovTT-Professor, Intuitive Robots Lab, Karlsruhe Institute of TechnologyVerified email at kit.edu
Jost Tobias SpringenbergGoogle DeepMind

Abbas Abdolmaleki

Deepmind

Verified email at google.com

Artificial Intelligence Reinforcement Learning Robotics


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Magnetic control of tokamak plasmas through deep reinforcement learning J Degrave, F Felici, J Buchli, M Neunert, B Tracey, F Carpanese, T Ewalds, ... Nature 602 (7897), 414-419, 2022	614	2022
Deepmind control suite Y Tassa, Y Doron, A Muldal, T Erez, Y Li, DL Casas, D Budden, ... arXiv preprint arXiv:1801.00690, 2018	545	2018
Maximum a posteriori policy optimisation A Abdolmaleki, JT Springenberg, Y Tassa, R Munos, N Heess, ... arXiv preprint arXiv:1806.06920, 2018	482	2018
Keep doing what worked: Behavioral modelling priors for offline reinforcement learning NY Siegel, JT Springenberg, F Berkenkamp, A Abdolmaleki, M Neunert, ... arXiv preprint arXiv:2002.08396, 2020	275	2020
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	231	2020
Robust reinforcement learning for continuous control with model misspecification DJ Mankowitz, N Levine, R Jeong, Y Shi, J Kay, A Abdolmaleki, ... arXiv preprint arXiv:1906.07516, 2019	109	2019
V-mpo: On-policy maximum a posteriori policy optimization for discrete and continuous control HF Song, A Abdolmaleki, JT Springenberg, A Clark, H Soyer, JW Rae, ... arXiv preprint arXiv:1909.12238, 2019	105	2019
From motor control to team play in simulated humanoid football S Liu, G Lever, Z Wang, J Merel, SMA Eslami, D Hennes, WM Czarnecki, ... Science Robotics 7 (69), eabo0235, 2022	102	2022
Model-based relative entropy stochastic search A Abdolmaleki, R Lioutikov, JR Peters, N Lau, L Pualo Reis, G Neumann Advances in Neural Information Processing Systems 28, 2015	87	2015
Continuous-discrete reinforcement learning for hybrid control in robotics M Neunert, A Abdolmaleki, M Wulfmeier, T Lampe, T Springenberg, ... Conference on Robot Learning, 735-751, 2020	85	2020
Beyond pick-and-place: Tackling robotic stacking of diverse shapes AX Lee, CM Devin, Y Zhou, T Lampe, K Bousmalis, JT Springenberg, ... 5th Annual Conference on Robot Learning, 2021	78	2021
A distributional view on multi-objective policy optimization A Abdolmaleki, S Huang, L Hasenclever, M Neunert, F Song, M Zambelli, ... International conference on machine learning, 11-22, 2020	71	2020
Relative entropy regularized policy iteration A Abdolmaleki, JT Springenberg, J Degrave, S Bohez, Y Tassa, D Belov, ... arXiv preprint arXiv:1812.02256, 2018	65	2018
Value constrained model-free continuous control S Bohez, A Abdolmaleki, M Neunert, J Buchli, N Heess, R Hadsell arXiv preprint arXiv:1902.04623, 2019	64	2019
Model-free trajectory optimization for reinforcement learning R Akrour, G Neumann, H Abdulsamad, A Abdolmaleki International Conference on Machine Learning, 2961-2970, 2016	48	2016
Imagined value gradients: Model-based policy optimization with tranferable latent dynamics models A Byravan, JT Springenberg, A Abdolmaleki, R Hafner, M Neunert, ... Conference on Robot Learning, 566-589, 2020	42	2020
Data-efficient hindsight off-policy option learning M Wulfmeier, D Rao, R Hafner, T Lampe, A Abdolmaleki, T Hertweck, ... International Conference on Machine Learning, 11340-11350, 2021	41	2021
An optimized gait generator based on fourier series towards fast and robust biped locomotion involving arms swing N Shafii, A Khorsandian, A Abdolmaleki, B Jozi 2009 IEEE International Conference on Automation and Logistics, 2018-2023, 2009	40	2009
Deriving and improving cma-es with information geometric trust regions A Abdolmaleki, B Price, N Lau, LP Reis, G Neumann Proceedings of the Genetic and Evolutionary Computation Conference, 657-664, 2017	39	2017
Omnidirectional walking and active balance for soccer humanoid robot N Shafii, A Abdolmaleki, R Ferreira, N Lau, LP Reis Progress in Artificial Intelligence: 16th Portuguese Conference on …, 2013	38	2013

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors