Abbas Abdolmaleki
Abbas Abdolmaleki
Verified email at
Cited by
Cited by
Maximum a posteriori policy optimisation
A Abdolmaleki, JT Springenberg, Y Tassa, R Munos, N Heess, ...
arXiv preprint arXiv:1806.06920, 2018
Deepmind control suite
Y Tassa, Y Doron, A Muldal, T Erez, Y Li, DL Casas, D Budden, ...
arXiv preprint arXiv:1801.00690, 2018
Model-based relative entropy stochastic search
A Abdolmaleki, R Lioutikov, JR Peters, N Lau, L Pualo Reis, G Neumann
Advances in Neural Information Processing Systems 28, 3537-3545, 2015
Keep doing what worked: Behavioral modelling priors for offline reinforcement learning
NY Siegel, JT Springenberg, F Berkenkamp, A Abdolmaleki, M Neunert, ...
arXiv preprint arXiv:2002.08396, 2020
An optimized gait generator based on fourier series towards fast and robust biped locomotion involving arms swing
N Shafii, A Khorsandian, A Abdolmaleki, B Jozi
2009 IEEE International Conference on Automation and Logistics, 2018-2023, 2009
Acme: A research framework for distributed reinforcement learning
M Hoffman, B Shahriari, J Aslanides, G Barth-Maron, F Behbahani, ...
arXiv preprint arXiv:2006.00979, 2020
Model-free trajectory optimization for reinforcement learning
R Akrour, G Neumann, H Abdulsamad, A Abdolmaleki
International Conference on Machine Learning, 2961-2970, 2016
V-MPO: On-policy maximum a posteriori policy optimization for discrete and continuous control
HF Song, A Abdolmaleki, JT Springenberg, A Clark, H Soyer, JW Rae, ...
arXiv preprint arXiv:1909.12238, 2019
Omnidirectional walking and active balance for soccer humanoid robot
N Shafii, A Abdolmaleki, R Ferreira, N Lau, LP Reis
Portuguese Conference on Artificial Intelligence, 283-294, 2013
Relative entropy regularized policy iteration
A Abdolmaleki, JT Springenberg, J Degrave, S Bohez, Y Tassa, D Belov, ...
arXiv preprint arXiv:1812.02256, 2018
Learning a humanoid kick with controlled distance
A Abdolmaleki, D Simões, N Lau, LP Reis, G Neumann
Robot World Cup, 45-57, 2016
Deriving and improving CMA-ES with information geometric trust regions
A Abdolmaleki, B Price, N Lau, LP Reis, G Neumann
Proceedings of the Genetic and Evolutionary Computation Conference, 657-664, 2017
Robust reinforcement learning for continuous control with model misspecification
DJ Mankowitz, N Levine, R Jeong, Y Shi, J Kay, A Abdolmaleki, ...
arXiv preprint arXiv:1906.07516, 2019
Simultaneously learning vision and feature-based control policies for real-world ball-in-a-cup
D Schwab, T Springenberg, MF Martins, T Lampe, M Neunert, ...
arXiv preprint arXiv:1902.04706, 2019
Regularized covariance estimation for weighted maximum likelihood policy search methods
A Abdolmaleki, N Lau, LP Reis, G Neumann
2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids …, 2015
Value constrained model-free continuous control
S Bohez, A Abdolmaleki, M Neunert, J Buchli, N Heess, R Hadsell
arXiv preprint arXiv:1902.04623, 2019
Regularized hierarchical policies for compositional transfer in robotics
M Wulfmeier, A Abdolmaleki, R Hafner, JT Springenberg, M Neunert, ...
arXiv preprint arXiv:1906.11228, 2019
Imagined value gradients: Model-based policy optimization with tranferable latent dynamics models
A Byravan, JT Springenberg, A Abdolmaleki, R Hafner, M Neunert, ...
Conference on Robot Learning, 566-589, 2020
Guide actor-critic for continuous control
V Tangkaratt, A Abdolmaleki, M Sugiyama
arXiv preprint arXiv:1705.07606, 2017
Model-free trajectory-based policy optimization with monotonic improvement
R Akrour, A Abdolmaleki, H Abdulsamad, J Peters, G Neumann
The Journal of Machine Learning Research 19 (1), 565-589, 2018
The system can't perform the operation now. Try again later.
Articles 1–20