Penalized Proximal Policy Optimization for Safe Reinforcement Learning L Zhang, L Shen, L Yang, S Chen, B Yuan, X Wang, D Tao The 31st International Joint Conference on Artificial Intelligence (IJCAI), 2022 | 41 | 2022 |
Constrained Update Projection Approach to Safe Policy Optimization L Yang, J Ji, J Dai, L Zhang, B Zhou, P Li, Y Yang, G Pan 36th Conference on Neural Information Processing Systems (NeurIPS 2022), 2022 | 25 | 2022 |
Saformer: A conditional sequence modeling approach to offline safe reinforcement learning Q Zhang, L Zhang, H Xu, L Shen, B Wang, Y Chang, X Wang, B Yuan, ... arXiv preprint arXiv:2301.12203, 2023 | 12 | 2023 |
Evaluating Model-free Reinforcement Learning toward Safety-critical Tasks L Zhang, Q Zhang, L Shen, B Yuan, X Wang, D Tao 37th AAAI Conference on Artificial Intelligence (AAAI-23), 2022 | 9 | 2022 |
SafeRL-Kit: Evaluating Efficient Reinforcement Learning Methods for Safe Autonomous Driving L Zhang, Q Zhang, L Shen, B Yuan, X Wang Safe Learning for Autonomous Driving Workshop in ICML 2022, 2022 | 9 | 2022 |
Are Large Language Models Really Robust to Word-Level Perturbations? H Wang, G Ma, C Yu, N Gui, L Zhang, Z Huang, S Ma, Y Chang, S Zhang, ... Socially Responsible Language Modelling Research (SoLaR) Workshop at NeurIPS'23, 2023 | 8 | 2023 |
Safety Correction from Baseline: Towards the Risk-aware Policy in Robotics via Dual-agent Reinforcement Learning L Zhang, Z Yan, L Shen, S Li, X Wang, D Tao IROS 2022, 2022 | 3 | 2022 |
CAT: Closed-loop Adversarial Training for Safe End-to-End Driving L Zhang, Z Peng, Q Li, B Zhou 7th Annual Conference on Robot Learning (CoRL 2023), 2023 | 2 | 2023 |
Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning G Ma, L Zhang, H Wang, L Li, Z Wang, Z Wang, L Shen, X Wang, D Tao 37th Conference on Neural Information Processing Systems (NeurIPS 2023), 2023 | 2 | 2023 |