Æȷοì
Hosein Hasanbeig
Hosein Hasanbeig
Microsoft Research, New York
microsoft.comÀÇ À̸ÞÀÏ È®ÀÎµÊ - ȨÆäÀÌÁö
Á¦¸ñ
Àοë
Àοë
¿¬µµ
Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction Guarantees
M Hasanbeig, Y Kantaros, A Abate, D Kroening, GJ Pappas, I Lee
IEEE Conference on Decision and Control (CDC), 2019
1322019
Logically-Constrained Reinforcement Learning
M Hasanbeig, A Abate, D Kroening
arXiv preprint arXiv:1801.08099, 2018
1102018
Cautious Reinforcement Learning with Logical Constraints
M Hasanbeig, A Abate, D Kroening
AAMAS, 483-491, 2020
902020
Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic
M Cai, M Hasanbeig, S Xiao, A Abate, Z Kan
IEEE Robotics and Automation and IROS, 2021
732021
Certified reinforcement learning with logic guidance
H Hasanbeig, D Kroening, A Abate
Artificial Intelligence 322, 103949, 2023
622023
Deep Reinforcement Learning with Temporal Logics
M Hasanbeig, D Kroening, A Abate
International Conference on Formal Modeling and Analysis of Timed Systems, 1-22, 2020
572020
Deepsynth: Program Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning
M Hasanbeig, NY Jeppu, A Abate, T Melham, D Kroening
AAAI Conference on Artificial Intelligence (AAAI-21), 2021
52*2021
Logically-Constrained Neural Fitted Q-iteration
M Hasanbeig, A Abate, D Kroening
AAMAS, 2012-2014, 2019
492019
Modular Deep Reinforcement Learning with Temporal Logic Specifications
LZ Yuan, M Hasanbeig, A Abate, D Kroening
arXiv preprint arXiv:1909.11591, 2019
452019
Towards Verifiable and Safe Model-free Reinforcement Learning
M Hasanbeig, D Kroening, A Abate
Workshop on Artificial Intelligence and Formal Verification, Logics ¡¦, 2020
28*2020
Shielding Atari Games with Bounded Prescience
M Giacobbe, M Hasanbeig, D Kroening, H Wijk
International Conference on Autonomous Agents and Multiagent Systems, 2021
232021
Deepsynth: Program synthesis for automatic task segmentation in deep reinforcement learning
M Hasanbeig, NY Jeppu, A Abate, T Melham, D Kroening
arXiv preprint arXiv:1911.10244, 2019
182019
Evaluating cognitive maps in large language models with cogeval: No emergent planning
I Momennejad, H Hasanbeig, FV Frujeri, H Sharma, RO Ness, N Jojic, ...
Advances in neural information processing systems 37, 2023
17*2023
LCRL: Certified Policy Synthesis via Logically-Constrained Reinforcement Learning
M Hasanbeig, D Kroening, A Abate
International Conference on Quantitative Evaluation of Systems, 217-231, 2022
152022
On Synchronous Binary Log-Linear Learning and Second Order Q-learning
M Hasanbeig, L Pavel
IFAC World Congress 50 (1), 8987-8992, 2017
112017
Distributed Coverage Control by Robot Networks in Unknown Environments using a Modified EM Algorithm
M Hasanbeig, L Pavel
International Journal of Computer and Information Engineering 11 (7), 815-823, 2017
82017
From Game-theoretic Multi-agent Log Linear Learning to Reinforcement Learning
M Hasanbeig, L Pavel
arXiv preprint arXiv:1802.02277, 2018
72018
Allure: A systematic protocol for auditing and improving llm-based evaluation of text using iterative in-context-learning
H Hasanbeig, H Sharma, L Betthauser, FV Frujeri, I Momennejad
arXiv preprint arXiv:2309.13701, 2023
62023
Logically-correct reinforcement learning. CoRR abs/1801.08099
M Hasanbeig, A Abate, D Kroening
62017
System III: Learning with domain knowledge for safety constraints
F Barez, H Hasanbieg, A Abbate
arXiv preprint arXiv:2304.11593, 2023
52023
ÇöÀç ½Ã½ºÅÛÀÌ ÀÛµ¿µÇÁö ¾Ê½À´Ï´Ù. ³ªÁß¿¡ ´Ù½Ã ½ÃµµÇØ ÁÖ¼¼¿ä.
ÇмúÀÚ·á 1–20