Kavosh Asadi

인용

	전체	2019년 이후
서지정보	2296	2118
h-index	13	13
i10-index	16	15

560

280

140

420

2017201820192020202120222023202442 125 146 253 396 466 552 277

공개 액세스

모두 보기

자료 3개

자료 0개

공개

비공개

재정 지원 요구사항 기준

공동 저자

Michael LittmanBrown Universitybrown.edu의 이메일 확인됨
Alex SmolaBoson AIsmola.org의 이메일 확인됨
George KonidarisBrowncs.brown.edu의 이메일 확인됨
Dipendra MisraMicrosoft Research New Yorkmicrosoft.com의 이메일 확인됨
Rasool FakoorAmazon Web Servicesamazon.com의 이메일 확인됨
Jason D. WilliamsAppleapple.com의 이메일 확인됨
David AbelResearch Scientist, DeepMinddeepmind.com의 이메일 확인됨
Seungchan KimCarnegie Mellon Universitycs.cmu.edu의 이메일 확인됨
Cameron S. AllenPostdoc, UC Berkeleyberkeley.edu의 이메일 확인됨
Yuu JinnaiCyberAgent, Inc.cyberagent.co.jp의 이메일 확인됨
Dilip ArumugamPh.D. Candidate - Stanford Universitycs.stanford.edu의 이메일 확인됨
Shoham SabachAssociate Professor, Technion, Faculty of Data and Decision Sciencestechnion.ac.il의 이메일 확인됨
Omer GottesmanAmazonamazon.com의 이메일 확인됨
Abdelrahman MohamedResearch scientist, Facebook AI Researchfb.com의 이메일 확인됨
Ronald ParrProfessor of Computer Science, Duke Universitycs.duke.edu의 이메일 확인됨
Lawson L.S. WongAssistant Professor, CCIS, Northeastern Universityccs.neu.edu의 이메일 확인됨
Erwan LecarpentierPhD in Computer Scienceisae-supaero.fr의 이메일 확인됨
Yao LiuAmazonstanford.edu의 이메일 확인됨
Taesup KimAssistant Professor, Seoul National Universitysnu.ac.kr의 이메일 확인됨

팔로우

Kavosh Asadi

Research Scientist, Amazon

amazon.com의 이메일 확인됨 - 홈페이지

Reinforcement Learning AI Alignment Optimization


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Dive into deep learning A Zhang, ZC Lipton, M Li, AJ Smola arXiv preprint arXiv:2106.11342, 2021	1072	2021
Hybrid code networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning JD Williams, K Asadi, G Zweig arXiv preprint arXiv:1702.03274, 2017	405	2017
An Alternative Softmax Operator for Reinforcement Learning K Asadi, ML Littman Proceedings of the 34th International Conference on Machine Learning, 243-252, 2017	221	2017
Lipschitz Continuity in Model-based Reinforcement Learning K Asadi, D Misra, ML Littman Proceedings of the 35th International Conference on Machine Learning, 2018	168	2018
Deepmellow: removing the need for a target network in deep Q-learning S Kim, K Asadi, M Littman, G Konidaris Proceedings of the Twenty Eighth International Joint Conference on …, 2019	76*	2019
State abstraction as compression in apprenticeship learning D Abel, D Arumugam, K Asadi, Y Jinnai, ML Littman, LLS Wong Proceedings of the AAAI Conference on Artificial Intelligence 33, 3134-3142, 2019	58	2019
Combating the Compounding-Error Problem with a Multi-step Model K Asadi, D Misra, S Kim, ML Littman arXiv preprint arXiv:1905.13320, 2019	55	2019
Lipschitz lifelong reinforcement learning E Lecarpentier, D Abel, K Asadi, Y Jinnai, E Rachelson, ML Littman Proceedings of the AAAI Conference on Artificial Intelligence 35 (9), 8270-8278, 2021	36	2021
Mean Actor Critic K Asadi, C Allen, M Roderick, A Mohamed, G Konidaris, M Littman arXiv preprint arXiv:1709.00503, 2017	35*	2017
Continuous doubly constrained batch reinforcement learning R Fakoor, J Mueller, K Asadi, P Chaudhari, AJ Smola arXiv preprint arXiv:2102.09225, 2021	28	2021
Deep radial-basis value functions for continuous control K Asadi, N Parikh, RE Parr, GD Konidaris, ML Littman Proceedings of the AAAI Conference on Artificial Intelligence, 2021	27*	2021
Sample-efficient Reinforcement Learning for Dialog Control K Asadi, JD Williams arXiv preprint arXiv:1612.06000, 2016	25	2016
Strengths, weaknesses, and combinations of model-based and model-free reinforcement learning K Asadi Department of Computing Science University of Alberta, 2015	14	2015
Mitigating Planner Overfitting in Model-Based Reinforcement Learning D Arumugam, D Abel, K Asadi, N Gopalan, C Grimm, JK Lee, L Lehnert, ... arXiv preprint arXiv:1812.01129, 2018	13	2018
Towards a Simple Approach to Multi-step Model-based Reinforcement Learning K Asadi, E Cater, D Misra, ML Littman arXiv preprint arXiv:1811.00128, 2018	13	2018
Equivalence between wasserstein and value-aware model-based reinforcement learning K Asadi, E Cater, D Misra, ML Littman FAIM Workshop on Prediction and Generative Modeling in Reinforcement Learning 3, 2018	13*	2018
Resetting the optimizer in deep RL: An empirical study K Asadi, R Fakoor, S Sabach Advances in Neural Information Processing Systems 36, 2023	9	2023
Fair E3: Efficient welfare-centric fair reinforcement learning C Cousins, K Asadi, ML Littman 5th Multidisciplinary Conference on Reinforcement Learning and Decision …, 2022	6	2022
Learning State Abstractions for Transfer in Continuous Control K Asadi, D Abel, ML Littman arXiv preprint arXiv:2002.05518, 2020	6	2020
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models Z Liu, J Zhang, K Asadi, Y Liu, D Zhao, S Sabach, R Fakoor arXiv preprint arXiv:2310.05905, 2023	4	2023

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용

공동 저자