Geon-Hyeong Kim

인용

	전체	2019년 이후
서지정보	265	264
h-index	6	6
i10-index	6	6

120

2019202020212022202320245 7 22 52 105 73

공동 저자

Kee-Eung KimKAISTkaist.ac.kr의 이메일 확인됨
Jongmin LeeUC Berkeleyberkeley.edu의 이메일 확인됨
HyeongJoo HwangKAISTai.kaist.ac.kr의 이메일 확인됨
Hongseok YangProfessor, School of Computing, KAISTkaist.ac.kr의 이메일 확인됨
Wonseok JeonQualcomm AI Researchqti.qualcomm.com의 이메일 확인됨
Seunghoon HongAssociate Professor, KAISTkaist.ac.kr의 이메일 확인됨
Youngsoo JangLG AI Researchlgresearch.ai의 이메일 확인됨
Pascal PoupartUniversity of Waterloouwaterloo.ca의 이메일 확인됨
Kanghoon LeeLG AI Researchlgresearch.ai의 이메일 확인됨
Daniel D. LeeTisch University Professor of ECE, Cornell Universityalum.mit.edu의 이메일 확인됨
Pedro A. OrtegaArtificial Intelligence & Machine Learningadaptiveagents.org의 이메일 확인됨

팔로우

Geon-Hyeong Kim

LG AI Research

lgresearch.ai의 이메일 확인됨 - 홈페이지

Imitation Learning Reinforcement Learning


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Demodice: Offline imitation learning with supplementary imperfect demonstrations GH Kim, S Seo, J Lee, W Jeon, HJ Hwang, H Yang, KE Kim International Conference on Learning Representations, 2022	70	2022
Monte-Carlo tree search for constrained POMDPs J Lee, GH Kim, P Poupart, KE Kim Advances in Neural Information Processing Systems 31, 2018	69	2018
Variational interaction information maximization for cross-domain disentanglement HJ Hwang, GH Kim, S Hong, KE Kim Advances in Neural Information Processing Systems 33, 22479-22491, 2020	43	2020
Multi-view representation learning via total correlation objective HJ Hwang, GH Kim, S Hong, KE Kim Advances in Neural Information Processing Systems 34, 12194-12207, 2021	33	2021
Monte-carlo tree search in continuous action spaces with value gradients J Lee, W Jeon, GH Kim, KE Kim Proceedings of the AAAI conference on artificial intelligence 34 (04), 4561-4568, 2020	23	2020
Lobsdice: Offline learning from observation via stationary distribution correction estimation GH Kim, J Lee, Y Jang, H Yang, KE Kim Advances in Neural Information Processing Systems 35, 8252-8264, 2022	17*	2022
Variational inference for sequential data with future likelihood estimates GH Kim, Y Jang, H Yang, KE Kim International Conference on Machine Learning, 5296-5305, 2020	4	2020
Prospector: Improving LLM agents with self-asking and trajectory ranking B Kim, Y Jang, L Logeswaran, GH Kim, YJ Kim, H Lee, M Lee	2	2023
Trust region sequential variational inference GH Kim, Y Jang, J Lee, W Jeon, H Yang, KE Kim Asian conference on machine learning, 1033-1048, 2019	2	2019
Bayesian optimistic kullback–leibler exploration K Lee, GH Kim, P Ortega, DD Lee, KE Kim Machine Learning 108, 765-783, 2019	2	2019
SafeDICE: offline safe imitation learning with non-preferred demonstrations Y Jang, GH Kim, J Lee, S Sohn, B Kim, H Lee, M Lee Advances in Neural Information Processing Systems 36, 2024		2024
Information-theoretic state space model for multi-view reinforcement learning HJ Hwang, S Seo, Y Jang, S Kim, GH Kim, S Hong, KE Kim		2023
Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration Y Jang, GH Kim, B Kim, YJ Kim, H Lee, M Lee Forty-first International Conference on Machine Learning, 0
DfPO: Degeneration-free Policy Optimization via Action Masking in Natural Language Action Spaces Y Jang, GH Kim, B Kim, H Lee, M Lee

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–14

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용

공동 저자