Geon-Hyeong Kim 个人学术档案

引用次数

	总计	2019 年至今
引用	269	268
h 指数	6	6
i10 指数	6	6

120

2019202020212022202320245 7 22 52 105 77

合著作者

Kee-Eung KimKAIST在 kaist.ac.kr 的电子邮件经过验证
Jongmin LeeUC Berkeley在 berkeley.edu 的电子邮件经过验证
HyeongJoo HwangKAIST在 ai.kaist.ac.kr 的电子邮件经过验证
Hongseok YangProfessor, School of Computing, KAIST在 kaist.ac.kr 的电子邮件经过验证
Wonseok JeonQualcomm AI Research在 qti.qualcomm.com 的电子邮件经过验证
Seunghoon HongAssociate Professor, KAIST在 kaist.ac.kr 的电子邮件经过验证
Youngsoo JangLG AI Research在 lgresearch.ai 的电子邮件经过验证
Pascal PoupartUniversity of Waterloo在 uwaterloo.ca 的电子邮件经过验证
Kanghoon LeeLG AI Research在 lgresearch.ai 的电子邮件经过验证
Daniel D. LeeTisch University Professor of ECE, Cornell University在 alum.mit.edu 的电子邮件经过验证
Pedro A. OrtegaArtificial Intelligence & Machine Learning在 adaptiveagents.org 的电子邮件经过验证

关注

Geon-Hyeong Kim

LG AI Research

在 lgresearch.ai 的电子邮件经过验证 - 首页

Imitation Learning Reinforcement Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Demodice: Offline imitation learning with supplementary imperfect demonstrations GH Kim, S Seo, J Lee, W Jeon, HJ Hwang, H Yang, KE Kim International Conference on Learning Representations, 2022	72	2022
Monte-Carlo tree search for constrained POMDPs J Lee, GH Kim, P Poupart, KE Kim Advances in Neural Information Processing Systems 31, 2018	69	2018
Variational interaction information maximization for cross-domain disentanglement HJ Hwang, GH Kim, S Hong, KE Kim Advances in Neural Information Processing Systems 33, 22479-22491, 2020	43	2020
Multi-view representation learning via total correlation objective HJ Hwang, GH Kim, S Hong, KE Kim Advances in Neural Information Processing Systems 34, 12194-12207, 2021	34	2021
Monte-carlo tree search in continuous action spaces with value gradients J Lee, W Jeon, GH Kim, KE Kim Proceedings of the AAAI conference on artificial intelligence 34 (04), 4561-4568, 2020	23	2020
Lobsdice: Offline learning from observation via stationary distribution correction estimation GH Kim, J Lee, Y Jang, H Yang, KE Kim Advances in Neural Information Processing Systems 35, 8252-8264, 2022	18*	2022
Variational inference for sequential data with future likelihood estimates GH Kim, Y Jang, H Yang, KE Kim International Conference on Machine Learning, 5296-5305, 2020	4	2020
Prospector: Improving LLM agents with self-asking and trajectory ranking B Kim, Y Jang, L Logeswaran, GH Kim, YJ Kim, H Lee, M Lee	2	2023
Trust region sequential variational inference GH Kim, Y Jang, J Lee, W Jeon, H Yang, KE Kim Asian conference on machine learning, 1033-1048, 2019	2	2019
Bayesian optimistic kullback–leibler exploration K Lee, GH Kim, P Ortega, DD Lee, KE Kim Machine Learning 108, 765-783, 2019	2	2019
SafeDICE: offline safe imitation learning with non-preferred demonstrations Y Jang, GH Kim, J Lee, S Sohn, B Kim, H Lee, M Lee Advances in Neural Information Processing Systems 36, 2024		2024
Information-theoretic state space model for multi-view reinforcement learning HJ Hwang, S Seo, Y Jang, S Kim, GH Kim, S Hong, KE Kim		2023
Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration Y Jang, GH Kim, B Kim, YJ Kim, H Lee, M Lee Forty-first International Conference on Machine Learning, 0
DfPO: Degeneration-free Policy Optimization via Action Masking in Natural Language Action Spaces Y Jang, GH Kim, B Kim, H Lee, M Lee

系统目前无法执行此操作，请稍后再试。

文章 1–14

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用