Youngsoo Jang 个人学术档案

引用次数

	总计	2019 年至今
引用	360	350
h 指数	8	7
i10 指数	7	6

120

201720182019202020212022202320242 8 11 20 68 90 114 47

合著作者

Kee-Eung KimKAIST在 kaist.ac.kr 的电子邮件经过验证
Jongmin LeeUC Berkeley在 berkeley.edu 的电子邮件经过验证
Geon-Hyeong KimLG AI Research在 lgresearch.ai 的电子邮件经过验证
Donghoon HamNaver Clova在 navercorp.com 的电子邮件经过验证
Jeong-Gwan LeeKRAFTON Inc.在 krafton.com 的电子邮件经过验证
Byung-Jun LeeKAIST在 kaist.ac.kr 的电子邮件经过验证
Hongseok YangProfessor, School of Computing, KAIST在 kaist.ac.kr 的电子邮件经过验证
Moontae LeeAssistant Professor of Information Decision Sciences, UIC Business School, University of Illinois at Chicago在 uic.edu 的电子邮件经过验证
Youngjae ChangNCLab, KAIST在 nclab.kaist.ac.kr 的电子邮件经过验证
Pascal PoupartUniversity of Waterloo在 uwaterloo.ca 的电子邮件经过验证
Pierre LisonChief Research Scientist, Norsk Regnesentral在 nr.no 的电子邮件经过验证
Byoungjip KimLG AI Research在 lgresearch.ai 的电子邮件经过验证
Wonseok JeonQualcomm AI Research在 qti.qualcomm.com 的电子邮件经过验证
Honglak LeeLG AI Research / U. Michigan在 umich.edu 的电子邮件经过验证
Lajanugen LogeswaranLG AI Research在 lgresearch.ai 的电子邮件经过验证
HyeongJoo HwangKAIST在 ai.kaist.ac.kr 的电子邮件经过验证
Seunghoon HongAssociate Professor, KAIST在 kaist.ac.kr 的电子邮件经过验证
Kyungjae Leelg ai research在 lgresearch.ai 的电子邮件经过验证
Dasol HwangLG AI Research在 lgresearch.ai 的电子邮件经过验证
Sungryull SohnResearch Scientist, LG AI Research在 umich.edu 的电子邮件经过验证

关注

Youngsoo Jang

LG AI Research

在 lgresearch.ai 的电子邮件经过验证 - 首页

Reinforcement Learning Large Language Model Dialogue System


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
End-to-end neural pipeline for goal-oriented dialogue systems using GPT-2 D Ham, JG Lee, Y Jang, KE Kim Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020	217	2020
GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems Y Jang, J Lee, KE Kim International Conference on Learning Representations, 2022	45	2022
Bayes-adaptive monte-carlo planning and learning for goal-oriented dialogues Y Jang, J Lee, KE Kim Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7994-8001, 2020	24	2020
Neural dialog state tracker for large ontologies by attention mechanism Y Jang, J Ham, BJ Lee, Y Chang, KE Kim 2016 IEEE spoken language technology workshop (SLT), 531-537, 2016	17	2016
Cross-language neural dialog state tracker for large ontologies using hierarchical attention Y Jang, J Ham, BJ Lee, KE Kim IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (11 …, 2018	12	2018
PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules Y Jang, J Lee, J Park, KH Lee, P Lison, KE Kim Proceedings of the 2019 Conference on Empirical Methods in Natural Language …, 2019	11	2019
Constrained Bayesian Reinforcement Learning via Approximate Linear Programming. J Lee, Y Jang, P Poupart, KE Kim IJCAI, 2088-2095, 2017	11	2017
Monte-carlo planning and learning with language action value estimates Y Jang, S Seo, J Lee, KE Kim International Conference on Learning Representations, 2021	8	2021
LobsDICE: Offline Imitation Learning from Observation via Stationary Distribution Correction Estimation GH Kim, J Lee, Y Jang, H Yang, KE Kim Advances in Neural Information Processing Systems (NeurIPS), 2022	7	2022
Variational Inference for Sequential Data with Future Likelihood Estimates GH Kim, Y Jang, H Yang, KE Kim International Conference on Machine Learning, 5296-5305, 2020	4	2020
Prospector: Improving LLM Agents with Self-Asking and Trajectory Ranking B Kim, Y Jang, L Logeswaran, GH Kim, YJ Kim, H Lee, M Lee NeurIPS 2023 Foundation Models for Decision Making Workshop, 2023	2	2023
Trust Region Sequential Variational Inference GH Kim, Y Jang, J Lee, W Jeon, H Yang, KE Kim Asian Conference on Machine Learning, 1033-1048, 2019	2	2019
Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection K Lee, D Hwang, S Park, Y Jang, M Lee arXiv preprint arXiv:2403.14238, 2024		2024
Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration Y Jang, GH Kim, B Kim, YJ Kim, H Lee, M Lee Forty-first International Conference on Machine Learning, 2024		2024
SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations Y Jang, GH Kim, J Lee, S Sohn, B Kim, H Lee, M Lee Thirty-seventh Conference on Neural Information Processing Systems, 2023		2023
Information-Theoretic State Space Model for Multi-View Reinforcement Learning HJ Hwang, S Seo, Y Jang, S Kim, GH Kim, S Hong, KE Kim International Conference on Machine Learning, 2023		2023

系统目前无法执行此操作，请稍后再试。

文章 1–16

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用