End-to-end neural pipeline for goal-oriented dialogue systems using GPT-2 D Ham, JG Lee, Y Jang, KE Kim Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020 | 217 | 2020 |
GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems Y Jang, J Lee, KE Kim International Conference on Learning Representations, 2022 | 45 | 2022 |
Bayes-adaptive monte-carlo planning and learning for goal-oriented dialogues Y Jang, J Lee, KE Kim Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7994-8001, 2020 | 24 | 2020 |
Neural dialog state tracker for large ontologies by attention mechanism Y Jang, J Ham, BJ Lee, Y Chang, KE Kim 2016 IEEE spoken language technology workshop (SLT), 531-537, 2016 | 17 | 2016 |
Cross-language neural dialog state tracker for large ontologies using hierarchical attention Y Jang, J Ham, BJ Lee, KE Kim IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (11 …, 2018 | 12 | 2018 |
PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules Y Jang, J Lee, J Park, KH Lee, P Lison, KE Kim Proceedings of the 2019 Conference on Empirical Methods in Natural Language …, 2019 | 11 | 2019 |
Constrained Bayesian Reinforcement Learning via Approximate Linear Programming. J Lee, Y Jang, P Poupart, KE Kim IJCAI, 2088-2095, 2017 | 11 | 2017 |
Monte-carlo planning and learning with language action value estimates Y Jang, S Seo, J Lee, KE Kim International Conference on Learning Representations, 2021 | 8 | 2021 |
LobsDICE: Offline Imitation Learning from Observation via Stationary Distribution Correction Estimation GH Kim, J Lee, Y Jang, H Yang, KE Kim Advances in Neural Information Processing Systems (NeurIPS), 2022 | 7 | 2022 |
Variational Inference for Sequential Data with Future Likelihood Estimates GH Kim, Y Jang, H Yang, KE Kim International Conference on Machine Learning, 5296-5305, 2020 | 4 | 2020 |
Prospector: Improving LLM Agents with Self-Asking and Trajectory Ranking B Kim, Y Jang, L Logeswaran, GH Kim, YJ Kim, H Lee, M Lee NeurIPS 2023 Foundation Models for Decision Making Workshop, 2023 | 2 | 2023 |
Trust Region Sequential Variational Inference GH Kim, Y Jang, J Lee, W Jeon, H Yang, KE Kim Asian Conference on Machine Learning, 1033-1048, 2019 | 2 | 2019 |
Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection K Lee, D Hwang, S Park, Y Jang, M Lee arXiv preprint arXiv:2403.14238, 2024 | | 2024 |
Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration Y Jang, GH Kim, B Kim, YJ Kim, H Lee, M Lee Forty-first International Conference on Machine Learning, 2024 | | 2024 |
SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations Y Jang, GH Kim, J Lee, S Sohn, B Kim, H Lee, M Lee Thirty-seventh Conference on Neural Information Processing Systems, 2023 | | 2023 |
Information-Theoretic State Space Model for Multi-View Reinforcement Learning HJ Hwang, S Seo, Y Jang, S Kim, GH Kim, S Hong, KE Kim International Conference on Machine Learning, 2023 | | 2023 |