Learning to cooperate via policy search L Peshkin, KE Kim, N Meuleau, LP Kaelbling arXiv preprint cs/0105032, 2001 | 394 | 2001 |
Solving very large weakly coupled Markov decision processes N Meuleau, M Hauskrecht, KE Kim, L Peshkin, LP Kaelbling, TL Dean, ... AAAI/IAAI 8, 2, 1998 | 292 | 1998 |
Learning finite-state controllers for partially observable environments N Meuleau, L Peshkin, KE Kim, LP Kaelbling arXiv preprint arXiv:1301.6721, 2013 | 267 | 2013 |
Solving POMDPs by searching the space of finite policies N Meuleau, KE Kim, LP Kaelbling, AR Cassandra arXiv preprint arXiv:1301.6720, 2013 | 255 | 2013 |
End-to-end neural pipeline for goal-oriented dialogue systems using GPT-2 D Ham, JG Lee, Y Jang, KE Kim Proceedings of the 58th annual meeting of the association for computational …, 2020 | 214 | 2020 |
Inverse reinforcement learning in partially observable environments JD Choi, KE Kim Journal of Machine Learning Research 12, 691-730, 2011 | 189 | 2011 |
Nonparametric Bayesian inverse reinforcement learning for multiple reward functions J Choi, KE Kim Advances in neural information processing systems 25, 2012 | 174 | 2012 |
An improved particle filter with a novel hybrid proposal distribution for quantitative analysis of gold immunochromatographic strips N Zeng, Z Wang, H Zhang, KE Kim, Y Li, X Liu IEEE Transactions on Nanotechnology 18, 819-829, 2019 | 158 | 2019 |
Map inference for bayesian inverse reinforcement learning J Choi, KE Kim Advances in neural information processing systems 24, 2011 | 111 | 2011 |
Hand grip pattern recognition for mobile user interfaces KE Kim, W Chang, SJ Cho, J Shim, H Lee, J Park, Y Lee, S Kim Proceedings of IAAI 21 (2), 1789, 2006 | 110 | 2006 |
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation J Lee, W Jeon, BJ Lee, J Pineau, KE Kim ICML, 2021 | 85 | 2021 |
Closing the gap: Improved bounds on optimal POMDP solutions P Poupart, KE Kim, D Kim Proceedings of the International Conference on Automated Planning and …, 2011 | 84 | 2011 |
Approximate linear programming for constrained partially observable Markov decision processes P Poupart, A Malhotra, P Pei, KE Kim, B Goh, M Bowling Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015 | 77 | 2015 |
Method and apparatus for inputting function of mobile terminal using user's grip posture while holding mobile terminal S Cho, HJ Lee, JA Park, W Chang, KE Kim US Patent 8,055,305, 2011 | 76 | 2011 |
Point-based value iteration for constrained POMDPs D Kim, J Lee, KE Kim, P Poupart IJCAI 11, 1968-1974, 2011 | 72 | 2011 |
Monte-Carlo tree search for constrained POMDPs J Lee, GH Kim, P Poupart, KE Kim Advances in Neural Information Processing Systems 31, 2018 | 69 | 2018 |
Demodice: Offline imitation learning with supplementary imperfect demonstrations GH Kim, S Seo, J Lee, W Jeon, HJ Hwang, H Yang, KE Kim International Conference on Learning Representations, 2021 | 68 | 2021 |
Exploration in gradient-based reinforcement learning N Meuleau, L Peshkin, KE Kim | 64 | 2001 |
Multi-view automatic lip-reading using neural network D Lee, J Lee, KE Kim Computer Vision–ACCV 2016 Workshops: ACCV 2016 International Workshops …, 2017 | 63 | 2017 |
Bayesian nonparametric feature construction for inverse reinforcement learning J Choi, KE Kim Twenty-Third International Joint Conference on Artificial Intelligence, 2013 | 58 | 2013 |