Dual graph attention networks for deep latent representation of multifaceted social effects in recommender systems Q Wu, H Zhang, X Gao, P He, P Weng, H Gao, G Chen The world wide web conference, 2091-2102, 2019 | 335 | 2019 |
Top-k selection based on adaptive sampling of noisy preferences R Busa-Fekete, B Szorenyi, P Weng, W Cheng, E Hüllermeier International Conference on Machine Learning, 1094-1102, 2013 | 93 | 2013 |
Analytics and machine learning in vehicle routing research R Bai, X Chen, ZL Chen, T Cui, S Gong, W He, X Jiang, H Jin, J Jin, ... International Journal of Production Research 61 (1), 4-30, 2023 | 92 | 2023 |
Learning fair policies in multi-objective (deep) reinforcement learning with average and discounted rewards U Siddique, P Weng, M Zimmer International Conference on Machine Learning, 8905-8915, 2020 | 85 | 2020 |
Teacher-student framework: a reinforcement learning approach M Zimmer, P Viappiani, P Weng AAMAS Workshop autonomous robots and multirobot systems, 2014 | 79 | 2014 |
Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm R Busa-Fekete, B Szörényi, P Weng, W Cheng, E Hüllermeier Machine learning 97, 327-351, 2014 | 69 | 2014 |
A survey on interpretable reinforcement learning C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao, W Liu Machine Learning, 1-44, 2024 | 64 | 2024 |
On finding compromise solutions in multiobjective Markov decision processes P Perny, P Weng ECAI 2010, 969-970, 2010 | 64 | 2010 |
Dual sequential prediction models linking sequential recommendation and information dissemination Q Wu, Y Gao, X Gao, P Weng, G Chen Proceedings of the 25th ACM SIGKDD international conference on knowledge …, 2019 | 61 | 2019 |
Optimization of probabilistic argumentation with Markov decision models E Hadoux, A Beynier, N Maudet, P Weng, A Hunter International Joint Conference on Artificial Intelligence, 2015 | 55 | 2015 |
Qualitative multi-armed bandits: A quantile-based approach B Szorenyi, R Busa-Fekete, P Weng, E Hüllermeier International Conference on Machine Learning, 1660-1668, 2015 | 55 | 2015 |
Learning fair policies in decentralized cooperative multi-agent reinforcement learning M Zimmer, C Glanois, U Siddique, P Weng International Conference on Machine Learning, 12967-12978, 2021 | 47 | 2021 |
Multi-objective bandits: Optimizing the generalized gini index R Busa-Fekete, B Szörényi, P Weng, S Mannor International Conference on Machine Learning, 625-634, 2017 | 44 | 2017 |
Decomposition methods for distributed optimal power flow: panorama and case studies of the DC model MH Amini, S Bahrami, F Kamyab, S Mishra, R Jaddivada, K Boroojeni, ... Classical and recent aspects of power system optimization, 137-155, 2018 | 43 | 2018 |
Interactive value iteration for markov decision processes with unknown rewards P Weng, B Zanuttini IJCAI'13-Twenty-Third international joint conference on Artificial …, 2013 | 42 | 2013 |
Algebraic Markov decision processes P Perny, O Spanjaard, P Weng 19th International Joint Conference on Artificial Intelligence, 1372-1377, 2005 | 42 | 2005 |
Invariant transform experience replay: Data augmentation for deep reinforcement learning Y Lin, J Huang, M Zimmer, Y Guan, J Rojas, P Weng IEEE Robotics and Automation Letters 5 (4), 6615-6622, 2020 | 40 | 2020 |
Sequential decision-making under non-stationary environments via sequential change-point detection E Hadoux, A Beynier, P Weng Learning over multiple contexts (LMCE), 2014 | 40 | 2014 |
Hierarchical electric vehicle charging aggregator strategy using Dantzig-Wolfe decomposition MH Amini, P McNamara, P Weng, O Karabasoglu, Y Xu IEEE Design & Test 35 (6), 25-36, 2017 | 36 | 2017 |
A survey of reinforcement learning from human feedback T Kaufmann, P Weng, V Bengs, E Hüllermeier arXiv preprint arXiv:2312.14925, 2023 | 35 | 2023 |