Max K-Armed Bandit: On the ExtremeHunter Algorithm and Beyond M Achab, S Clémençon, A Garivier, A Sabourin, C Vernade Machine Learning and Knowledge Discovery in Databases: European Conference …, 2017 | 16 | 2017 |
Weighted empirical risk minimization: Sample selection bias correction based on importance sampling M Achab, S Cl{\'e}men{\c{c}}on, C Tillier, R Vogel Proceedings of the International Conference on Machine Learning, Artificial …, 2020 | 15* | 2020 |
Profitable bandits M Achab, S Clémençon, A Garivier Asian Conference on Machine Learning, 694-709, 2018 | 10 | 2018 |
Ranking data with continuous labels through oriented recursive partitions S Clémençon, M Achab Advances in Neural Information Processing Systems, 4600-4608, 2017 | 9 | 2017 |
Ranking and risk-aware reinforcement learning| Theses. fr M Achab Institut polytechnique de Paris, 2020 | 6* | 2020 |
One-Step Distributional Reinforcement Learning M Achab, R Alami, YA Dahou Djilali, K Fedyanin, E Moulines Transactions on Machine Learning Research, 2023 | 3 | 2023 |
Distributional deep Q-learning with CVaR regression M Achab, R Alami, YAD Djilali, K Fedyanin, E Moulines, M Panov Deep Reinforcement Learning Workshop NeurIPS 2022, 2022 | 3 | 2022 |
Robustness and risk management via distributional dynamic programming M Achab, G Neu arXiv preprint arXiv:2112.15430, 2021 | 3 | 2021 |
Dimensionality Reduction and (Bucket) Ranking: a Mass Transportation Approach M Achab, A Korba, S Clémençon Algorithmic Learning Theory, 64-93, 2019 | 3 | 2019 |
Investigating Regularization of Self-Play Language Models R Alami, A Abubaker, M Achab, MEA Seddik, S Lahlou arXiv preprint arXiv:2404.04291, 2024 | 1 | 2024 |
A Nested Matrix-Tensor Model for Noisy Multi-view Clustering MEA Seddik, M Achab, H Goulart, M Debbah arXiv preprint arXiv:2305.19992, 2023 | 1 | 2023 |
A Risk-Averse Framework for Non-Stationary Stochastic Multi-Armed Bandits R Alami, M Mahfoud, M Achab MAB-KD Workshop ICDM 2023, 2023 | | 2023 |
Deep Reinforcement Learning Algorithms for Hybrid V2X Communication: A Benchmarking Study F Boukhalfa, R Alami, M Achab, E Moulines, M Bennis https://arxiv.org/abs/2310.03767, 2023 | | 2023 |
Beyond Log-Concavity: Theory and Algorithm for Sum-Log-Concave Optimization M Achab https://arxiv.org/abs/2309.15298, 2023 | | 2023 |
Checkered Regression M Achab TechRxiv preprint, 2022 | | 2022 |