Inductive pairwise ranking: going beyond the n log (n) barrier

A Saha, P Gaillard - International Conference on Machine …, 2022 - proceedings.mlr.press

We study the problem of $ K $-armed dueling bandit for both stochastic and adversarial
environments, where the goal of the learner is to aggregate information through relative …

被引用次数：23 相关文章所有 2 个版本

[PDF] mlr.press

Preference modeling with context-dependent salient features

A Bower, L Balzano - International Conference on Machine …, 2020 - proceedings.mlr.press

We consider the problem of estimating a ranking on a set of items from noisy pairwise
comparisons given item features. We address the fact that pairwise comparison data often …

被引用次数：17 相关文章所有 7 个版本

[PDF] arxiv.org

Versatile dueling bandits: Best-of-both-world analyses for online learning from preferences

A Saha, P Gaillard - arXiv preprint arXiv:2202.06694, 2022 - arxiv.org

We study the problem of $ K $-armed dueling bandit for both stochastic and adversarial
environments, where the goal of the learner is to aggregate information through relative …

被引用次数：7 相关文章所有 5 个版本

[PDF] mlr.press

Fast and accurate ranking regression

I Yildiz, J Dy, D Erdogmus… - International …, 2020 - proceedings.mlr.press

We consider a ranking regression problem in which we use a dataset of ranked choices to
learn Plackett-Luce scores as functions of sample features. We solve the maximum …

被引用次数：13 相关文章所有 5 个版本

[PDF] arxiv.org

Spectral ranking with covariates

SL Chau, M Cucuringu, D Sejdinovic - Joint European Conference on …, 2022 - Springer

We consider spectral approaches to the problem of ranking n players given their incomplete
and noisy pairwise comparisons, but revisit this classical problem in light of player covariate …

被引用次数：12 相关文章所有 6 个版本

[PDF] mlr.press

Quadratic metric elicitation for fairness and beyond

G Hiranandani, J Mathur… - Uncertainty in …, 2022 - proceedings.mlr.press

Metric elicitation is a recent framework for eliciting classification performance metrics that
best reflect implicit user preferences based on the task and context. However, available …

被引用次数：5 相关文章所有 8 个版本

[PDF] openreview.net

A graph theoretic approach for preference learning with feature information

A Saha, A Rajkumar - The 40th Conference on Uncertainty in …, 2024 - openreview.net

We consider the problem of ranking a set of $ n $ items given a sample of their pairwise
preferences. It is well known from the classical results of sorting literature that without any …

[PDF] arxiv.org

CURATRON: Complete Robust Preference Data for Robust Alignment of Large Language Models

ST Nguyen, NU Naresh, T Tulabandhula - arXiv preprint arXiv:2403.02745, 2024 - arxiv.org

This paper addresses the challenges of aligning large language models (LLMs) with human
values via preference learning (PL), with a focus on the issues of incomplete and corrupted …

被引用次数：1 相关文章

[PDF] northeastern.edu

Sample complexity of rank regression using pairwise comparisons

B Kadıoğlu, P Tian, J Dy, D Erdoğmuş, S Ioannidis - Pattern Recognition, 2022 - Elsevier

We consider a rank regression setting, in which a dataset of N samples with features in R d
is ranked by an oracle via M pairwise comparisons. Specifically, there exists a latent total …

被引用次数：2 相关文章所有 4 个版本

Ranking with features: Algorithm and a graph theoretic analysis

A Saha, A Rajkumar - arXiv preprint arXiv:1808.03857, 2018 - arxiv.org

We consider the problem of ranking a set of items from pairwise comparisons in the
presence of features associated with the items. Recent works have established that $ O …

被引用次数：6 相关文章所有 2 个版本

高级搜索

QQ 群