- 学术资源搜索

Combinatorial stochastic-greedy bandit

F Fourati, CJ Quinn, MS Alouini… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

We propose a novel combinatorial stochastic-greedy bandit (SGB) algorithm for
combinatorial multi-armed bandit problems when no extra information other than the joint …

被引用次数：5 相关文章所有 7 个版本

[PDF] acm.org Full View

Stochastic Top K-Subset Bandits with Linear Space and Non-Linear Feedback with Applications to Social Influence Maximization

M Agarwal, V Aggarwal, AK Umrawal… - ACM/IMS Transactions on …, 2022 - dl.acm.org

There are numerous real-world problems where a user must make decisions under
uncertainty. For the problem of influence maximization on a social network, for example, the …

被引用次数：10 相关文章所有 2 个版本

[PDF] aaai.org

Dart: Adaptive accept reject algorithm for non-linear combinatorial bandits

M Agarwal, V Aggarwal, AK Umrawal… - Proceedings of the AAAI …, 2021 - ojs.aaai.org

We consider the bandit problem of selecting K out of N arms at each time step. The joint
reward can be a non-linear function of the rewards of the selected individual arms. The …

被引用次数：9 相关文章所有 5 个版本

[PDF] arxiv.org

A contextual combinatorial bandit approach to negotiation

Y Li, Z Mu, S Qi - arXiv preprint arXiv:2407.00567, 2024 - arxiv.org

Learning effective negotiation strategies poses two key challenges: the exploration-
exploitation dilemma and dealing with large action spaces. However, there is an absence of …

[PDF] arxiv.org

Stochastic submodular bandits with delayed composite anonymous bandit feedback

M Pedramfar, V Aggarwal - arXiv preprint arXiv:2303.13604, 2023 - arxiv.org

This paper investigates the problem of combinatorial multiarmed bandits with stochastic
submodular (in expectation) rewards and full-bandit delayed feedback, where the delayed …

被引用次数：1 相关文章所有 2 个版本

[PDF] wiley.com Full View

An online frequency allocation strategy for multi‐carrier radar against spot jammer

Z Shan, L Wang, Z Zhang, Y Liu - IET Radar, Sonar & …, 2024 - Wiley Online Library

Spot jamming poses a significant threat to radar detection due to its ability to rapidly
intercept radar signals and emit high‐power interference. A novel Cognitive Multi‐Carrier …

Online Influence Maximization: Concept and Algorithm

J Guo - arXiv preprint arXiv:2312.00099, 2023 - arxiv.org

In this survey, we offer an extensive overview of the Online Influence Maximization (IM)
problem by covering both theoretical aspects and practical applications. For the integrity of …

Stochastic Top- Subset Bandits with Linear Space and Non-Linear Feedback

M Agarwal, V Aggarwal, CJ Quinn… - Algorithmic Learning …, 2021 - proceedings.mlr.press

Many real-world problems like Social Influence Maximization face the dilemma of choosing
the best $ K $ out of $ N $ options at a given time instant. This setup can be modeled as a …

被引用次数：7 相关文章所有 9 个版本

[PDF] arxiv.org

Combinatorial Bandits for Maximum Value Reward Function under Max Value-Index Feedback

Y Wang, W Chen, M Vojnović - arXiv preprint arXiv:2305.16074, 2023 - arxiv.org

We consider a combinatorial multi-armed bandit problem for maximum value reward
function under maximum value and index feedback. This is a new feedback structure that …

被引用次数：3 相关文章所有 4 个版本

[PDF] purdue.edu

Machine Learning Algorithms for Influence Maximization on Social Networks

AK Umrawal - 2023 - hammer.purdue.edu

With an increasing number of users spending time on social media platforms and engaging
with family, friends, and influencers within communities of interest (such as in fashion …

高级搜索

QQ 群

Combinatorial stochastic-greedy bandit

Stochastic Top K-Subset Bandits with Linear Space and Non-Linear Feedback with Applications to Social Influence Maximization

Dart: Adaptive accept reject algorithm for non-linear combinatorial bandits

A contextual combinatorial bandit approach to negotiation

Stochastic submodular bandits with delayed composite anonymous bandit feedback

An online frequency allocation strategy for multi‐carrier radar against spot jammer

Online Influence Maximization: Concept and Algorithm

Stochastic Top- Subset Bandits with Linear Space and Non-Linear Feedback

Combinatorial Bandits for Maximum Value Reward Function under Max Value-Index Feedback

Machine Learning Algorithms for Influence Maximization on Social Networks

引用