Combinatorial stochastic-greedy bandit

F Fourati, CJ Quinn, MS Alouini… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
We propose a novel combinatorial stochastic-greedy bandit (SGB) algorithm for
combinatorial multi-armed bandit problems when no extra information other than the joint …

Stochastic Top K-Subset Bandits with Linear Space and Non-Linear Feedback with Applications to Social Influence Maximization

M Agarwal, V Aggarwal, AK Umrawal… - ACM/IMS Transactions on …, 2022 - dl.acm.org
There are numerous real-world problems where a user must make decisions under
uncertainty. For the problem of influence maximization on a social network, for example, the …

Dart: Adaptive accept reject algorithm for non-linear combinatorial bandits

M Agarwal, V Aggarwal, AK Umrawal… - Proceedings of the AAAI …, 2021 - ojs.aaai.org
We consider the bandit problem of selecting K out of N arms at each time step. The joint
reward can be a non-linear function of the rewards of the selected individual arms. The …

A contextual combinatorial bandit approach to negotiation

Y Li, Z Mu, S Qi - arXiv preprint arXiv:2407.00567, 2024 - arxiv.org
Learning effective negotiation strategies poses two key challenges: the exploration-
exploitation dilemma and dealing with large action spaces. However, there is an absence of …

Stochastic submodular bandits with delayed composite anonymous bandit feedback

M Pedramfar, V Aggarwal - arXiv preprint arXiv:2303.13604, 2023 - arxiv.org
This paper investigates the problem of combinatorial multiarmed bandits with stochastic
submodular (in expectation) rewards and full-bandit delayed feedback, where the delayed …

An online frequency allocation strategy for multi‐carrier radar against spot jammer

Z Shan, L Wang, Z Zhang, Y Liu - IET Radar, Sonar & …, 2024 - Wiley Online Library
Spot jamming poses a significant threat to radar detection due to its ability to rapidly
intercept radar signals and emit high‐power interference. A novel Cognitive Multi‐Carrier …

Online Influence Maximization: Concept and Algorithm

J Guo - arXiv preprint arXiv:2312.00099, 2023 - arxiv.org
In this survey, we offer an extensive overview of the Online Influence Maximization (IM)
problem by covering both theoretical aspects and practical applications. For the integrity of …

Stochastic Top- Subset Bandits with Linear Space and Non-Linear Feedback

M Agarwal, V Aggarwal, CJ Quinn… - Algorithmic Learning …, 2021 - proceedings.mlr.press
Many real-world problems like Social Influence Maximization face the dilemma of choosing
the best $ K $ out of $ N $ options at a given time instant. This setup can be modeled as a …

Combinatorial Bandits for Maximum Value Reward Function under Max Value-Index Feedback

Y Wang, W Chen, M Vojnović - arXiv preprint arXiv:2305.16074, 2023 - arxiv.org
We consider a combinatorial multi-armed bandit problem for maximum value reward
function under maximum value and index feedback. This is a new feedback structure that …

Machine Learning Algorithms for Influence Maximization on Social Networks

AK Umrawal - 2023 - hammer.purdue.edu
With an increasing number of users spending time on social media platforms and engaging
with family, friends, and influencers within communities of interest (such as in fashion …