Deep reinforcement learning for dynamic multichannel access in wireless networks

S Wang, H Liu, PH Gomes… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
We consider a dynamic multichannel access problem, where multiple correlated channels
follow an unknown joint Markov model and users select the channel to transmit data. The …

Decentralized learning for multiplayer multiarmed bandits

D Kalathil, N Nayyar, R Jain - IEEE Transactions on Information …, 2014 - ieeexplore.ieee.org
We consider the problem of distributed online learning with multiple players in multiarmed
bandit (MAB) models. Each player can pick among multiple arms. When a player picks an …

Deep reinforcement learning techniques in diversified domains: a survey

S Gupta, G Singal, D Garg - Archives of Computational Methods in …, 2021 - Springer
There have been tremendous improvements in deep learning and reinforcement learning
techniques. Automating learning and intelligence to the full extent remains a challenge. The …

Low-complexity, low-regret link rate selection in rapidly-varying wireless channels

H Gupta, A Eryilmaz, R Srikant - IEEE INFOCOM 2018-IEEE …, 2018 - ieeexplore.ieee.org
We consider the problem of transmitting at the optimal rate over a rapidly-varying wireless
channel with unknown statistics when the feedback about channel quality is very limited …

Online learning for multi-channel opportunistic access over unknown markovian channels

W Dai, Y Gai, B Krishnamachari - 2014 Eleventh Annual IEEE …, 2014 - ieeexplore.ieee.org
A fundamental theoretical problem in opportunistic spectrum access is the following: a single
secondary user must choose a channel to sense and access at each time, with the …

A Truthful Pricing-based Defending Strategy against Adversarial Attacks in Budgeted Combinatorial Multi-armed Bandits

H Wang, E Wang, Y Yang, B Yang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
We study defending strategies against adversarial attacks on Combinatorial Multi-Armed
Bandits (CMAB) algorithms. CMAB is an effective sequence decision making tool that has …

基于多臂赌博机模型的信道选择.

朱江, 陈红翠, 熊加毫 - Telecommunication Engineering, 2015 - search.ebscohost.com
(Chongqing Key Laboratory of Mobile Communications Technology, Chongqing University
of Posts and Telecommunications, Chongqing 400065, China) Abstract: In the opportunistic …

QoS-compliant sequential channel sensing for cognitive radios

T Shu, H Li - IEEE Journal on Selected Areas in …, 2014 - ieeexplore.ieee.org
In this paper, we study the quality-of-service (QoS) support for realtime traffic in cognitive
radio (CR) networks when spectrum availability and quality is not known a priori. A resource …

Minimizing File Transfer Time in Opportunistic Spectrum Access Model

J Hu, V Doshi - IEEE Transactions on Mobile Computing, 2022 - ieeexplore.ieee.org
We study the file transfer problem in opportunistic spectrum access (OSA) model, which has
been widely studied in throughput-oriented applications for max-throughput strategies and …

On a restless multi-armed bandit problem with non-identical arms

N Nayyar, Y Gai… - 2011 49th Annual Allerton …, 2011 - ieeexplore.ieee.org
We consider the following learning problem motivated by opportunistic spectrum access in
cognitive radio networks. There are N independent Gilbert-Elliott channels with possibly non …