Efficient Beam Alignment in Millimeter Wave Systems Using Contextual Bandits

Hashemi, Morteza; Sabharwal, Ashutosh; Köksal, C. Emre; Shroff, Ness B.

doi:10.1109/infocom.2018.8486279

Cited by 125 publications

(111 citation statements)

References 21 publications

(31 reference statements)

Supporting

Mentioning

111

Contrasting

Order By: Relevance

“…where b represents the optimal beam and N π b i (T ) denotes the number of times that b i has been selected up to time slot T . Hence, maximizing the cumulative reward is equivalent to minimizing the expected cumulative regret within T [6], which can be expressed as…”

Section: B Problem Formulationmentioning

confidence: 99%

Fast mmwave Beam Alignment via Correlated Bandit Learning

Cheng

Zhang

et al. 2019

IEEE Trans. Wireless Commun.

121

View full text Add to dashboard Cite

Beam alignment (BA) is to ensure the transmitter and receiver beams are accurately aligned to establish a reliable communication link in millimeter-wave (mmwave) systems. Existing BA methods search the entire beam space to identify the optimal transmit-receive beam pair, which incurs significant BA latency on the order of seconds in the worst case. In this paper, we develop a learning algorithm to reduce BA latency, namely Hierarchical Beam Alignment (HBA) algorithm. We first formulate the BA problem as a stochastic multi-armed bandit problem with the objective to maximize the cumulative received signal strength within a certain period. The proposed algorithm takes advantage of the correlation structure among beams such that the information from nearby beams is extracted to identify the optimal beam, instead of searching the entire beam space. Furthermore, the prior knowledge on the channel fluctuation is incorporated in the proposed algorithm to further accelerate the BA process.Theoretical analysis indicates that the proposed algorithm is asymptotically optimal. Extensive simulation results demonstrate that the proposed algorithm can identify the optimal beam with a high probability and reduce the BA latency from hundreds of milliseconds to a few milliseconds in the multipath channel, as compared to the existing BA method in IEEE 802.11ad.

show abstract

Section: B Problem Formulationmentioning

confidence: 99%

Fast mmwave Beam Alignment via Correlated Bandit Learning

Cheng

Zhang

et al. 2019

IEEE Trans. Wireless Commun.

121

View full text Add to dashboard Cite

show abstract

“…This assumption is related to beam alignment problem, the reader may see Maschietti et al 16 and Hashemi et al 17 for more details about this issue. This assumption is related to beam alignment problem, the reader may see Maschietti et al 16 and Hashemi et al 17 for more details about this issue.…”

Section: Figurementioning

confidence: 99%

Pilot contamination mitigation based on antenna subset transmission for mmWave massive MIMO

Smaili

Djeddou

Azrar

2018

Int J Communication

View full text Add to dashboard Cite

This paper tackles the problem of pilot contamination (PC) in mmWave Massive MIMO cellular systems. We propose an analog precoder based on antenna subset transmission technique to mitigate the PC. This technique ensures the interfering signal to become noise-like signal thus helping a more efficient estimation of channel coefficients. Also, this strategy is low cost and introduces no complexity load. The result shows the effectiveness of the proposed precoder to mitigate PC issue. KEYWORDS antenna set transmission, massive MIMO, millimeter wave, pilot contamination Int J Commun Syst. 2018;31:e3768.wileyonlinelibrary.com/journal/dac

show abstract

“…A coded beamalignment scheme is proposed in [11] to correct these errors, but with no consideration of feedback to improve beamselection. A multi-armed bandit (MAB) formulation based on upper confidence bound (UCB) is proposed in [9], by selecting the beam based on the empirical SNR distribution. A hierarchical beam-alignment scheme based on posterior matching is proposed in [10]: therein, a first-best policy is formulated, which selects the most likely beam pair based on the posterior distribution on the AoA-AoD pair.…”

Section: Introductionmentioning

confidence: 99%

“…We derive lower and upper bounds to the value function, based on which we propose a heuristic policy which selects the beam pair with the second-best preference. We show numerically that this policy strikes a favorable trade-off between exploration and exploitation: instead of greedily choosing the beam corresponding to the most likely AoA-AoD pair (first-best [10]), it chooses the second most likely one, leading to better exploration; at the same time, it avoids wasting precious resources to scan unlikely beam pairs, leading to better exploitation than other MAB techniques, such as linear Thompson sampling (LTS) [12] and UCB [9]. The proposed second-best scheme is shown to outperform first-best [10], LTS-based [12] and UCBbased [9] schemes by up to 7%, 10% and 30% in alignment probability, respectively.…”

Section: Introductionmentioning

confidence: 99%

Second-Best Beam-Alignment via Bayesian Multi-Armed Bandits

Hussain¹,

Michelusi

2019

2019 IEEE Global Communications Conference (GLOBECOM)

View full text Add to dashboard Cite

Millimeter-wave (mm-wave) systems rely on narrowbeams to cope with the severe signal attenuation in the mmwave frequency band. However, susceptibility to beam misalignment due to mobility or blockage requires the use of beamalignment schemes, with huge cost in terms of overhead and use of system resources. In this paper, a beam-alignment scheme is proposed based on Bayesian multi-armed bandits, with the goal to maximize the alignment probability and the data-communication throughput. A Bayesian approach is proposed, by considering the state as a posterior distribution over angles of arrival (AoA) and of departure (AoD), given the history of feedback signaling and of beam pairs scanned by the base-station (BS) and the userend (UE). A simplified sufficient statistic for optimal control is identified, in the form of preference of BS-UE beam pairs. By bounding a value function, the second-best preference policy is formulated, which strikes an optimal balance between exploration and exploitation by selecting the beam pair with the current second-best preference. Through Monte-Carlo simulation with analog beamforming, the superior performance of the secondbest preference policy is demonstrated in comparison to existing schemes based on first-best preference, linear Thompson sampling, and upper confidence bounds, with up to 7%, 10% and 30% improvements in alignment probability, respectively.

show abstract

Efficient Beam Alignment in Millimeter Wave Systems Using Contextual Bandits

Cited by 125 publications

References 21 publications

Fast mmwave Beam Alignment via Correlated Bandit Learning

Fast mmwave Beam Alignment via Correlated Bandit Learning

Pilot contamination mitigation based on antenna subset transmission for mmWave massive MIMO

Second-Best Beam-Alignment via Bayesian Multi-Armed Bandits

Contact Info

Product

Resources

About