Machine Learning Approaches for Sharing Unlicensed Millimeter-Wave Bands in Heterogeneously Integrated Sensing and Communication Networks

Tang, Chunju; Liu, Yanping

doi:10.3390/electronics12204193

Open AccessArticle

Machine Learning Approaches for Sharing Unlicensed Millimeter-Wave Bands in Heterogeneously Integrated Sensing and Communication Networks

by

Chunju Tang

¹ and

Yanping Liu

^2,*

¹

School of Humanities, Guizhou University of Finance and Economics, Guiyang 550025, China

²

College of Big Data Statistics, Guizhou University of Finance and Economics, Guiyang 550025, China

^*

Author to whom correspondence should be addressed.

Electronics 2023, 12(20), 4193; https://doi.org/10.3390/electronics12204193

Submission received: 26 July 2023 / Revised: 21 September 2023 / Accepted: 29 September 2023 / Published: 10 October 2023

(This article belongs to the Section Microwave and Wireless Communications)

Download

Browse Figures

Versions Notes

Abstract

:

Due to the increasing demand of high data rate, spectrum scarcity is a key problem for providing unprecedented capacity in diversified applications for future wireless networks. Therefore, the efficiently shared use of unlicensed bands is one of the promising solutions for addressing the spectrum scarcity issue. We study decentralized machine learning approaches using the paradigm of integrated sensing and communication (ISAC) for the shared use of unlicensed millimeter-wave bands. We first present a 5G–WiFi fusion protocol stack for sharing unlicensed millimeter-wave bands, and then design an ISAC-based access protocol and an ISAC based coexistence protocol integrated with decentralized learning function to achieve the efficiently shared use of unlicensed bands. Using the coexistence protocol, we propose promising decentralized machine learning approaches to share unlicensed millimeter-wave bands. Finally, simulations are provided to verify the performance of the proposed scheme, where the results have shown that the proposed scheme greatly reduces the search space of the solution and effectively protects the communication performance of the WiFi system compared to traditional schemes, which indicates that simultaneous transmissions of 5G-U and WiFi at the 60 GHz band are feasible under the proposed scheme.

Keywords:

machine learning; unlicensed millimeter-wave bands; 5G–WiFi fusion protocol; coexistence protocol; heterogeneous networks

1. Introduction

With the proliferation of various smart devices, the traffic of wireless data services will increase by 10,000 times by the year 2030 [1,2,3]. In this context, spectrum scarcity is a key challenge to provide such an unprecedented capacity for massive wireless applications such as industrial internet of things [4,5,6,7]. In order to meet the surging traffic demand, it is generally believed that the capacities of future wireless networks can be enhanced based on three dimensions, i.e., improving spectrum efficiency, increasing cell density and expanding the spectrum. However, due to the limitations of physical layer technologies, the capacity improvement from the spectral efficiency enhancement is limited. Therefore, expanding the spectrum with the network densification is a more direct way to enhance the system capacity. Although licensed bands have good transmission characteristics, they are very crowded and thus cannot meet the demand of high rate and large bandwidth in future wireless communication networks. Consequently, unlicensed bands have been recently proposed to be used by cellular systems for improving the capacities of future wireless networks. Nevertheless, the cellular system that accesses the same unlicensed band may degrade the performance of existing systems, such as the WiFi system. Therefore, to make full use of unlicensed bands, the efficiently shared use of unlicensed bands by multiple systems under the premise of protecting the performance of existing systems is of critical importance to enhance the overall capacities of future wireless networks.

Recently, many research efforts have been made to investigate the shared use of unlicensed bands from both the academic and industry communities. To be specific, LTE-Advanced has been suggested for deployment in the unlicensed band of 5 GHz as supplemental downlink by Qualcomm and Ericsson, which is known as LTE-Unlicensed (LTE-U) [8]. Furthermore, since 2014, Huawei, NTT and DoCoMo have also been studying licensed-assisted access through LTE (LAA-LTE). By deploying pre-commercial multi-cell networks, all the above companies have demonstrated that LTE-U/LAA-LTE has better capacity and coverage performance at the unlicensed band of 5 GHz than WiFi alone. The analytical and experiment results in reference [9] have also shown that LTE-U users are able to manage interference effectively through coordinating with existing users, and thus can achieve harmonious coexistence with existing unlicensed users. In reference [10], the channel allocation problem between LTE-U and device-to-device users, for sharing both unlicensed and licensed bands, has been studied considering the co-channel interference, where the results have shown that the overall capacity can be largely improved when unlicensed bands are shared by multiple systems. In reference [11], a coexistence mechanism was proposed to manage the backoff of LTE-U and WiFi for sharing the 5 GHz band, which has a performance superior to the listen-before-talk and carrier sense adaptive transmission. In reference [12], a survey of spectrum sharing technologies in LTE was presented, where the television (TV) white-space band, 3.5 GHz and 5 GHz unlicensed bands were considered. In reference [13], the joint interference coordination mechanism and spectrum sharing protocol in unlicensed bands were designed, where the results indicate that they can enhance the throughput of LTE-U and satisfy the traffic needs of 802.11 users. In reference [14], a duty cycle adjustment approach was designed to dynamically configure transmission gaps to ensure better coexistence between LTE and WiFi, where the results have indicated that the proposed approach can improve energy efficiency and throughput. In reference [15], an investigation was conducted on Internet of Things technologies deployed within unlicensed spectrum and licensed cellular, such as shared spectrum management and interference management. In reference [16], a matching-based unlicensed and licensed bands allocation scheme was proposed to optimize the cellular users’ utilities as well as guarantee the rate constraints for both WiFi and cellular users. In reference [17], joint resource management and a user association problem were formulated to maximize the number of QoS-preferred users, where a listen-before-talk-based flexible coexistence framework was proposed. In reference [18], a blockchain-based approach was designed for achieving secure spectrum sharing for machine-to-machine and human-to-human coexistence in heterogeneous networks, where a comprehensive architecture overview of the proposed framework and its implementation procedures were provided. In reference [19], the LTE/WiFi spectrum sharing problem, to maximize the LTE throughput while protecting the WiFi system, was investigated, where the simulation results show that, even without signalling exchanges, the proposed scheme has a similar performance to that of the genie-aided exhaustive search algorithm. In reference [20], the integrated use of 28GHz and 60GHz bands over 5G Unlicensed (5G-U) was proposed, and strategies of spectrum management in 5G-U were also discussed, where the results show that the harmonized utilization of unlicensed and licensed bands can largely enhance the performance in both network-centric and device-centric metrics. In reference [21], a data-driven spectrum-sharing scheme among multiple operators was designed to enhance the spectrum utilization in millimeter-wave (mmWave) cellular networks. In reference [22], clustering-based spectrum-sharing schemes with lower overhead and complexity were proposed to improve the sum rate for a multi-operator mmWave cellular network.

However, most of previous works concentrate mainly on the shared use of unlicensed bands with lower frequency. As the unlicensed bands with lower frequency are becoming more crowded, there is an increasing need for utilizing unlicensed mmWave bands that have a potentially large bandwidth. Although existing works [21,22] investigated the mmWave spectrum sharing problem, they mainly focused on sharing the licensed mmWave bands between homogeneous systems. Therefore, we considered the spectrum sharing in V-Band (57–64 GHz) mainly for the following reasons. First, although the 28 GHz and E-Band (71–76 GHz and 81–86 GHz), that can be used for the 5G mmWave cellular networks, offer huge spectrum availability in the order of some GHz, they may be insufficient to support the ever-growing bandwidth-hungry scenarios, which require data rates on the order of up to tens of Gbit/s, such as massive drone surveillance, autonomous driving, Virtual Reality (VR), Augmented Reality (AR), Synchronized Reality (SR), and Mixed Reality (MR)[23]. Second, existing wireless systems like IEEE 802.11ad and IEEE 802.11ay have worked on V-Band, however, other communication systems may join in the future due to the unlicensed feature. Therefore, if future cellular systems want to utilize V-Band, efficient spectrum sharing is the best way to enhance the spectral efficiency and protect the performance of existing systems.

Despite the many advantages of the shared use of unlicensed mmWave bands, it is still facing some technical challenges. Firstly, due to the different medium access control (MAC) mechanisms, coexistent systems that share the same unlicensed band cannot implement information interaction, which makes it difficult to share the spectrum efficiently. Secondly, since coexistent systems are generally not able to acquire full knowledge of a wireless environment and complete information on the opposite actions, most existing schemes, relying on the assumptions that coexistent systems have global information of the wireless environment, will be infeasible in practice. Finally, with the increasing of network density, centralized solutions will bring significant signaling overheads and, thus, are less efficient despite their optimality. For these reasons, one key obstacle to achieving the efficiently shared use of unlicensed bands is how to design an efficient coexistence protocol and efficiently decentralized learning algorithms for spectrum sharing. Fortunately, the emergence of the paradigm of integrated sensing and communication (ISAC), that is regarded as an emerging technology towards 6G and next generation WLAN [24], provides potential solutions for coexistence and information exchange between heterogeneous systems. On the other hand, the decentralized machine learning theory provides a mathematical tool for studying scenarios with incomplete information in wireless environments, thereby avoiding the heavy signaling overhead brought on by the frequent exchange of global information [25,26,27]. Therefore, ISAC and decentralized learning approaches can be applied to the shared use of unlicensed bands for obtaining efficient solutions. The scope of this article is thus to study integrated ISAC and decentralized learning approaches for the shared use of unlicensed mmWave bands.

Motivated by these challenges, in this paper, different from existing works [21,22] that ignored mmWave spectrum sharing between heterogeneous systems, we study the unlicensed mmWave bands sharing problem in heterogeneous networks, to maximize the sum rate of all cellular user equipments (CUEs) as well as protecting the performance of each piece of WiFi user equipment (WUE). The contributions are summarized as follows.

We present a fusion MAC framework for 5G-U systems, based on which we design an ISAC-based access protocol and an ISAC-based coexistence protocol for CUEs and WUEs to efficiently share unlicensed mmWave bands.
We investigate the unlicensed mmWave bands sharing problem for maximizing the sum rate of CUEs, with consideration of the interference constraints of each WUE and the limitation of radio frequency (RF) links for each mmWave gNB in heterogeneous networks.
We formulate the considered problem as a potential game and prove the existence of the Nash equilibrium (NE) for the proposed game, and then propose five decentralized machine learning algorithms to obtain the NE of the proposed game.
We provide extensive simulations to evaluate the performances of the proposed scheme from various aspects, where the results have demonstrated the effectiveness of the proposed scheme, and have shown that the proposed scheme can enhance the spectrum utilization and reduce the search space for the considered problem, as well as effectively guarantee the performance of the existing system.

The rest of this article is arranged as follows. In Section 2, the access protocol, coexistence protocol, system model and problem formulation are presented. The proposed potential game is analyzed in Section 3. In Section 4, decentralized machine learning solutions to the shared use of unlicensed bands are designed. In Section 5, a performance evaluation is provided. The conclusions are drawn in Section 6.

2. Coexistence Protocol, System Model and Problem Formulation

2.1. Coexistence Protocol for Sharing Unlicensed Bands

There are a lot of unlicensed bands utilized for various applications, such as the very crowded 900 MHz and 2.4 GHz bands, 3.5 GHz, 5 GHz and 60 GHz bands. More specifically, the band at 900 MHz is currently utilized by cordless phones and ZigBee. The bands at 2.4 GHz and 5 GHz are proposed to be used by the LTE technology as a secondary carrier [9], which are currently also used for Bluetooth and WiFi. The 3.5 GHz band, which is mainly used for satellite and military communications, has been also proposed to be used for small cell deployment scenarios. The 60 GHz bands, which belong to mmWave bands, are well suited for short distance communications such as peer-to-peer communications, 5G cellular and Gbps WiFi (e.g., IEEE 802.11ad/802.11 ay). Since 900 MHz and 2.4 GHz bands are very crowded, the unlicensed bands can be more likely shared by the cellular system, and other communication systems are mainly 3.5 GHz, 5 GHz and 60 GHz bands. The characteristics of these unlicensed bands vary greatly, and are listed in Table 1. The major technologies for the shared use of unlicensed bands to improve utilization efficiency fall into four aspects, i.e., user association, channel allocation/selection, power control and transmission duration control. Without a loss of generality, we concentrate on the shared use of unlicensed mmWave bands by user association and power control, however, the proposed algorithms can be easily expanded to channel allocation/selection and transmission duration control.

Inspired by reference [13], we design a 5G–WiFi fusion protocol stack, as shown in Figure 1, which consists of a cellular part and a WiFi part where each part includes an independent RF unit. The 5G–WiFi fusion protocol stack has the following functions: (i) the WiFi part of the 5G–WiFi fusion protocol stack detects and decodes the physical layer (PHY) frame from WiFi systems in order to obtain the interference level from each CUE link to each WUE link; (ii) the WiFi part of the 5G-WiFi fusion protocol stack reports this interference information to the cellular part, and then the cellular part informs all CUEs of this inference information. Using the 5G–WiFi fusion protocol stack, we design an access protocol for CUEs and WUEs to share the unlicensed mmWave bands, as shown in Figure 2, where CUEs first use beams embedded in orthogonal sequences with fixed transmission power to conduct beamforming training, and, at the same time, the WiFi AP can sense and detect the interference level from each cellular link to each WiFi link. After that, the WiFi AP broadcasts the PHY frame, which includes the detected interference information. The WiFi part of the 5G–WiFi fusion protocol detects and decodes the PHY frame from the WiFi AP, then reports the result to the cellular part of the 5G–WiFi fusion protocol. The cellular part of the 5G gNB informs all CUEs of this inference information, based on which of the CUEs first estimates the channel gains from their available beam association directions to WiFi transmission links and then adjust their respective transmission strategy space to avoid strong inference to WiFi transmission links. After that, all CUEs perform decentralized machine learning to learn the joint user association and transmission power selection scheme, based on which all cellular links transmit data. Next, we propose a coexistence protocol with decentralized learning function for the shared use of unlicensed millimeter-wave bands, on which many learning algorithms can be run to obtain the solutions. In particular, motivated by reference [10], we design an integrated sensing- and communication-based coexistence protocol integrated with decentralized learning function for 5G-U and WiFi, coexisting as shown in Figure 3, where the full duty cycle includes sensing subframes (SSs), reserved WiFi subframes and sharing transmission subframes. The sensing subframe is used to find the cleanest channel to avoid collisions with on-going transmissions as well as to learn the optimal action for each transmission link, and the reserved WiFi subframe is used for the exclusive transmission of WiFi to protect the WiFi performance, while the sharing transmission subframe is designed for simultaneous transmissions of 5G-U and WiFi to improve the spectrum efficiency because of the short transmission range of mmWave communications. Compared to the traditional centralized scheduling scheme, the proposed scheme can not only be used for spectrum sharing in sub 6GHz bands, but also can be utilized for adaptively selecting the transmission strategy according to the changes of the network under highly dynamic interference environments.

2.2. System Model and Problem Formulation

Consider an uplink heterogeneous network including N cellular gNBs (i.e., 5G NodeB), L WiFi access points (APs), U CUEs and W WUEs, where the sets of gNBs, APs, CUEs and WUEs can be expressed as

N = {1, 2, \dots, N}

,

L = {1, 2, \dots, L}

,

U = {1, 2, \dots, U}

and

W = {1, 2, \dots, W}

, respectively. The channel gain from CUE u to gNB n, including the large-scale and small-scale fading, is expressed as

\begin{matrix} g_{u, n}^{c} = h_{u, n}^{2} 10^{- \frac{{\bar{L}}_{u, n}}{10}}, \end{matrix}

(1)

in which

{\bar{L}}_{u, n}

represents the average path loss between gNB n and CUE u in dB, and

h_{u, n}

is the channel gain of small-scale fading. In the same way, we can obtain the interference channel gains between CUEs and APs, and between WUEs and gNBs. For the blockage model of the link from CUE u to gNB n, the model in reference [28,29] is considered, and then the line-of-sight (LOS) communication probability for the transmission link with length

d_{u, n}

can be defined as

\begin{matrix} P r_{u, n}^{l o s} = \exp (- α d_{u, n}), \end{matrix}

(2)

where

d_{u, n}

represents the distance from CUE u to gNB n, and

α

is used to capture the density and size of obstacles. The path loss between CUE u and gNB n in dB at 60 GHz frequency for LOS and for non-line-of-sight (NLOS) can be given by [30]

L_{u, n} = \{\begin{matrix} 68 + 21.7 \log (d [m]) + η_{L O S}, for LOS link \\ 68 + 30.1 \log (d [m]) + η_{N L O S}, for NLOS link, \end{matrix}

(3)

where

η_{L O S} \in CN (0, 0.88)

and

η_{N L O S} \in CN (0, 1.55)

.

Then, the average path loss between gNB n and CUE u can be given by

\begin{matrix} {\bar{L}}_{u, n} = P r_{u, n}^{l o s} L_{u, n}^{l o s} + (1 - P r_{u, n}^{l o s}) L_{u, n}^{n l o s} . \end{matrix}

(4)

In this work, we use the sector antenna model in reference [31,32,33,34,35] to approximate the antenna gain. Let

ϕ_{u, n}^{t}

and

ϕ_{u, n}^{r}

be the beamwidths of the transmitter u and receiver n, respectively. The transmission gain between the transmitter u and receiver n can be expressed as

\begin{matrix} g_{u, n}^{t} (ϕ_{u, n}^{t}) = \{\begin{matrix} \frac{2 π - (2 π - ϕ_{u, n}^{t}) z}{ϕ_{u, n}^{t}}, & in the main lobe, \\ z, & in side lobes, \end{matrix} \end{matrix}

(5)

in which

0 \leq z < 1

denotes the gain of the side lobe. By replacing

ϕ_{u, n}^{t}

with

ϕ_{u, n}^{r}

, the reception gain

g_{u, n}^{r} (ϕ_{u, n}^{r})

between receiver n and transmitter u can be similarly obtained.

Then, the signal to interference and noise ratio (SINR) of the signal from CUE u to gNB n at the receiving end is given by

\begin{matrix} {SINR}_{u, n} = \frac{p_{u} g_{u, n}^{t} g_{u, n}^{c} g_{u, n}^{r}}{\sum_{k \in U ∖ u} p_{k} g_{k, n}^{t} g_{k, n}^{c} g_{k, n}^{r} + I_{u, n}^{w i f i - > c e l l u l a r} + B N_{0}}, \end{matrix}

(6)

where

I_{u, n}^{w i f i - > c e l l u l a r} = \sum_{w \in W} p_{w} g_{w, n}^{t} g_{w, n}^{c} g_{w, n}^{r}

represents the experienced interference of the link between CUE u and gNB n from the WiFi system. Note that here we assume that, during the uplink transmissions of CUEs, the WUEs also perform uplink transmissions for simplicity. However, our approach can be easily extended to the hybrid transmission scenario with both uplink and downlink transmissions for the WiFi network.

Then, the achievable rate of CUE u in the uplink transmission can be given by

\begin{matrix} R_{u} = B \sum_{n \in N} x_{u, n} \log_{2} (1 + {SINR}_{u, n}), \end{matrix}

(7)

where

x_{u, n}

is the binary selection variable, and

x_{u, n} = 1

if CUE

u

chooses gNB

n

as the saving base station, and otherwise

x_{u, n} = 0

.

In order to maximize the sum rate of all CUEs considering the interference constraints of all WUEs, the unlicensed mmWave bands sharing problem in heterogeneous networks can be modeled as

\begin{matrix} P 1 : max_{\begin{matrix} x, p \end{matrix}} \sum_{u \in U} R_{u} \\ s.t. & C 1 : \sum_{u \in U} x_{u, n} \leq N_{n}^{R F}, \forall n, \\ C 2 : \sum_{n \in N} x_{u, n} = 1, \forall u \in U_{1}, \\ C 3 : x_{u, n} = {0, 1}, \forall u \in U, n \in N, \\ C 4 : I_{w}^{c e l l u l a r - > w i f i} \leq I_{w}^{t h r}, \forall w \in W, \\ C 5 : p_{u} \in P_{u}, \forall u \in U, \end{matrix}

(8)

where

x = {x_{u, n}, u \in U, n \in N}

,

p = {p_{u}, u \in U}

represent the gNB selection policy profile and transmission power policy profile, respectively. Constraints C1, C2 and C3 indicate that each gNB can, at most, serve

N_{n}^{R F}

CUEs simultaneously because of the limited number of RF chains, and each CUE can select one gNB as its serving base station at each time, Constraint C4 ensures that the interference from all CUEs to each WUE must be less than a predetermined interference threshold for avoiding serious interference to each WUE. Constraint C5 provides transmission power limitation constraints, where the power of each CUE

u, \forall u \in U

could be selected from the predetermined set

P_{u}

.

3. Potential Game for Sharing Unlicensed Millimeter-Wave Bands

Since CUEs generally maximize their respective utilities in the decentralized learning environment, the learning process can be modeled as a noncooperative game

G = [U, {A_{u}}_{u \in U}, {U_{u}}_{u \in U}]

, where

U

,

A_{u}

and

U_{u}

denote the set of CUEs (i.e., players), the action set of CUE u and the utility of player u, respectively. As we mainly focus on jointly optimizing the gNB selection and transmission power allocation of CUEs for the coexisting of cellular and WiFi systems, the action of CUE u is the combination of available transmission power levels and available candidate gNBs, and can be denoted by

a_{u} = (x_{u, n}, p_{u}), \forall u \in U

, where n is one of the available gNBs for CUE u. However, the proposed learning framework is also applicable to optimize other radio resources, such as channel selection, transmission duration or their combinations. To protect the WiFi performance, the strategy space of each CUE should be constrained to a reasonable range. We assume that each CUE can obtain the interference experienced by each WUE from the cellular system according to the proposed 5G–WiFi fusion protocol stack and the designed access protocol for sharing the unlicensed mmWave bands. For simplicity, we distribute the interference threshold of each WUE to each CUE according to the interference proportion generated by each CUE. Therefore, if the CUEs in

M

share the unlicensed mmWave band with WiFi link w, to ensure that the tolerable interference of all 5G-U links to WUE w should be no more than

{\bar{I}}_{w}^{t h r}

, which denotes the tolerant interference of WiFi link w, the interference borne by each CUE

u \in M

can be expressed as

\begin{matrix} {\hat{I}}_{u}^{c e l l u l a r - > w i f i} \\ = & \frac{{\bar{I}}_{u - > w}^{c e l l u l a r - > w i f i}}{\sum_{u \in M} {\bar{I}}_{u - > w}^{c e l l u l a r - > w i f i}} ({\bar{I}}_{w}^{t h r} - I_{w}^{w i f i - > w i f i}) \\ = & \frac{{\bar{I}}_{u - > w}^{c e l l u l a r - > w i f i}}{I_{w}^{c e l l u l a r - > w i f i}} ({\bar{I}}_{w}^{t h r} - I_{w}^{w i f i - > w i f i}), \end{matrix}

(9)

where

{\bar{I}}_{u - > w}^{c e l l u l a r - > w i f i}

represents the average interference of all available beam association directions for CUE u to WUE w. Based on the estimated channel gains from its available beam association directions to WUE w, each CUE

u \in M

should exclude the strategies that will result in an interference larger than

{\hat{I}}_{u}^{c e l l u l a r - > w i f i}

. Of course, if there is more than one WiFi link sharing the unlicensed mmWave band with the CUEs in

M

, the above process can be repeated to further exclude the improper strategies for each CUE.

Following references [32,33], the utility of UE u is given by

\begin{matrix} {\tilde{U}}_{u} = U_{u} + \sum_{k \in U_{u}} U_{k}, \end{matrix}

(10)

with

\begin{matrix} U_{u} = R_{u} + η \sum_{n \in N} {x_{u, n} Δ_{n} Φ (N_{n}^{R F}, \sum_{u \in U} x_{u, n})} \end{matrix}

(11)

and

\begin{matrix} Δ_{n} = \sum_{u \in U} x_{u, n} - N_{n}^{R F}, \end{matrix}

(12)

where

U_{u}

denotes the neighboring CUEs set of CUE u, which can be defined based on interference distance similar to [33],

η

is the non-negative penalty factor and penalty function

Φ (x, y) = - 1

if

x < y

and 0 otherwise. The second term in (11) indicates that the CUE who selects a strategy that violates constraint C1 will be punished.

Theorem 1.

Game

G

is an exact potential game.

Proof.

Let

{\bar{U}}_{u}

be the set of players excluding player u and its neighbors, then we have

U = {u} \cup U_{u} \cup {\bar{U}}_{u}

. The potential function of game

G

can be constructed as

\begin{matrix} Ψ = \sum_{u \in U} U_{u} . \end{matrix}

(13)

Assuming any player u unilaterally changes the strategy from

a_{u}

to

a_{u}^{'}

, suppose that an arbitrary player u unilaterally changes its strategy from

a_{u}

to

a_{u}^{'}

, then the change in the potential function resulted in this unilateral change can be expressed in (14).

\begin{matrix} Ψ (a_{u}^{'}, a_{- u}) - Ψ (a_{u}, a_{- u}) \\ = & \sum_{u \in U} U_{u} (a_{u}^{'}, a_{- u}) - \sum_{u \in U} U_{u} (a_{u}, a_{- u}) \\ = & {U_{u} (a_{u}^{'}, a_{- u}) + \sum_{u \in U_{u}} U_{u} (a_{u}^{'}, a_{- u})} - {U_{u} (a_{u}, a_{- u}) + \sum_{u \in U_{u}} U_{u} (a_{u}, a_{- u})} \\ + \sum_{u \in {\bar{U}}_{u}} (U_{u} (a_{u}^{'}, a_{- u}) - U_{u} (a_{u}, a_{- u})) \\ \overset{(*)}{=} & {U_{u} (a_{u}^{'}, a_{- u}) + \sum_{u \in U_{u}} U_{u} (a_{u}^{'}, a_{- u})} - {U_{u} (a_{u}, a_{- u}) + \sum_{u \in U_{u}} U_{u} (a_{u}, a_{- u})} \\ = & {\tilde{U}}_{u} (a_{u}^{'}, a_{- u}) - {\tilde{U}}_{u} (a_{u}, a_{- u}) . \end{matrix}

(14)

in which

(*)

is due to the fact that the strategy of player u affects only the utilities of its neighboring CUEs, which results in

U_{u} (a_{u}^{'}, a_{- u}) - U_{u} (a_{u}, a_{- u}) = 0

for every

u \in {\bar{U}}_{u}

. Based on the definition in reference [36], game

G

is an exact potential game whose potential function is

Ψ

. Moreover, it has at least one pure strategy. The proof is completed. □

Theorem 2.

If all gNBs have more beams than the number of CUEs in the system and

η > \bar{η} = \sum_{u \in U} R_{u}

, the NE of game

G

must be feasible.

Proof.

Assumingthat the strategy profile

(a_{1}^{*}, \dots, a_{U}^{*})

is a pure policy NE of the game

G

that is not satisfied with constraint C1, there must be some CUEs associated with gNBs, which have a greater number of CUs than the supported beams. Let player

k (1 \leq k \leq U)

be one of these CUEs. According to the assumption of this theorem, player k must be able to select a new strategy

a_{k}^{'}

for associating with another gNB, which has a smaller number of associated CUEs than its supported beams.

\begin{matrix} {\tilde{U}}_{k} (a_{k}^{*}, a_{- k}^{*}) - {\tilde{U}}_{k} (a_{k}^{'}, a_{- k}^{*}) \\ = & \sum_{u \in U_{k} \cup k} R_{u} (a_{k}^{*}, a_{- k}^{*}) - \sum_{u \in U_{k} \cup k} R_{u} (a_{k}^{'}, a_{- k}^{*}) + [F (a_{k}^{*}, a_{- k}^{*}) - F (a_{k}^{'}, a_{- k}^{*})] \\ = & \sum_{u \in U_{k} \cup k} R_{u} (a_{k}^{*}, a_{- k}^{*}) - \sum_{u \in U_{k} \cup k} R_{u} (a_{k}^{'}, a_{- k}^{*}) - η < 0 . \end{matrix}

(15)

We define

F (a_{1}, \dots, a_{U}) = η \sum_{u \in U_{k} \cup k} \sum_{n \in N} {x_{u, n} Δ_{n} Φ (N_{n}^{R F}, \sum_{u \in U} x_{u, n})}

, then we have (15). According to Definition 1 in reference [32], (15) is contrary to the assumption that

(a_{1}^{*}, \dots, a_{U}^{*})

is a pure strategy NE of game

G

. Then, it is easily concluded that a strategy profile that is not satisfied with constraint C1 will definitely not be a pure strategy NE of game

G

. Accordingly, if the conditions of this theorem hold, the pure strategy NE of game

G

must be feasible. □

Theorem 3.

If the number of beams of all gNB is greater than the number of CUEs in the network and

\tilde{η} > \bar{η} = \sum_{u \in U} R_{u}

, game

G

has at least one feasible pure strategy NE.

Proof.

The proof process is the same as that of Theorem 2 in reference [32], and thus is omitted here. □

4. Decentralized Machine Learning Approaches for Sharing Unlicensed Millimeter-Wave Bands

In this section, we provide the details of decentralized machine learning approaches for sharing unlicensed millimeter-wave bands. In the learning process, each CUE acts as a player, and its action set is composed of the combinations of its available transmission power levels and candidate gNBs. For the ease of the following descriptions, we denote by

p_{u} = {p_{u 1}, \dots, p_{u K_{u}}}

the mixed strategy of CUE u with

\sum_{k = 1}^{K_{u}} p_{u k} = 1

, where

p_{u k}

represents the probability that CUE u chooses its k-th pure strategy (i.e., action)

a_{u k} \in A_{u}

and

K_{u}

is the number of actions of player u. Moreover, we denote by

a_{u}^{t}

and

p_{u}^{t} (a_{u k})

the action selected by player u at time t and the probability of choosing action

a_{u k}

, respectively.

4.1. Stochastic Learning for Sharing Unlicensed Millimeter-Wave Bands

A stochastic learning process involves a set of learning automata, each of which is considered to be a decision-maker who can learn the best strategy from a set of actions through repeated interactions with a random environment [37]. More specifically, for each action chosen by the learning automaton, the environment will evaluate the action, and send feedback to the automaton. According to the feedback, the automaton chooses the next action. With the learning process progressing, the automaton can learn the optimal action from the random environment asymptotically. As such, stochastic learning was widely used in discrete power control [37], channel selection [38] and relay selection [39]. During the stochastic learning process, each player u chooses its strategy according to a time-varying action choice probability distribution, where its action probability is updated according to the following rule:

\begin{matrix} \{\begin{matrix} p_{u}^{t + 1} (a_{u k} \neq a_{u}^{t}) = p_{u}^{t} (a_{u k}) - b r_{u}^{t} p_{u}^{t} (a_{u k}), \\ p_{u}^{t + 1} (a_{u k} = a_{u}^{t}) = p_{u}^{t} (a_{u k}) + b r_{u}^{t} (1 - p_{u}^{t} (a_{u k})), \end{matrix} \end{matrix}

(16)

where

b \in (0, 1)

represents step size, and

r_{u}^{t}

denotes the random payoff of player u choosing the k-th action at time t. To calculate the random payoff

r_{u}^{t} \in [0, 1)

of player u at time t, a normalization function

Ξ (x) : x \in R \to [0, 1)

is defined as

r_{u}^{t} = Ξ (U_{u}^{t})

. Based on the analysis above, the stochastic learning process for the shared use of the unlicensed mmWave bands can be described using Figure 4a, the details of which are summarized as follows: (1) Initialize the action probability

p_{u} = {p_{u 1}, \dots, p_{u K_{u}}}

with

p_{u 1} = 1 / K_{u}

; (2) Each player chooses an action according to its action probability vector

p_{u}

; (3) All players adhere to their respective actions in a decision period, and then estimate their respective received utilities; (4) All players update their mixed strategies according to (16), and if the algorithm converges then output the strategy of all players, otherwise go to step (2).

4.2. No-Regret Learning for Sharing Unlicensed Millimeter-Wave Bands

No-regret learning is also known as regret-matching, the stationary solution of which exhibits no regret [40]. Due to its ability to converge to a correlation equilibrium (CE), no-regret learning is widely used in resource allocation problems [41]. For each player in no-regret learning, the probability of choosing a strategy is proportional to the “regret” of not choosing another strategy. The results in reference [40] have shown that, if the probability distribution is selected according to their proposed approach, the no-regret algorithm will reach the CE. Next, we introduce how to obtain the the probability distribution for the joint gNB selection and power control design of the shared use of an unlicensed mmWave band. During the no-regret learning process, each player u will choose its strategy

a_{u k}

at each time t according to the probability distribution

p_{u}^{t} (a_{u k}), a_{u k} \in A_{u}

. For the time

t = 1

,

p_{u}^{t} (a_{u k}) = \frac{1}{A_{u}}

. At the time

t > 1

,

p_{u}^{t + 1} (a_{u k})

is updated according the following rule:

\begin{matrix} \{\begin{matrix} p_{u}^{t + 1} (a_{u k} \neq a_{u}^{t}) = \frac{1}{μ} R_{u}^{t} (a_{u}^{t}, a_{u k}), \\ p_{u}^{t + 1} (a_{u k} = a_{u}^{t}) = 1 - \sum_{a_{u k} \neq a_{u}^{t}} p_{u}^{t + 1} (a_{u k}), \end{matrix} \end{matrix}

(17)

where

μ

is a normalization factor,

a_{u}^{t}

denotes the previously chosen strategy at time t and

R_{u}^{t} (a_{u}^{t}, a_{u k}) = \max {Γ_{u}^{t} (a_{u}^{t}, a_{u k}), 0}

is the regret of choosing strategy

a_{u}^{t}

instead of

a_{u k} (a_{u k} \neq a_{u}^{t})

with

Γ_{u}^{t} (a_{u}^{t}, a_{u k}) = \frac{1}{t + 1} \sum_{τ = 1}^{t + 1} (U_{u}^{τ} (a_{u k}, a_{- u}^{τ}) - U_{u}^{τ} (a_{u}^{t}, a_{- u}^{τ}))

. Then, the no-regret learning process for the shared use of unlicensed bands is shown in Figure 4b, and the detailed steps are described as follows: (1) initialize the action probability of player u choosing action

a_{u k}

with

p_{u}^{t} (a_{u k}) = \frac{1}{A_{u}}

; (2) at iteration t, each player chooses an action based on its action probability vector

p_{u}^{t}

; (3) all players adhere to their respective actions in a decision period, and then estimate their respective received utilities and calculate the regrets of the chosen strategies; (4) all players update their mixed strategies via (17) and, if the algorithm converges, then output the strategy of all players, otherwise go to step (2).

4.3. Binary Log-Linear Learning for Sharing Unlicensed Millimeter-Wave Bands

Binary log-linear learning was first proposed in reference [42], which has been widely used for power control, channel selection and base station sleeping [43,44]. At each iteration t of binary log-linear learning, a random player u is selected to replace its current action

a_{u}^{t}

with an exploratory action

a_{u k} \in A_{u}

, while the actions of all other players are kept unchanged. Denote by

U_{u}^{t} (a_{u}^{t}, a_{- u}^{t})

and

{\bar{U}}_{u}^{t} (a_{u k}, a_{- u}^{t})

the utilities when player u chooses the current action

a_{u}^{t}

and the exploratory action

a_{u k}

, respectively. Then, the action probabilities of each player u are updated based on the following rule:

\begin{matrix} \{\begin{matrix} p_{u}^{t + 1} (a_{u}^{t + 1} = a_{u k}) = \frac{\exp {β {\bar{U}}_{u}^{t} (a_{u k}, a_{- u}^{t}) / C}}{Φ}, \\ p_{u}^{t + 1} (a_{u}^{t + 1} = a_{u}^{t}) = \frac{\exp {β U_{u}^{t} (a_{u}^{t}, a_{- u}^{t}) / C}}{Φ}, \end{matrix} \end{matrix}

(18)

with

\begin{matrix} Φ = \exp {β U_{u}^{t} (a_{u}^{t}, a_{- u}^{t})} + \exp {β {\bar{U}}_{u}^{t} (a_{u k}, a_{- u}^{t})}, \end{matrix}

(19)

in which

β

represents a learning parameter and C denotes a scaling parameter. The binary log-linear learning process for the shared use of the unlicensed band can be summarized using Figure 4c, and the details are described as follows: (1) initialize the action probability

p_{u} = {p_{u 1}, \dots, p_{u K_{u}}}

with

p_{u 1} = 1 / K_{u}

; (2) each player chooses an action based on the action probability vector

p_{u}

; (3) all players adhere to their respective actions in a decision period, and then estimate their respective received utilities; (4) all player update their mixed strategies via (18), and, if the algorithm converges, then output the strategy of all players, otherwise go to step (2).

4.4. Q-Learning for Sharing Unlicensed Millimeter-Wave Bands

Q-learning is a reinforcement learning technique and it can determine an optimal action policy with an unknown system transition model, which is widely used for spectrum allocation and channel selection [45]. In Q-learning, a decision-maker learns its policy according to the observed environment, and performs the action to optimize its utility function in an environment of uncertainty. Generally, Q-learning consists of three components, namely, state, action and reward. In this work, we focus on single-state Q-learning, each player’s action set is composed of the combinations of the available transmission power levels and the candidate gNBs, and the reward of an action is formulated as the predicted reward function of the data rate. The Q value when player u chooses its k-th action is updated based on the following rule:

\begin{matrix} Q_{u k}^{t + 1} = (1 - ζ^{t}) Q_{u k}^{t} + ζ^{t} r_{u k}^{t}, \end{matrix}

(20)

where

r_{u k} = U_{u} (a_{u k}, a_{- u})

, and

ζ^{t} \in [0, 1)

represents the learning rate, satisfying that

\sum_{t = 0}^{\infty} ζ^{t} = \infty

,

\sum_{t = 0}^{\infty} {(ζ^{t})}^{2} < \infty

. Then, the policy of player u is updated according to

\begin{matrix} p_{u}^{t + 1} (a_{u}^{t + 1} = a_{u k}) = \frac{\exp [Q_{u k}^{t + 1} / ϱ]}{\sum_{j = 1}^{K_{u}} \exp [Q_{u j}^{t + 1} / ϱ]}, \end{matrix}

(21)

where

ϱ

is used to control the tradeoff between exploitation and exploration. The Q-learning process for the shared use of the unlicensed band can be described using Figure 4d, and the details are described as follows: (1) initialize the action probability

p_{u} = {p_{u 1}, \dots, p_{u K_{u}}}

with

p_{u 1} = 1 / K_{u}

; (2) each player selects an action based on the action probability vector

p_{u}

; (3) all players adhere to their respective actions in a decision period, and then estimate their respective received utilities and calculate the Q values according to (20); (4) all players update the mixed strategies through (21), and, if the algorithm converges, then output the strategy of all players, otherwise go to step (2).

4.5. Multi-Armed Bandit Learning for Sharing Unlicensed Millimeter-Wave Bands

Multi-armed bandit learning is one of the classic reinforcement learning techniques which aims at maximizing the total rewards on the condition that some rewards will be obtained after each agent pulls an arm (action) [46,47]. For the multi-armed bandits with multiple agents, the agents affect each other, and thus the obtained reward of each agent is dependent on both its own actions and the joint strategy profile of other agents. Therefore, it is important for multi-agent scenarios to have some kind of equilibrium or stable state. So far, many algorithms, such as

ϵ

-greedy, upper confidence bound (UCB), Softmax, and Pursuit, have been proposed to solve various kinds of multi-armed bandit problems, where the UCB scheme is very suitable for the bandit problems with stochastic stationary characteristics. Therefore, the UCB scheme can be used for the multi-armed bandit learning problem of joint gNB selection and power control for the shared use of unlicensed bands. At each iteration in the UCB scheme, an upper-bound of the average reward for every arm k of each player u is estimated as follows:

\begin{matrix} I_{u k}^{t} = {\bar{U}}_{u k}^{t} + \sqrt{\frac{2 \ln (t)}{T_{u k}^{t - 1}}}, \end{matrix}

(22)

where

{\bar{U}}_{u k}^{t}

represents the average reward of arm k for player u at round t, and

T_{u k}^{t - 1}

denotes the number of rounds where arm k is chosen up to round

t - 1

. Then, the arm with the highest estimated bound is chosen according to the following rule:

\begin{matrix} a_{u}^{t} = \underset{a_{u k} \in A_{u}}{arg max} I_{u k}^{t} . \end{matrix}

(23)

The multi-armed bandits learning process for the shared use of the unlicensed band can be described using Figure 4e, and the details are described as follows: (1) initialize the parameters and play each arm once; (2) each player chooses the action with the highest estimated bound according to (23); (3) all players adhere to their respective actions in a decision period, and then estimate their respective average reward of each arm according to (22); (4) if the algorithm converges, then output the strategy of all players, otherwise go to step (2).

5. Computational Complexity Analysis

In this section, we analyze the complexities of the proposed algorithms. For the stochastic learning algorithm, its complexity can be expressed as

T_{i t e r 1} (O (C 1) + O (C 2) + O (C 3))

, where

C 1

denotes the complexity of estimating the number of interfering users,

C 2

represents the complexity during the process of updating the action selection vectors, which includes the operations of two vector sums and one scalar-vector product, and

C 3

represents the complexity during the procedure of user selection. For the binary log-linear learning algorithm, its complexity can be given as

T_{i t e r 1} (O (C 1) + O (C 2) + O (C 3))

, where

O (C 1)

is caused by the estimating the number of interfering users,

O (C 2)

is due to the updating action selection involving the operations of two exponents, one sum and two divisions, and

C 3

is a small constant which represents the complexity during the the procedures of user selection. For the Q learning algorithm, its complexity can be given by

O (\frac{n}{ϵ^{2}} l o g (n))

, where n represents the number of state–action pairs for achieving the

ϵ

-optimal solution with high probability. For the multi-armed bandit learning algorithm, in order to show the complexity of each algorithm more clearly, we provide the comparison of running time for each algorithm in Section 6.

6. Simulation Results and Analysis

We consider an uplink network with 5G-U and WiFi coexisting, which is composed of 4 gNBs, L CUEs, 2 APs and 2 WUEs, where CUEs and WUEs are distributed randomly in a circle with a radius of 100 m. Each gNB has six RF chains. We assume that all communication links perform uplink transmissions in sharing transmission subframes and share one mmWave band with bandwidth 1.5 GHz. The main simulation parameters are listed as follows. The available transmission power levels for each CUE is set to

{0.01, 0.05, 0.1, 0.15, 0.2}

watts and the power level of each WUE is fixed at 0.1 watts. The thermal noise density is set to −174 dBm/Hz, and the beamwidths of the WUEs, 5G-U base stations and APs are all set to 20°. Unless otherwise specified, the tolerant interference threshold of each WiFi link is set to −62 dBm. The parameters related to the learning algorithms are listed as follows:

b = 0.05

,

μ = 10^{10}

bps,

C = 10^{13}

bps,

β = 10

,

ϱ = 10^{10}

bps, the normalization function for stochastic learning algorithm

Ξ (x) = (x + γ) / γ

with

γ = 10^{14}

bps.

Figure 5 presents the sum rates varying with different numbers of CUEs under different learning algorithms. As can be seen from Figure 5a, binary log-linear learning has a better sum rate of the 5G-U system than other learning algorithms. The Q-learning algorithm has a sum rate of the 5G-U system that is slightly worse than that of binary log-linear learning, which, however, is slightly better than that of multi-armed bandit learning. The sum rate of no-regret learning is the worst of all, and stochastic learning is better than no-regret learning. This means that binary log-linear learning, Q-learning and multi-armed bandit learning can reach better NE of the spectrum sharing problem. From Figure 5b, it can be easily found that the better the achieved NE, the worse the sum rate of the WiFi system will be. Moreover, we can see that, with the help of constrained actions of each CUE, the transmissions of the 5G-U system have no serious effects on the WiFi system when the number of CUEs is small. Therefore, simultaneous transmissions of WiFi and 5G-U at the 60 GHz unlicensed band are feasible if the number of accessed CUEs is controlled reasonably. From Figure 5c, it can observed that the sum rate of the 5G-U system and WiFi system has the same trend as that of the sum rate of the 5G-U system, the reason for which is due to the fact that the sum rate of the 5G-U system is much larger than that of the WiFi system.

Figure 6 provides the sum rates varying with different beamwidths of CUEs under different learning techniques, where the number of CUEs is set to 16. We can find from Figure 6a that the sum rates of all learning techniques for the 5G-U system decrease with the beamwidth of CUEs. This is because, as the beamwith of CUEs increases, the directional transmission gain decreases. In Figure 6b, it can seen that the sum rates of all learning techniques for the 5G-U system decrease with the beamwidth of CUEs, the reason for which is that the interference with the WiFi system will become serious as the beamwidth of CUEs increases. Moreover, we also see that, when the beamwidth exceeds 40°, the sum rates of binary log-linear learning and Q-learning will decrease sharply. Therefore, the reasonable control of beamwidth of CUEs is also important for the shared use of 60 GHz unlicensed bands. From Figure 6c, it can be observed that the sum rates of the 5G-U system and WiFi system also have the same trend sd that of the sum rate in the 5G-U system.

Figure 7 depicts a comparison between the proposed scheme and the traditional scheme in terms of system sum rate, in which the number of CUEs is set to 16. It can observed that the sum rate performance of the WiFi system can be effectively protected under the proposed scheme, even at a lower tolerant interference threshold, while that of the traditional scheme is seriously deteriorated under the lower tolerant interference threshold, so that WUEs cannot access the channel.

Figure 8 plots a comparison of different learning techniques in terms of convergence speed, where the number of CUEs is set to 20. We can see that the multi-armed bandit learning converges faster that other learning techniques, binary log-linear learning has the second fastest convergence speed followed by stochastic learning, while Q-learning and no-regret learning have slower convergence speeds. It can be concluded that the binary log-linear learning, that has the best sum rate of 5G and WiFi systems and acceptable convergence speed, is well suited for the joint gNB selection and power control problem regarding the shared use of 60 GHz unlicensed bands.

7. Conclusions

The efficient shared use of unlicensed mmWave bands is one of the promising solutions to deal with the challenge of spectrum scarcity in future networks. In this work, we first provide a 5G–WiFi fusion protocol stack for sharing unlicensed mmWave bands, and then design an ISAC-based access protocol and an ISAC-based coexistence protocol, integrated with a decentralized learning framework, on which machine learning algorithms can be run in order to obtain the joint user association and power control strategy. We then provide the details of multiple machine learning algorithms for the shared use of unlicensed mmWave bands, including stochastic learning, no-regret learning, log-linear learning, Q-Learning and multi-armed bandit learning. Later, we provide simulations for the aforementioned learning algorithms from various aspects. The results have shown that the proposed scheme greatly reduces the search space of the solution and effectively protects the performance of the WiFi system compared to the traditional scheme; binary log-linear learning is suitable for the spectrum sharing problem at 60 GHz unlicensed bands, and the simultaneous transmissions of 5G-U and WiFi at the 60 GHz unlicensed band are feasible under the proposed scheme. Meanwhile, from the simulation results, some insights can be observed regarding how to choose the learning algorithms to achieve a balance between computational complexity and a good sum rate performance.

Author Contributions

Conceptualization, C.T. and Y.L.; Methodology, C.T. and Y.L.; Validation, C.T. and Y.L.; Formal analysis, C.T.; Investigation, Y.L.; Resources, C.T. and Y.L.; Writing—original draft preparation, C.T.; Writing—review and editing, C.T. and Y.L.; Supervision, Y.L.; Project administration, Y.L.; Funding acquisition, C.T. and Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Guizhou Provincial Basic Research Program under Grant ZK[2023]028, and in part by the Guizhou University of Finance and Economics Innovation Exploration and Academic Emerging Project under Grant 2022XSXMB14.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Xiao, M.; Mumtaz, S.; Huang, Y.; Dai, L.; Li, Y.; Matthaiou, M.; Karagiannidis, G.; Bjronson, E.; Yang, K.; Chih-Lin, I.; et al. Millimeter Wave Communications for Future Mobile Networks. IEEE J. Sel. Areas Commun. 2017, 35, 1909–1935. [Google Scholar]
Zhou, P.; Cheng, K.; Han, X.; Fang, X.; Fang, Y.; He, R.; Long, Y.; Liu, Y. IEEE 802.11ay based mmWave WLANs: Design Challenges and Solutions. IEEE Commun. Surv. Tutorials 2018, 20, 1654–1681. [Google Scholar] [CrossRef]
Liu, Y.; Fang, X.; Xiao, M. Joint Transmission Reception Point Selection and Resource Allocation for Energy-Efficient Millimeter-Wave Communications. IEEE Trans. Veh. Technol. 2021, 70, 412–428. [Google Scholar] [CrossRef]
Mumtaz, S.; Alsohaily, A.; Pang, Z.; Rayes, A.; Tsang, K.F.; Rodriguez, J. Massive Internet of Things for Industrial Applications: Addressing Wireless IIoT Connectivity Challenges and Ecosystem Fragmentation. IEEE Industrial Electronics Mag. 2017, 11, 28–33. [Google Scholar] [CrossRef]
Shayea, I.; Azmi, M.H.; Rahman, T.A.; Ergen, M.; Han, C.T.; Arsad, A. Spectrum Gap Analysis With Practical Solutions for Future Mobile Data Traffic Growth in Malaysia. IEEE Access 2019, 7, 24910–24933. [Google Scholar] [CrossRef]
Staple, G.; Werbach, K. The end of spectrum scarcity [spectrum allocation and utilization]. IEEE Spectrum 2004, 41, 48–52. [Google Scholar] [CrossRef]
Liu, M.; Xue, W.; Jia, P.; Makarov, S.B.; Li, B. Research on spectrum optimization technology for a wireless communication system. Symmetry 2020, 12, 34. [Google Scholar] [CrossRef]
Zhang, H.; Chu, X.; Guo, W.; Wang, S. Coexistence of Wi-Fi and Heterogeneous Small Cell Networks Sharing Unlicensed Spectrum. IEEE Commun. Mag. 2015, 53, 158–164. [Google Scholar] [CrossRef]
QUALCOMM. Making the Best Use of Unlicensed Spectrum for 1000x. Available online: https://www.qualcomm.com/media/documents/files/whitepaper-making-the-best-use-of-unlicensed-spectrum.pdf (accessed on 25 July 2023).
Zhang, H.; Liao, Y.; Song, L. D2D-U: Device-to-device communications in unlicensed bands for 5G system. IEEE Trans. Wireless Commun. 2017, 16, 3507–3519. [Google Scholar] [CrossRef]
Zhou, Z.; Mumtaz, S.; Huq, K.M.S.; Al-Dulaimi, A.; Chandra, K.; Rodriquez, J. Cloud miracles: Heterogeneous cloud RAN for fair coexistence of LTE-U and Wi-Fi in ultra dense 5G networks. IEEE Commun. Mag. 2018, 56, 64–71. [Google Scholar] [CrossRef]
Ye, Y.; Wu, D.; Shu, Z.; Qian, Y. Overview of LTE spectrum sharing technologies. IEEE Access 2016, 4, 8105–8115. [Google Scholar] [CrossRef]
Song, H.; Fang, X. A spectrum etiquette protocol and interference coordination for LTE in unlicensed bands (LTE-U). In Proceedings of the 2015 IEEE International Conference on Communication Workshop (ICCW), London, UK, 8–12 June 2015; pp. 2338–2343. [Google Scholar]
Sriyananda, M.G.S.; Parvez, I.; Güvene, I.; Bennis, M.; Sarwat, A.I. Multi-armed bandit for LTE-U and WiFi coexistence in unlicensed bands. In Proceedings of the 2016 IEEE Wireless Communications and Networking Conference, Doha, Qatar, 3–6 April 2016; pp. 1–6. [Google Scholar]
Zhang, L.; Liang, Y.; Xiao, M. Spectrum Sharing for Internet of Things: A Survey. IEEE Wireless Commun. 2019, 26, 132–139. [Google Scholar] [CrossRef]
Gao, Y.; Wu, Y.; Hu, H.; Chu, X.; Zhang, J. Licensed and Unlicensed Bands Allocation for Cellular Users: A Matching-Based Approach. IEEE Wireless Commun. Lett. 2019, 8, 969–972. [Google Scholar]
Tan, J.; Xiao, S.; Han, S.; Liang, Y.; Leung, V.C.M. QoS-Aware User Association and Resource Allocation in LAA-LTE/WiFi Coexistence Systems. IEEE Trans. Wireless Commun. 2019, 18, 2415–2430. [Google Scholar]
Zhou, Z.; Chen, X.; Zhang, Y.; Mumtaz, S. Blockchain-Empowered Secure Spectrum Sharing for 5G Heterogeneous Networks. IEEE Netw. 2020, 34, 24–31. [Google Scholar] [CrossRef]
Tan, J.; Zhang, L.; Liang, Y.; Niyato, D. Intelligent Sharing for LTE and WiFi Systems in Unlicensed Bands: A Deep Reinforcement Learning Approach. IEEE Trans. Commun. 2020, 68, 2793–2808. [Google Scholar] [CrossRef]
Lu, X.; Petrov, V.; Moltchanov, D.; Andreev, S.; Mahmoodi, T.; Dohler, M. 5G-U: Conceptualizing Integrated Utilization of Licensed and Unlicensed Spectrum for Future IoT. IEEE Commun. Mag. 2019, 57, 92–98. [Google Scholar]
Ghadikolaei, H.S.; Ghauch, H.; Fodor, G.; Skoglund, M.; Fischione, C. A Hybrid Model-based and Data-driven Approach to Spectrum Sharing in mmWave Cellular Networks. IEEE Trans. Cogn. Commun. Netw. 2020, 6, 1269–1282. [Google Scholar] [CrossRef]
Xie, N.; Ou-Yang, L.; Liu, A.X. Spectrum Sharing in mmWave Cellular Networks Using Clustering Algorithms. IEEE/ACM Trans. Netw. 2020, 28, 1378–1390. [Google Scholar] [CrossRef]
Lu, X.; Sopin, E.; Petrov, V.; Galinina, O.; Moltchanov, D.; Ageev, K.; Andreev, S.; Koucheryavy, Y.; Samouylov, K.; Dohler, M. Integrated Use of Licensed- and Unlicensed-Band mmWave Radio Technology in 5G and Beyond. IEEE Access 2019, 7, 24376–24391. [Google Scholar] [CrossRef]
Zhang, Z.; Xiao, Y.; Ma, Z.; Xiao, M.; Ding, Z.; Lei, X.; Karagiannidis, G.K.; Fan, P. 6G Wireless Networks: Vision, Requirements, Architecture, and Key Technologies. IEEE Veh. Technol. Mag. 2019, 14, 28–41. [Google Scholar]
Sun, Y.; Peng, M.; Zhou, Y.; Huang, Y.; Mao, S. Application of Machine Learning in Wireless Networks: Key Techniques and Open Issues. IEEE Commun. Surv. Tutorials 2019, 21, 3072–3108. [Google Scholar]
Luo, F.L. Machine Learning for Future Wireless Communications; Wiley: Hoboken, NJ, USA, 2020. [Google Scholar]
Liu, R.; Lee, M.; Yu, G.; Li, G.Y. User Association for Millimeter-Wave Networks: A Machine Learning Approach. IEEE Trans. Commun. 2020, 68, 4162–4174. [Google Scholar] [CrossRef]
Kulkarni, M.N.; Singh, S.; Andrews, J.G. Coverage and rate trends in dense urban mmWave cellular networks. In Proceedings of the IEEE Global Communications Conference (GLOBECOM), Austin, TX, USA, 8–12 December 2014. [Google Scholar]
Liu, Y.; Fang, X.; Xiao, M.; Cui, Y.; Xue, Q. Coordinated Multi-Beam Transmissions for Reliable Millimeter-Wave Communications with Independent and Correlated Blockages. IEEE Wireless Commun. Lett. 2023, 12, 1523–1527. [Google Scholar] [CrossRef]
Geng, S.; Kivinen, J.; Zhao, X.; Vainikainen, P. Millimeter-Wave Propagation Channel Characterization for Short-Range Wireless Communications. IEEE Trans. Veh. Technol. 2009, 58, 3–13. [Google Scholar] [CrossRef]
Shokri-Ghadikolaei, H.; Gkatzikis, L.; Fischione, C. Beam-searching and transmission scheduling in millimeter wave communications. In Proceedings of the 2015 IEEE International Conference on Communications (ICC), London, UK, 8–12 June 2015; pp. 1292–1297. [Google Scholar]
Liu, Y.; Fang, X.; Xiao, M. Discrete Power Control and Transmission Duration Allocation for Self-Backhauling Dense mmWave Cellular Networks. IEEE Trans. Commun. 2018, 66, 432–447. [Google Scholar] [CrossRef]
Liu, Y.; Fang, X.; Xiao, M.; Mumtaz, S. Decentralized beam pair delection in multi-beam millimeter-wave networks. IEEE Trans. Commun. 2018, 66, 2722–2737. [Google Scholar] [CrossRef]
Liu, Y.; Fang, X.; Xiao, M. Resource Management for Maximizing the Secure Sum Rate in Dense Millimeter-Wave Networks. IEEE Access 2020, 8, 158416–158431. [Google Scholar] [CrossRef]
Liu, Y.; Fang, X. Joint user association and resource allocation for self-backhaul ultra-dense networks. China Commun. 2016, 13, 1–10. [Google Scholar] [CrossRef]
Monderer, D.; Shapley, L.S. Potential games. Games Econom. Behav. 1999, 14, 124–143. [Google Scholar] [CrossRef]
Xing, Y.; Chandramouli, R. Stochastic Learning Solution for Distributed Discrete Power Control Game in Wireless Data Networks. IEEE/ACM Trans. Net. 2008, 16, 932–944. [Google Scholar] [CrossRef]
Xu, Y.; Wang, J.; Wu, Q.; Anpalagan, A.; Yao, Y. Opportunistic Spectrum Access in Unknown Dynamic Environment: A Game-Theoretic Stochastic Learning Solution. IEEE Trans. Wireless Commun. 2012, 11, 1380–1391. [Google Scholar] [CrossRef]
Zhong, W.; Chen, G.; Jin, S.; Wong, K. Relay Selection and Discrete Power Control for Cognitive Relay Networks via Potential Game. IEEE Trans. Signal Process. 2014, 62, 5411–5424. [Google Scholar]
Hart, S.; Mas-Colell, A. A simple adaptive procedure leading to correlated equilibrium. Econometrica 2000, 68, 1127–1150. [Google Scholar] [CrossRef]
Zheng, J.; Cai, Y.; Xu, Y.; Anpalagan, A. Distributed Channel Selection for Interference Mitigation in Dynamic Environment: A Game-Theoretic Stochastic Learning Solution. IEEE Trans. Veh. Technol. 2014, 63, 4757–4762. [Google Scholar] [CrossRef]
Marden, J.R.; Arslan, G.; Shamma, J.S. Connections between cooperative control and potential games illustrated on the consensus problem. In Proceedings of the 2007 European Control Conference (ECC), Kos, Greece, 2–5 July 2007; pp. 4604–4611. [Google Scholar]
Zhang, N.; Zhang, S.; Zheng, J.; Fang, X.; Mark, J.W.; Shen, X. QoE Driven Decentralized Spectrum Sharing in 5G Networks: Potential Game Approach. IEEE Trans. Veh. Technol. 2017, 66, 7797–7808. [Google Scholar] [CrossRef]
Zheng, J.; Cai, Y.; Chen, X.; Li, R.; Zhang, H. Optimal Base Station Sleeping in Green Cellular Networks: A Distributed Cooperative Framework Based on Game Theory. IEEE Trans. Wireless Commun. 2015, 14, 4391–4406. [Google Scholar] [CrossRef]
Wu, Y.; Hu, F.; Kumar, S.; Matyjas, J.D. Apprenticeship Learning based Spectrum Decision in Multi-Channel Wireless Mesh Networks with Multi-Beam Antennas. IEEE Trans. Mobile Computing 2017, 16, 314–325. [Google Scholar] [CrossRef]
Maghsudi, S.; Hossain, E. Distributed user association in energy harvesting dense small cell networks: A mean-field multi-armed bandit approach. IEEE Access 2017, 5, 3513–3523. [Google Scholar] [CrossRef]
Liu, Y.; Tang, C. Concurrent multi-beam transmissions for reliable communication in millimeter-wave networks. Comput. Ccmmun. 2022, 195, 281–291. [Google Scholar] [CrossRef]

Figure 1. 5G–WiFi fusion protocol stack.

Figure 2. Integrated sensing- and communication-based access protocol for 5G-U and WiFi systems sharing the unlicensed mmWave bands.

Figure 3. Integrated sensing- and communication-based coexistence protocol with decentralized machine learning.

Figure 4. Flow chart of decentralized machine learning algorithms.

Figure 5. Sum rates varying with the number of CUEs for different learning algorithms.

Figure 6. Sum rates varying with beamwidth for different learning algorithms.

Figure 7. Comparison of sum rates for the proposed scheme and traditional scheme.

Figure 8. Comparison of convergence speeds for different learning algorithms.

Table 1. Comparison of different unlicensed bands.

Unlicensed	Spectrum	Bandwidth	Degree of	Incumbent	Coexistence
Bands	Range		Congestion	Users	Mechanisms
900 MHz	890 ∼ 960 MHz	50 MHz	High	Cordless phones	Transmit power control
				IEEE 802.15.4	Channel allocation
				Zigbee	Transmission duration control
2.4 GHz	2.4∼ 2.5 GHz	83.5 MHz	Very high	IEEE 802.11 WLAN	Listen-before-talk
				Bluetooth	Emission mask
				Zigbee	Maximum transmit power
3.5 GHz	3.55∼3.7 GHz	150 MHz	Low	Naval Radar	Dynamic frequency selection
				Fixed Satellite systems	Transmit power control
5 GHz	5.15∼5.925 GHz	495 MHz	Low	Radiolocation	Listen-before-talk
				Earth Exploration	Dynamic frequency selection
				Space Research	Transmit power control
60 GHz	57∼64 GHz	7 GHz	Low	IEEE 802.11ad/ay	Listen-before-talk
					Spatial reuse
					Transmit power control

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tang, C.; Liu, Y. Machine Learning Approaches for Sharing Unlicensed Millimeter-Wave Bands in Heterogeneously Integrated Sensing and Communication Networks. Electronics 2023, 12, 4193. https://doi.org/10.3390/electronics12204193

AMA Style

Tang C, Liu Y. Machine Learning Approaches for Sharing Unlicensed Millimeter-Wave Bands in Heterogeneously Integrated Sensing and Communication Networks. Electronics. 2023; 12(20):4193. https://doi.org/10.3390/electronics12204193

Chicago/Turabian Style

Tang, Chunju, and Yanping Liu. 2023. "Machine Learning Approaches for Sharing Unlicensed Millimeter-Wave Bands in Heterogeneously Integrated Sensing and Communication Networks" Electronics 12, no. 20: 4193. https://doi.org/10.3390/electronics12204193

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Machine Learning Approaches for Sharing Unlicensed Millimeter-Wave Bands in Heterogeneously Integrated Sensing and Communication Networks

Abstract

1. Introduction

2. Coexistence Protocol, System Model and Problem Formulation

2.1. Coexistence Protocol for Sharing Unlicensed Bands

2.2. System Model and Problem Formulation

3. Potential Game for Sharing Unlicensed Millimeter-Wave Bands

4. Decentralized Machine Learning Approaches for Sharing Unlicensed Millimeter-Wave Bands

4.1. Stochastic Learning for Sharing Unlicensed Millimeter-Wave Bands

4.2. No-Regret Learning for Sharing Unlicensed Millimeter-Wave Bands

4.3. Binary Log-Linear Learning for Sharing Unlicensed Millimeter-Wave Bands

4.4. Q-Learning for Sharing Unlicensed Millimeter-Wave Bands

4.5. Multi-Armed Bandit Learning for Sharing Unlicensed Millimeter-Wave Bands

5. Computational Complexity Analysis

6. Simulation Results and Analysis

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI