June 27, 2020
In this paper, we propose an interference exploitation symbol-level precoding (SLP) method for multi-level modulations via an in-block power allocation scheme to greatly reduce the signaling overhead. Existing SLP approaches require the symbol-level broadcast of the rescaling factor to the users for correct demodulation, which hinders the practical implementation of SLP. The proposed approach allows a block-level broadcast of the rescaling factor as done in traditional block-level precoding, greatly reducing the signaling overhead for SLP without sacrificing the performance. Our derivations further show that the proposed in-block power allocation enjoys an exact closed-form solution and thus does not increase the complexity at the base station (BS). In addition to the significant alleviation of the signaling overhead validated by the effective throughput result, numerical results demonstrate that the proposed power allocation approach also improves the error-rate performance of the existing SLP. Accordingly, the proposed approach enables the practical use of SLP in multi-level modulations.
MIMO, symbol-level precoding, constructive interference, multi-level modulations, signaling overhead.
Precoding design has been extensively studied for multi-user transmission in the field of multi-antenna communication systems [1]. By exploiting the channel state information (CSI), linear precoding methods have been studied in the literature [2], [3], which however do not exploit the information of the data symbols that is also available at the base station (BS) before transmission. Recently, interference exploitation symbol-level precoding (SLP) that employs the information of both the data symbols to be transmitted and the channel knowledge has emerged as a new precoding method [4], [5]. Compared to common block-level precoding where the precoding matrix is applied to a block of data symbols, SLP applies different precoding matrix to different data symbols or directly designs the precoded signals to be transmitted at the antenna port, which allows to observe interference that is inherent in multi-user transmission on a symbol-by-symbol basis and further exploit it instead of cancelling it, thus greatly enhancing the performance of multi-antenna communication systems. Due to such benefits, SLP has gained increasing research attention in recent years.
The study of interference exploitation precoding originates from the adaptation of traditional zero-forcing (ZF) and regularized ZF (RZF) precoding to closed-form SLP, where the concept of constructive interference (CI) and destructive interference (DI) is characterized [[6]][7]-[8]. More specifically, [6] retains the CI with a dynamic linear precoding scheme while fully removing the DI via ZF, while a more superior scheme is proposed in [7] that further rotates the phases of DI such that all the interfering signals are constructive to the users. As a step further, optimization-based CI precoding is studied in [9] for PSK signaling under the context of vector perturbation precoding, where CI in the form of symbol scaling is proposed. In [[10]][11]-[12], CI precoding for signal-to-interference-plus-noise ratio (SINR)-constrained power minimization problems with PSK signaling is studied based on phase rotation, where the superiority of non-strict phase rotation is presented over the strict phase rotation considered in [6], [7], and [13] has further extended the CI precoding to QAM modulations. A distance-preserving CI metric is introduced in [14], which considers the power-constrained SINR balancing problems. Importantly, [15] and [16] have further derived the optimal precoding structure for CI precoding with PSK signaling and QAM signaling respectively, which greatly reduces the computational costs of CI precoding. While the above works have shown to be effective in addressing the complexity issue of SLP, it should be noted that the above SLP solutions for power-constrained SINR balancing problems with QAM modulations have assumed that the rescaling factor is known to the users, which would however result in an excessive signaling overhead and hinder the practical implementation of CI precoding, since the rescaling factor needs to be broadcast to the users on a symbol level for these power-constrained SINR balancing SLP methods.
Therefore in this paper, we aim to reduce such excessive signaling overhead for SLP and propose a practical SLP scheme for multi-level modulations via an in-block power allocation. Specifically, as opposed to traditional SLP schemes that consider a uniform power allocation for each symbol duration within a transmission block, we propose to allocate the available transmit power of SLP dynamically for each symbol duration within a transmission block, based on which a two-stage optimization problem on the precoding matrix and the power allocation strategy is formulated. We prove via the Karush-Kuhn-Tucker (KKT) conditions that the proposed dynamic power allocation will lead to an identical rescaling factor within a transmission block if we firstly design the precoding matrix followed by the design of the power allocation strategy, which then allows the block-level broadcast of the rescaling factor to the users as usually done in traditional block-level precoding and greatly reduces the signaling overhead for SLP. Our derivations further reveal an exact closed-form solution for the proposed in-block power allocation strategy, leading to only negligible complexity increase at the BS. Numerical results have validated the significant signaling overhead reduction brought by the proposed in-block power allocation scheme without sacrificing the error-rate performance of SLP, which further boosts the practical deployment of SLP for multi-level modulations.
Notations: \(a\), \(\boldsymbol{a}\), and \(\boldsymbol{A}\) denote scalar, column vector and matrix, respectively. \({( \cdot )^\text{T}}\), \({( \cdot )^\text{H}}\), \({( \cdot )^{-1}}\) denote transposition, conjugate transposition and inverse, respectively. \({{\cal C}^{n \times n}}\) and \({{\cal R}^{n \times n}}\) represent the sets of \(n\times n\) complex- and real-valued matrices, respectively. \(\Re \left\{ \cdot \right\}\) and \(\Im \left\{ \cdot \right\}\) extract the real and imaginary part, \(\circ\) is the Hadamard product, \(\left\| \cdot \right\|_2\) represents the \(\ell_2\)-norm and \(\jmath\) is the imaginary unit.
We focus on a generic multi-user multiple-input single-output (MU-MISO) communication system in the downlink, where the BS with \(N_\text{T}\) transmit antennas communicates with a total number of \(K\) single-antenna users simultaneously in the same time-frequency resource, where \(K \le N_\text{T}\). Assume that a transmission block consists of \(M\) symbol durations, within which the wireless channel stays constant, and we can express the data symbol matrix as \[{\boldsymbol{S}}=\left[ { {\boldsymbol{s}}^{\left( {1}\right)}, {\boldsymbol{s}}^{\left( {2}\right)}, \cdots, {\boldsymbol{s}}^{\left( {M}\right)} } \right] \in {\cal C}^{K \times M}, \label{eq951}\tag{1}\] where \({\boldsymbol{s}}^{\left( {m}\right)}=\left[ {s_1^{\left( m \right)}, s_2^{\left( m \right)}, \cdots, s_K^{\left( m \right)}} \right]^\text{T} \in {\cal C}^{K \times 1}\) is the data symbol vector in the \(m\)-th symbol duration drawn from a nominal QAM constellation, which is a representative example for multi-level modulations. Accordingly, the received signal for user \(k\) in the \(m\)-th symbol duration can be expressed as \[y_k^{\left( {m}\right)}=\sqrt{p^{\left( {m}\right)}} \cdot {\boldsymbol{h}}_k^\text{T} {\boldsymbol{W}}^{\left( {m}\right)} {\boldsymbol{s}}^{\left( {m}\right)} + n_k^{\left( {m}\right)}, \label{eq952}\tag{2}\] where \({y}_k^{\left( {m}\right)}\) is the received signal for user \(k\) in the \(m\)-th symbol duration, \(p^{\left( {m}\right)}\) represents the allocated transmit power for the \(m\)-th symbol duration, \({\boldsymbol{h}}_k^\text{T} \in {\cal C}^{1 \times N_\text{T}}\) is the flat-fading Rayleigh channel between the BS and the users that is constant within the entire transmission block, \({\boldsymbol{W}}^{\left( {m}\right)} \in {\cal C}^{N_\text{T} \times K}\) denotes the precoding matrix, and \(n_k^{\left( {m}\right)}\) is the corresponding additive Gaussian noise at the users with zero mean and variance \(\sigma^2\). Since we focus on SLP and have included the allocated transmit power \(p^{\left( {m}\right)}\) in 2 , the symbol-level power constraint is enforced as \(\left \| \mathbf{W}^{\left ( m \right )} \mathbf{s}^{\left ( m \right )} \right \|_{2}^2 = 1\), and we have \(\sum_{m=1}^{M} p^{\left ( m \right )} \leq P_{\text{T}}\), where \(P_\text{T}\) represents the total available transmit power for the transmission block.
At the receiver side, \(y_k^{\left( {m}\right)}\) needs to be scaled for correct demodulation when multi-level modulations are employed, and the signals ready for demodulation can be expressed as \[r_k^{\left( {m}\right)}= f^{\left( {m}\right)} y_k^{\left( {m}\right)}= f^{\left( {m}\right)} \sqrt{p^{\left( {m}\right)}} \cdot {\boldsymbol{h}}_k^\text{T} {\boldsymbol{W}}^{\left( {m}\right)} {\boldsymbol{s}}^{\left( {m}\right)} + f^{\left( {m}\right)} n_k^{\left( {m}\right)}, \label{eq953}\tag{3}\] where \(f^{\left( {m}\right)}\) is the rescaling factor for the \(m\)-th symbol duration, also termed as the noise amplification factor [16]. Since \({\boldsymbol{W}}^{\left( {m}\right)}\) and \({\boldsymbol{s}}^{\left( {m}\right)}\) are known, \(f^{\left( {m}\right)}\) can be calculated at the transmitter side but is not known at the receiver side, and therefore needs to be broadcast to each user on a symbol level for SLP.
CI can be defined as the interference that pushes the received signal of interest further away from all of the decision boundaries [4], which effectively increases the distance between the received signal and the decision boundaries and further improves the performance of demodulation. As an illustrative example, we depict one quarter of a 16QAM constellation in Fig. 1, where the green shaded area represents the constructive region, within which the received signal receives CI, while the residual area in the QAM constellation is the destructive region, within which the corresponding signal receives DI. Based on the definition of CI, for QAM modulations CI can be exploited by the outermost constellation points belonging to type B, C and D in Fig. 1 (which will be explained later in Section III-A), while all the interference is seen as destructive for the inner constellation points.
In this section, we introduce the proposed two-stage optimization on the precoding \({\boldsymbol{W}}^{\left( {m}\right)}\) and power allocation \({p}^{\left( {m}\right)}\), where the optimization on \({\boldsymbol{W}}^{\left( {m}\right)}\) is firstly designed, based on which the optimization on \({p}^{\left( {m}\right)}\) is performed.
In traditional SLP approach, uniform transmit power is assumed for each symbol duration, i.e., \(p^{\left( m \right)}=\frac{P_\text{T}}{M}\), \(\forall m \in {\cal M}\), where \({\cal M} = \left\{ {1,2,\cdots,M} \right\}\). In this letter, we aim to optimize both the transmit power and the precoding matrix for each symbol duration such that the minimum CI effect within the considered transmission block is maximized, and such a two-stage optimization will return an identical rescaling factor within the transmission block, as will be mathematically shown in Section III-B. Following [16], we divide the constellation points for a QAM modulation into 4 types, as shown in Fig. 1. We follow the symbol-scaling CI metric in [16] and decompose each constellation point into \[s_k^{\left( m \right)}=s_k^{\cal A} + s_k^{\cal B}, \label{eq954}\tag{4}\] where \(s_k^{\cal A}=\Re \left\{ s_k^{\left( m \right)} \right\}\) and \(s_k^{\cal B} = \jmath \cdot \Im \left\{ s_k^{\left( m \right)} \right\}\). Following a similar principle, we decompose the noiseless received signal \({\boldsymbol{h}}_k^\text{T} {\boldsymbol{W}}^{\left( {m}\right)} {\boldsymbol{s}}^{\left( {m}\right)}\) into \[{\boldsymbol{h}}_k^\text{T} {\boldsymbol{W}}^{\left( {m}\right)} {\boldsymbol{s}}^{\left( {m}\right)}=\alpha_k^{\cal A} s_k^{\cal A} + \alpha_k^{\cal B}s_k^{\cal B}, \label{eq955}\tag{5}\] where \(\alpha_k^{\cal A}\) and \(\alpha_k^{\cal B}\) are two introduced real scalars that jointly determine the effect of interference on \(s_k^{\left( m \right)}\), and for simplicity of notation we have removed the index of symbol duration for \(\alpha_k^{\cal A}\), \(\alpha_k^{\cal B}\), \(s_k^{\cal A}\) and \(s_k^{\cal B}\).
Based on Fig. 1, we observe that for QAM modulations, all the interference for 1) the real and imaginary part of constellation point type ‘A’, 2) the real part of constellation point type ‘C’, and 3) the imaginary part of constellation point type ‘B’ is destructive. Therefore, to guarantee the correct demodulation of these constellation points after precoding, the relative position for constellation type ‘A’, the real part of constellation type ‘C’ and the imaginary part of constellation type ‘B’ should remain constant, i.e., the following conditions need to be satisfied: \[\alpha_p^{\cal A}=\alpha_p^{\cal B}, {\kern 3pt} \alpha_p^{\cal A}=\alpha_q^{\cal A}, {\kern 3pt} \alpha_p^{\cal B}=\alpha_j^{\cal B},\] if \(s_p\) belongs to constellation type ‘A’, \(s_q\) belongs to constellation type ‘C’, and \(s_j\) belongs to constellation type ‘B’. For notational simplicity, we further define a set \(\cal I\) that consists of the scaling factors for these inner constellation points, given by \[{\cal I}=\left\{ {\alpha_p^{\cal A}, \alpha_p^{\cal B}, \alpha_q^{\cal A}, \alpha_j^{\cal B}, \forall p,q,j} \right\}=\left\{ {\alpha_n^{\cal I}, \forall n} \right\},\] and we can obtain that each \(\alpha_n^{\cal I}\) should be identical for correct demodulation. Similarly, we can also define a set \(\cal O\) consisting of the outer constellation points that can exploit CI, i.e., 1) the real part of constellation point type ‘B’, 2) the imaginary part of constellation point type ‘C’, and 3) the real and imaginary part of constellation point type ‘D’, given by \[{\cal O}=\left\{ {\alpha_p^{\cal A}, \alpha_p^{\cal B}, \alpha_q^{\cal A}, \alpha_j^{\cal B}, \forall p,q,j} \right\}=\left\{ {\alpha_l^{\cal O}, \forall l} \right\},\] if \(s_p\) belongs to constellation type ‘D’, \(s_q\) belongs to constellation type ‘B’, and \(s_j\) belongs to constellation type ‘C’, and we obtain that to achieve CI, the value of the scaling factors in \(\cal O\) should be made not smaller than the value of the scaling factors in \(\cal I\).
Accordingly, the optimization problem on \({\boldsymbol{W}}^{\left( {m}\right)}\) to maximize the CI effect for the \(m\)-th symbol duration can then be formulated as [16] \[\begin{align} &\mathcal{P}_1: {\kern 3pt} \mathop{\max }\limits_{{\boldsymbol{W}}^{\left( {m}\right)}, t^{\left( {m}\right)}, \alpha_k^{\cal A}, \alpha_k^{\cal B}} {\kern 5pt} t^{\left( {m}\right)} \\ &{\kern 0pt} s. t. {\kern 10pt} {\boldsymbol{C}1:} {\kern 3pt} {\boldsymbol{h}}_k^\text{T} {\boldsymbol{W}}^{\left( {m}\right)} {\boldsymbol{s}}^{\left( {m}\right)}=\alpha_k^{\cal A} s_k^{\cal A} + \alpha_k^{\cal B}s_k^{\cal B}, {\kern 3pt} \forall k \in {\cal K}; \\ &{\kern 24pt} {\boldsymbol{C}2:} {\kern 3pt} t^{\left( {m}\right)} \le \alpha_l^{\cal O}, {\kern 3pt} \forall \alpha_l^{\cal O} \in {\cal O};\\ &{\kern 24pt} {\boldsymbol{C}3:} {\kern 3pt} t^{\left( {m}\right)} = \alpha_n^{\cal I}, {\kern 3pt} \forall \alpha_n^{\cal I} \in {\cal I};\\ &{\kern 24pt} {\boldsymbol{C}4:} {\kern 1pt} \left \| \mathbf{W}^{\left ( m \right )} \mathbf{s}^{\left ( m \right )} \right \|_{2}^2 = 1, \label{eq956} \end{align}\tag{6}\] where \({\cal K}=\left\{ {1,2,\cdots,K} \right\}\). Although \({\cal P}_1\) is not convex due to the strict equality power constraint in \({\boldsymbol{C}4}\), we note that \({\cal P}_1\) can be transformed into a convex problem by relaxing the equality constraint in C4 into inequality, given by \[\begin{align} &\mathcal{P}_2: {\kern 3pt} \mathop{\max }\limits_{{\boldsymbol{W}}^{\left( {m}\right)}, t^{\left( {m}\right)}, \alpha_k^{\cal A}, \alpha_k^{\cal B}} {\kern 5pt} t^{\left( {m}\right)} \\ &{\kern 0pt} s. t. {\kern 10pt} {\boldsymbol{C}1:} {\kern 3pt} {\boldsymbol{h}}_k^\text{T} {\boldsymbol{W}}^{\left( {m}\right)} {\boldsymbol{s}}^{\left( {m}\right)}=\alpha_k^{\cal A} s_k^{\cal A} + \alpha_k^{\cal B}s_k^{\cal B}, {\kern 3pt} \forall k \in {\cal K}; \\ &{\kern 24pt} {\boldsymbol{C}2:} {\kern 3pt} t^{\left( {m}\right)} \le \alpha_l^{\cal O}, {\kern 3pt} \forall \alpha_l^{\cal O} \in {\cal O};\\ &{\kern 24pt} {\boldsymbol{C}3:} {\kern 3pt} t^{\left( {m}\right)} = \alpha_n^{\cal I}, {\kern 3pt} \forall \alpha_n^{\cal I} \in {\cal I};\\ &{\kern 24pt} {\boldsymbol{C}4:} {\kern 1pt} \left \| \mathbf{W}^{\left ( m \right )} \mathbf{s}^{\left ( m \right )} \right \|_{2}^2 \le 1, \label{krzugoxq} \end{align}\tag{7}\] which shares the same optimal solution as \({\cal P}_1\) since the strict equality in \({\boldsymbol{C}4}\) is always achieved for the optimality of \({\cal P}_2\), as shown in [16]. Moreover, we observe that the constraint \({\boldsymbol{C}3}\) indicates that the nominal constellation is scaled by \(t^{\left( {m}\right)}\) due to the effect of the wireless channel, and together with 2 , the rescaling factor for the received signals in the \(m\)-th symbol duration can be obtained as: \[f^{\left( {m}\right)}=\frac{1}{t^{\left( {m}\right)}\sqrt{p^{\left( {m}\right)}}}. \label{eq957}\tag{8}\]
Given that the optimization on \({\boldsymbol{W}}^{\left( {m}\right)}\) in \({\cal P}_1\) has been well studied in [16], in the following we focus on the optimization on \(p^{\left( {m}\right)}\) for given \(t^{\left( {m}\right)}\). Accordingly, the optimization on \(p^{\left( {m}\right)}\) to maximize the minimum CI effect within a transmission block, which is equivalent to minimizing the maximum noise amplification effect \(f^{\left( {m}\right)}\), can be constructed as: \[\begin{align} &\mathcal{P}_3: {\kern 3pt} \mathop{\max } \limits_{p^{\left( {m}\right)}} \mathop{\min } \limits_{m} {\kern 3pt} t^{\left( {m}\right)}\sqrt{p^{\left( {m}\right)}} \\ &{\kern 0pt} s. t. {\kern 10pt} {\boldsymbol{C}1:} {\kern 3pt} \sum_{m=1}^{M} p^{\left ( m \right )} \leq P_{\text{T}}. \label{eq959} \end{align}\tag{9}\] By introducing an auxiliary variable \(u_m=\sqrt{p^{\left ( m \right )}}\), the above problem can be further expressed in a standard convex form: \[\begin{align} &\mathcal{P}_4: {\kern 3pt} \mathop{\min } \limits_{u_m} {\kern 3pt} -g \\ &{\kern 0pt} s. t. {\kern 10pt} {\boldsymbol{C}1:} {\kern 3pt} g - t^{\left( {m}\right)}{u_m} \le 0, {\kern 3pt} \forall m \in {\cal M} ; \\ &{\kern 23.5pt} {\boldsymbol{C}2:}\sum_{m=1}^{M} u_m^2 - P_{\text{T}} \le 0, \label{eq9510} \end{align}\tag{10}\] based on which the following proposition is obtained.
Proposition 1: When the optimality of \({\cal P}_4\) is achieved, we arrive at an identical rescaling factor for each symbol duration in the transmission block, i.e, \(f^{\left( {1}\right)}=f^{\left( {2}\right)}=\cdots=f^{\left( {M}\right)}\).
Proof: We prove this proposition via the KKT conditions, where the Lagrangian of \({\cal P}_4\) is formulated as \[\begin{align} &{\cal L} \left( {u_m ,g, \delta_m, \vartheta } \right)= -g+\sum_{m=1}^{M}\delta_m\left ( g-t^{\left ( m \right )} u_{m} \right ) \\ & {\kern 77pt} + \vartheta \left( {\sum_{m=1}^{M} u_m^2 - P_{\text{T}}} \right) \\ & {\kern 20pt} = \left( {{\boldsymbol{1}}^\text{T} {\boldsymbol{\delta}} -1} \right)g + \vartheta \cdot {\boldsymbol{u}}^\text{T} {\boldsymbol{u}} - \left( {{\boldsymbol{\delta}} \circ {\boldsymbol{t}}} \right)^\text{T} {\boldsymbol{u}} - \vartheta \cdot P_{\text{T}}. \end{align} \label{eq9511}\tag{11}\] In 11 , \({\boldsymbol{\delta}}=\left[ {\delta_1,\delta_2,\cdots,\delta_M} \right]^\text{T}\), \({\boldsymbol{u}}\) and \({\boldsymbol{t}}\) are similarly defined, and \({\boldsymbol{1}}=\left[ {1,1,\cdots,1} \right]^\text{T}\).
Based on 11 , the corresponding KKT conditions for \({\cal P}_4\) are given by
rCl = ( 1^ -1 ) = 0
= - ( ) = 0, m
_m( g-t^( m ) u_m ) = 0, m
( _m=1^M u_m^2 - P_ ) = 0
where \(\delta_m\), \(\forall m \in {\cal M}\) and \(\vartheta\) are the introduced Lagrange multipliers corresponding to the inequality constraints C1 and C2 in \({\cal P}_4\), respectively. By observing (12), firstly we obtain that \(\vartheta \ne 0\), since \(\vartheta = 0\) results in \(\delta_m=0\), \(\forall m \in {\cal M}\) based on [eq9512b], which contradicts with [eq9512a]. This means that the transmit power constraint is active when optimality is achieved, i.e., \(\sum_{m=1}^{M} u_m^2 = P_{\text{T}}\). Subsequently, according to [eq9512c] and given that \(\delta_m \ne 0\), \(\forall m \in {\cal M}\), we further obtain \(g-t^{\left ( m \right )} u_{m}=0\), \(\forall m \in {\cal M}\), which is equivalent to \[t^{\left( {1}\right)}{u_1}=t^{\left( {2}\right)}{u_2}=\cdots=t^{\left( {M}\right)}{u_M}, \label{eq9513}\tag{12}\] and with \(u_m=\sqrt{p^{\left ( m \right )}}\), we arrive at \[t^{\left( {1}\right)}\sqrt{p^{\left ( 1 \right )}}=t^{\left( {2}\right)}\sqrt{p^{\left ( 2 \right )}}=\cdots=t^{\left( {M}\right)}\sqrt{p^{\left ( M \right )}}, \label{eq9514}\tag{13}\] which based on 8 completes the proof. \(\blacksquare\)
Moreover, we can further derive the closed-form solution of the optimal transmit power value \(p^{\left ( m \right )}\), given by the following proposition.
Proposition 2: The value of the optimal transmit power \(p^{\left ( m \right )}\) can be obtained in a closed form as \[p^{\left ( m \right )}=\frac{\frac{1}{\left( t^{\left ( m \right )} \right)^2}}{\sum_{m=1}^{M} \frac{1}{\left( t^{\left ( m \right )} \right)^2} } \cdot P_\text{T}, \label{eq9515}\tag{14}\] and the value of the rescaling factor in the considered transmission block is given by \[f^{\left( {1}\right)}=f^{\left( {2}\right)}=\cdots=f^{\left( {M}\right)}=\sqrt{\frac{\sum_{m=1}^{M} \frac{1}{\left( t^{\left ( m \right )} \right)^2} }{P_\text{T}}}. \label{eq95f}\tag{15}\]
Proof: The closed-form solution can readily be obtained based on the results in Proposition 1 where \(\sum_{m=1}^{M} p^{\left ( m \right )} = P_{\text{T}}\) and \(t^{\left( {1}\right)}\sqrt{p^{\left ( 1 \right )}}=t^{\left( {2}\right)}\sqrt{p^{\left ( 2 \right )}}=\cdots=t^{\left( {M}\right)}\sqrt{p^{\left ( M \right )}}\), and the value of \(f^{\left( {m}\right)}\) can be obtained accordingly. \(\blacksquare\)
Based on Proposition 2 above, it is observed that the rescaling factor for each symbol duration in the considered transmission block is identical and therefore only needs to be fed to the receivers once in a transmission block, which greatly reduces the signaling overhead for SLP with multi-level constellations.
In this section, we evaluate the performance of SLP with the proposed in-block power allocation via Monte Carlo simulations. We consider a practical communication system where the BS employs \(B\) bits to broadcast the rescaling factor \(f\) to the users, which is subject to quantization errors due to the limited feed-forwarding, given by \[\hat{f}= f + \epsilon , \label{eq9516}\tag{16}\] where \(\hat{f}\) is the rescaling factor received at the users, and \(f\) is the ideal rescaling factor known at the BS as in 3 . \(\epsilon \sim {\cal N} \left( {0, \nu } \right)\) is the quantization error, whose variance is modeled according to [9], [17], [18] as \[\nu=\frac{f_\text{max}}{2^B}, \label{eq9517}\tag{17}\] where \(f_\text{max}\) is the maximum value of the rescaling factor with non-zero probability. Accordingly, we evaluate the performance of different precoding techniques via the effective throughput for \(\mathbb{M}\)-QAM, defined as throughput minus the total number of bits required for feed forwarding, given by \[\begin{align} T_\text{eff} &=\text{max} \left\{ { \left( {1-\text{BLER}} \right) \cdot \log _{2}\left ( \mathbb{M} \right ) \cdot K - N_\text{overhead}, 0 } \right\} \\ &= \text{max} \left\{ { \left( {1-P_\text{b}} \right)^{M\cdot \log _{2}\left ( \mathbb{M} \right )} \cdot \log _{2}\left ( \mathbb{M} \right ) \cdot K-N_\text{overhead}, 0 } \right\}, \end{align}\] where BLER is the block error rate, and \(P_\text{b}\) is the bit error rate (BER). \(N_\text{overhead}\) is the total number of bits for signaling overhead, and \(N_\text{overhead}=\frac{B}{M}\) for ZF, RZF, and SLP with in-block power allocation, while \(N_\text{overhead}=B\) for traditional SLP with uniform power allocation. Without loss of generality, we assume that the number of bits for broadcasting the rescaling factor to the users is \(B=5\) bits. The maximum available transmit power in a transmission block is \(P_\text{T}=1\), and the transmit SNR in each symbol duration is accordingly defined as \(\rho=\frac{1}{M \sigma^2}\). We compare our proposed scheme with traditional block-level ZF and RZF precoding via 5000 channel realizations, and both 16QAM and 64QAM modulations are considered in the simulations.



Figure 2: Numerical results for 16QAM and 64QAM, \(K=N_\text{T}=12\), \(f_\text{max}=1\), \(B=5\).
Fig. 2 validates the effectiveness of the proposed in-block power allocation in reducing the signaling overhead for both 16QAM and 64QAM, where we assume \(M=10\) symbols for ease of illustration. Compared to traditional SLP approach where the value of the rescaling factor varies for each symbol duration in the transmission block, which requires a symbol-level broadcast of these values to the users for correct demodulation, SLP with the proposed in-block power allocation scheme returns an identical rescaling factor within a transmission block, which then enables the block-level broadcast of the rescaling factors as done in traditional block-level precoding, validating the effectiveness of the proposed scheme.
Fig. 3 compares the BER of SLP with the proposed in-block power allocation with ZF precoding with practical \(M=200\) symbols, RZF precoding and SLP with uniform power allocation, for both 16QAM and 64QAM. Since we have considered quantization errors in broadcasting the rescaling factors to the users as in practical communication systems, the traditional SLP technique is observed to exhibit BER losses when the transmit SNR increases and becomes inferior to traditional ZF/RZF precoding. On the contrary, the SLP method with the proposed in-block power allocation has alleviated such performance degradation, and meanwhile still offers transmit SNR gains over traditional schemes.
Fig. 4 depicts the effective throughput for different precoding schemes to further highlight the significance of the proposed in-block power allocation strategy for SLP with practical \(M=200\) symbols. As can be observed, traditional SLP methods have shown the worst effective throughput performance due to the requirement of symbol-level broadcast of the rescaling factor, which results in an excessive signaling overhead. Meanwhile, we observe that the proposed scheme can greatly improve the effective throughput for SLP and achieves the highest throughput performance over ZF and RZF precoding. Both of the BER and throughput results above have exhibited the superiority and significance of the proposed scheme for existing SLP with multi-level modulations.
In this letter, we have designed an in-block power allocation scheme for SLP with multi-level modulations, which returns an identical rescaling factor for the entire transmission block. The proposed scheme can thus greatly reduce the signaling overhead of existing SLP schemes by reducing the frequency for the broadcast of the rescaling factor from symbol level to block level, achieving a similar signaling overhead to traditional block-level precoding methods without sacrificing the performance. By proposing a two-stage precoding and power allocation design, our derivations have shown that the in-block power allocation enjoys a closed-form solution, which is thus efficient to deploy. In addition, numerical results demonstrate that the proposed power allocation scheme exhibits significant performance improvements for SLP in terms of the effective throughput over existing SLP techniques without in-block power allocation.