Recent research has delved into advanced designs for reconfigurable intelligent surfaces (RIS) with integrated sensing functions. One promising concept is the hybrid RIS (HRIS), which blends sensing and reflecting meta-atoms. This enables HRIS to process signals, aiding in channel estimation (CE) and symbol detection tasks. This paper formulates novel semi-blind receivers for HRIS-aided wireless communications that enable joint symbol and CE at the HRIS and BS. The proposed receivers exploit a tensor coding at the transmit side, while capitalizing on the multilinear structures of the received signals. We develop iterative and closed-form receiver algorithms for joint estimation of the uplink channels and symbols at both the HRIS and the BS, enabling joint channel and symbol estimation functionalities. The proposed receivers offer symbol decoding capabilities to the HRIS and ensure ambiguity-free separate CE without requiring an a priori training stage. We also study identifiability conditions that provide a unique joint channel and symbol recovery, and discuss the computational complexities and tradeoffs involved in the proposed semi-blind receivers. Our findings demonstrate the competitive performances of the proposed solutions at the HRIS and the BS and unveil distinct performance trends based on the possible combinations of HRIS-BS receiver pairs. Finally, extensive numerical results elucidate the interplay between power splitting, symbol recovery, and CE accuracy in HRIS-assisted communications. Such insights are pivotal for optimizing receiver design and enhancing system performance in future HRIS deployments.
Hybrid RIS, joint channel and symbol estimation, reconfigurable surfaces, semi-blind receivers, tensor modeling.
Recently, reconfigurable intelligent surface (RIS) has been envisioned as a key enabling technology for deploying future wireless networks, for example, the sixth generation (6G) [1]–[6]. RIS is a large array of passive reflecting elements mounted on a planar surface that can independently interact with the impinging electromagnetic waves by means of software-controllable phase shifts [3], [4], [7]–[9]. Several applications for RIS can be found in the literature, such as coverage for users located in dead zones and co-channel interference suppression for users located at the edges of cells [9], [10], improvement of the physical layer security [11], integration with unmanned aerial vehicles (UAVs) and other aerial platforms [12], simultaneous wireless information and power transfer (SWIPT) [9], and integrated sensing and communications (ISAC) [13], to mention a few.
In this context, accurate channel state information (CSI) is crucial in optimizing RIS-assisted systems [14], [15]. Its acquisition is necessary and challenging for designing the RIS reflection coefficients as well as the precoder/beamformer at both the base station (BS) and user terminal (UT) [10], [16]–[18]. In general, channel estimation (CE) in multiple-input multiple-output (MIMO) RIS-assisted wireless communication systems faces two main challenges. The first is related to a notable increase in the required number of pilots compared to conventional systems, driven by a large number of RIS elements, leading to a significantly large number of channel coefficients [19]–[24]. The second one is the unavailability of estimating the separate channels from the RIS-assisted one, namely UT-RIS and RIS-BS links, since the passive RIS (PRIS) acts only as a signal reflector [20], [22]–[25] and only the cascaded channel is estimated2 so that all the receiver processing is done only at the BS or the UT. Solutions in the literature that have addressed these challenges are usually sorted into two families of methods: the first one incorporates novel algorithms to leverage the channel structure while preserving the original hardware properties of the PRIS. In contrast, the second approach involves modifying its hardware architecture to allow for additional signal processing capabilities at the RIS [19]. This paper relies on the second approach.
Given the passive nature of the RIS, most CE-related work commonly falls into cascaded CE, which is sufficient for applications like rate maximization and beamforming design. In contrast, a scaling ambiguity-free separate CE is preferred for applications like channel sounding, user localization, and mobility tracking, as highlighted in [28]. For instance, in mobility scenarios, [29] argues that separate CE facilitates channel tracking by identifying the behavior of the individual links under temporal variations. As pointed out therein, cascaded CE complicates tracking as changes occur in either the UT-RIS, RIS-BS, or both links. Moreover, in scenarios where the UT-RIS channel changes more rapidly than the RIS-BS one, the former must be estimated more often, while the latter not, highlighting the importance of recovering such channels individually instead of the combined one [16], [30]. On the other hand, some designs depend on the availability of the individual channels, such as in [31], [32], and [33]. The importance of estimating the involved channels separately is reinforced in [22], [34]–[37].
From a hardware perspective, a notable study was carried out in [38]. Specifically, the authors in [38] proposed a RIS structure by sparsely replacing some passive reflecting elements with active sensors connected to a single receive RF-chain each, thereby enabling baseband processing at the RIS controller. This receive RF-chain is comprised of a low noise amplifier, a downconverter mixer (from RF to baseband), and an analog-to-digital converter [39]. These active elements merely sense the impinging signal without reflection. Adopting this hybrid architecture3, the authors introduced a CE scheme based on compressed sensing and deep learning, achieving minimal pilot overhead and facilitating the CE process at the sacrifice of increased hardware complexity and power consumption. Inspired by the pioneering research in [38], such a hybrid architecture has been comprehensively investigated in subsequent works, such as in [23]–[25], [37] and references therein.
In contrast to RIS, another metasurface-based technology called dynamic metasurface antenna (DMA) has emerged to enable low-cost, extremely large antenna arrays [40]. Despite the differences between the RIS and DMA operation, it is attainable to envision a hybrid meta-atom capable of reflecting and sensing since their meta-atoms share similarities [3]. Motivated by DMAs, a novel metasurface was introduced in [41], where the physical structure of each meta-atom is modified to couple small portions of the incoming wave. Relying on this paradigm, [3] introduced the so-called hybrid RIS (HRIS) architecture, outlining its prospects/obstacles. These meta-atoms are integrated into sampling waveguides similarly to [41], and the sensed signals are forwarded via analog combining to RF-chains, whose outputs enable signal processing in the digital domain while retaining their reconfigurable reflection functionalities. The relationship of both reflected and sensed portions is dictated by the coupling level, controlled by changing either the substrate-integrated waveguide or the annular slot sizes [3]. Based on this architecture and leveraging transmitted pilots, the work [22] exploits the signal processing capabilities at the HRIS to estimate the UTs-HRIS channels from the sensed signal part. In contrast, the HRIS-BS channel is estimated at the BS from the reflected one. This is accomplished by exploiting a feedback control link (CL) between the HRIS and the BS, through which the BS acquires the UT-HRIS channel matrix estimated at the HRIS. This CL is used to reconfigure the reflection patterns of the RIS [4], [9], [42]. In [22], error-free transmission over a high-throughput CL was considered. Detailed studies of different advanced/hybrid architectures are provided in [1] and [4].
Tensor decompositions have been successfully applied to model wireless communication systems [43], [44], including blind/semi-blind receivers [45], space-time (ST)/space-time-frequency (STF) coding schemes [46], [47]. These works have highlighted the effectiveness of tensor decompositions and their powerful uniqueness properties to harness the multidimensional nature of received signals and channels for deriving receiver algorithms capable of operating semi-blindly under less restrictive requirements than competing (matrix-based) methods, while offering good performance/complexity tradeoffs. See [48]–[51] and references therein for an overview. Recent works have introduced tensor modeling to the context of passive RIS (PRIS)-aided communications to solve problems such as CE, semi-blind joint CE and symbol detection, and channel tracking [26], [27], [52]–[55]. Among these works, parallel factor (PARAFAC) decomposition, also known as canonical polyadic decomposition (CPD) [56], was applied to solve the CE estimation problem in a PRIS approach [26], [27], [57], and also [24], [58] in a hybrid semi-passive one. More recently, [55] proposed PARAFAC-based algorithms for CE accounting for RIS operating under imperfections from real-world effects. In particular, without requiring prior CE via training sequences, [53] and [54] introduced data-aided semi-blind CE methods for PRIS-aided communications using Khatri-Rao ST coding (KRSTC), integrating symbol detection and CE through closed-form and iterative receivers, respectively. In [54], the authors exploited the PARATUCK tensor decomposition [59], [60], while a generalized version was presented in [53]. In the work [61], an algebraic framework was derived to reduce the computational complexity of the iterative receiver proposed in [54]. Nonetheless, KRSTC thresholds the number of streams to the number of transmitting antennas.
Different from the aforementioned works, which are mostly restricted to PRIS, where the cascaded CE problem is concentrated at the BS, this paper shows that the estimation of the individual channels and the transmitted symbols can be achieved jointly at both the HRIS and BS in a semi-blind fashion by resorting to tensor modeling. Moreover, in contrast to [22], which relies on pilot-assisted CE at the HRIS, our approach expands upon this by incorporating joint symbol and CE directly at the HRIS. This is achieved iteratively or in closed form using simple algorithms without the need for pilot training. As will be discussed later, empowering HRIS with symbol-decoding capabilities is useful in several scenarios. Part of this work has been presented in a conference paper [58], which was limited to a pair of closed-form receivers. This work goes beyond our previous study by i) developing a series of new semi-blind “HRIS-BS” receiver pairs; ii) presenting detailed derivations of the proposed algorithms; iii) delving into identifiability, uniqueness, and computational complexity of the proposed solutions, while discussing their tradeoffs, and iv) providing an extensive numerical performance evaluation.
The contributions of this paper are summarized as follows:
First, leveraging the HRIS architecture [3] and assuming a one-way structured time domain protocol, we formulate the received signals at both the HRIS and BS using a new tensor formalism that disentangles the received signal into effective channel tensors and coded signal tensors following PARAFAC and/or Tucker decompositions. Exploiting these tensor models allows the HRIS to jointly estimate the associated channel and decode the transmitted symbols in a semi-blind fashion. By transmitting data symbols in advance during the CE stage, our approach can improve data rate and reduce symbol decoding delay compared to pilot-only methods.
Second, capitalizing on the proposed tensor models, we derive semi-blind joint symbol and CE methods for HRIS-aided MIMO wireless communication systems. More specifically, we formulate iterative and closed-form receiver pairs split between HRIS and BS to solve the semi-blind CE problem effectively. The proposed receivers eliminate the need for training sequences and additional steps for scaling ambiguity removal on the estimated channels and symbols, while partially circumventing the path-loss effects induced by the cumulated UT-HRIS and HRIS-BS links.
Third, we study identifiability at both the HRIS and the BS, derive a set of conditions that ensure a unique channel and symbol recovery, and discuss the computational complexities and trade-offs involved by the proposed semi-blind receivers.
Finally, extensive numerical results showcase the interplay between power splitting, symbol recovery, and CE accuracy in HRIS-assisted communications. Our findings demonstrate competitive performances among receivers at the HRIS and BS and uncover distinct performance trends based on the combinations of HRIS-BS receiver pairs. We also dive into a brief exploration of scenarios where a joint symbol and CE at the HRIS are useful.
The rest of the paper is organized as follows. Section 2 describes the system and signal models at the HRIS and the BS, including the transmission protocol and the main assumptions. Section 3 derives the corresponding tensor signal models and develops the core equations associated with the receiver design. The proposed semi-blind receivers for the HRIS and the BS are detailed in Sections 4 and 5. Sections 6, 7, and 8 discuss identifiability, uniqueness, and computational complexity, respectively. Section 9 presents the evaluated combinations of the “HRIS-BS” semi-blind receivers. Section 10 contains our numerical results, and Section 11 discusses the potential use cases benefiting from the proposed joint channel/symbol estimation at the HRIS. Finally, conclusions are drawn in Section 12.
We utilize lowercase \(a\), bold lowercase \(\mathit{\boldsymbol{a}}\), bold uppercase \({\mathbf{A}}\), and calligraphic \(\mathcal{A}\) to denote scalars, column vectors, matrices, and tensors, respectively. The \((i,j)\)-th element of \({\mathbf{A}}\) is denoted as \([{\mathbf{A}}]_{i,j}\). Transpose, conjugate, and Moore-Penrose pseudo-inverse of \({\mathbf{A}}\) are denoted as \({\mathbf{A}}^{{\rm T}}\), \({\mathbf{A}}^\ast\), and \({\mathbf{A}}^\dagger\), respectively. The operator \({\rm diag\!}\left\{ \mathit{\boldsymbol{a}} \right\}\) constructs a diagonal matrix from \(\mathit{\boldsymbol{a}}\). \(\left\lceil{a}\right\rceil\) is the smallest integer greater than or equal to \(a\), and the Frobenius norm is indicated by \(\left|\left|\cdot\right|\right|_{\rm F}\). The symbols \(\diamond\), and \(\otimes\) represent the Khatri-Rao and Kronecker matrix products, respectively. Stated \({\mathbf{A}}\! \in\!\mathbb{C}^{I \times J}\), the vectorization operator, denoted as \({\rm vec}\left\{ {\mathbf{A}} \right\}\), yields the vector \(\mathit{\boldsymbol{a}} \in \mathbb{C}^{JI \times 1}\). Conversely, the reverse operation, \(\mathrm{unvec}_{I \times J}(\mathit{\boldsymbol{a}})\), restores the matrix \({\mathbf{A}}\). A tensor \(\mathcal{A}\! \in \!\mathbb{C}^{I_1 \times I_2 \times \cdots \times I_P}\) is a multidimensional array with order \(P\). Unfolding is the procedure that reshapes a tensor into a matrix. For instance, a 3rd-order tensor can be matricized such that one mode varies along the rows and the other two along the columns. This is referred to as \(n\)-mode unfolding, \(n=\{1,2,3 \}\). The 1-mode, 2-mode, and 3-mode unfoldings of \(\mathcal{A} \in \mathbb{C}^{I_1 \times I_2 \times I_3}\) are respectively given by \[\begin{align} \tag{1}\left[ \mathcal{A} \right]_{(1)} &= [\mathcal{A}_{\cdot\cdot 1},\cdots,\mathcal{A}_{\cdot\cdot K}] \in \mathbb{C}^{I_1 \times I_3I_2},\\ \tag{2}\left[ \mathcal{A} \right]_{(2)} &= [\mathcal{A}_{\cdot\cdot 1}^{{\rm T}},\cdots,\mathcal{A}_{\cdot\cdot K}^{{\rm T}}] \in \mathbb{C}^{I_2 \times I_3I_1},\\ \tag{3}\left[ \mathcal{A} \right]_{(3)} &= [{\rm vec}\left\{ \mathcal{A}_{\cdot\cdot 1} \right\},\cdots,{\rm vec}\left\{ \mathcal{A}_{\cdot\cdot K} \right\}]^{{\rm T}}\in \mathbb{C}^{I_3 \times I_2I_1}. \end{align}\] In addition, \(\mathcal{I}_{3,P} \!\in \!\mathbb{R}^{P \times P \times P}\) is the 3rd-order identity tensor. Consider two \(P\)-th order tensors \(\mathcal{A} \!\in \!\mathbb{C}^{I_1 \times \cdots \times I_p \times \cdots \times I_P}\) and \(\mathcal{B} \!\in\! \mathbb{C}^{J_1 \times \cdots \times J_q \times \cdots \times J_P}\), such that \(I_P\! = \!J_P\) and \(I_p \!= \!J_q\). We define the mode-wise contraction operation as a contraction between slices of \(\mathcal{A}\) and \(\mathcal{B}\). For simplicity, we assume this operation affects the \(P\)-mode of such tensors, which gives \[\label{def:mw1}\mathcal{A} \begin{tikzpicture}[baseline=-0.6ex] \node[draw,circle,inner sep=0.5pt]{\scalebox{0.5}{P}}; \end{tikzpicture}_{p}^{q}\mathcal{B}\! \doteq \!\mathcal{C}\! \,\,\in\! \mathbb{C}^{\scriptscriptstyle I_1\! \times\! \cdots\! \times I_{p\!-\!1}\! \times \!I_{p\!+\!1}\! \times\! \cdots\! \times\! I_{P\!-\!1}\! \times\! J_1\! \times \!\cdots\! \times\! J_{q\!-\!1}\! \times \!J_{q\!+\!1} \!\times\! \cdots\! \times\! J_P},\tag{4}\] where the \(P\)-mode slice of the \((2P\!-\!3)\)-th order tensor \(\mathcal{C}\) results from the tensor contraction between the \(P\)-mode slices of \(\mathcal{A}\) and \(\mathcal{B}\), involving mode \(p\) of \(\mathcal{A}\) and mode \(q\) of \(\mathcal{B}\). For instance, the mode-wise contraction of two 3rd-order tensors \(\mathcal{A}\) and \(\mathcal{B}\) is accomplished by \[\label{def:mw2} \mathcal{C}_{\cdot\cdot j} = \left(\mathcal{A} \begin{tikzpicture}[baseline=-0.6ex] \node[draw,circle,inner sep=0.5pt]{\scalebox{0.5}{3}}; \end{tikzpicture}_{p}^{q}\mathcal{B}\right)_{\cdot\cdot j} = \mathcal{A}_{\cdot\cdot j} \bullet_p^q \mathcal{B}_{\cdot\cdot j}, \quad j=1,\cdots,J_P \; .\tag{5}\] Throughout this paper, we make use of the following identities: \[\label{prop:vec} {\rm vec}\left\{ {\mathbf{ABC}} \right\} = ({\mathbf{C}}^{{\rm T}}\otimes{\mathbf{A}}){\rm vec}\left\{ {\mathbf{B}} \right\};\tag{6}\] \[\label{prop:vecd} {\rm vec}\left\{ {\mathbf{ABC}} \right\} = \left({\mathbf{C}}^{{\rm T}}\diamond{\mathbf{A}}\right){\rm vecd}\left\{ {\mathbf{B}} \right\},\mathrm{for } {\mathbf{B}} \mathrm{ diagonal};\tag{7}\] \[\label{prop:kronkron} {\mathbf{AB}} \otimes{\mathbf{CD}} = ({\mathbf{A}} \otimes{\mathbf{C}})({\mathbf{B}} \otimes{\mathbf{D}});\tag{8}\] \[\label{prop:diagab} {\rm diag\!}\left\{ \mathit{\boldsymbol{a}} \right\}\mathit{\boldsymbol{b}} = {\rm diag\!}\left\{ \mathit{\boldsymbol{b}} \right\}\mathit{\boldsymbol{a}}, \quad \mathrm{for } \mathit{\boldsymbol{a}},\mathit{\boldsymbol{b}} \in \mathbb{C}^{P \times 1};\tag{9}\] \[\label{prop:kronabvec} \mathit{\boldsymbol{a}} \otimes\mathit{\boldsymbol{b}} = {\rm vec}\left\{ ba^{{\rm T}} \right\} \in \mathbb{C}^{PQ \times 1},\tag{10}\] where \(\mathit{\boldsymbol{a}} \in \mathbb{C}^{P \times 1}\) and \(\mathit{\boldsymbol{b}} \in \mathbb{C}^{Q \times 1}\).
Additionally, Tables 1 and 2 summarize the most essential acronyms and variable notations used in this paper, along with their meanings.
| Acronym | Definition | 
|---|---|
| RIS | Reconfigurable intelligent surface | 
| HRIS | Hybrid RIS | 
| PRIS | Passive RIS | 
| BS | Base station | 
| UT | User terminal | 
| CE | Channel estimation | 
| CL | Control link | 
| KRSTC | Khatri-Rao space-time coding | 
| TSTC | Tensor space-time coding | 
| KronF | Kronecker factorization | 
| KRF | Khatri-Rao factorization | 
| BALS | Bilinear alternating least-squares | 
| TALS | Trilinear alternating least-squares | 
| Notation | Definition | 
|---|---|
| \(L\) | number of UT antennas | 
| \(M\) | number of BS antennas | 
| \(N\) | number of HRIS elementos | 
| \(N_c\) | number of HRIS RF-chains | 
| \(R\) | number of user data streams | 
| \(T\) | number of symbol periods | 
| \(K\) | number of sub-frames | 
| \(\boldsymbol{\Psi}\) | reflecting phase-shift matrix | 
| \(\boldsymbol{\Lambda}\) | coding matrix (KRSTC) | 
| \(\mathcal{W}\) | coding tensor (TSTC) | 
| \(\mathcal{T}_{\boldsymbol{\Phi}}\) | sensing phase-shift tensor | 
| \(\boldsymbol{\Phi}\) | sensing phase-shift matrix | 
| \({\mathbf{G}}\) | UT-HRIS channel matrix | 
| \({\mathbf{H}}\) | HRIS-BS channel matrix | 
| \(\boldsymbol{\Theta}\) | combined channel matrix | 
| \({\mathbf{X}}\) | symbol matrix | 
| \({\mathbf{Q}}\) | composite matrix comprising symbols and UT-HRIS channel | 
| \({\mathbf{Z}}\) | composite matrix comprising symbols and HRIS-BS channel | 
| \(\mathcal{Y}^\mathrm{RC}\) | detected signal tensor at the HRIS | 
| \(\mathcal{Y}^\mathrm{BS}\) | received signal tensor at the BS | 
We consider a single-user HRIS-assisted MIMO communication system where the multi-antenna UT and BS are equipped with \(L\) and \(M\) antennas, respectively4. This work considers uplink communication5. We suppose there is no direct link between the BS and UT due to blockages, and it is left out of the signal model. Hence, only non-LoS (NLoS) transmission is considered. In addition, the HRIS controller is linked to the BS via a control feedback channel, which is assumed to be error-free. The UT communicates with the BS through the assistance of an HRIS comprising a metasurface of \(N\) meta-atom elements connected via analog combining to a digital controller through \(N_c\) RF-chains [3], [22], as depicted in Fig. 1. We use the power split parameter \(\rho_n(t)\) to represent the fraction of the signal reflected from the \(n\)-th HRIS meta-atom at the \(t\)-th time instant. Hence, \(1-\rho_n(t)\) denotes the sensed portion forwarded to the RF-chains. \(e^{j\psi_n(t)}\) is the controllable reflecting phase-shift of the \(n\)-th meta-atom at the \(t\)-th time instant, and \(e^{j\phi_{n_c,n}(t)}\) is the phase-shift that models the joint effect on the wave captured by the \(n\)-th meta-atom element at the \(t\)-th time instant caused by the adjustable frequency response of the meta-material element by phase-shifting and the propagation inside the waveguide, which forwards to the \(n_c\)-th RF-chain. We consider \(\psi_n(t),~\phi_{n_c,n}(t) \in [0,2\pi)\). As pointed out by [22], when the sensing elements are connected to multiple RF-chains, conventional phase-shifter networks are required to apply distinct phase-shifts to each chain. This reconfigurability reflects the external control over HRIS parameters (\(\rho\), \(\psi\), and \(\phi\)).
The design of the sensing phase-shifts depends on the degree of connectivity involving the sensing elements of the HRIS, and includes single-connected, partially-connected, or a more general fully-connected case. In this work, we consider a fully-connected HRIS architecture for generalization purposes. A single-connected one would be conceived by connecting each sensing element to an RF-chain, implying \(\boldsymbol{\Phi}_k\) having a diagonal construction [62]. On the other hand, if we have a partially connected architecture, each group of \(N_g\) elements would be connected to an RF-chain. Hence, the matrix \(\boldsymbol{\Phi}_k\) would have a block diagonal structure [63]. These two architectures are particular cases of the full-matrix \(\boldsymbol{\Phi}_k\) adopted in this paper.
A structured two-block time-domain transmission is adopted, during which the semi-blind CE occurs in the first block of \(T_s\) symbol periods, comprised of \(K\) sub-frames of \(T\) symbol periods each (i.e., \(T_s\!=\!KT\)). In contrast, the second block has \(T_d\) symbol periods dedicated to pure data transmission. Note that during \(T_s\) symbol periods, this structure spends the same time as that dedicated to addressing CE in [22], in which only pilots are transmitted. The key difference is that in our approach, data symbols are transmitted in advance during \(T_s\) symbol periods, enhancing the data rate and reducing the overall symbol decoding delay. A quasi-static flat-fading channel with coherence time \(T_c\) is assumed, where UT-HRIS and HRIS-BS channels remain constant during at least \(T_s\) symbol periods, with \(T_s \ll T_c\). Digital precoding/combiner design, RIS phase shift optimization, and signal processing in the second block fall outside the scope of this work.
 
 
Remark 1. We emphasize that the objective of this paper is not to compare passive and hybrid RIS architectures. Instead, it is assumed that, in certain scenarios, network engineers may opt to trade off the energy efficiency provided by passive RISs in favor of enabling additional functionalities. In such cases, the RIS is envisioned as an active network node, acting as an intermediate point capable of performing specific tasks and supporting more dynamic and autonomous network operations, which are not expected in a passive RIS architecture. For instance, the HRIS can act as a sensing receiver in an RIS-assisted ISAC scenario. Recall that in ISAC scenarios, BSs are allowed to act as dedicated sensing receivers, which may be integrated with or physically separated from the ISAC transmitters [64]. In addition, in RIS-assisted ISAC scenarios, due to its passive nature, the RIS typically requires a separate receiver unit to sense the environment and manage configuration control [13], [65]. In this context, [66] proposed a distributed ISAC architecture that enables cooperation among sensing nodes to understand their surrounding environment, allowing RISs with sensing capabilities (hybrid RISs) to replace sensing receiver nodes. Examples of such potential functionalities are discussed in Section 11.
Remark 2. In the context of energy consumption, we assume that the adopted HRIS is capable of switching between two operational modes: hybrid mode* and passive mode. In hybrid mode operation, the HRIS’s active units related to the sensing process are turned on, i.e., adders, phase-shifters, and RF-chains (including low-noise amplifiers, downconverters, and analog-to-digital converters). When no sensing task is required, the HRIS can switch to the passive mode, and all the active units are turned off. This means that the active components are activated as needed to perform a channel estimation procedure, or even to the HRIS to extract some relevant information from the decoded/estimated symbols. In this paper, we consider the semi-blind joint symbol and channel estimation under the hybrid mode operation.*
Before transmission at the UT, the input symbols undergo a tensor ST coding scheme [67], which incorporates spatial multiplexing with spreading, enabling a linear combination of streams across both spatial and temporal dimensions. Consequently, our transmit signal model extends beyond that exploited in [54]. This way, all the \(R\) independent streams at the \(t\)-th symbol period of the \(k\)-th sub-frame (\(k\!=\!1,\!\cdots\!,\!K\)) are split across \(L\) combiners. At the \(l\)-th combiner, each independent symbol \(x_{r,t}\) (\(r\!=\!1,\!\cdots\!,\!R\)) is coded by \(w_{l,r,k}\) (\(l=1,\cdots,L\)). After that, the \(R\) coded signals are combined to yield \(s_{l,t,k} = (1/\sqrt{L})\textstyle \sum_{r=1}^R{w_{l,r,k}x_{r,t}}\), to be forwarded to the \(l\)-th transmitting antenna. We assume that coefficients \(w_{l,r,t,k}\) remain constant within the \(k\)-th sub-frame and may vary from one sub-frame to another, which means \(w_{l,r,t,k} \!=\! w_{l,r,k}\), for \(t=1,\cdots,T\). Collecting the coded symbols forwarded to all \(L\) antennas, we have \(\mathit{\boldsymbol{s}}_{t,k}\! =\! {\mathbf{W}}_k\mathit{\boldsymbol{x}}_t \!\in\! \mathbb{C}^{L \times 1}\), in which \(\mathit{\boldsymbol{x}}_t \!\doteq\![x_{1,t},\!\cdots\!,x_{R,t}]^{{\rm T}}\!\in\! \mathbb{C}^{R \times 1}\) is comprised by symbols coming from all the \(R\) data streams at the \(t\)-th time instant, and \({\mathbf{W}}_k \in \mathbb{C}^{L \times R}\) is the coding matrix of the \(k\)-th sub-frame gathering all \(R\) inputs and \(L\) outputs, whose entries are \([{\mathbf{W}}_k]_{l,r} = (1/\sqrt{L})w_{l,r,k}\). The HRIS parameters are assumed to be reconfigured just like the coefficients \(w_{l,r,k}\), which leads to \(\rho_{n,t,k} = \rho_{n,k}\), \(\psi_{n,t,k} = \psi_{n,k}\) and \(\phi_{n_c,n,t,k} = \phi_{n_c,n,k}\), following [22].
The portion of the signal transmitted by the \(L\) UT antennas, sensed by the \(N\) HRIS meta-atoms, and then forwarded to the \(N_c\) RF-chains via analog combining at the \(t\)-th time slot of the \(k\)-th sub-frame, is given by \(\mathit{\boldsymbol{y}}^{\mathrm{RC}}_{t,k} = \boldsymbol{\Phi}_k{\mathbf{G}}{\mathbf{W}}_k\mathit{\boldsymbol{x}}_t + \mathit{\boldsymbol{\nu}}^{\mathrm{RC}}_{t,k} \in \mathbb{C}^{N_c \times 1}\), where \(\mathit{\boldsymbol{\nu}}^{\mathrm{RC}}_{t,k}\) is the associated additive noise at the HRIS, \({\mathbf{G}} \!\in\! \mathbb{C}^{N \times L}\) is the UT-HRIS channel matrix and \(\boldsymbol{\Phi}_k \in \mathbb{C}^{N_c \times N}\) is the sensing phase shift matrix of the \(k\)-th sub-frame that yields the analog combining carried out by HRIS [22], in which \([\boldsymbol{\Phi}_k]_{n_c,n} \!= \!\sqrt{\!(1\!-\!\rho_{n,k})/N_c}e^{j\phi_{n_c,n,k}}\). Meanwhile, the received signal at the BS, corresponding to the portion of the signal reflected by the HRIS, is given by \(\mathit{\boldsymbol{y}}^{\mathrm{BS}}_{t,k} = {\mathbf{H}}{\rm diag\!}\left\{ \mathit{\boldsymbol{\psi}}_k \right\}{\mathbf{G}}{\mathbf{W}}_k\mathit{\boldsymbol{x}}(t) + \mathit{\boldsymbol{\nu}}^{\mathrm{BS}}_{t,k}\in \mathbb{C}^{M \times 1}\), where \(\mathit{\boldsymbol{\nu}}^{\mathrm{BS}}_{t,k}\) represents the noise at the BS, \({\mathbf{H}} \!\in\! \mathbb{C}^{M \times N}\) is the HRIS-BS channel matrix, and \(\mathit{\boldsymbol{\psi}}_k \doteq [\sqrt{\rho_{1,k}}e^{j\psi_{1,k}},\!\cdots\!,\!\sqrt{\rho_{N,k}}e^{j\psi_{N,k}}] \in \mathbb{C}^{N \times 1}\) is the reflecting phase-shift beam. After \(T\) time slots of the \(k\)-th sub-frame, we collect column-wise \(\mathit{\boldsymbol{y}}^{\mathrm{RC}}_{t,k}\) and \(\mathit{\boldsymbol{y}}^{\mathrm{BS}}_{t,k}\), into the matrices \({\mathbf{Y}}^{\mathrm{RC}}_k \doteq [\mathit{\boldsymbol{y}}^{\mathrm{RC}}_{1,k},\cdots,\mathit{\boldsymbol{y}}^{\mathrm{RC}}_{T,k}] \in \mathbb{C}^{N_c \times T}\) and \({\mathbf{Y}}^{\mathrm{BS}}_k \doteq [\mathit{\boldsymbol{y}}^\mathrm{BS}_{1,k},\cdots,\mathit{\boldsymbol{y}}^\mathrm{BS}_{T,k}] \in \mathbb{C}^{M \times T}\), respectively, to get \[\label{matyrckw} {\mathbf{Y}}^\mathrm{RC}_k = \boldsymbol{\Phi}_k{\mathbf{G}}{\mathbf{W}}_k{\mathbf{X}} + {\mathbf{V}}^\mathrm{RC}_k \in \mathbb{C}^{N_c \times T}\tag{11}\] and \[\label{matybskw} {\mathbf{Y}}^\mathrm{BS}_k = {\mathbf{H}}{\rm diag\!}\left\{ \mathit{\boldsymbol{\psi}}_k \right\}{\mathbf{G}}{\mathbf{W}}_k{\mathbf{X}} + {\mathbf{V}}^\mathrm{BS}_k \in \mathbb{C}^{M \times T},\tag{12}\] where \({\mathbf{X}} = [\mathit{\boldsymbol{x}}_1,\cdots,\mathit{\boldsymbol{x}}_T] \in \mathbb{C}^{R \times T}\) is the symbol matrix, which collects \(T\) symbol periods of the \(R\) data streams. Here, \({\mathbf{V}}^\mathrm{RC}_k\) and \({\mathbf{V}}^\mathrm{BS}_k\) stand for the noise matrices associated with sub-frame \(k\) at the HRIS and BS, respectively.
We also consider KRSTC for comparison purposes. Following [68], [54] and the adopted time protocol, each independent symbol \(x_{l,t}\) (\(l=1,\cdots,L\)) is coded by a coefficient \(\lambda_{l,k}\) (\(k=1,\cdots,K\)) to yield \(s_{l,t,k} = \lambda_{l,k}x_{l,t}\), which is then forwarded to the \(l\)-th transmitting antenna. Collecting the coded symbols for all \(L\) antennas, we have \(\mathit{\boldsymbol{s}}_{t,k} = {\rm diag\!}\left\{ \mathit{\boldsymbol{\lambda}}_k \right\}\mathit{\boldsymbol{x}}_t \in \mathbb{C}^{L \times 1}\), in which \(\mathit{\boldsymbol{x}}_t \doteq [x_{1,t},\cdots,x_{L,t}]^{{\rm T}}\in \mathbb{C}^{L \times 1}\) and \(\mathit{\boldsymbol{\lambda}}_k \doteq [\lambda_{1,k},\cdots,\lambda_{L,k}]^{{\rm T}}\in \mathbb{C}^{L \times 1}\) is the coding vector of the \(k\)-th sub-frame. In this case, 11 and 12 are, respectively, recast as \[\label{matyrckkr} {\mathbf{Y}}^\mathrm{RC}_k = \boldsymbol{\Phi}_k{\mathbf{G}}{\rm diag\!}\left\{ \mathit{\boldsymbol{\lambda}}_k \right\}{\mathbf{X}} + {\mathbf{V}}^\mathrm{RC}_k \in \mathbb{C}^{N_c \times T},\tag{13}\] \[\label{matybskkr} {\mathbf{Y}}^{\mathrm{BS}}_k = {\mathbf{H}}{\rm diag\!}\left\{ \mathit{\boldsymbol{\psi}}_k \right\}{\mathbf{G}}{\rm diag\!}\left\{ \mathit{\boldsymbol{\lambda}}_k \right\}{\mathbf{X}} + {\mathbf{V}}^\mathrm{BS}_k \in \mathbb{C}^{M \times T},\tag{14}\] where the symbol matrix is recast as \({\mathbf{X}} \in \mathbb{C}^{L \times T}\). Note that KRSTC is a special case of TSTC, where the coding matrix associated with the \(k\)-th sub-frame is diagonal. This implies \(R=L\) and the absence of signal combining/multiplexing at the transmitter. The signal model of 14 , which follows the PARATUCK model, was considered in [54] for the PRIS case.
In the following, we recast the received signals using a tensor approach. We can collect \({\mathbf{Y}}^\mathrm{RC}_k\), the signal sensed at the HRIS associated with the \(k\)-th sub-frame defined in 11 , to form the 3rd-order sensed signal tensor at the HRIS \(\mathcal{Y}^\mathrm{RC} \doteq {\mathbf{Y}}^\mathrm{RC}_1 \sqcup_3 {\mathbf{Y}}^\mathrm{RC}_2 \sqcup_3 \cdots \sqcup_3 {\mathbf{Y}}^\mathrm{RC}_{K} \in \mathbb{C}^{N_c \times T \times K}\), where \(\sqcup_3\) indicates a concatenation along the third dimension, for \(k=1,\cdots,K\). This way, \({\mathbf{Y}}^\mathrm{RC}_k\) can be viewed as a frontal slice of \(\mathcal{Y}^\mathrm{RC}\), i.e., \(\mathcal{Y}^\mathrm{RC}_{\cdot\cdot k} \in \mathbb{C}^{N_c \times T}\). This matrix is constructed by fixing the third-mode index \(k\) and varying the tensor along modes one and two. Likewise, we construct the 3rd-order received signal tensor at the BS \(\mathcal{Y}^\mathrm{BS} \doteq {\mathbf{Y}}^\mathrm{BS}_1 \sqcup_3 {\mathbf{Y}}^\mathrm{BS}_2 \sqcup_3 \cdots \sqcup_3 {\mathbf{Y}}^\mathrm{BS}_{K} \in \mathbb{C}^{M \times T \times K}\), for which \({\mathbf{Y}}^\mathrm{BS}_k\), defined in 12 , matches to the \(k\)-th frontal slice \(\mathcal{Y}^\mathrm{BS}_{\cdot\cdot k} \in \mathbb{C}^{M \times T}\). The scalar representations of \(\mathcal{Y}^\mathrm{RC}\) and \(\mathcal{Y}^\mathrm{BS}\) are, respectively, given by \[\label{scalaryrc} y_{n_c,t,k}^{\mathrm{RC}} \!=\!\!\sum\limits_{n=1}^N\!\sum\limits_{l=1}^L\!\sum\limits_{r=1}^R{\!\!\sqrt{\frac{1\!-\!\rho_{n,k}}{LN_c}}e^{j\phi_{n_c,n,k}}\!g_{n,l}w_{l,r,k}x_{r,t}} \!+\! \nu_{n_c,t,k}^{\mathrm{RC}}\tag{15}\] and \[\label{scalarybs} y_{m,t,k}^{\mathrm{BS}} \!=\!\! \sum\limits_{n=1}^N\!\sum\limits_{l=1}^L\!\sum\limits_{r=1}^R{\!\!\sqrt{\frac{\rho_{n,k}}{L}}h_{m,n}e^{j\psi_{n,k}}g_{n,l}w_{l,r,k}x_{r,t}} \!+\! \nu_{n_c,t,k}^{\mathrm{BS}}.\tag{16}\]
 
Indeed, these signals are three-way (3D) arrays having one spatial dimension (\(N_c\) for the HRIS or \(M\) for the BS) and two temporal dimensions (\(T\) and \(K\)). These tensor models are exploited later to derive the proposed HRIS-BS semi-blind receiver pairs.
In this section, we show how the received tensors at the HRIS and BS can be viewed as a combination of two tensors whose structures may follow PARAFAC or Tucker models in a decoupled perspective.
Regarding the TSTC scheme, and upon closer examination, the noiseless part of \(\mathcal{Y}^\mathrm{RC}\) can be viewed as a slice-wise product between the effective UT-HRIS channel \(\mathcal{C}\) and the coded symbol tensor \(\mathcal{S}\), and can be written using the \(n\)-mode product notation [69] as \(\mathcal{C} \!=\! \mathcal{T}_{\boldsymbol{\Phi}} \times_1 {\mathbf{I}}_{N_c} \times_2 {\mathbf{G}}^{{\rm T}}\times_3 {\mathbf{I}}_K \!\in\! \mathbb{C}^{N_c \times L \times K}\) and \(\mathcal{S} \!=\! \mathcal{W} \times_1 {\mathbf{I}}_L \times_2 {\mathbf{X}}^{{\rm T}}\times_3 {\mathbf{I}}_K \!\in\! \mathbb{C}^{L \times T \times K}\), respectively. At the same time, \(\mathcal{T}_{\boldsymbol{\Phi}} \doteq \boldsymbol{\Phi}_1 \sqcup_3 \boldsymbol{\Phi}_2 \sqcup_3 \cdots \sqcup_3 \boldsymbol{\Phi}_K \in \mathbb{C}^{N_c \times N \times K}\) and \(\mathcal{W} \doteq {\mathbf{W}}_1 \sqcup_3 {\mathbf{W}}_2 \sqcup_3 \cdots \sqcup_3 {\mathbf{W}}_K \in \mathbb{C}^{L \times R \times K}\) are, respectively, the sensing phase shift tensor and the coding tensor. Using the \(K\)-mode slice contraction operator 4 , we have \[\begin{align} \label{tenyrc} \mathcal{Y}^\mathrm{RC} =& \: \mathcal{C} \begin{tikzpicture}[baseline=-0.6ex] \node[draw,circle,inner sep=0.5pt]{\scalebox{0.5}{3}}; \end{tikzpicture}_{2}^{1}\mathcal{S} + \mathcal{V}^\mathrm{RC}\nonumber,\\ =& \left(\mathcal{T}_{\boldsymbol{\Phi}} \times_2 {\mathbf{G}}^{{\rm T}}\right)\begin{tikzpicture}[baseline=-0.6ex] \node[draw,circle,inner sep=0.5pt]{\scalebox{0.5}{3}}; \end{tikzpicture}_{2}^{1} \left(\mathcal{W} \times_2 {\mathbf{X}}^{{\rm T}}\right) + \mathcal{V}^\mathrm{RC}, \end{align}\tag{17}\] where \(\mathcal{V}^\mathrm{RC}\) represents the additive noise tensor at the HRIS. From this perspective, the sensed signal at the HRIS results from a mode-wise contraction of the tensors \(\mathcal{C}\) and \(\mathcal{S}\), which follow Tucker-(1,3) models [70], respectively. We refer to the tensor model in 17 as a double Tucker model.
The noiseless part of the tensor \(\mathcal{Y}^\mathrm{BS}\) results from a mode-wise contraction between the cascaded UT-HRIS-BS channel tensor \(\mathcal{T}_{\boldsymbol{\Omega}}\) and the coded symbol tensor, where the first follows a PARAFAC model and can be written as \(\mathcal{T}_{\boldsymbol{\Omega}} = \mathcal{I}_{3,N} \times_1 {\mathbf{H}} \times_2 {\mathbf{G}}^{{\rm T}}\times_3 \boldsymbol{\Psi} \in \mathbb{C}^{M \times L \times K}\), where \(\boldsymbol{\Psi} \doteq [\mathit{\boldsymbol{\psi}}_1,\cdots,\mathit{\boldsymbol{\psi}}_K]^{{\rm T}}\in \mathbb{C}^{K \times N}\). Hence, \(\bar{\mathcal{Y}}^\mathrm{BS}\) is given by \[\begin{align} \label{tenybs} \mathcal{Y}^\mathrm{BS} =&\: \mathcal{T}_{\boldsymbol{\Omega}}\begin{tikzpicture}[baseline=-0.6ex] \node[draw,circle,inner sep=0.5pt]{\scalebox{0.5}{3}}; \end{tikzpicture}_{2}^{1} \mathcal{S} + \mathcal{V}^\mathrm{BS}\nonumber\\ =&\! \left(\mathcal{I}_{3,N} \!\times_1 \!{\mathbf{H}} \!\times_2\! {\mathbf{G}}^{{\rm T}}\!\! \times_3\! \boldsymbol{\Psi}\right) \! \begin{tikzpicture}[baseline=-0.6ex] \node[draw,circle,inner sep=0.5pt]{\scalebox{0.5}{3}}; \end{tikzpicture}_{2}^{1}\! \left(\mathcal{W} \!\times_2 \!{\mathbf{X}}^{{\rm T}}\right) \!+\! \mathcal{V}^\mathrm{BS}\!\!, \end{align}\tag{18}\] where \(\mathcal{V}^\mathrm{BS}\) is the corresponding additive noise tensor at the BS. Since the signal tensor received at the BS corresponds to the 3-mode contraction between a PARAFAC and a Tucker-(1,3) models, we refer to 18 as a PARAFAC-Tucker model. The received signal tensors at both the HRIS and BS are illustrated in Fig. 3.
The mode-wise contraction formalism, applied in 17 and 18 , makes it possible to decouple the tensor structures of the transmitted signals and their respective combined/effective channels, revealing their associated (PARAFAC/Tucker) tensor decompositions in a modularized fashion.
If KRSTC is used instead of TSTC, the received signal tensors \(\mathcal{Y}^\mathrm{RC}\) and \(\mathcal{Y}^\mathrm{BS}\) can be built the same way, assuming \(R=L\). The coded symbol tensor follows a PARAFAC model and is written as \(\bar{\mathcal{S}}\! =\! \mathcal{I}_{3,L} \!\times_1\! {\mathbf{I}}_L \!\times_2 \!{\mathbf{X}}^{{\rm T}}\!\times_3\! \boldsymbol{\Lambda} \!\in\! \mathbb{C}^{L \times T \times K}\), where \(\boldsymbol{\Lambda}\! \doteq\! [\mathit{\boldsymbol{\lambda}}_1,\!\cdots\!,\mathit{\boldsymbol{\lambda}}_K]^{{\rm T}}\!\in\! \mathbb{C}^{K \times L}\). In this case, the received signal tensor at the HRIS would be \(\mathcal{Y}^\mathrm{RC}=\left(\mathcal{T}_{\boldsymbol{\Phi}} \times_2 {\mathbf{G}}^{{\rm T}}\right) \! \begin{tikzpicture}[baseline=-0.6ex] \node[draw,circle,inner sep=0.5pt]{\scalebox{0.5}{3}}; \end{tikzpicture}_{2}^{1} \! \left(\mathcal{I}_{3, L} \times_2 {\mathbf{X}}^{{\rm T}}\times_3 \boldsymbol{\Lambda}\right) + \mathcal{V}^\mathrm{RC}\), corresponding to the Tucker-PARAFAC tensor structure. On the other hand, at the BS, the received signal tensor follows a PARATUCK-2 tensor structure, which is given by \(\mathcal{Y}^\mathrm{BS} \! = \! \left(\mathcal{I}_{3, N} \! \times_1 \! {\mathbf{H}} \! \times_2 \! {\mathbf{G}}^{{\rm T}}\! \times_3 \! \boldsymbol{\Psi}\right) \! \begin{tikzpicture}[baseline=-0.6ex] \node[draw,circle,inner sep=0.5pt]{\scalebox{0.5}{3}}; \end{tikzpicture}_{2}^{1} \!\left(\mathcal{I}_{3, L} \! \times_2 \! {\mathbf{X}}^{{\rm T}}\! \times_3 \! \boldsymbol{\Lambda}\right) + \mathcal{V}^\mathrm{BS}\). It is clear that these tensors are special cases of 17 and 18 .
In this section, we develop the proposed semi-blind receivers for joint channel and symbol estimation at the HRIS and BS by exploiting the tensor signals derived in the previous section.
Optimization problem for the HRIS: For the HRIS, consider the noisy sensed signal tensor \(\mathcal{Y}^\mathrm{RC}\). Our goal is found estimates of the UT-HRIS channel \({\mathbf{G}}\) and the symbol matrix \({\mathbf{X}}\) by solving the following problem \[\label{optbsxgw} \min \limits_{{\mathbf{G}},{\mathbf{X}}} \Big\|\mathcal{Y}^\mathrm{RC} - \left(\mathcal{T}_{\boldsymbol{\Phi}} \times_2 {\mathbf{G}}^{{\rm T}}\right)\begin{tikzpicture}[baseline=-0.6ex] \node[draw,circle,inner sep=0.5pt]{\scalebox{0.5}{3}}; \end{tikzpicture}_{2}^{1} \left(\mathcal{W} \times_2 {\mathbf{X}}^{{\rm T}}\right)\Big\|^2_{\textrm{F}}.\tag{19}\] Starting from this cost function, we formulate two solutions to solve this problem by exploiting the different reshapings of the tensor signal structure. The first one resorts to an iterative alternating linear estimation scheme, while the second delivers closed-form estimates of the channel and symbols.
Using the definition 5 , we apply the \(\mathrm{vec}\{\cdot\}\) operator to the \(k\)-th frontal slice of \(\mathcal{Y}^\mathrm{RC}\) defined in 17 , i.e., \(\mathrm{vec}\{(\mathcal{C} \begin{tikzpicture}[baseline=-0.6ex] \node[draw,circle,inner sep=0.5pt]{\scalebox{0.5}{3}}; \end{tikzpicture}_{1}^{2}\mathcal{S})_{\cdot\cdot k} + \mathcal{V}^\mathrm{RC}_{\cdot\cdot k}\} = \mathrm{vec}\{{\mathbf{Y}}^\mathrm{RC}_k\}\) to define \(\mathit{\boldsymbol{y}}^\mathrm{RC}_k\), given by \[\mathit{\boldsymbol{y}}^\mathrm{RC}_k = ({\mathbf{X}}^{{\rm T}}\otimes{\mathbf{I}}_{N_c})({\mathbf{W}}_k^{{\rm T}}\otimes\boldsymbol{\Phi}_k)\mathit{\boldsymbol{g}} \in \mathbb{C}^{TN_c \times 1} + \mathit{\boldsymbol{\nu}}_k^\mathrm{RC},\] where \(\mathit{\boldsymbol{g}}\! \doteq \!{\rm vec}\left\{ {\mathbf{G}} \right\} \!\in\! \mathbb{C}^{LN \times 1}\), and \(\mathit{\boldsymbol{\nu}}^\mathrm{RC} \doteq {\rm vec}\left\{ {\mathbf{V}}^\mathrm{RC}_k \right\} \in \mathbb{C}^{TN_c \times 1}\). We define \(\mathit{\boldsymbol{y}}^\mathrm{RC}\! \doteq\! \bigl[(\mathit{\boldsymbol{y}}^\mathrm{RC}_1)^{{\rm T}},\cdots,(\mathit{\boldsymbol{y}}^\mathrm{RC}_K)^{{\rm T}}\bigr]^{{\rm T}}\!= \!\mathrm{vec}\bigl\{\bigl[\mathcal{Y}^\mathrm{RC}\bigr]_{(3)}^{{\rm T}}\bigr\} \!\in \!\mathbb{C}^{KTN_c \times 1}\) by stacking \(\mathit{\boldsymbol{y}}^\mathrm{RC}_k\) during the \(K\) sub-frames, to get \[\label{hriswvetyrc} \mathit{\boldsymbol{y}}^\mathrm{RC} = \bigl({\mathbf{I}}_{K} \otimes{\mathbf{X}}^{{\rm T}}\otimes{\mathbf{I}}_{N_c}\bigr){\mathbf{F}}_\mathrm{g}\mathit{\boldsymbol{g}} + \mathit{\boldsymbol{\nu}}^\mathrm{RC},\tag{20}\] where \({\mathbf{F}}_\mathrm{g} \doteq \bigl[{\mathbf{W}}_1 \otimes\boldsymbol{\Phi}_1^{{\rm T}},\cdots,{\mathbf{W}}_K \otimes\boldsymbol{\Phi}_K^{{\rm T}}\bigr]^{{\rm T}}\in \mathbb{C}^{KRN_c \times LN}\) contains the coding structure and the sensing phase shifts, which are known at the HRIS, and \(\mathit{\boldsymbol{\nu}}^\mathrm{RC} = \mathrm{vec}\bigl\{\bigl[\mathcal{V}^\mathrm{RC}\bigr]_{(3)}^{{\rm T}}\bigr\} \in \mathbb{C}^{KTN_c \times 1}\) is the corresponding noise term. A least-squares (LS) estimate of the UT-HRIS channel can be found by solving the problem \[\hat{\mathit{\boldsymbol{g}}} = \mathop{\mathrm{arg\,min}}_{\mathit{\boldsymbol{g}}} \Bigl\|\mathit{\boldsymbol{y}}^\mathrm{RC} - \bigl({\mathbf{I}}_{K} \otimes{\mathbf{X}}^{{\rm T}}\otimes{\mathbf{I}}_{N_c}\bigr){\mathbf{F}}_\mathrm{g}\mathit{\boldsymbol{g}}\Bigr\|_\mathrm{F}^2,\] whose analytical solution is given by \[\label{hriswgest} \hat{{\mathbf{G}}} = \mathrm{unvec}_{N \times L}\bigl\{\bigl[\bigl({\mathbf{I}}_{K} \otimes{\mathbf{X}}^{{\rm T}}\otimes{\mathbf{I}}_{N_c}\bigr){\mathbf{F}}_\mathrm{g}\bigr]^\dagger\mathit{\boldsymbol{y}}^\mathrm{RC}\bigr\},\tag{21}\] Exploiting \(\bigl[\mathcal{Y}^\mathrm{RC}\bigr]_{(2)}\! \doteq \! \bigl[({\mathbf{Y}}^\mathrm{RC}_1)^{{\rm T}},\!\cdots\!,({\mathbf{Y}}^\mathrm{RC}_K)^{{\rm T}}\bigr] \!\in \!\mathbb{C}^{T \times KN_c}\), corresponding to the 2-mode unfolding of \(\mathcal{Y}^\mathrm{RC}\), we have \[\label{hriswyrcet} \bigl[\mathcal{Y}^\mathrm{RC}\bigr]_{(2)}^{{\rm T}}= {\mathbf{F}}_\mathrm{x}{\mathbf{X}} + \bigl[\mathcal{V}^\mathrm{RC}\bigr]_{(2)}^{{\rm T}}\in \mathbb{C}^{KN_c \times T},\tag{22}\] where \({\mathbf{F}}_\mathrm{x} \doteq \bigl[(\boldsymbol{\Phi}_1{\mathbf{G}}{\mathbf{W}}_1)^{{\rm T}},\cdots,(\boldsymbol{\Phi}_K{\mathbf{G}}{\mathbf{W}}_k)^{{\rm T}}\bigr]^{{\rm T}}\in \mathbb{C}^{KN_c \times R}\). The symbol matrix can be found by solving \[\hat{{\mathbf{X}}} = \mathop{\mathrm{arg\,min}}_{{\mathbf{X}}} \left\|\left[ \mathcal{Y}^\mathrm{RC} \right]_{(2)}^{{\rm T}}- {\mathbf{F}}_\mathrm{x}{\mathbf{X}}\right\|^2_{\textrm{F}},\] the solution of which is given by \[\label{hriswxest} \hat{{\mathbf{X}}} = {\mathbf{F}}_\mathrm{x}^\dagger\left[ \mathcal{Y}^\mathrm{RC} \right]_{(2)}^{{\rm T}}.\tag{23}\]
Note that 21 and 23 are jointly used to iteratively estimate the UT-HRIS channel and symbols via a bilinear alternating LS (BALS) algorithm, herein referred to as HRIS-BALS receiver. The algorithm consists of estimating \({\mathbf{G}}\) and \({\mathbf{X}}\) iteratively, starting from a random initialization until convergence is achieved. As discussed in previous works [56], [27], such a BALS procedure converges after a few iterations and provides unique estimates of the channel and symbol matrices up to trivial scaling ambiguities, as will be discussed later. The HRIS-BALS receiver is summarized in Algorithm 4.
 
 
Define \(\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC}_k \doteq \mathrm{vec}\{({\mathbf{Y}}^\mathrm{RC}_k)^{{\rm T}}\} \in \mathbb{C}^{N_cT \times 1}\), given by \[\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC}_k = (\boldsymbol{\Phi}_k \otimes{\mathbf{I}}_T){\mathbf{Q}}\mathit{\boldsymbol{w}}_k + \bar{\mathit{\boldsymbol{\nu}}}^\mathrm{RC}_k,\] where \(\mathit{\boldsymbol{w}}_k \!\doteq \!{\rm vec}\left\{ {\mathbf{W}}_k^{{\rm T}} \right\} \!\in \!\mathbb{C}^{LR \times 1}\), \(\bar{\mathit{\boldsymbol{\nu}}}^\mathrm{RC}_k \doteq \mathrm{vec}\{({\mathbf{V}}^\mathrm{RC}_k)^{{\rm T}}\}\), and \({\mathbf{Q}}\!\doteq\! {\mathbf{G}} \otimes{\mathbf{X}}^{{\rm T}}\!\in \!\mathbb{C}^{NT \times LR}\). Applying \(\mathrm{vec}\{\cdot\}\) again, we obtain \[\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC}_k = (\mathit{\boldsymbol{w}}_k^{{\rm T}}\otimes\boldsymbol{\Phi}_k \otimes{\mathbf{I}}_T)\mathit{\boldsymbol{q}} + \bar{\mathit{\boldsymbol{\nu}}}^\mathrm{RC}_k,\] where \(\mathit{\boldsymbol{q}} = {\rm vec}\left\{ {\mathbf{Q}} \right\} \in \mathbb{C}^{LRNT \times 1}\). Defining \(\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC} \doteq \bigl[(\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC}_1)^{{\rm T}},\cdots,(\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC}_K)^{{\rm T}}\bigr]^{{\rm T}}= \mathrm{vec}\bigl\{\bigl[\mathcal{Y}^\mathrm{RC}\bigr]_{(2)}\bigr\}\) by collecting the sensed signals during the \(K\) sub-frames, we get \[\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC} = ({\mathbf{F}}_\mathrm{xg} \otimes{\mathbf{I}}_T)\mathit{\boldsymbol{q}} \in \mathbb{C}^{KN_cT \times 1} + \bar{\mathit{\boldsymbol{\nu}}}^\mathrm{RC},\] where \(\bar{\mathit{\boldsymbol{\nu}}}^\mathrm{RC} \in \mathbb{C}^{N_cT \times 1}\) is the related noise component, and \({\mathbf{F}}_\mathrm{xg} \in \mathbb{C}^{KN_c \times LRN}\) is \[\label{fxg} {\mathbf{F}}_\mathrm{xg} \doteq \bigl[\mathit{\boldsymbol{w}}_1 \otimes\boldsymbol{\Phi}_1^{{\rm T}},\cdots,\mathit{\boldsymbol{w}}_K \otimes\boldsymbol{\Phi}_K^{{\rm T}}\bigr]^{{\rm T}}.\tag{24}\] Let us consider the following problem \[\hat{\mathit{\boldsymbol{q}}} = \mathop{\mathrm{arg\,min}}_{\mathit{\boldsymbol{q}}} \left\|\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC} - ({\mathbf{F}}_\mathrm{xg} \otimes{\mathbf{I}}_T)\mathit{\boldsymbol{q}}\right\|^2,\] from which we can determine an LS estimate for the matrix \({\mathbf{Q}}\) through its corresponding solution \[\label{hriswxgest} \hat{{\mathbf{Q}}} = \mathrm{unvec}_{NT \times LR}\left\{({\mathbf{F}}_\mathrm{xg}^\dagger\otimes{\mathbf{I}}_T)\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC}\right\}.\tag{25}\] Upon obtaining \(\hat{{\mathbf{Q}}}\), the next step involves finding an estimate of \({\mathbf{X}}\) and \({\mathbf{G}}\) from it. Since the matrix \(\hat{{\mathbf{Q}}}\) follows a Kronecker construction, it can be interpreted as a block matrix, whose each sub-matrix is \(\hat{{\mathbf{Q}}}^{n,l} = g_{n,l}{\mathbf{X}}^{{\rm T}}\in \mathbb{C}^{T \times R}\). Therefore, we address the problem \[\label{eq:kronFcost} \min_{\scriptscriptstyle {\mathbf{X}},{\mathbf{G}}} \left\|\hat{{\mathbf{Q}}} - {\mathbf{G}} \otimes{\mathbf{X}}^{{\rm T}}\right\|_\mathrm{F}^2,\tag{26}\] whose solution is found by the so-called Kronecker Factorization (KronF) algorithm [71]. The solution to this problem is found by recasting the problem 26 as a rank-1 matrix approximation problem \[\min_{\scriptscriptstyle {\mathbf{X}},{\mathbf{G}}} \bigl\|{\bar{{\mathbf{Q}}} - \mathit{\boldsymbol{x}}\mathit{\boldsymbol{g}}^{{\rm T}}\bigr\|}_\mathrm{F}^2,\] where \(\bar{{\mathbf{Q}}} \in \mathbb{C}^{RT \times LN}\) is a matrix rearrangement of the blocks contained in \(\hat{{\mathbf{Q}}}\), as follows \[\bar{{\mathbf{Q}}} = \bigl[g_{1,1}\mathit{\boldsymbol{x}},\ldots,g_{N,1}\mathit{\boldsymbol{x}},\ldots,g_{1,L}\mathit{\boldsymbol{x}},\ldots,g_{N,L}\mathit{\boldsymbol{x}}\bigr]\\ = \mathit{\boldsymbol{x}}\mathit{\boldsymbol{g}}^{{\rm T}},\] where \(\mathit{\boldsymbol{x}} \doteq {\rm vec}\left\{ {\mathbf{X}}^{{\rm T}} \right\} \in \mathbb{C}^{RT \times 1}\), and \(g_{n,l}\mathit{\boldsymbol{x}} = \mathrm{vec}\bigl\{\hat{{\mathbf{Q}}}^{n,l}\bigr\} \in \mathbb{C}^{RT \times 1}\). From this problem, the estimates of \({\mathbf{X}}\) and \({\mathbf{G}}\) are given by the dominant left and right singular vectors of \(\bar{{\mathbf{Q}}}\), respectively. This procedure leads to the HRIS-KronF receiver, whose key steps are summarized in Algorithm 5, wherein we define the index sets \(\mathbb{S}_n\) and \(\mathbb{S}_l\) to indicate, respectively, the row and column ranges of (\(n,l\))-th sub-matrix of \(\hat{{\mathbf{Q}}}\).
Remark 3. The HRIS does not need to estimate the full information contained in the symbol matrix. It may be of interest to only decode a subset of columns of \({\mathbf{X}}\), leaving the remaining subset to be decoded by the BS. As elucidated in Section 11, \({\mathbf{X}}\) can be partitioned into user data and control data submatrices during the transmission time structure. We consider that HRIS and BS fully estimate the symbol matrix for the convenience of exposition. Note also that even in a scenario where the HRIS does not need to perform data decoding, the proposed semi-blind receivers provide data-aided CE capabilities at the HRIS. In this scenario, data symbols intended for the BS are exploited at the HRIS to estimate the associated channel matrix, unlike existing methods, which accomplish this by using only pilot symbols.
Optimization problem for the BS: As for the HRIS, joint symbol and CE can be achieved at the BS by exploiting the tensor structure of the received signal \(\mathcal{Y}^\mathrm{BS}\) as well as the estimated UT-HRIS channel matrix obtained at the HRIS and conveyed via the CL. Recall that the BS knows the coding tensor and the reflection phase shifts. We consider the following LS tensor fitting problem \[\label{optbsxh} \min \limits_{{\mathbf{H}},{\mathbf{X}}} \left\|\mathcal{Y}^\mathrm{BS} \!\! - \!\left(\!\mathcal{I}_{3,N} \! \times_1 \! {\mathbf{H}} \! \times_2 \! \hat{{\mathbf{G}}}^{{\rm T}}\! \times_3 \! \boldsymbol{\Psi}\!\right) \!\begin{tikzpicture}[baseline=-0.6ex] \node[draw,circle,inner sep=0.5pt]{\scalebox{0.5}{3}}; \end{tikzpicture}_{2}^{1} \! \left(\mathcal{W} \! \times_2 \! {\mathbf{X}}^{{\rm T}}\right)\right\|^2_{\textrm{F}}\!\!\!.\tag{27}\] In what follows, we exploit the different reshapings of the received tensor \(\mathcal{Y}^\mathrm{BS}\) to derive the corresponding iterative and closed-form semi-blind receivers at the BS.
Taking into account the tensor \(\mathcal{Y}^\mathrm{BS}\), defined in 18 , we concatenate its frontal slices \((\mathcal{T}_{\boldsymbol{\Omega}} \begin{tikzpicture}[baseline=-0.6ex] \node[draw,circle,inner sep=0.5pt]{\scalebox{0.5}{3}}; \end{tikzpicture}_{2}^{1}\mathcal{S})_{\cdot\cdot k} + \mathcal{V}^\mathrm{BS}_{\cdot\cdot k} = {\mathbf{Y}}^\mathrm{BS}_k\) for \(k=1,2,\cdots,K\) to obtain its 1-mode and 2-mode unfoldings, given by \(\bigl[\mathcal{Y}^\mathrm{BS}\bigr]_{(1)}\! =\!\bigl[{\mathbf{Y}}^{\mathrm{BS}}_1,\cdots,{\mathbf{Y}}^{\mathrm{BS}}_K\bigr] \in \mathbb{C}^{M \times KT}\) and \(\bigl[\mathcal{Y}^\mathrm{BS}\bigr]_{(2)} = \bigl[({\mathbf{Y}}^\mathrm{BS}_1)^{{\rm T}},\cdots,({\mathbf{Y}}^\mathrm{BS}_K)^{{\rm T}}\bigr] \in \mathbb{C}^{T \times KM}\), respectively. These unfoldings can be expressed in compact form as \[\begin{align} &&\bigl[\mathcal{Y}^\mathrm{BS}\bigr]_{(1)} = {\mathbf{H}}{\mathbf{E}}_\mathrm{h}({\mathbf{I}}_K \otimes{\mathbf{X}}) + \bigl[\mathcal{V}^\mathrm{BS}\bigr]_{(1)},\tag{28}\\ &&\bigl[\mathcal{Y}^\mathrm{BS}\bigr]_{(2)}^{{\rm T}}= ({\mathbf{I}}_K \otimes{\mathbf{H}}){\mathbf{E}}_\mathrm{x}{\mathbf{X}} + \bigl[\mathcal{V}^\mathrm{BS}\bigr]_{(2)}^{{\rm T}},\tag{29} \end{align}\] where \({\mathbf{E}}_\mathrm{h} \in \mathbb{C}^{N \times KR}\) and \({\mathbf{E}}_\mathrm{x} \in \mathbb{C}^{KN \times R}\) are defined as \[\begin{align} &&{\mathbf{E}}_\mathrm{h} \doteq \big[{\rm diag\!}\left\{ \mathit{\boldsymbol{\psi}}_1 \right\}\hat{{\mathbf{G}}}{\mathbf{W}}_1,\cdots,{\rm diag\!}\left\{ \mathit{\boldsymbol{\psi}}_K \right\}\hat{{\mathbf{G}}}{\mathbf{W}}_K\big],\\ &&{\mathbf{E}}_\mathrm{x}\!\doteq\!\bigl[{\mathbf{W}}_1^{{\rm T}}\hat{{\mathbf{G}}}^{{\rm T}}{\rm diag\!}\left\{ \mathit{\boldsymbol{\psi}}_1 \right\}\!,\!\cdots\!,{\mathbf{W}}_K^{{\rm T}}\hat{{\mathbf{G}}}^{{\rm T}}{\rm diag\!}\left\{ \mathit{\boldsymbol{\psi}}_K \right\}\bigr]^{{\rm T}}. \end{align}\] respectively. From 28 and 29 , the estimation of the HRIS-BS channel \({\mathbf{H}}\) and the symbol matrix \({\mathbf{X}}\) can be obtained by solving the following LS problems \[\hat{{\mathbf{H}}} = \mathop{\mathrm{arg\,min}}_{{\mathbf{H}}}\bigl|\bigl|\left[ \mathcal{Y}^{\mathrm{BS}} \right]_{(1)} - {\mathbf{H}}{\mathbf{E}}_\mathrm{h}({\mathbf{I}}_K \otimes{\mathbf{X}})\bigr|\bigr|_{\rm F}^2,\] \[\hat{{\mathbf{X}}} = \mathop{\mathrm{arg\,min}}_{{\mathbf{X}}}\bigl|\bigl|\left[ \mathcal{Y}^{\mathrm{BS}} \right]_{(2)}^{{\rm T}}- ({\mathbf{I}}_K \otimes{\mathbf{H}}){\mathbf{E}}_\mathrm{x}{\mathbf{X}}\bigr|\bigr|_{\rm F}^2,\] the solutions of which are respectively given by \[\label{bswhest} \hat{{\mathbf{H}}} = \left[ \mathcal{Y}^{\mathrm{BS}} \right]_{(1)}\bigl[{\mathbf{E}}_\mathrm{h}({\mathbf{I}}_K \otimes{\mathbf{X}})\bigr]^\dagger,\tag{30}\] \[\label{bswxest} \hat{{\mathbf{X}}} = \bigl[({\mathbf{I}}_K \otimes{\mathbf{H}}){\mathbf{E}}_\mathrm{x}\bigr]^\dagger\left[ \mathcal{Y}^{\mathrm{BS}} \right]_{(2)}^{{\rm T}}.\tag{31}\] Similarly to the HRIS side, the estimate of the HRIS-BS channel and transmitted symbols can be obtained by solving (30 ) and (31 ) iteratively using alternating least-squares. This algorithm is referred to as the BS-BALS receiver and is summarized in Algorithm 6.
 
We now derive the expressions for the closed-form estimation of \({\mathbf{H}}\) and \({\mathbf{X}}\) are the BS. The procedure is analogous to that discussed on the HRIS side. First, by applying \(\mathrm{vec}\{\cdot\}\) to the \(k\)-th frontal slice of \(\mathcal{Y}^\mathrm{BS}\), we define \(\mathit{\boldsymbol{y}}^{\mathrm{BS}}_k\! \doteq \!\mathrm{vec}\{(\mathcal{T}_{\boldsymbol{\Omega}} \begin{tikzpicture}[baseline=-0.6ex] \node[draw,circle,inner sep=0.5pt]{\scalebox{0.5}{3}}; \end{tikzpicture}_{2}^{1}\mathcal{S})_{\cdot\cdot k} + \mathcal{V}^\mathrm{BS}_{\cdot\cdot k}\} \in \mathbb{C}^{TM \times 1}\), or \[\mathit{\boldsymbol{y}}^{\mathrm{BS}}_k = ({\mathbf{X}}^{{\rm T}}\otimes{\mathbf{H}})({\mathbf{W}}_k^{{\rm T}}\otimes{\rm diag\!}\left\{ \mathit{\boldsymbol{\psi}}_k \right\})\hat{\mathit{\boldsymbol{g}}} + \mathit{\boldsymbol{\nu}}^\mathrm{BS},\] where \(\mathit{\boldsymbol{\nu}}^\mathrm{BS} \doteq {\rm vec}\left\{ {\mathbf{V}}^\mathrm{BS}_k \right\} \in \mathbb{C}^{TM \times 1}\). Stacking column-wise the received signal vectors \(\mathit{\boldsymbol{y}}^\mathrm{BS}_k\), for \(k=1,2,\cdots,K\), we obtain the 3-mode unfolding of \(\mathcal{Y}^\mathrm{BS}\), given by \(\bigl[\mathcal{Y}^\mathrm{BS}\bigr]_{(3)} = \bigl[\mathit{\boldsymbol{y}}^{\mathrm{BS}}_1,\cdots,\mathit{\boldsymbol{y}}^{\mathrm{BS}}_K\bigr]^{{\rm T}}\in \mathbb{C}^{K \times TM}\). The transposed version of this unfolding can be expressed as \[\label{ybs3t} \bigl[\mathcal{Y}^\mathrm{BS}\bigr]_{(3)}^{{\rm T}}= \left({\mathbf{X}}^{{\rm T}}\otimes{\mathbf{H}}\right){\mathbf{E}}_\mathrm{xh}({\mathbf{I}}_K \otimes\hat{\mathit{\boldsymbol{g}}}) + \bigl[\mathcal{V}^\mathrm{BS}\bigr]_{(3)}^{{\rm T}},\tag{32}\] where \({\mathbf{E}}_\mathrm{xh} \in \mathbb{C}^{RN \times KLN}\) is defined as \[\label{omegaxhbscomb} {\mathbf{E}}_\mathrm{xh} \doteq \bigl[{\mathbf{W}}_1^{{\rm T}}\otimes{\rm diag\!}\left\{ \mathit{\boldsymbol{\psi}}_1 \right\},\cdots,{\mathbf{W}}_K^{{\rm T}}\otimes{\rm diag\!}\left\{ \mathit{\boldsymbol{\psi}}_K \right\}\bigr].\tag{33}\] Defining \({\mathbf{Z}} \doteq {\mathbf{X}}^{{\rm T}}\otimes{\mathbf{H}} \in \mathbb{C}^{TM \times RN}\), we first find its compound estimate by solving the following LS problem \[\hat{{\mathbf{Z}}} = \mathop{\mathrm{arg\,min}}_{{\mathbf{Z}}} \left\|\left[ \mathcal{Y}^{\mathrm{BS}} \right]_{(3)}^{{\rm T}}- {\mathbf{Z}}{\mathbf{E}}_\mathrm{xh}({\mathbf{I}}_K \otimes\hat{\mathit{\boldsymbol{g}}})\right\|^2_{\textrm{F}},\] the solution of which is given by \[\label{bswxhest} \hat{{\mathbf{Z}}} = \left[ \mathcal{Y}^{\mathrm{BS}} \right]_{(3)}^{{\rm T}}\bigl[{\mathbf{E}}_\mathrm{xh}({\mathbf{I}}_K \otimes\hat{\mathit{\boldsymbol{g}}})\bigr]^\dagger.\tag{34}\] From the estimate \(\hat{{\mathbf{Z}}}\), we can jointly find the individual estimates of the \({\mathbf{X}}\) and \({\mathbf{H}}\) by solving the problem \[\label{kronFXh} \min_{\scriptscriptstyle {\mathbf{X}},{\mathbf{H}}} \left\|\hat{{\mathbf{Z}}} - {\mathbf{X}}^{{\rm T}}\otimes{\mathbf{H}}\right\|_\mathrm{F}^2,\tag{35}\] which is solved via the KronF algorithm [71]. In our context, this is accomplished by solving the following rank-1 matrix approximation problem \[\min_{\scriptscriptstyle {\mathbf{X}},{\mathbf{H}}} \bigl\|\bar{{\mathbf{Z}}} - \mathit{\boldsymbol{h}}\mathit{\boldsymbol{x}}^{{\rm T}}\bigr\|_{\mathrm{F}}^2,\] where \(\mathit{\boldsymbol{h}} \doteq {\rm vec}\left\{ {\mathbf{H}} \right\} \in \mathbb{C}^{NM \times 1}\). The rank-1 matrix \(\bar{{\mathbf{Z}}} \in \mathbb{C}^{RT \times LN}\) is obtained by rearranging the matrix blocks of \(\hat{{\mathbf{Z}}}\) similarly to the method used for the HRIS-KronF receiver (see Section 4.2). The main steps of the BS-KronF receiver are summarized in Algorithm 7.
 
In this section, we briefly present the semi-blind receivers involved in this coding scheme, referring, when appropriate, to equations and algorithms from the TSTC case for the sake of brevity, since the algebraic steps to derive the semi-blind receivers for the KRSTC scheme are, in most cases, similar to those for the TSTC. In those cases, we replace \({\mathbf{W}}_k\) by \({\rm diag\!}\left\{ \mathit{\boldsymbol{\lambda}}_k \right\}\) in \({\mathbf{F}}_\mathrm{g}\), \({\mathbf{F}}_\mathrm{x}\), \({\mathbf{F}}_\mathrm{xg}\), \({\mathbf{E}}_\mathrm{h}\) and \({\mathbf{E}}_\mathrm{x}\), except in \({\mathbf{E}}_\mathrm{xh}\). In addition, \(R=L\) is assumed in \({\mathbf{X}}\).
The BALS receiver at the HRIS for KRSTC capitalizes on the BALS’s framework in Algorithm 4, by exploting 21 and 23 while redefining \({\mathbf{F}}_\mathrm{g}\) and \({\mathbf{F}}_\mathrm{x}\) as \({\mathbf{F}}_\mathrm{g} \! \doteq \! \bigl[{\rm diag\!}\left\{ \mathit{\boldsymbol{\lambda}}_1 \right\} \otimes\boldsymbol{\Phi}_1^{{\rm T}},\!\cdots\!,{\rm diag\!}\left\{ \!\mathit{\boldsymbol{\lambda}}_K\! \right\} \otimes\boldsymbol{\Phi}_K^{{\rm T}}\bigr]^{{\rm T}}\!\! \in \! \mathbb{C}^{KLN_c \times LN}\) and \({\mathbf{F}}_\mathrm{x} \!\doteq\! \bigl[{\rm diag\!}\left\{ \!\mathit{\boldsymbol{\lambda}}_1\! \right\}\!{\mathbf{G}}^{{\rm T}}\boldsymbol{\Phi}_1^{{\rm T}},\!\cdots\!,{\rm diag\!}\left\{ \!\mathit{\boldsymbol{\lambda}}_K\! \right\}\!{\mathbf{G}}^{{\rm T}}\boldsymbol{\Phi}_K^{{\rm T}}\bigr]^{{\rm T}}\!\in \!\mathbb{C}^{KN_c \times L}\), respectively.
Closed-form estimates of \({\mathbf{G}}\) and \({\mathbf{X}}\) at the HRIS are obtained after employing steps similar to those adopted in Section 4.2 after minor algebraic modifications. As a result, the previously Kronecker structured combined matrix \({\mathbf{Q}}\), estimated in 25 , becomes Khatri-Rao structured, i.e., \({\mathbf{Q}} \doteq {\mathbf{G}} \diamond{\mathbf{X}}^{{\rm T}}\in \mathbb{C}^{NT \times L}\). Therefore, we rewrite 25 to get \[\label{xgesthriskr} \hat{{\mathbf{Q}}} = \mathrm{unvec}_{NT \times L}\{({\mathbf{F}}_\mathrm{xg}^\dagger\otimes{\mathbf{I}}_T)\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC}\},\tag{36}\] where \({\mathbf{F}}_\mathrm{xg}\) is redefined as \({\mathbf{F}}_\mathrm{xg}\doteq\bigl[\mathit{\boldsymbol{\lambda}}_1 \otimes\boldsymbol{\Phi}_1^{{\rm T}},\cdots,\mathit{\boldsymbol{\lambda}}_K \otimes\boldsymbol{\Phi}_K^{{\rm T}}\bigr]^{{\rm T}}\in \mathbb{C}^{KN_c \times LN}\). Once \(\hat{{\mathbf{Q}}}\) is found, we consider to solve the problem \[\label{minkrf} \min_{\scriptscriptstyle {\mathbf{X}},{\mathbf{G}}} \bigl\|\hat{{\mathbf{Q}}} - {\mathbf{G}} \diamond{\mathbf{X}}^{{\rm T}}\bigr\|_\mathrm{F}^2,\tag{37}\] for which the Khatri-Rao Factorization (KRF) algorithm [27], [61], [72] can be applied to decouple estimates of \({\mathbf{X}}\) and \({\mathbf{G}}\). According [72], this can be accomplished by reshaping each \(l\)-th column of \(\hat{{\mathbf{Q}}}\), defined as \(\hat{\mathit{\boldsymbol{x}}}_{{\mathrm{g}}_l}\), into a rank-1 matrix \({\mathbf{Q}}_l \in \mathbb{C}^{T \times N}\). Defining the \(l\)-th column of \({\mathbf{G}}\) and \({\mathbf{X}}^{{\rm T}}\) as, respectively, \(\mathit{\boldsymbol{g}}_l \in \mathbb{C}^{N \times 1}\) and \(\mathit{\boldsymbol{x}}_l \in \mathbb{C}^{T \times 1}\) and by considering the property 10 , \({\mathbf{Q}}_l\) can be further approximated by \(\mathit{\boldsymbol{x}}_l\mathit{\boldsymbol{g}}_l^{{\rm T}}\). Therefore, we can tackle the problem 37 and get closed-form estimates of \({\mathbf{G}}\) and \({\mathbf{X}}\) by solving \(L\) rank-1 matrix approximation subproblems, i.e., \[\label{ghestkrf} [\hat{{\mathbf{G}}},\hat{{\mathbf{X}}}] = \mathop{\mathrm{arg\,min}}_{\mathit{\boldsymbol{g}}_l,\mathit{\boldsymbol{x}}_l}\sum\limits_{l=1}^L\left|\left|{\mathbf{Q}}_l - \mathit{\boldsymbol{x}}_l\mathit{\boldsymbol{g}}_l^{{\rm T}}\right|\right|_{\rm F}^2,\tag{38}\] where each subproblem operates on the reshaping of the \(l\)-th column of \(\hat{{\mathbf{Q}}}\) into a rank-1 matrix \({\mathbf{Q}}_l \in \mathbb{C}^{T \times N}\). The \(l\)-th columns of \(\hat{{\mathbf{X}}}^{{\rm T}}\) and \(\hat{{\mathbf{G}}}\) are respectively found from the dominant left and right singular vectors of \({\mathbf{Q}}_l\). A relevant feature of the KRF method is that by distributing the \(L\) estimation steps across parallel processors, the processing delay can be reduced. The HRIS-KRF receiver is summarized in Algorithm 5.
 
At the BS, the BALS receiver for KRSTC exploits the BALS’s framework in Algorithm 6 by rewriting 30 and 31 while redefining \({\mathbf{E}}_\mathrm{h} \in \mathbb{C}^{N \times KL}\) and \({\mathbf{E}}_\mathrm{x} \in \mathbb{C}^{KN \times L}\) as \({\mathbf{E}}_\mathrm{h}\!\doteq \!\!\bigl[{\rm diag\!}\left\{ \!\mathit{\boldsymbol{\psi}}_1\! \right\}\!\hat{{\mathbf{G}}}{\rm diag\!}\left\{ \!\mathit{\boldsymbol{\lambda}}_1\! \right\}\!,\!\cdots\!,\!{\rm diag\!}\left\{ \!\mathit{\boldsymbol{\psi}}_K\! \right\}\!\hat{{\mathbf{G}}}{\rm diag\!}\left\{ \!\mathit{\boldsymbol{\lambda}}_K\! \right\}\bigr]\) and \({\mathbf{E}}_\mathrm{x}\!\doteq \!\!\bigl[{\rm diag\!}\left\{ \!\mathit{\boldsymbol{\lambda}}_1\! \right\}\!\hat{{\mathbf{G}}}^{{\rm T}}\!{\rm diag\!}\left\{ \!\mathit{\boldsymbol{\psi}}_1\! \right\}\!,\!\cdots\!,\!{\rm diag\!}\left\{ \!\mathit{\boldsymbol{\lambda}}_K\! \right\}\!\hat{{\mathbf{G}}}^{{\rm T}}{\rm diag\!}\left\{ \!\mathit{\boldsymbol{\psi}}_K\! \right\}\bigr]^{{\rm T}}\!\).
The closed-form receiver following KRSTC can be derived by first redefining \({\mathbf{Z}} \doteq {\mathbf{X}}^{{\rm T}}\otimes{\mathbf{H}} \in \mathbb{C}^{TM \times LN}\) and applying the \(\mathrm{vec}\{\cdot\}\) operator to 14 . Then, one can use the properties 7 and 9 to obtain \(\bigl[\mathcal{Y}^\mathrm{BS}\bigr]_{(3)}^{{\rm T}}= {\mathbf{Z}}{\rm diag\!}\left\{ \hat{\mathit{\boldsymbol{g}}} \right\}{\mathbf{E}}_\mathrm{xh} + \bigl[\mathcal{V}^\mathrm{BS}\bigr]_{(3)}^{{\rm T}}\), where \({\mathbf{E}}_\mathrm{xh}\) assumes the form \({\mathbf{E}}_\mathrm{xh} \doteq \boldsymbol{\Lambda}^{{\rm T}}\diamond\boldsymbol{\Psi}^{{\rm T}}\in \mathbb{C}^{LN \times K}\), constructed from the definitions of \(\boldsymbol{\Psi}\) and \(\boldsymbol{\Lambda}\) (in Section 3). An estimate of the composite matrix \({\mathbf{Z}}\) can be found by solving \[\hat{{\mathbf{Z}}} = \mathop{\mathrm{arg\,min}}_{{\mathbf{Z}}} \left\|\left[ \mathcal{Y}^{\mathrm{BS}} \right]_{(3)}^{{\rm T}}- {\mathbf{Z}}{\rm diag\!}\left\{ \hat{\mathit{\boldsymbol{g}}} \right\}{\mathbf{E}}_\mathrm{xh})\right\|^2_{\textrm{F}}.\] This way, we get \[\label{bsxhestkr} \hat{{\mathbf{Z}}} = \left[ \mathcal{Y}^\mathrm{BS} \right]_{(3)}^{{\rm T}}\bigl({\rm diag\!}\left\{ \hat{\mathit{\boldsymbol{g}}} \right\}{\mathbf{E}}_\mathrm{xh}\bigr)^\dagger\in \mathbb{C}^{TM \times LN}.\tag{39}\] Then, we replace 34 by 39 and invoke Algorithm 7 to estimate \({\mathbf{X}} \in \mathbb{C}^{L \times T}\) and \({\mathbf{H}} \in \mathbb{C}^{M \times N}\).
Estimating \({\mathbf{X}}\), \({\mathbf{G}}\), and \({\mathbf{H}}\) at the HRIS and BS requires solving estimation steps that include right and/or left-matrix inverses and should ensure unique solutions. For the TSTC scheme, this takes into account satisfying the identifiability conditions of 21 and 23 for the iterative receiver BALS, and 25 for the closed-form receiver KronF at the HRIS. Similarly, at the BS, we need to ensure unique estimates of 30 and 31 for the BALS receiver, as 34 for the KronF. Analogous considerations should be made for the KRSTC scheme. For simplicity, we base our identifiability assessment on the assumption that \(\mathcal{T}_{\boldsymbol{\Phi}}\), \(\mathcal{W}\) and \(\boldsymbol{\Psi}\) are designed to ensure that the full-rank property is preserved in all \(K\) blocks that comprise the matrices to be left/right inverted in the TSTC scheme. For this purpose, we assume \(\boldsymbol{\Phi}_k\), \({\mathbf{W}}_k\), and \({\rm diag\!}\left\{ \mathit{\boldsymbol{\psi}}_k \right\}\) have full-rank in 11 and 12 , for \(k=1,\cdots,K\). Analogously, the same assumption is applied in the KRSTC for \(\mathcal{T}_{\boldsymbol{\Phi}}\), \(\boldsymbol{\Lambda}\) and \(\boldsymbol{\Psi}\), where \(\boldsymbol{\Phi}_k\), \({\rm diag\!}\left\{ \mathit{\boldsymbol{\lambda}}_k \right\}\) and \({\rm diag\!}\left\{ \mathit{\boldsymbol{\psi}}_k \right\}\) in 13 and 14 are also assumed to be full-rank. The design optimization of these phase-shifts and coding coefficients is beyond the scope of this paper and will be left for future work. Hereafter, we discuss the conditions for identifiability of the channel and symbol matrices and their implication for receiver design.
The uniqueness of \(\hat{{\mathbf{X}}}\) requires that \({\mathbf{F}}_\mathrm{x}\) (at the HRIS) and \(({\mathbf{I}}_K \otimes{\mathbf{H}}){\mathbf{E}}_\mathrm{x}\) (at the BS) are left-invertible in, respectively, 23 and 31 , which entails that both of them must have full column rank. To achieve this at the HRIS, the necessary conditions are \(KN_c \geq R\) for the TSTC and \(KN_c \geq L\) for the KRSTC. On the other hand, the necessary conditions are \(KM \geq R\) (TSTC) and \(KM \geq L\) (KRSTC) at the BS.
Proposition 1. Doing \(\kappa_\mathrm{g} = \mathrm{rank}({\mathbf{G}})\), \({\mathbf{F}}_\mathrm{x}\) has full column-rank if \(K\zeta_\mathrm{x} \geq R\), where \(\zeta_\mathrm{x}\) is the rank of \({\mathbf{F}}_{\mathrm{x}_k}\), i.e., the \(k\)-th block of \({\mathbf{F}}_\mathrm{x}\). In addition, we have \(\zeta_\mathrm{x} \leq \mathrm{min}\{N_c,\kappa_\mathrm{g},R\}\) for the TSTC. This inequality becomes \(\zeta_\mathrm{x} \leq \mathrm{min}\{N_c,\kappa_\mathrm{g}\}\) for the KRSTC, and \({\mathbf{F}}_\mathrm{x}\) would have full column-rank if \(K\zeta_\mathrm{x} \geq L\). Proof:* see Appendix 12.1.*
Proposition 2. Doing \(\kappa_\mathrm{h} \!=\! \mathrm{rank}({\mathbf{H}})\) and \(\xi_\mathrm{x} \!=\! \mathrm{rank}({\mathbf{H}}{\mathbf{E}}_{\mathrm{x}_k})\), the product \(({\mathbf{I}}_K \otimes{\mathbf{H}}){\mathbf{E}}_\mathrm{x}\) has full column-rank if \(K\xi_\mathrm{x} \geq R\), where \(\xi_\mathrm{x} \leq \mathrm{min}\{\kappa_\mathrm{h},\kappa_\mathrm{g},R\}\) for the TSTC. For the KRSTC, \(({\mathbf{I}}_K \otimes{\mathbf{H}}){\mathbf{E}}_\mathrm{x}\) would have full column-rank if \(K\xi_\mathrm{x} \geq L\), with \(\xi_\mathrm{x} \leq \mathrm{min}\{\kappa_\mathrm{h},\kappa_\mathrm{g}\}\). Proof:* see Appendix 12.1.*
At the HRIS, the necessary conditions become sufficient when \({\mathbf{G}}\) has full-rank, which corresponds to a rich scattering wireless propagation for the UT-HRIS channel, e.g., Rayleigh fading. In practical scenarios where \(N \gg L\), the full rank of \({\mathbf{G}}\) indicates that it has full column-rank, meaning \(\kappa_\mathrm{g} = L\). Otherwise, when a poor scattering is considered, e.g., for millimeter wave or Terahertz communications, \({\mathbf{G}}\) may be rank-deficient (\(\kappa_\mathrm{g} < L\)). Note that a more restrictive condition occurs when \(\zeta_\mathrm{x} = 1\), and then we would have \(K \geq R\) and \(K \geq L\) for TSTC and KRTC, respectively. This last condition corresponds to the sufficient condition that guarantees the unique solution of \(\hat{{\mathbf{X}}}\) in any scenario.
On the other hand, the necessary conditions to get \(\hat{{\mathbf{X}}}\) at the BS are sufficient when \(\hat{{\mathbf{G}}}\) and \({\mathbf{H}}\) have full rank, which implies a rich scattering scenario to both the UT-HRIS and HRIS-BS channels. In contrast, under a poor scattering scenario in one or both channels, \(\hat{{\mathbf{G}}}\) and/or \({\mathbf{H}}\) may have rank-deficient (in this case, \(\kappa_\mathrm{h} < \mathrm{min}\{M,N\}\)). Similar to the HRIS case, the sufficient conditions for any scenario are \(K \geq R\) and \(K \geq L\) for TSTC and KRSTC, respectively, once they also cover the rank possibility \(\xi_\mathrm{x} = 1\) in the blocks.
The uniqueness of \(\hat{{\mathbf{G}}}\) requires that \(\bar{{\mathbf{F}}}_\mathrm{g}\) in 21 has full column-rank to be left-invertible, where \(\bar{{\mathbf{F}}}_\mathrm{g} = ({\mathbf{I}}_{K} \otimes{\mathbf{X}}^{{\rm T}}\otimes{\mathbf{I}}_{N_c}){\mathbf{F}}_\mathrm{g}\) in 21 . For this purpose, it is necessary that \(KTN_c \geq LN\) for both transmission schemes.
Proposition 3. Doing \(\kappa_\mathrm{x} = \mathrm{rank}({\mathbf{X}}) \leq \mathrm{min}\{R,T\}\), \(\bar{{\mathbf{F}}}_\mathrm{g}\) has full column-rank if \(KN_c\kappa_\mathrm{x} \geq LN\). Proof:* see Appendix 12.2.*
If \({\mathbf{X}}\) has full row-rank, i.e., \(\kappa_\mathrm{x} = R\) (TSTC), or \(\kappa_\mathrm{x} = L\) (KRSTC), the necessary condition is sufficient. However, if \({\mathbf{X}}\) is column-rank (\(\kappa_\mathrm{x} = T\)), the UT must compensate for this by transmitting additional sub-frames to ensure a rank equal to \(LN\) for \(\bar{{\mathbf{F}}}_\mathrm{g}\). Even so, designing \({\mathbf{F}}_\mathrm{g}\) as full column-rank, as assumed in Appendix 12.2, already induces such compensation.
Even though \({\mathbf{X}}\) can be assumed to have full rank, recall that \({\mathbf{G}}\) is estimated alternately with \({\mathbf{X}}\) in the iterative receiver BALS at the HRIS. Due to the random initialization of one of these matrices, the initial iterations may yield poorly conditioned solutions in very low SNR regimes, potentially resulting in a low rank for \(\hat{{\mathbf{X}}}\). In the worst case, more sub-frames must be transmitted to guarantee \(KN_c \geq LN\) if \(\kappa_\mathrm{x} = 1\).
Estimating \(\hat{{\mathbf{H}}}\) uniquely requires that \({\mathbf{E}}_\mathrm{h}({\mathbf{I}}_K \otimes{\mathbf{X}})\) in 30 have full row-rank, i.e., right-invertible. This holds if the necessary condition \(KT \geq N\) is satisfied.
Proposition 4. \({\mathbf{E}}_\mathrm{h}({\mathbf{I}}_K \otimes{\mathbf{X}})\) has full row-rank if \(K\xi_\mathrm{h} \geq N\), where \(\xi_\mathrm{h}\) is the rank of \({\mathbf{E}}_{\mathrm{h}_k}{\mathbf{X}}\). Proof:* see Appendix 12.3.*
Assuming \(\hat{{\mathbf{G}}}\) has full rank, the matrix \({\mathbf{E}}_\mathrm{h}\) would have full row rank if \(KR \geq N\) for TSTC and \(KL \geq N\) for KRSTC. Accordingly, the necessary condition above is sufficient if \({\mathbf{X}}\) has full row-rank. However, if \({\mathbf{X}}\) is column-rank, the UT must transmit additional sub-frames to ensure \({\mathbf{E}}_\mathrm{h}({\mathbf{I}}_K \otimes{\mathbf{X}})\) to have full row-rank. Therefore, the sufficient condition is \(K\xi_\mathrm{h} \geq N\). The more restrictive scenario in this concern requires \(K \geq N\).
Estimating the combined matrix \({\mathbf{Q}}\) is mandatory before applying the Kronecker factorization in the TSTC scheme and the Khatri-Rao factorization in the KRSTC one, both of which are closed-form solutions for estimating \({\mathbf{X}}\) and \({\mathbf{G}}\). To \(\hat{{\mathbf{Q}}}\) be unique, it is sufficient that \({\mathbf{F}}_\mathrm{xg}\) in 25 and 36 to be full column-rank. This requires \(KN_c \geq RLN\) and \(KN_c \geq LN\) for TSTC and KRSTC, respectively. Indeed, estimating \(\hat{{\mathbf{Q}}}\) in the TSTC is \(R\) times more restrictive than in KRSTC. In contrast, KRSTC does not provide multiplexing of multiple streams at the transmitter. In addition, to recover \({\mathbf{X}}\) and \({\mathbf{G}}\) without scaling ambiguities, the UT must transmit \(L\) known symbols per sub-frame, unlike TSTC, which requires only 1 symbol, as we will see in the next section.
The estimate of the composite matrix \(\hat{{\mathbf{Z}}}\), required before applying the Kronecker factorization, is unique if \({\mathbf{E}}_\mathrm{xh}({\mathbf{I}}_K \otimes\hat{\mathit{\boldsymbol{g}}})\) in 34 is full-row rank, which implies \(K \geq RN\) for TSTC and \(K \geq LN\) for KRSTC. These solutions provide unique estimates of \({\mathbf{E}}_\mathrm{xh}\) even in more challenging scenarios where \({\mathbf{G}}\) is rank-deficient. Note that the design requirements for executing those closed-form methods to estimate \({\mathbf{X}}\) and \({\mathbf{H}}\) are the more restrictive ones in terms of the number of sub-frames and, hence, time overhead.
The identifiability conditions required to satisfy each receiver (at both the HRIS and BS) are summarized in Table 3 for both TSTC and KRSTC schemes, which are presented in terms of the minimum number \(K\) of sub-frames necessary to ensure the estimation of the corresponding channel and symbol matrices. For BALS receivers, the required conditions to estimate both matrices must be satisfied simultaneously. To address this, simplified conditions meeting this requirement are provided in Table 3. In addition, Table 3 exhibits the computational complexity of each receiver, which will be covered in Section 8.
0.7pt
| Receiver | Entity | Coding | Condition \(K \geq \left\lceil{\cdot}\right\rceil\) | Complexity \(\mathcal{O}(\cdot)\) | 
|---|---|---|---|---|
| BALS | HRIS | TSTC | \((1/N_c)\mathrm{max}\{R,\!LN/T\}\) | \(KN_c(R^2\!+\!L^2N^2T)\) | 
| KronF | HRIS | TSTC | \(LRN/N_c\) | \(LRN(LRNKN_c\! +\! T)\) | 
| BALS | BS | TSTC | \(\mathrm{max}\{R/M,N/T\}\) | \(K(R^2M\!+\!N^2T)\) | 
| KronF | BS | TSTC | \(RN\) | \(RN(RNK\! +\! TM)\) | 
| BALS | HRIS | KRSTC | \((1/N_c)\mathrm{max}\{\!L,\!LN/T\}\) | \(L^2KN_c(1\! + \!N^2T)\) | 
| KRF | HRIS | KRSTC | \(LN/N_c\) | \(LN(LNKN_c\! +\! T)\) | 
| BALS | BS | KRSTC | \(\mathrm{max}\{L/M,N/T\}\) | \(K(L^2M\!+\!N^2T)\) | 
| KronF | BS | KRSTC | \(LN\) | \(LN(LNK \!+\! TM)\) | 
| H | BS | TSTC | \(N/T\) | \(KN^2T\) | 
| H | BS | KRSTC | \(N/T\) | \(KN^2T\) | 
Once the conditions outlined in Table 3 are met, the estimated matrices \(\hat{{\mathbf{G}}}\) and \(\hat{{\mathbf{X}}}\) (at the HRIS) and \(\hat{{\mathbf{H}}}\) and \(\hat{{\mathbf{X}}}\) (at the BS) share scaling ambiguities that mutually compensate each other. As mentioned in Section 3, the received signal at the HRIS following TSTC can be interpreted as a double-Tucker tensor structure with one of the factor matrices being \({\mathbf{I}}_{N_c}\). We can see in Section 4 that \({\mathbf{G}}\) and \({\mathbf{X}}\) interact with each other through a Kronecker product. In this way, we can study the uniqueness of the double-Tucker with one known matrix through one of its unfoldings. Let us assuming that \({\mathbf{G}}\) and \({\mathbf{X}}\) are linked to their estimates as \({\mathbf{X}} = {\mathbf{U}}_\mathrm{x}^{{\rm T}}\hat{{\mathbf{X}}}\) and \({\mathbf{G}} = \hat{{\mathbf{G}}}{\mathbf{U}}_\mathrm{g}\), where \({\mathbf{U}}_\mathrm{x} \in \mathbb{C}^{R \times R}\) and \({\mathbf{U}}_\mathrm{g} \in \mathbb{C}^{L \times L}\) are non-singular transformation matrices. Replacing both matrices in the noiseless part of 11 yields \[\label{uniqdt1} {\mathbf{Y}}^\mathrm{RC}_k = \boldsymbol{\Phi}_k\hat{{\mathbf{G}}}{\mathbf{U}}_\mathrm{g}{\mathbf{W}}_k{\mathbf{U}}_\mathrm{x}^{{\rm T}}\hat{{\mathbf{X}}} + {\mathbf{V}}^\mathrm{RC}_k.\tag{40}\] After applying \({\rm vec}\left\{ \cdot \right\}\) to the transpose of 40 , we obtain \[\label{uniqdt2} \bar{\mathit{\boldsymbol{y}}}^\mathrm{RC}_k = (\boldsymbol{\Phi}_k \otimes{\mathbf{I}}_T)(\hat{{\mathbf{G}}}{\mathbf{U}}_\mathrm{g} \otimes\hat{{\mathbf{X}}}^{{\rm T}}{\mathbf{U}}_\mathrm{x})\mathit{\boldsymbol{w}}_k + \bar{\mathit{\boldsymbol{\nu}}}^\mathrm{RC}_k,\tag{41}\] and using the property 8 , we have \[\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC}_k = (\boldsymbol{\Phi}_k \otimes{\mathbf{I}}_T)(\hat{{\mathbf{G}}} \otimes\hat{{\mathbf{X}}}^{{\rm T}})({\mathbf{U}}_\mathrm{g} \otimes{\mathbf{U}}_\mathrm{x})\mathit{\boldsymbol{w}}_k + \bar{\mathit{\boldsymbol{\nu}}}^\mathrm{RC}_k.\] Applying \({\rm vec}\left\{ \cdot \right\}\) again, followed by stacking row-wise \(\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC}_k\) for \(k=1,\cdots,K\), we finally get \[\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC} = ({\mathbf{F}}_\mathrm{xg} \otimes{\mathbf{I}}_T)({\mathbf{U}}_\mathrm{g}^{{\rm T}}\otimes{\mathbf{U}}_\mathrm{x}^{{\rm T}}\otimes{\mathbf{I}}_{NT})\mathit{\boldsymbol{q}} + \bar{\mathit{\boldsymbol{\nu}}}^\mathrm{RC}.\] If \({\mathbf{F}}_\mathrm{xg}\) has full column-rank, we can do \[({\mathbf{F}}_\mathrm{xg}^\dagger\otimes{\mathbf{I}}_T)\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC} = ({\mathbf{F}}_\mathrm{xg}^\dagger{\mathbf{F}}_\mathrm{xg} \otimes{\mathbf{I}}_T)({\mathbf{U}}_\mathrm{g}^{{\rm T}}\otimes{\mathbf{U}}_\mathrm{x}^{{\rm T}}\otimes{\mathbf{I}}_{NT})\mathit{\boldsymbol{q}},\] which leads to \[{\mathbf{F}}_\mathrm{xg}^\dagger{\mathbf{F}}_\mathrm{xg} \otimes{\mathbf{I}}_T = {\mathbf{U}}_\mathrm{g}^{{\rm T}}\otimes{\mathbf{U}}_\mathrm{x}^{{\rm T}}\otimes{\mathbf{I}}_{NT} = {\mathbf{I}}_{LRNT}\] Therefore, \({\mathbf{U}}_\mathrm{x}^{{\rm T}}\otimes{\mathbf{U}}_\mathrm{h}^{{\rm T}}= {\mathbf{I}}_{LR}\). The unique solution for this occurs when \({\mathbf{U}}_\mathrm{x}\) and \({\mathbf{U}}_\mathrm{g}\) are scaled identity matrices, i.e., \({\mathbf{U}}_\mathrm{x} = \alpha{\mathbf{I}}_R\) and \({\mathbf{U}}_\mathrm{g} = (1/\alpha){\mathbf{I}}_L\). By this means, \(\hat{{\mathbf{X}}}\) and \(\hat{{\mathbf{G}}}\) are unique up to scaling factors that cancel each other.
On the other hand, \({\mathbf{G}}\) and \({\mathbf{X}}\) entangle through a Khatri-Rao product in the KRSTC scheme. This happens due to the algebraic structure that follows the Tucker-PARAFAC structure. Let us rewrite 41 as \[\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC}_k = (\boldsymbol{\Phi}_k \otimes{\mathbf{I}}_T)(\hat{{\mathbf{G}}}{\mathbf{U}}_\mathrm{g} \diamond\hat{{\mathbf{X}}}^{{\rm T}}{\mathbf{U}}_\mathrm{x})\mathit{\boldsymbol{\lambda}}_k + \bar{\mathit{\boldsymbol{\nu}}}^\mathrm{RC}_k,\] wherein the symbol matrix is recast as \({\mathbf{X}} \in \mathbb{C}^{L \times T}\). Using the property 7 , we obtain \[\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC}_k = (\boldsymbol{\Phi}_k \otimes{\mathbf{I}}_T)(\hat{{\mathbf{G}}} \otimes\hat{{\mathbf{X}}}^{{\rm T}})({\mathbf{U}}_\mathrm{g} \diamond{\mathbf{U}}_\mathrm{x})\mathit{\boldsymbol{\lambda}}_k + \bar{\mathit{\boldsymbol{\nu}}}^\mathrm{RC}_k.\] We now apply \({\rm vec}\left\{ \cdot \right\}\) and we stack row-wise \(\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC}_k\) for \(k=1,\cdots,K\), we have \[\bar{\mathit{\boldsymbol{y}}}^\mathrm{RC} = ({\mathbf{F}}_\mathrm{xg} \otimes{\mathbf{I}}_T)\bigl[({\mathbf{U}}_\mathrm{g} \diamond{\mathbf{U}}_\mathrm{x})^{{\rm T}}\otimes{\mathbf{I}}_{NT} \bigr]\mathrm{vec}\bigl\{\hat{{\mathbf{G}}} \otimes\hat{{\mathbf{X}}}^{{\rm T}}\bigr\} + \bar{\mathit{\boldsymbol{\nu}}}^\mathrm{RC}.\] Stated \({\mathbf{A}} \in \mathbb{C}^{I \times P}\) and \({\mathbf{B}} \in \mathbb{C}^{J \times P}\), the Khatri-Rao product \({\mathbf{A}} \diamond{\mathbf{B}} \in \mathbb{C}^{IJ \times P}\) can be computed from the Kronecker one \({\mathbf{A}} \otimes{\mathbf{B}} \in \mathbb{C}^{IJ \times P^2}\) using a reduction matrix [73] such that \({\mathbf{A}} \diamond{\mathbf{B}} = ({\mathbf{A}} \otimes{\mathbf{B}})\boldsymbol{\Xi}\). By applying \({\rm vec}\left\{ \cdot \right\}\) to this and using the property 6 , we obtain \(\mathrm{vec}\bigl\{{\mathbf{A}} \diamond{\mathbf{B}}\bigr\} = (\boldsymbol{\Xi}^{{\rm T}}\otimes{\mathbf{I}}_{IJ})\mathrm{vec}\bigl\{{\mathbf{G}} \otimes{\mathbf{X}}\bigr\}\). This way, \[\mathrm{vec}\bigl\{\hat{{\mathbf{G}}} \diamond\hat{{\mathbf{X}}}^{{\rm T}}\bigr\} = \bigl[({\mathbf{I}}_L \diamond{\mathbf{I}}_L)^{{\rm T}}\otimes{\mathbf{I}}_{NT} \bigr]\mathrm{vec}\bigl\{\hat{{\mathbf{G}}} \otimes\hat{{\mathbf{X}}}^{{\rm T}}\bigr\}.\] Note that, if \({\mathbf{U}}_\mathrm{g}\) and \({\mathbf{U}}_\mathrm{x}\) are diagonal matrices, then, \({\mathbf{U}}_\mathrm{g} \diamond{\mathbf{U}}_\mathrm{x} = {\mathbf{I}}_L \diamond{\mathbf{U}}_\mathrm{g}{\mathbf{U}}_\mathrm{x} = {\mathbf{U}}_\mathrm{g}{\mathbf{U}}_\mathrm{x} \diamond{\mathbf{I}}_L\). Therefore, to recover the Khatri-Rao product from the Kronecker one, it is mandatory that \({\mathbf{U}}_\mathrm{g}{\mathbf{U}}_\mathrm{x} = {\mathbf{I}}_L\). This means that the estimates \(\hat{{\mathbf{G}}}\) and \(\hat{{\mathbf{X}}}\) are unique up to diagonal scaling matrices that mutually cancel.
Stated \({\mathbf{U}}_\mathrm{x}^{{\rm T}}\in \mathbb{C}^{R \times R}\) and \({\mathbf{U}}_\mathrm{h} \in \mathbb{C}^{N \times N}\) are non-singular transformation matrices, \({\mathbf{X}}\) and \({\mathbf{H}}\) are related to their estimates as \({\mathbf{X}} = {\mathbf{U}}_\mathrm{x}^{{\rm T}}\hat{{\mathbf{X}}}\) and \({\mathbf{H}} = \hat{{\mathbf{H}}}{\mathbf{U}}_\mathrm{h}\), respectively, so that we can rewrite 32 as \[\bigl[\mathcal{Y}^\mathrm{BS}\bigr]_{(3)}^{{\rm T}}= (\hat{{\mathbf{X}}}^{{\rm T}}{\mathbf{U}}_\mathrm{x} \otimes\hat{{\mathbf{H}}}{\mathbf{U}}_\mathrm{h})\bar{{\mathbf{E}}}_\mathrm{xh} + \bigl[\mathcal{V}^\mathrm{BS}\bigr]_{(3)}^{{\rm T}},\] where \(\bar{{\mathbf{E}}}_\mathrm{xh} = {\mathbf{E}}_\mathrm{xh}({\mathbf{I}}_K \otimes\hat{\mathit{\boldsymbol{g}}})\). Using the property 8 , we have \[\bigl[\mathcal{Y}^\mathrm{BS}\bigr]_{(3)}^{{\rm T}}= (\hat{{\mathbf{X}}}^{{\rm T}}\otimes\hat{{\mathbf{H}}})({\mathbf{U}}_\mathrm{x} \otimes{\mathbf{U}}_\mathrm{h})\bar{{\mathbf{E}}}_\mathrm{xh} + \bigl[\mathcal{V}^\mathrm{BS}\bigr]_{(3)}^{{\rm T}}.\] If \(\bar{{\mathbf{E}}}_\mathrm{xh}\) is full row-rank, \[\bigl[\mathcal{Y}^\mathrm{BS}\bigr]_{(3)}^{{\rm T}}\bar{{\mathbf{E}}}_\mathrm{xh}^\dagger= (\hat{{\mathbf{X}}}^{{\rm T}}\otimes\hat{{\mathbf{H}}})({\mathbf{U}}_\mathrm{x} \otimes{\mathbf{U}}_\mathrm{h})\bar{{\mathbf{E}}}_\mathrm{xh}\bar{{\mathbf{E}}}_\mathrm{xh}^\dagger.\] Then, \[{\mathbf{U}}_\mathrm{x} \otimes{\mathbf{U}}_\mathrm{h} = \bar{{\mathbf{E}}}_\mathrm{xh}\bar{{\mathbf{E}}}_\mathrm{xh}^\dagger= {\mathbf{I}}_{RN}.\] To be possible \({\mathbf{U}}_\mathrm{x} \otimes{\mathbf{U}}_\mathrm{h} = {\mathbf{I}}_{RN}\), the unique solutions are \({\mathbf{U}}_\mathrm{x} = \alpha{\mathbf{I}}_R\) and \({\mathbf{U}}_\mathrm{h} = (1/\alpha){\mathbf{I}}_N\). Therefore, \(\hat{{\mathbf{X}}}\) and \(\hat{{\mathbf{H}}}\) are unique up to scaling factors that cancel each other.
Note that the interaction between the HRIS-BS channel and symbol matrices \({\mathbf{H}}\) and \({\mathbf{X}}\) is dictated utilizing a Kronecker product for both TSTC and KRSTC. Therefore, this ambiguity pattern is valid for both transmission schemes at the BS.
On the HRIS side, for TSTC, scaling ambiguities can be mitigated by simply sending a single pilot embedded into the transmitted data. A simple choice is to set \({\mathbf{X}}_{1,1} = 1\). This knowledge allows us to determine \(\alpha\) to eliminate the scaling ambiguity through normalization. On the other hand, for KRSTC, computing \(\boldsymbol{\Delta}_\mathrm{x}\) implies the knowledge of one column of \({\mathbf{X}} \in \mathbb{C}^{L \times T}\) to eliminate the scaling ambiguities. In this case, the UT can send a pilot embedded in the first symbol period of each data stream. A straightforward option is to consider \({\mathbf{X}}_{\cdot,1} = [1,\cdots,1]^{{\rm T}}\). For both coding schemes, the scaling ambiguities affecting the estimated channel and symbol matrices at the BS are given by \(\hat{{\mathbf{X}}} = \beta{\mathbf{X}}\) and \(\hat{{\mathbf{H}}} = (1/\beta){\mathbf{H}}\), which can also be eliminated using the same procedure discussed for the HRIS side.
As far as computational complexity is concerned, let us first recall the complexity of the matrix inverse. We consider a complexity of \(\mathcal{O}(I^2J)\) to calculate the pseudo-inverse of a wide matrix \({\mathbf{A}} \in \mathbb{C}^{I \times J}\), where \(\mathrm{rank}\{{\mathbf{A}}\}=I\). For the iterative BALS algorithms, the complexity of each iteration is dominated by the two matrix inverses in 21 and 23 (for the HRIS-BALS receiver) and in 30 and 31 (for the BS-BALS receiver). The overall complexity is given by multiplying the complexity of a single iteration by the number of iterations to convergence. Moreover, note that the complexity of computing the truncated-SVD\(({\mathbf{A}})\) is assumed to be \(\mathcal{O}(IJ\mathrm{rank}\{{\mathbf{A}}\})\). In the particular case of the KronF algorithms, the complexity is given by that of the LS estimation step in the first stage, given by 25 for the HRIS-KronF receiver and by 34 for the BS-KronF receiver, followed by the complexity associated with computing a rank-1 matrix approximation step associated with the Kronecker factorization problems in 26 and 35 , respectively. Finally, the KRF algorithm (considered at the HRIS under the KRSTC scheme) involves solving 36 followed by \(L\) parallel rank-1 matrix approximation routines. Table 3 lists the complexity of all receivers discussed in this work, with the complexity of all BALS receivers provided per iteration.
1pt
| CL scenario | Feedback set | Bits fed back | Receiver pair | Coding scheme | Equations | Algorithms | 
|---|---|---|---|---|---|---|
| \(\mathscr{C}=\{\hat{{\mathbf{G}}}\}\) | \(LN\eta\) | BALS-BALS | TSTC | 21 , 23 , 30 , 31 | 4 & 6 | |
| \(\mathscr{C}=\{\hat{{\mathbf{G}}}\}\) | \(LN\eta\) | BALS-KronF | TSTC | 21 , 23 , 34 | 4 & 7 | |
| \(\mathscr{C}=\{\hat{{\mathbf{G}}}\}\) | \(LN\eta\) | KronF-BALS | TSTC | 25 , 30 , 31 | 5 & 6 | |
| \(\mathscr{C}=\{\hat{{\mathbf{G}}}\}\) | \(LN\eta\) | KronF-kronF | TSTC | 25 , 34 | 5 & 7 | |
| \(\mathscr{C}=\{\hat{{\mathbf{G}}}\}\) | \(LN\eta\) | BALS-BALS | KRSTC | 21 \(^\ast\), 23 \(^\ast\), 30 \(^\ast\), 31 \(^\ast\) | 4\(^\ast\) & 6\(^\ast\) | |
| \(\mathscr{C}=\{\hat{{\mathbf{G}}}\}\) | \(LN\eta\) | BALS-KronF | KRSTC | 21 \(^\ast\), 23 \(^\ast\), 39 | 4\(^\ast\) & 7\(^\ast\) | |
| \(\mathscr{C}=\{\hat{{\mathbf{G}}}\}\) | \(LN\eta\) | KRF-BALS | KRSTC | 36 , 30 \(^\ast\), 31 \(^\ast\) | 8 & 6\(^\ast\) | |
| \(\mathscr{C}=\{\hat{{\mathbf{G}}}\}\) | \(LN\eta\) | KRF-KronF | KRSTC | 36 , 39 | 8 & 7\(^\ast\) | |
| \(\mathscr{C}=\{\hat{{\mathbf{G}}},\hat{{\mathbf{X}}}\}\) | \((RT\!-\!1)\mathrm{log}_2\varrho + LN\eta\) | BALS-H | TSTC | 21 , 23 , 30 | 4 & eq. 30 | |
| \(\mathscr{C}=\{\hat{{\mathbf{G}}},\hat{{\mathbf{X}}}\}\) | \((RT\!-\!1)\mathrm{log}_2\varrho + LN\eta\) | KronF-H | TSTC | 25 , 30 | 5 & eq. 30 | |
| \(\mathscr{C}=\{\hat{{\mathbf{G}}},\hat{{\mathbf{X}}}\}\) | \(L(T\!-\!1)\mathrm{log}_2\varrho + LN\eta\) | BALS-H | KRSTC | 21 \(^\ast\), 23 \(^\ast\), 30 \(^\ast\) | 4\(^\ast\)& eq. 30 \(^\ast\) | |
| \(\mathscr{C}=\{\hat{{\mathbf{G}}},\hat{{\mathbf{X}}}\}\) | \(L(T\!-\!1)\mathrm{log}_2\varrho + LN\eta\) | KRF-H | KRSTC | 36 , 30 \(^\ast\) | 8 & eq. 30 \(^\ast\) | 
\(^\ast\)
Equations and algorithms from TSC that can be reused for KRSTC with necessary adaptations.
At the HRIS, two matrices are estimated by following our semi-blind strategy: the symbol matrix, which carries useful data from the user terminal, and the UT-HRIS channel matrix. Depending on the system design and control overhead constraints, either or both of these matrices can be conveyed to the BS via the control link. Fig. 9 illustrates this process. This shared information defines the operational mode of the receiver at the BS to determine whether full, partial, or no prior knowledge is assumed during the processing of the received signals. The specific choice influences the strategy employed at the base station for joint channel and symbol estimation, ultimately affecting the receiver’s performance, design constraints, and complexity. Capitalizing on the methods adopted in both HRIS and BS, we can define semi-blind receiver pairs by simply associating the receivers used in both, whose formation depends on the information shared between the HRIS and the BS through the control channel. Henceforth, we refer to these receiver pairs by adopting labels in the form “HRIS-BS” pair. Defining the set of data conveyed from the HRIS to the BS through the CL as \(\mathscr{C}\), we can envision different operation modes at the BS.
 
The first possible scenario would be \(\mathscr{C} = \{\hat{{\mathbf{G}}},\hat{{\mathbf{X}}}\}\), i.e., both the estimated symbol and the UT-HRIS channel are reported to the BS. In this case, the BS would only need to calculate \(\hat{{\mathbf{H}}}\) by utilizing 30 . We refer to this simplified method as simply replacing “BS” by “H” in the ‘HRIS-BS’’ label. Accordingly, as the joint symbol and CE can be accomplished iteratively or in closed form, this scenario leads to the following possibilities of receiver pairs: BALS-H and KronF-H for TSTC, and BALS-H and KRF-H for KRSTC.
Discussion: Recall that the semi-blind receivers provide continuous-valued estimations of the symbols and channel coefficients, so that before conveying the entries of the estimated matrices through the feedback control channel, these must be quantized. Considering a fixed resolution to represent the actual estimated data symbols, depending on the adopted modulation, the feedback duration related to sending \(\hat{{\mathbf{X}}}\) via CL is proportional to \((RT-1)\mathrm{log}_2\varrho\) for TSTC, and \(L(T-1)\mathrm{log}_2\varrho\) for KRSTC, where \(\varrho\) is the cardinality of the constellation. These quantities already account for the number of known symbols to remove scaling ambiguities. In addition, a level of resolution must be set to represent the estimated UT-HRIS channel coefficients, namely, the number of bits to quantize them. This way, the feedback duration related to sending \(\hat{{\mathbf{G}}}\) is proportional to \(LN\eta\), with \(\eta\) being the resolution of each \(\hat{{\mathbf{G}}}\) entry in bits. Indeed, this first scenario provides a lower computational cost at the BS by operating under a receiver processing simplified compared to KronF and BALS, and may offer better estimation performances. Despite the potential benefits, this approach would increase the control link load. In the context of the proposed data-aided semi-blind approach, this scenario serves as a performance reference that will be considered in our numerical evaluations.
A second scenario corresponds to \(\mathscr{C} = \{\hat{{\mathbf{G}}}\}\). This was initially exploited in the work [22] in the pilot-aided approach for CE. In line with this control link usage, there are four possible combinations of semi-blind receiver pairs for each coding scheme (TSTC or KRSTC). In the TSTC scheme, the four receiver pairs are BALS-BALS, BALS-KronF, KronF-BALS, and KronF-KronF. In the KRSTC scheme, we have BALS-BALS, BALS-KronF, KRF-BALS, and KRF-KronF. These terminologies (and those of the first scenario) will be used in Section 10, where the numerical results of the different receiver pairs will be evaluated. By sending to the BS just one matrix, this scenario reduces the feedback overhead, decreasing the HRIS-to-BS latency, and thereby conserving network resources, which can be an advantage in high-mobility or fast-fading environments. This strategy also preserves network resources by limiting the use of the control channel to signaling exchange, thereby avoiding its occupation with data transmission.
For reference, we provide in Table 4 all the possible combinations of “HRIS-BS” estimation methods by considering the scenario of the CL usage, related equations, and algorithms. When considering all the receiver options available at both the HRIS and the BS, there is flexibility in selecting the receiver pair to be used for the joint estimation of the channel and symbols. For instance, one may opt for combinations that offer low computational complexity at the cost of being more restrictive regarding system design. Conversely, more costly schemes can provide greater design flexibility. Moreover, a balance between complexity and design constraints can be achieved by selecting mixed receiver pairs with iterative and closed-form solutions. For example, using KronF at the HRIS and BALS at the BS would provide low computational cost for the HRIS, while it would be more costly for the BS. In contrast, the design requirements would be more restrictive for the HRIS than the BS.
We adopt a distance-dependent path loss (PL) model, given by \(\mathrm{PL} = \mathrm{PL}_0(d/d_0)^{-\alpha}\), in which \(\mathrm{PL}_0 = -20\) dB is the path loss at the reference distance \(d_0 = 1\)m, \(d\) is the individual link distance, and \(\alpha\) denotes the path loss exponent. We consider \(d_u = 40\) m, \(d_h = 10\) m, and we set \(\alpha_g =2.5\) and \(\alpha_h=2\) as, respectively, the UT-HRIS and HRIS-BS link distances and path loss exponents. We assume the Rayleigh fading channel model, in which the UT-HRIS and HRIS-BS channels are taken from a zero-mean independent and identically distributed (i.i.d.) complex-valued Gaussian distribution with variances \(\gamma\) and \(\beta\), respectively, corresponding to the path losses of these links. To keep the analyses simple, both the HRIS and the BS have the same noise power level \(\sigma_v^2 = -90\) dBm. Given that the energy coupling level is dictated through meta-atom design, as highlighted in [3], we allocate the same coupling level to all meta-atoms, and we assume the \(\rho\) parameter is non-reconfigurable to simplify the assessment. We design the reflecting and sensing phase shifts as well as the coding (for both TSTC and KRSTC) according to Appendix 12.4. The symbol matrix \({\mathbf{X}}\) is based on a 64-QAM constellation. We evaluate joint symbol and CE accuracies employing the symbol error rate (SER) and the normalized mean square error (NMSE), respectively. Each result is an average over at least \(10^4\) independent Monte Carlo runs. Each run considers different realizations of the symbols, channels, and noise. To ensure a fair comparison between the proposed TSTC and KRSTC schemes, we set \(R = L\) and dismiss the entire first column of \(\hat{{\mathbf{X}}} \in \mathbb{C}^{R \times T}\) to calculate the SER (not only \(\hat{{\mathbf{X}}}_{1,1}\)). Unless otherwise stated, we assume the parameter set \(\{M,N,N_c,L,R,T,K\} = \{8,32,2,2,2,4,64\}\).
 
 
Firstly, we examine the trade-off between the data-aided semi-blind CE accuracy and the power splitting parameter \(\rho\), shown in Figs. 10 and 11. We provide NMSE and SER results by fixing transmit power values in dBm, denoted by \(P_t\), and changing \(\rho\). In particular, we retain the focus of this study on symbol estimation performance as the CE evaluation was previously reported in [22], which introduced a pilot-aided approach. To inspect symbol estimation at both HRIS and BS, we selected the scenario in which only the estimated channel matrix \(\hat{{\mathbf{G}}}\) is received at the BS from the CL (CL scenario 1), which implies using semi-blind receivers for joint symbol and CE at both ends (BS and HRIS). We chose the KronF and BALS receivers following the TSTC scheme for the HRIS and BS, respectively.
From the results, we can observe that when \(\rho\) approaches 0, the HRIS becomes a quasi “only detecting (not reflecting) RIS”, causing low SER values for the symbol estimation performed at the HRIS, while they approach 1 at the BS. As \(\rho\) increases, the symbol estimation performance is degraded at the HRIS due to the decreased sensing/detection capability. In contrast, the estimation accuracy at the BS side is enhanced as the reflected signal arrives at the BS with greater strength. Likewise, higher values of \(\rho\) imply an increase in the NMSE of \({\mathbf{G}}\) while decreasing the NMSE of \({\mathbf{H}}\), as predicted in [22]. Since the CE capability at the HRIS is less affected by path loss in our setup (the HRIS is closer to the UT than the BS), the estimation accuracy for the channel \({\mathbf{G}}\) remains much higher even by decreasing the sensing capability (i.e., increasing \(\rho\)). Nevertheless, when \(\rho\) approaches 1, and the HRIS behaves closer to a “pure reflecting RIS”, the improvement on the estimates of \(\hat{{\mathbf{H}}}\) at the BS stops, since the accuracy of \(\hat{{\mathbf{G}}}\) becomes compromised. In addition, from the Fig. 10, we can note that for smaller values of \(\rho\), the SER performance at the HRIS is more sensitive to transmit power variations. The same happens with the SER performance at the BS for higher values of \(\rho\). On the other hand, the Fig. 11 shows that the channel estimation performances are less sensitive to the variation of the \(P_t\) compared to the SER ones.
In the next experiments, all the results consider \(\rho=90\)%. Since the BS experiences higher path loss due to the cascaded (UT-HRIS-BS) link, this choice allocates more power to the reflected signal part. Despite the lower power allocated to the sensed signal part, the low path loss associated with the UT-HRIS link still ensures reliable symbol detection and CE at the HRIS.
In Fig. 12, we study the NMSE performances at the HRIS and the BS as a function of the transmit power. We depict the NMSE of the individual channels \({\mathbf{G}}\) (estimated at the HRIS) and \({\mathbf{H}}\) (estimated at the BS). On the other hand, Fig. 13 displays the SER results associated with the symbol detection at both HRIS and BS. In both figures, we compare the performances of the proposed semi-blind receivers as follows: i) we evaluate the results of the proposed receivers designed for the HRIS (see “NMSE of \({\mathbf{G}}\)” curves); ii) by selecting a receiver for the HRIS and another for the BS, we account the performance of “HRIS-BS” receiver pairs (see “NMSE of \({\mathbf{H}}\)” curves). This is made for both coding schemes, TSTC and KRSTC. Let us first start with the performance on the HRIS side. As a reference for comparisons, in Fig. 12, we also plot the performance of the pilot-assisted case based on [22] using the same set of parameters6, wherein we apply a simple LS solution to estimate \({\mathbf{G}}\) at the HRIS and \({\mathbf{H}}\) at the BS.
 
 
As shown in Figs. 12 and 13, all semi-blind receivers operating at the HRIS exhibit the same NMSE and SER performances (for both TSTC and KRSTC), with the HRIS offering higher accuracy in symbol and CE compared to the BS due to its positioning. Such results align with those reported in [22] in the CE scope (see Fig. 11 therein). Moreover, the spatial diversity introduced by analog combining at the HRIS also contributes to improved performance. However, from Fig. 12, comparing the pilot-aided CE [22] with all the data-aided semi-blind receivers, we observe that the former can be regarded as a lower bound for the proposed methods in terms of CE performance. This is because estimating the symbol matrix in addition to the channel introduces an additional challenge compared to the baseline pilot-assisted method, which is restricted to only estimating the channel \({\mathbf{G}}\) with full knowledge of \({\mathbf{X}}\) (pilots, in this case). Nevertheless, as we will see later, this performance difference at the HRIS will not be significant for certain receiver combinations (“HRIS-BS” pairs), meaning that the estimation at the BS will not be substantially affected. Furthermore, the semi-blind approach allows the UT to transmit data symbols instead of only pilot sequences within the same time-division structure. Indeed, adding a joint symbol and CE functionality at the HRIS can unlock new potential for RIS-aided wireless communication systems, which will be further discussed in Section 11. Regarding symbol estimation, Fig. 13 shows that all the receivers performed competitively for both coding schemes, corroborating our numerical results shown previously. These results represent a remarkable milestone in symbol estimation utilizing the hybrid architecture proposed by [3] using only two RF-chains out of \(N=32\) HRIS elements.
Still considering Figs. 12 and 13, let us now focus on the BS performance by considering pairs of “HRIS-BS” receivers. Recall all the possibilities of “HRIS-BS” receiver pairs/combinations by referring back to Table 4. Here, we do not consider receiver pairs using closed-form schemes at the BS for CL scenario 2 (i.e., KronF for both coding schemes) since we found unsatisfactory results. The adoption of such closed-form receivers at the BS will be discussed later in the topic , at the end of this Section. To assist us in the discussion, Figs. 15 (a) and 15 (b) display, respectively, the average runtime (in seconds) and the number of iterations required for convergence of the iterative algorithms as a function of \(P_t\) for all receiver pairs. Additionally, Fig. 15 shows the evolution of the computational complexity with respect to the number of HRIS elements.
Regarding the estimation of the channel \({\mathbf{H}}\) (HRIS-BS channel), whose results are also exhibited in Fig. 12, all receiver pairs arising from the CL scenario 2 and all pairs from the CL scenario 1 using iterative BALS at the BS performed similarly and demonstrated improved estimation accuracy. Although a difference in accuracy was noted between pilot-aided and data-aided approaches when estimating \({\mathbf{G}}\) at the HRIS, the NMSE curves related to estimating \({\mathbf{H}}\) at the BS closely resemble the baseline pilot-aided method, particularly for KronF-H, KRF-H, and BALS-H (in both coding schemes). The accurate estimation previously obtained at the HRIS effectively narrowed the performance gap at the BS between pilot-aided and the early-mentioned data-aided semi-blind methods, significantly reducing the performance disparities. Note that scenarios in which \(\hat{{\mathbf{X}}}\) and \(\hat{{\mathbf{G}}}\) are jointly conveyed via the CL (CL scenario 2) result in solutions with lower computational complexities and less restrictive design requirements, reaching the best results in terms of joint symbol and CE. However, this method requires more feedback associated with the additional conveyance of \(\hat{{\mathbf{X}}}\) estimated at the HRIS, especially when it has larger dimensions. This way, when the choice is to save on feedback, i.e., send only the estimated matrix \(\hat{{\mathbf{G}}}\), the BS would jointly estimate channel and symbols. By inspecting the NMSE results for the individual channel \({\mathbf{H}}\) in Fig. 12, applying iterative receivers at the BS (CL scenario 1) implies just a 2 dB gap in performance, being an exciting finding. Recall that BALS receivers have the distinguishing feature of refining the channel and symbol estimates at each iteration, based only on the estimated channel \(\hat{{\mathbf{G}}}\) reported by the HRIS. In contrast, their overall complexity depends on the SNR, since the number of iterations required for convergence increases for lower transmit power levels, as indicated in Fig. 15 (b).
 
In addition, we plot the NMSE of the combined channel in Fig. 14, evaluated at the BS by adopting the Khatri-Rao structured matrix \(\boldsymbol{\Theta} = {\mathbf{G}}^{{\rm T}}\diamond{\mathbf{H}} \in \mathbb{C}^{LM \times N}\). We observe that the performance of the combined CE follows similar patterns to those obtained from the estimations of \({\mathbf{H}}\). This emphasizes the similarity in performance between the two groups of methods regarding the estimation at the BS: one first group that employs BALS-based iterative semi-blind receivers (from CL scenario 1, to estimate \({\mathbf{H}}\) and \({\mathbf{X}}\)), and a second group that estimates only \({\mathbf{H}}\) (the data-aided semi-blind approach from CL scenario 2, and the pilot-aided one). In the context of the estimation of \(\boldsymbol{\Theta}\), the Fig. 14 reveals a more distinguishable and clear separation between such groups, previously observed only for estimation of \({\mathbf{H}}\) in Fig. 12. Note that the performance gap between pilot and data-aided strategies was practically eliminated when we considered the combined channel estimation \(\boldsymbol{\Theta}\). In addition to the proposed methods previously discussed, we include in Fig. 14 the iterative trilinear ALS (TALS) semi-blind receiver proposed in [54], representing the baseline method for data-aided semi-blind CE in the context of PRIS-assisted communications. Recall that in the HRIS approach using the KRSTC scheme, the received signal at the BS differs from that of the PRIS one adopted in [54] only by introducing the factor \(\rho\), causing the HRIS to reflect an impinging wave’s fraction instead of its totality (\(\rho = 1\)). Although a small performance gap was observed between groups 1 and 2, the methods in group 2 demonstrated similar performance to that of the baseline TALS, highlighting their effectiveness in solving the problem of joint channel and symbol estimation semi-blindly in a data-aided CE viewpoint.
To assess symbol estimation performance at the BS, Fig. 13 depicts the SER results of the receiver pairs discussed earlier, and those are compared to the symbol estimation provided by the TALS receiver [54]. The results show that “HRIS-BS” receiver pairs executing BALS (for both TSTC and KRSTC) at the BS perform similarly to the PRIS case using TALS. These results support our findings in Fig. 14 for the combined CE related to the receivers of group 2 and the baseline TALS in the PRIS approach. It is essential to highlight that the HRIS absorbs 10% of the incident signal’s energy. This is significant, as the joint symbol and channel estimation remains nearly unaffected compared to the PRIS case when employing iterative BALS receivers. Moreover, it is worth noting that the proposed receivers in the hybrid approach offer a scaling ambiguity-free separate CE while decentralizing the CE task, which was previously performed only at the BS in the passive approach. However, one should consider the trade-off between hardware complexity and power consumption when opting for the HRIS architecture.
 
 

Figure 15: Number of FLOPS vs. number of HRIS elements.. a — Average runtime of the receiver pairs (CL scenario 1) vs. transmit power., b — Iterations to convergence for iterative methods vs. transmit power.
 
 
To provide simulation results under the umbrella of a fair comparison, we have assigned the minimum number of sub-frames that simultaneously meet the identifiability condition for all matrix inverses present in the considered estimators. This choice led to the results of the joint symbol, and CE presented in Figs. 12 and 13. In this topic, we show that the use of semi-blind closed-form KronF receivers at the BS can be leveraged at the cost of paying additional time overhead.
From Table 3, note that \(K=64\) sub-frames are more than enough to meet the identifiability conditions of the proposed iterative BALS receivers. In contrast, the KronF ones operate tightly at their minimum identifiability boundary (\(K=RN\) for TSTC and \(K=LN\) for KRSTC). To complement our discussion, we present additional simulation results by setting \(K=128\), i.e., beyond the minimum \(K\) required for all the closed-form receivers at the BS (recall Table 3). We show in Figs. 16 and 17 the NMSE of the HRIS-BS channel (channel \({\mathbf{H}}\)) and SER at the BS, respectively, for all the receiver pairs that apply closed-form receivers KronF at the BS, i.e, BALS-KronF and KRF-KronF for KRSTC, and BALS-KronF and KronF-KronF for TSTC. We provide comparisons between the formers and the pairs BALS-BALS for both TSTC and KRSTC (which apply an iterative receiver at the BS).
We can see that such an increase of \(K\) was not sufficient to bring improved performance at the BS when the Khatri-Rao coding scheme and KronF are exploited (BALS-KronF and KRF-KronF for KRSTC). Therefore, we do not recommend using such a closed-form receiver at the BS when adopting the KRSTC transmission scheme. In contrast, KronF, using the TSTC scheme, achieves a significant performance improvement at the BS since the BALS-KronF and KronF-KronF pairs performed similarly to the iterative ones and the baseline methods. Note that the benefits provided by the KronF receiver under the TSTC come with the transmission of more sub-frames. Such findings were initially reported in [58]. It is worth mentioning that under the same number of UT antennas, the TSTC scheme can allocate more streams than the number \(L\) of transmit antennas, which is an interesting setup not possible with the KRSTC scheme.
Concerning the choice of iterative or closed-form receivers at the BS for joint symbol and CE in the CL scenario 1, let us recall that BALS-based receivers can operate under more flexible system setups compared to the closed-form ones. Conversely, KronF performs only a single matrix inverse followed by a rank-1 matrix approximation step, which is much less complex than BALS (for both coding schemes). As mentioned earlier, in the low transmit power regime, the ALS procedure may require several iterations to converge, thereby increasing the overall computational cost. Hence, the runtime of receiver pairs associated with KronF at the BS in Fig. 15 (a) is relatively low compared to those of BALS-based ones. However, it is worth mentioning that although the computational complexity of KronF is lower than that of BALS, it implies more restrictive system setups, as demonstrated in Table 3.
This illustrates the interesting tradeoffs offered by the proposed semi-blind HRIS-BS receiver pairs involving performance, complexity, and operating conditions.
In the following, we discuss a few examples of application scenarios and use cases that can potentially benefit from a joint channel and symbol estimation at the HRIS. Relying on direct estimation, uplink sensing was considered in [74] in a perceptive mobile network [75] employing joint communication and sensing, which involves the detection of UTs and environmental characteristics between them and remote radio units (RRUs). Therein, symbols are embedded into a sensing matrix, and compressed sensing is applied to estimate delay, Doppler, and angular parameters. The semi-blind approach is suitable for the mentioned joint communication and radar sensing scenario since it dispenses pilot-based training and jointly recovers symbols and channel estimates in a one-way time protocol employing simple receiver algorithms. Furthermore, multiple HRISs can be distributed to serve as decentralized uplink sensing points while alleviating the processing load at the BS. It is worth pointing out that recent works have considered estimating channel parameters at a hybrid RIS. As examples, please refer to [21], [23], [76], [77].
In [78], a multi-antenna UT conveys extra bits to the RIS controller via a CL while sending data symbols to the BS via the UT-RIS channel during the uplink transmission. The RIS controller uses such extra bits to apply an over-the-air beamforming technique to improve the transmission. This relies on the prerequisite that there is a CL between the UTs and the RIS. However, maintaining a CL with each active UT can result in a scalability problem since the RIS controller may need to support simultaneous connections with many UTs. This is another suitable application for the proposed semi-blind HRIS processing since control signals can be embedded directly into the data and sent over the UT-RIS link, thereby reducing or eliminating the dependency on multiple CLs between the HRIS and the UTs. More specifically, the transmitted signals may contain a payload containing both data and control symbols, i.e., the symbol matrix \({\mathbf{X}}\) can be partitioned into \({\mathbf{X}} = [{\mathbf{X}}_c, {\mathbf{X}}_d]\), where \({\mathbf{X}}_c\) contains control symbols to fulfill the mentioned purpose. In contrast, \({\mathbf{X}}_d\) contains data symbols. This way, our semi-blind approach eliminates the need for CLs between UTs and the RIS since \({\mathbf{X}}\) (or part of it) can be found at the HRIS. Otherwise stated, leveraging the information contained in \({\mathbf{X}}_c\) allows the HRIS to decode control signals in a stand-alone fashion directly.
Another application that can potentially benefit from a joint symbol and channel estimation at the HRIS is vehicular communications. For instance, consider a scenario with multiple roadside RISs placed to serve high-mobility vehicles, as shown in [79]. In [79], assuming that the RIS-BS channel is static, the time-varying UT-RIS channel can be estimated/predicted at the RIS in a decentralized manner, i.e., without the assistance of the BS, minimizing the usage of the CL and avoiding feedback delays and outdated beamforming optimization as a consequence of high UT’s mobility. To this, the RIS controller transmits the pilots by the UT (vehicle) to the BS during the uplink transmission protocol. Bringing this problem to our proposed semi-blind approach, we foresee new use cases allowing UTs to directly share valuable information with their serving HRIS (and the adjacent ones) by embedding control data such as position, speed, and handover commands into the symbol matrix, which can be decoded at each HRIS and exchanged between in a decentralized way without requiring feedback with their serving BSs. For example, such control data can include speed/position [80], following a similar perspective to that used in active road safety [81] applications in vehicular networking.
Finally, we can also envisage a useful scenario where the BS sends control data to (re)configure the HRIS autonomously. In that case, the symbol matrix to be estimated/decoded at the HRIS contains control commands for HRIS configuration purposes or any other relevant network signaling information. This scenario corresponds to an over-the-air HRIS reconfiguration or standalone operation without using the control link.
This paper proposed semi-blind joint channel and symbol estimation solutions for a hybrid simultaneous reflecting and sensing RIS. Adopting a tensor modeling approach, we revealed the tensor structures of the transmitted signals and the received signals at the HRIS and BS as combinations of PARAFAC and Tucker models, from which novel semi-blind receiver pairs for combined HRIS-BS processing are derived. The proposed tensor-based receivers provide data-aided estimations of the involved channels at both the HRIS and the BS without an a priori pilot transmission stage, reducing the symbol decoding delay and improving the data rate. We derived both iterative and closed-form algorithms for joint channel and symbol estimation. We also studied identifiability conditions for guaranteed channel and symbol recovery for each semi-blind receiver pair, revealing the competitive performances of the proposed solutions in comparison with reference methods. Extensive simulation results showcased the performance trends and tradeoffs for the different HRIS-BS receiver pairs. Despite their higher computational complexity, receiver pairs using iterative BALS at the BS offer better estimation accuracy compared to the closed-form (KronF) ones, regardless of the receiver chosen at the HRIS. On the other hand, we have demonstrated that KronF can achieve performance comparable to that of iterative methods and further reduce computational complexity at the expense of additional overhead. Our discussion also illuminates the opportunities and use cases arising from empowering HRIS with symbol detection capability. Our numerical results also clarified the impacts of power splitting and tensor coding on channel estimation accuracy and symbol error rates for HRIS-assisted communications. These insights are pivotal for optimizing the system performance in future HRIS deployments. Perspectives include extending the proposed semi-blind receivers to multi-user scenarios and studying alternative tensor-based estimation algorithms.
For the HRIS, presuming the design of \(\mathcal{T}_{\boldsymbol{\Phi}}\) and \(\mathcal{W}\) ensures \(\mathrm{dim}[\mathtt{R}({\mathbf{F}}_{\mathrm{x}_1}) \cap \cdots \cap \mathtt{R}({\mathbf{F}}_{\mathrm{x}_K})]=0\), where \(\mathtt{R}({\mathbf{F}}_{\mathrm{x}_k})\) denotes the row-space of \({\mathbf{F}}_{\mathrm{x}_k}\), \({\mathbf{F}}_\mathrm{x}\) has full column-rank if the sum of the ranks of all blocks [82],[83] \({\mathbf{F}}_{\mathrm{x}_1},\!\cdots\!,{\mathbf{F}}_{\mathrm{x}_K}\) is no less than \(R\) when considering the TSTC scheme, i.e, \[\begin{align} \mathrm{rank}\bigl({\mathbf{F}}_\mathrm{x}\bigr) =& \mathrm{min}\{\textstyle \sum_{k=1}^K{\mathrm{rank}({\mathbf{F}}_{\mathrm{x}_k})},R\}\nonumber\\ =& \mathrm{min}\{K\zeta_\mathrm{x},R\},\label{rankfx} \end{align}\tag{42}\] which in turn requires \(K\zeta_\mathrm{x} \geq R\). Assuming \(N \gg N_c\), we can determine the upper-bound of \(\zeta_\mathrm{x}\) by exploiting the well-known property \(\mathrm{rank}({\mathbf{ABC}}) \leq \mathrm{min}\{\mathrm{rank}({\mathbf{A}}),\mathrm{rank}({\mathbf{B}}),\mathrm{rank}({\mathbf{C}})\}\). In this way, \(\zeta_\mathrm{x} \leq \mathrm{min}\{\mathrm{rank}(\boldsymbol{\Phi}_k), \mathrm{rank}({\mathbf{G}}), \mathrm{rank}({\mathbf{W}}_k)\}\), or \(\zeta_\mathrm{x} \leq \mathrm{min}\{N_c,\kappa_\mathrm{g},\mathrm{min}\{L,R\}\}\). Since \(\kappa_\mathrm{g} \leq L\), the upper-bound inequality can be simplified by simply replacing \(\mathrm{min}\{L,R\}\) by the number of data streams that the UT encodes through TSTC, i.e., \[\label{lemmax1} \zeta_\mathrm{x} \leq \mathrm{min}\{N_c,\kappa_\mathrm{g},R\}.\tag{43}\] When KRSTC is employed, recall that \({\mathbf{W}}_k\) is replaced by \({\rm diag\!}\left\{ \mathit{\boldsymbol{\lambda}}_k \right\}\). This way, the inequality of 43 becomes \(\zeta_\mathrm{x} \leq \mathrm{min}\{N_c,\kappa_\mathrm{g}\}\), and following the same reasoning, 42 turns into \(\mathrm{rank}\bigl({\mathbf{F}}_\mathrm{x}\bigr) = \mathrm{min}\{K\zeta_\mathrm{x},L\}\), requiring \(K\zeta_\mathrm{x} \geq L\).
For the BS, assuming the design of \(\boldsymbol{\Psi}\) and \(\mathcal{W}\) guarantees \(\mathrm{dim}[\mathtt{R}({\mathbf{H}}{\mathbf{E}}_{\mathrm{x}_1}) \cap \cdots \cap \mathtt{R}({\mathbf{H}}{\mathbf{E}}_{\mathrm{x}_K})]=0\), to \(({\mathbf{I}}_K \otimes{\mathbf{H}}){\mathbf{E}}_\mathrm{x}\) have full column-rank, the sum of the ranks of all blocks must be at least \(R\) in the TSTC or, equivalently, \[\begin{align} \label{rankex} \mathrm{rank}\bigl(({\mathbf{I}}_K \otimes{\mathbf{H}}){\mathbf{E}}_\mathrm{x}\bigr) =& \mathrm{min}\{\textstyle \sum_{k=1}^K{\mathrm{rank}({\mathbf{H}}{\mathbf{E}}_{\mathrm{x}_k})},R\}\nonumber\\ =& \mathrm{min}\{K\xi_\mathrm{x},R\}. \end{align}\tag{44}\] Therefore, \(K\xi_\mathrm{x} \geq R\), and the upper-bound of \(\xi_\mathrm{x}\) is obtained as \(\xi_\mathrm{x} \leq \mathrm{min}\{\mathrm{rank}({\mathbf{H}}), \mathrm{rank}({\rm diag\!}\left\{ \mathit{\boldsymbol{\psi}}_k \right\}),\mathrm{rank}({\mathbf{G}}),\) \(\mathrm{rank}({\mathbf{W}}_k)\}\), or simply \[\label{lemmax2} \xi_\mathrm{x} \leq \mathrm{min}\{\kappa_\mathrm{h},\kappa_\mathrm{g},R\}.\tag{45}\] Likewise, \(\mathrm{rank}\bigl(({\mathbf{I}}_K \otimes{\mathbf{H}}){\mathbf{E}}_\mathrm{x}\bigr) = \mathrm{min}\{K\xi_\mathrm{x},L\}\) for KRSTC, implying on \(K\xi_\mathrm{x} \geq L\) and \(\xi_\mathrm{x} \leq \mathrm{min}\{\kappa_\mathrm{h},\kappa_\mathrm{g}\}\).
Using the property \(\mathrm{rank}({\mathbf{AB}}) \leq \mathrm{min}\{\mathrm{rank}({\mathbf{A}}),\mathrm{rank}({\mathbf{B}})\}\), we have \[\mathrm{rank}\bigl(\bar{{\mathbf{F}}}_\mathrm{g}\bigr) \leq \mathrm{min}\{\mathrm{rank}({\mathbf{I}}_{K} \otimes{\mathbf{X}}^{{\rm T}}\otimes{\mathbf{I}}_{N_c}),\mathrm{rank}({\mathbf{F}}_\mathrm{g})\}.\] Since \({\mathbf{F}}_\mathrm{g}\) depends only on \(\mathcal{T}_{\boldsymbol{\Phi}}\) and \(\mathcal{W}\) (or \({\mathbf{\Lambda}}\)), we consider it is designed to have full column rank, which implies \(KRN_c \geq LN\) for TSTC, and \(KN_c \geq N\) for KRSTC. Doing \(\mathrm{rank}({\mathbf{F}}_\mathrm{g}) = LN\) and applying the property \(\mathrm{rank}({\mathbf{A}} \otimes{\mathbf{B}}) = \mathrm{rank}({\mathbf{A}})\mathrm{rank}({\mathbf{B}})\), we obtain \[\label{lemmag1} \mathrm{rank}\bigl(\bar{{\mathbf{F}}}_\mathrm{g}\bigr) \leq \mathrm{min}\{KN_c\kappa_\mathrm{x},LN\}.\tag{46}\]
For both TSTC and KRSTC transmission schemes, we can express the rank of the \(k\)-th block of the column-wise stacking \({\mathbf{E}}_\mathrm{h}({\mathbf{I}}_K \otimes{\mathbf{X}})\) as \[\label{lemmah2} \mathrm{rank}({\mathbf{E}}_{\mathrm{h}_k}{\mathbf{X}}) = \xi_\mathrm{h} \leq \mathrm{min}\{\kappa_\mathrm{g},\kappa_\mathrm{x}\}.\tag{47}\] Following [82], [83], to ensure that \({\mathbf{E}}_\mathrm{h}({\mathbf{I}}_K \otimes{\mathbf{X}})\) has full row-rank, the sum of the ranks of all blocks \({\mathbf{E}}_{\mathrm{h}_k}{\mathbf{X}}\) must be no less than \(N\). On the assumption that the design of \(\mathcal{W}\) and \(\boldsymbol{\Psi}\) enforces \(\mathrm{dim}[\mathtt{C}({\mathbf{E}}_{\mathrm{h}_1}{\mathbf{X}}) \cap \cdots \cap \mathtt{C}({\mathbf{E}}_{\mathrm{h}_K}{\mathbf{X}})]=0\), where \(\mathtt{C}({\mathbf{E}}_{\mathrm{h}_k}{\mathbf{X}})\) denotes the column-space of \({\mathbf{E}}_{\mathrm{h}_k}{\mathbf{X}}\), we have \[\begin{align} \label{lemmah3} \mathrm{rank}\bigl({\mathbf{E}}_\mathrm{h}({\mathbf{I}}_K \otimes{\mathbf{X}})\bigr) =& \mathrm{min}\Bigl\{N, \textstyle \sum_{k=1}^K{\mathrm{rank}({\mathbf{E}}_{\mathrm{h}_k}{\mathbf{X}})}\Bigr\}\nonumber\\ =& \mathrm{min}\{N, K\xi_\mathrm{h}\}. \end{align}\tag{48}\] Therefore, \(K\xi_\mathrm{h} \geq N\).
We jointly design the sensing and reflecting phase shifts by adapting a procedure proposed in [28] while designing the tensor coding separately. We adopt an index vector \(\tau_i^J \triangleq [(i-1)J+1,(i-1)J+2,\cdots,iJ] \in {\mathbb{Z}^\ast_{+}}^{J \times 1}\) for \(i=1,\cdots,I\) to denote the \(i\)-th block of an \(IJ\)-dimensional column vector, in which each block has length of \(J\). Consider a \(KN_c\)-dimensional DFT matrix \({\mathbf{D}} = [\mathit{\boldsymbol{d}}_1,\cdots,\mathit{\boldsymbol{d}}_{KN_c}]\). By sampling \({\mathbf{D}}\), the 3-mode fibers of \(\mathcal{T}_{\boldsymbol{\Phi}}\) and the columns of \(\boldsymbol{\Psi}\) are filled by, respectively, \(\boldsymbol{\Phi}_{n_cn\cdot} = \mathit{\boldsymbol{d}}_n(\tau_{n_c}^K) \in \mathbb{C}^{K \times 1}\) and \(\boldsymbol{\Psi}_{\cdot n} = \mathit{\boldsymbol{d}}_{(n-1)N_c+1}(\tau_1^K) \in \mathbb{C}^{K \times 1}\), for \(n_c=1,\cdots,N_c\) and \(n=1,\cdots,N\), where the constraint \(KN_c \geq N\) is assumed. This yields respectively the following equivalent constructions for \(\mathcal{T}_{\boldsymbol{\Phi}}\) and \(\boldsymbol{\Psi}\), defined in Section 3: \[\bigl[\mathcal{T}_{\boldsymbol{\Phi}}\bigr]_{(3)} \!=\!\! \left[\begin{array}{*{7}{c@{}}} d_{1,1}&\!\cdots\!&d_{(N_c\!-\!1)K\!+\!1,1}&\cdots&d_{1,N}&\!\cdots\!&d_{(N_c\!-\!1)K\!+\!1,1}\\ \vdots&\!\cdots\!&\vdots&\!\cdots\!&\vdots&\!\cdots\!&\vdots\\ d_{K,1}&\!\cdots\!&d_{N_cK,1}&\cdots&d_{K,N}&\!\cdots\!&d_{N_cK,N} \end{array}\right]\!\!,\] \[\boldsymbol{\Psi} = \begin{bmatrix} d_{1,1}&\!\cdots\!&d_{1,(N-1)N_c+1}\\ \vdots&\!\cdots\!&\vdots\\ d_{K,1}&\!\cdots\!&d_{K,(N-1)N_c+1} \end{bmatrix}.\] To design the coding tensor \(\mathcal{W}\) (TSTC), we first construct the matrix \(\boldsymbol{\Upsilon} \in \mathbb{R}^{K \times RL}\) by truncating a \(K\)-dimensional Hadamard matrix to its first \(RL\) columns, such that \(\boldsymbol{\Upsilon} = \left[ \mathcal{W} \right]_{(3)}\). Then, we get \(\mathcal{W}\) by tensorizing \(\boldsymbol{\Upsilon}\) or, simply, by doing \({\mathbf{W}}_k = \mathrm{unvec}_{L \times R}\{\boldsymbol{\Upsilon}_{k\cdot}^{{\rm T}}\}\), \(k=1,\cdots,K\). For the KRSTC scheme, the coding matrix \(\boldsymbol{\Lambda}\) is designed as a truncated Hadamard matrix, where \(K \geq L\). This design can prevent the generalized inverses mentioned in Section 8 by replacing them with matrix multiplications through simplified expressions. For KRSTC, this is achieved when \(K \geq LN\), and for TSTC, when \(K \geq RLN\). Herein, however, we prioritize choosing the minimum number of sub-frames required to ensure joint symbol and CE uniqueness for all semi-blind receivers.
This work is partially supported by the National Institute of Science and Technology (INCT-Signals), sponsored by Brazil’s National Council for Scientific and Technological Development (CNPq) under grant 406517/2022-3, and CAPES/Brazil. A. L. F. de Almeida is partially supported by CNPq under grant 312491/2020-4.↩︎
The cascaded (RIS-assisted, concatenated, composite, combined, or compound) channel comprises the joint effect of both UT-RIS and RIS-BS channels. Sometimes, the cascaded CE is achieved through its decoupled version, whose estimated matrices are affected by scaling ambiguities, as shown in [26] and [27]. This necessitates complementary techniques to acquire the knowledge required for scaling removal.↩︎
This architecture has been referred to in the literature using different terminologies, such as hybrid semi-passive RIS, sensing RIS, receiving RIS, or simply hybrid RIS. Such an architecture should not be confused with the one considered in this paper, as will be clear later.↩︎
Although we adopt a single user with \(L\) antennas, the expressions derived in this work can be adapted for a scenario with \(L\) single-antenna users.↩︎
Although the primary focus of this paper is on uplink communication from the UT to BS, the results obtained can also be applied to downlink communication in the opposite direction by leveraging uplink-downlink channel reciprocity and simply reversing the roles of the transmitter and receiver.↩︎
We adapted the signal model of [22] to the single-user case. In this case, we design \({\mathbf{X}}\) as a truncated discrete Fourier transform (DFT) matrix. Since HRIS optimization is out of the scope of our work, and to keep the fair comparison, we do not leverage the optimization procedure proposed in [22].↩︎