Strong Secrecy on a Class of Degraded Broadcast Channels Using Polar Codes

Del Olmo Alos, Jaume; Rodríguez Fonollosa, Javier

doi:10.3390/e20060467

Open AccessArticle

Strong Secrecy on a Class of Degraded Broadcast Channels Using Polar Codes

by

Jaume Del Olmo Alos

^*

and

Javier Rodríguez Fonollosa

Departament de Teoria del Senyal i Communications (TSC), Universitat Politècnica de Catalunya, Barcelona 08034, Spain

^*

Author to whom correspondence should be addressed.

Entropy 2018, 20(6), 467; https://0-doi-org.brum.beds.ac.uk/10.3390/e20060467

Submission received: 15 May 2018 / Revised: 8 June 2018 / Accepted: 12 June 2018 / Published: 15 June 2018

Download

Browse Figures

Versions Notes

Abstract

:

Asymptotic secrecy-capacity achieving polar coding schemes are proposed for the memoryless degraded broadcast channel under different reliability and secrecy requirements: layered decoding or layered secrecy. In these settings, the transmitter wishes to send multiple messages to a set of legitimate receivers keeping them masked from a set of eavesdroppers. The layered decoding structure requires receivers with better channel quality to reliably decode more messages, while the layered secrecy structure requires eavesdroppers with worse channel quality to be kept ignorant of more messages. Practical constructions for the proposed polar coding schemes are discussed and their performance evaluated by means of simulations.

Keywords:

polar codes; information-theoretic security; degraded broadcast channels; strong secrecy

1. Introduction

Information-theoretic security over noisy channels was introduced by Wyner in [1], which characterized the (secrecy-)capacity of the degraded wiretap channel. Later, Csiszár and Körner in [2] generalized Wyner’s results to the general wiretap channel. In these settings, one transmitter wishes to reliably send one message to a legitimate receiver, while keeping it secret from an eavesdropper, where secrecy is defined based on a condition on some information-theoretic measure that is fully quantifiable. One of these measures is the information leakage, defined as the mutual information

I (W; Z^{n})

between a uniformly-distributed random message W and the channel observations

Z^{n}

at the eavesdropper, n being the number of uses of the channel. Based on this measure, the most common secrecy conditions required to be satisfied by channel codes are the weak secrecy, which requires

{lim}_{n \to \infty} \frac{1}{n} I (W; Z^{n}) = 0

, and the strong secrecy, requiring

{lim}_{n \to \infty} I (W; Z^{n}) = 0

. Although the second notion of security is stronger, surprisingly, both secrecy conditions result in the same secrecy-capacity region [3].

In the last decade, information-theoretic security has been extended to a large variety of contexts, and this paper focuses on two different classes of discrete memoryless Degraded Broadcast Channels (DBC) surveyed in [4]: (a) with Non-Layered Decoding and Layered Secrecy (DBC-NLD-LS) and (b) with Layered Decoding and Non-Layered Secrecy (DBC-LD-NLS). In these models, the transmitter wishes to send a set of messages through the DBC, and each message must be reliably decoded by a particular set of receivers and kept masked from a particular set of eavesdroppers. The degradedness condition of the channel implies that individual channels can be ordered based on the quality of their received signals. The layered decoding structure requires receivers with better channel quality to reliably decode more messages, while the layered secrecy requires eavesdroppers with worse channel quality to be kept ignorant of more messages.

The capacity region of these models was first characterized in [4,5,6]. However, the achievable schemes used by these works rely on random coding arguments that are nonconstructive in practice. In this sense, the purpose of this paper is to provide coding schemes based on polar codes, which were originally proposed by Arikan [7] to achieve the capacity of binary-input, symmetric, point-to-point channels under Successive Cancellation (SC) decoding. Capacity achieving polar codes for the binary symmetric degraded wiretap channel were introduced in [8,9], satisfying the weak and the strong secrecy condition, respectively. Recently, polar coding has been extended to the general wiretap channel in [10,11,12,13]. Indeed, [12,13] generalize their results providing polar coding schemes for the broadcast channel with confidential messages, and [11] also proposes polar coding strategies to achieve the best-known inner bounds on the secrecy-capacity region of some multi-user settings.

Although recent literature has proven the existence of different secrecy-capacity achieving polar coding schemes for multi-user scenarios (for instance, see [11,12,13,14,15,16,17,18]), polar codes for the two models on which this paper is focused have, as far as we know, not been analyzed yet. As mentioned in [4], these settings capture practical scenarios in wireless systems, in which channels can be ordered based on the quality of the received signals (for example, Gaussian channels are degraded). Hence, the ultimate goal of this work is not only to prove the existence of two asymptotic secrecy-capacity achieving polar coding schemes for these models under the strong secrecy condition, but also to discuss their practical construction and evaluate their performance for a finite blocklength by means of simulations.

1.1. Relation to Prior Work

A good overview of the similarities and differences between the polar codes proposed in [10,11,12,13] for the general wiretap channel can be found in [13] (Figure 1). The polar coding schemes proposed in this paper are based mainly on those introduced by [13] because of the following reasons:

To provide strong secrecy. Despite both weak and strong secrecy conditions resulting in the same secrecy-capacity region, the weak secrecy requirement in practical applications can result in important system vulnerabilities [19] (Section 3.3).
To provide polar coding schemes that are implementable in practice. Notice in [13] (Figure 1) that the coding scheme presented in [10] relies on a construction for which no efficient code is presently known. Moreover, the polar coding scheme in [12] relies on the existence, through averaging, of certain deterministic mappings for the encoding/decoding process.

As in [13], our polar coding schemes are totally explicit. However, to provide strong secrecy and reliability simultaneously, the transmitter and the legitimate receivers need to share a secret key of negligible size in terms of rate, and the distribution induced by the encoder must be close in terms of statistical distance to the original one considered for the code construction. Moreover, we adapt the deterministic SC encoder of [20] to our channel models, and we show that it can perform well in practice. As concluded in [20], this deterministic SC encoder will avoid the need to draw large sequences according to specific distributions at the encoder, which can be useful in communication systems requiring low complexity at the transmitter.

In [13] (Remark 3), the authors highlight the connection between polar code constructions and random binning proofs that allows them to apply their designs to different problems in network information theory. Nevertheless, in our polar coding schemes, the chaining construction used in [13] is not needed because of the degradedness condition of the channels, and consequently, we can introduce small changes in the design in order to make our proposed coding schemes more practical. In this sense, we assume that a source of common randomness is accessible to all parties, which allows the transmitter to send secret information in just one block of size n by only using a secret key with negligible size in terms of rate. Despite this common randomness being available to the eavesdroppers, no information will be leaked about the messages. Moreover, if we consider a communication system requiring transmissions over several blocks of size n, the same realization of this source of common randomness can be used at each block without compromising the strong secrecy condition.

1.2. Overview of Novel Contributions

The main novelties of this paper can be summarized as follows:

Scenario. This paper focuses on two different models of the DBC with an arbitrary number of legitimate receivers and an arbitrary number of eavesdroppers for which polar codes have not yet been proposed. These two models arise very commonly in wireless communications.
Existence of the polar coding schemes. We prove the existence for sufficiently large n of two secrecy-capacity achieving polar coding schemes under the strong secrecy condition.
Practical implementation. We provide polar codes that are implementable in real communication systems, and we discuss further how to construct them in practice. As far as we know, although the construction of polar codes has been covered in a large number of references (for instance, see [21,22,23]), they only focus on polar code constructions under reliability constraints.
Performance evaluation. Simulations results are provided in order to evaluate the reliability and secrecy performance of the polar coding schemes. The performance is evaluated according to different design parameters of the practical code construction. As far as we know, this paper is the first to evaluate the secrecy performance in terms of the strong secrecy, which is done by upper-bounding the information leakage at the eavesdroppers.

1.3. Notation

Through this paper, let

[n] = {1, \dots, n}

for

n \in Z^{+}

,

a^{n}

denote a row vector

(a (1), \dots, a (n))

. We write

a^{1 : j}

for

j \in [n]

to denote the subvector

(a (1), \dots, a (j))

. Let

A \subset [n]

, then we write

a [A]

to denote the sequence

{a (j)}_{j \in A}

, and we use

A^{C}

to denote the set complement with respect to the universal set

[n]

, that is

A^{C} = [n] \ A

. If

A

denotes an event, then

A^{C}

also denotes its complement. We use ln to denote the natural logarithm, whereas log denotes the logarithm base two. Let X be a random variable taking values in

X

, and let

q_{x}

and

p_{x}

be two different distributions with support

X

, then

D (q_{x}, p_{x})

and

V (q_{x}, p_{x})

denote the Kullback-Leibler divergence and the total variation distance, respectively. Finally,

h_{2} (p)

denotes the binary entropy function, i.e.,

h_{2} (p) = - p log p - (1 - p) log (1 - p)

, and we define the indicator function

𝟙 {u}

such that it equals one if the predicate u is true and zero otherwise.

1.4. Organization

The remainder of this paper is organized as follows. In Section 2, the channel models DBC-NLD-LS and DBC-LD-NLS are introduced formally, and their secrecy-capacity regions are characterized. In Section 3, the fundamentals theorems of polar codes are revisited. In Section 4 and Section 5, two polar coding schemes are proposed for the DBC-NLD-LS and DBC-LD-NLS, respectively, and we prove that both are asymptotic secrecy-capacity achieving. In Section 6, practical polar code constructions are discussed for both models, and the performances of the polar codes are evaluated by means of simulations. Finally, the concluding remarks are presented in Section 7.

2. System Model and Secrecy-Capacity Region

Formally, a DBC

(X, p_{Y_{K} \dots Y_{1} Z_{M} \dots Z_{1} | X}, Y_{K} \times \dots \times Y_{1} \times Z_{M} \times \dots \times Z_{1})

with K legitimate receivers and M eavesdroppers is characterized by the probability transition function

p_{Y_{K} \dots Y_{1} Z_{M} \dots Z_{1} | X}

, where

X \in X

denotes the channel input,

Y_{k} \in Y_{k}

denotes the channel output corresponding to the legitimate receiver

k \in [1, K]

and

Z_{m} \in Z_{m}

denotes the channel output corresponding to the eavesdropper

m \in [1, M]

. The broadcast channel is assumed to gradually degrade in such a way that each legitimate receiver has a better channel than any eavesdropper, that is:

\begin{matrix} X - Y_{K} - \dots - Y_{1} - Z_{M} - \dots - Z_{1} \end{matrix}

(1)

forms a Markov chain. Although we consider physically degradation, the polar coding schemes proposed in this paper are also suitable for stochastically degraded channels (see Remark 2).

2.1. Degraded Broadcast Channel with Non-Layered Decoding and Layered Secrecy

In this model (see Figure 1), the transmitter wishes to send M messages

{W_{m}}_{m = 1}^{M}

to the K legitimate receivers. The non-layered decoding structure requires the legitimate receiver

k \in [1, K]

to reliably decode all M messages, and the layered secrecy structure requires the eavesdropper

m \in [1, M]

to be kept ignorant about messages

{W_{i}}_{i = m}^{M}

. Consider a

(⎡ 2^{n R_{1}} ⎤, \dots, ⎡ 2^{n R_{M}} ⎤, n)

code for the DBC-NLD-LS, where

W_{m} \in [⎡ 2^{n R_{m}} ⎤]

for any

m \in [1, M]

. The reliability condition to be satisfied by this code is measured in terms of the average probability of error at each legitimate receiver and is given by:

\begin{matrix} lim_{n \to \infty} P [({\hat{W}}_{1}, \dots, {\hat{W}}_{M}) \neq (W_{1}, \dots, W_{M})] = 0, for any legitimate receiver k \in [1, K] . \end{matrix}

(2)

On the other hand, the strong secrecy condition to be satisfied by the code is measured in terms of the information leakage at each eavesdropper and is given by:

\begin{matrix} lim_{n \to \infty} I (W_{m}, W_{m + 1}, \dots, W_{M}; Z_{m}^{n}) = 0, for the eavesdropper m \in [1, M] . \end{matrix}

(3)

A tuple of rates

(R_{1}, \dots, R_{M}) \in R_{+}^{M}

is achievable for the DBC-NLD-LS if there exists a sequence of

(⎡ 2^{n R_{1}} ⎤, \dots, ⎡ 2^{n R_{M}} ⎤, n)

codes satisfying Equations (2) and (3).

Proposition 1

(Adapted from [4,5]). The achievable region of the DBC-NLD-LS is the union of all M-tuples of rates

(R_{1}, \dots, R_{M}) \in R_{+}^{M}

satisfying the following inequalities,

\begin{matrix} \sum_{i = m}^{M} R_{i} \leq I (X; Y_{1}) - I (X; Z_{m}), m = 1, \dots, M, \end{matrix}

where the union is taken over all distributions

p_{X}

.

The proof for the case of only one legitimate receiver in the context of the fading wiretap channel is provided in [5], where the information-theoretic achievable scheme is based on embedded coding, stochastic encoding and rate sharing. Due to the degradedness condition of Equation (1), by applying the data processing inequality and Fano’s inequality, an achievable scheme ensuring the reliability condition in Equation (2) for the legitimate Receiver 1 will satisfy it for any legitimate receiver

k \in [2, K]

.

Corollary 1.

The achievable subregion of the DBC-NLD-LS without considering rate sharing is a K-orthotope defined by the closure of all K-tuples of rates

(R_{1}, \dots, R_{M}) \in R_{+}^{M}

satisfying:

\begin{matrix} R_{m} & \leq I (X; Z_{m + 1}) - I (X; Z_{m}), m = 1, \dots, M - 1, \\ R_{M} & \leq I (X; Y_{1}) - I (X; Z_{M}) . \end{matrix}

2.2. Degraded Broadcast Channel with Layered Decoding and Non-Layered Secrecy

In this model (see Figure 2), the transmitter wishes to send K messages

{W_{ℓ}}_{ℓ = 1}^{K}

to the K legitimate receivers. The layered decoding structure requires the legitimate receiver

k \in [1, K]

to reliably decode the messages

{W_{ℓ}}_{ℓ = 1}^{k}

, and the non-layered secrecy structure requires the eavesdropper

m \in [1, M]

to be kept ignorant of all K messages. Consider a

(⎡ 2^{n R_{1}} ⎤, \dots, ⎡ 2^{n R_{K}} ⎤, n)

code for the DBC-LD-NLS, where

W_{ℓ} \in [⎡ 2^{n R_{ℓ}} ⎤]

for any

ℓ \in [1, K]

. The reliability condition to be satisfied by this code is:

\begin{matrix} lim_{n \to \infty} P [({\hat{W}}_{1}, \dots, {\hat{W}}_{k - 1}, {\hat{W}}_{k}) \neq (W_{1}, \dots, W_{k - 1}, W_{k})] = 0, for the legitimate receiver k \in [1, K], \end{matrix}

(4)

and the strong secrecy condition is given by:

\begin{matrix} lim_{n \to \infty} I (W_{1}, \dots, W_{K}; Z_{m}^{n}) = 0, for any eavesdropper m \in [1, M] . \end{matrix}

(5)

A tuple of rates

(R_{1}, \dots, R_{K}) \in R_{+}^{K}

is achievable for the DBC-LD-NLS if there exists a sequence of

(⎡ 2^{n R_{1}} ⎤, \dots, ⎡ 2^{n R_{K}} ⎤, n)

codes such that they satisfy Equations (4) and (5).

Proposition 2

(Adapted from [4,6]). The achievable region of the DBC-LD-NLS is the union of all K-tuples of rates

(R_{1}, \dots, R_{K}) \in R_{+}^{K}

satisfying the following inequalities,

\begin{matrix} \sum_{ℓ = 1}^{k} R_{ℓ} \leq \sum_{ℓ = 1}^{k} I (V_{ℓ}; Y_{ℓ} | V_{ℓ - 1}) - I (V_{k}, Z_{M}), k = 1, \dots, K, \end{matrix}

where

V_{0} ≜ ⌀

and

V_{K} ≜ X

, and the union is taken over all distributions

p_{V_{1} \dots V_{K}}

such that

V_{1} - V_{2} - \dots - V_{K}

forms a Markov chain.

The proof for the case of only one eavesdropper is provided in [6], where the information-theoretic achievable scheme is based on superposition coding, stochastic encoding and rate sharing. Due to the degradedness condition of Equation (1), note that any achievable scheme ensuring the strong secrecy condition in Equation (5) for the eavesdropper M will also satisfy it for any eavesdropper

m \in [1, M - 1]

.

Corollary 2.

The achievable subregion of the DBC-LD-NLS without considering rate sharing is a K-orthotope defined by the closure of all K-tuples of rates

(R_{1}, \dots, R_{K}) \in R_{+}^{K}

satisfying:

\begin{matrix} R_{ℓ} & \leq I (V_{ℓ}; Y_{ℓ} | V_{ℓ - 1}) - I (V_{ℓ}; Z_{M} | V_{ℓ - 1}), ℓ = 1, \dots, K . \end{matrix}

3. Review of Polar Codes

Let

(X \times Y, p_{X Y})

be a Discrete Memoryless Source (DMS), where

X \in {0, 1}

(see Endnote [24]—which refers to References [25,26]) and

Y \in Y

. The polar transform over the n-sequence

X^{n}

, n being any power of two, is defined as

U^{n} ≜ X^{n} G_{n}

, where

G_{n} ≜ {[\begin{matrix} 1 & 1 \\ 1 & 0 \end{matrix}]}^{\otimes n}

is the source polarization matrix [27]. Since

G_{n} = G_{n}^{- 1}

, then

X^{n} = U^{n} G_{n}

.

The polarization theorem for source coding with side information [27] (Theorem 1) states that the polar transform extracts the randomness of

X^{n}

in the sense that, as

n \to \infty

, the set of indices

j \in [n]

can be divided practically into two disjoint sets, namely

H_{X | Y}^{(n)}

and

L_{X | Y}^{(n)}

, such that

U (j)

for

j \in H_{X | Y}^{(n)}

is practically independent of

(U^{1 : j - 1}, Y^{n})

and uniformly distributed, i.e.,

H (U (j) | U^{1 : j - 1}, Y^{n}) \to 1

, and

U (j)

for

j \in L_{X | Y}^{(n)}

is almost determined by

(U^{1 : j - 1}, Y^{n})

, i.e.,

H (U (j) | U^{1 : j - 1}, Y^{n}) \to 0

. Formally, let:

\begin{matrix} H_{X | Y}^{(n)} & ≜ \{j \in [n] : H (U (j) | U^{1 : j - 1}, Y^{n}) \geq 1 - δ_{n}\}, \\ L_{X | Y}^{(n)} & ≜ \{j \in [n] : H (U (j) | U^{1 : j - 1}, Y^{n}) \leq δ_{n}\}, \end{matrix}

where

δ_{n} ≜ 2^{- n^{β}}

for some

β \in (0, \frac{1}{2})

. Then, by [27] (Theorem 1), we have

{lim}_{n \to \infty} \frac{1}{n} | H_{X | Y}^{(n)} | = H (X | Y)

and

{lim}_{n \to \infty} \frac{1}{n} | L_{X | Y}^{(n)} | = 1 - H (X | Y)

, which imply that

{lim}_{n \to \infty} \frac{1}{n} | {(H_{X | Y}^{(n)})}^{C} \cap {(L_{X | Y}^{(n)})}^{C} | = 0

, i.e., the number of elements that have not been polarized is asymptotically negligible in terms of rate. Furthermore, [27] (Theorem 2) states that given

U [{(L_{X | Y}^{(n)})}^{C}]

and

Y^{n}

,

U [L_{X | Y}^{(n)}]

can be reconstructed using SC decoding with error probability in

O (n δ_{n})

. Alternatively, the previous sets can be defined based on the Bhattacharyya parameters

{Z (U (j) | U^{1 : j - 1}, Y^{n})}_{j = 1}^{n}

because both parameters polarize simultaneously [27] (Proposition 2). It is worth mentioning that both the entropy terms and the Bhattacharyya parameters required to define these sets can be obtained deterministically from

p_{X Y}

and the algebraic properties of

G_{n}

[21,22,23].

Similarly to

H_{X | Y}^{(n)}

and

L_{X | Y}^{(n)}

, the sets

H_{X}^{(n)}

and

L_{X}^{(n)}

can be defined by considering that observations

Y^{n}

are absent. A discrete memoryless channel

(X, p_{Y | X}, Y)

with some arbitrary

p_{X}

can be seen as a DMS

(X \times Y, p_{X} p_{Y | X})

. In channel polar coding, first, we define

H_{X | Y}^{(n)}

,

L_{X | Y}^{(n)}

,

H_{X}^{(n)}

and

L_{X}^{(n)}

from the target distribution

p_{X} p_{Y | X}

(polar construction). Then, based on the previous sets, the encoder somehow constructs

{\tilde{U}}^{n}

and applies the inverse polar transform

{\tilde{X}}^{n} = {\tilde{U}}^{n} G_{n}

, with distribution

{\tilde{q}}_{X^{n}}

(since the polar-based encoder will construct random variables that must approach the target distribution of the DMS, throughout this paper, we use a tilde above the random variables to emphasize this purpose). Afterwards, the transmitter sends

{\tilde{X}}^{n}

over the channel, which induces

{\tilde{Y}}^{n} \sim {\tilde{q}}_{Y^{n}}

. If

V ({\tilde{q}}_{X^{n} Y^{n}}, p_{X^{n} Y^{n}}) \to 0

, then the receiver can reliably reconstruct

\tilde{U} [L_{X | Y}^{(n)}]

from

{\tilde{Y}}^{n}

and

\tilde{U} [{(L_{X | Y}^{(n)})}^{C}]

by using SC decoding [28].

To conclude this part, the following lemma provides a useful property of polar codes for the DBC.

Lemma 1

(Subset property, adapted from [14] (Lemma 4)). Let

(X, Y_{2}, Y_{1})

be random variables such that

X - Y_{2} - Y_{1}

forms a Markov chain. Then, the following property holds for the polar transform

U^{n} = X^{n} G_{n}

,

\begin{matrix} H (U (j) | U^{1 : j - 1}) \geq H (U (j) | U^{1 : j - 1}, Y_{1}^{n}) \geq H (U (j) | U^{1 : j - 1}, Y_{2}^{n}) \forall j \in [n], which implies \\ L_{X}^{(n)} \subseteq L_{X | Y_{1}}^{(n)} \subseteq L_{X | Y_{2}}^{(n)}, and H_{X | Y_{2}}^{(n)} \subseteq H_{X | Y_{1}}^{(n)} \subseteq H_{X}^{(n)} . \end{matrix}

Remark 1.

The subset property also holds if the sets are defined based on the Bhattacharyya parameters because, under the previous Markov chain condition,

Z (U (j) | U^{1 : j - 1}) \geq Z (U (j) | U^{1 : j - 1}, Y_{1}^{n}) \geq Z (U (j) | U^{1 : j - 1}, Y_{2}^{n})

.

Remark 2.

According to [14] (Lemma 4), the subset property also holds if the channels are stochastically degraded. Therefore, since the construction of the polar codes proposed in the following sections is based basically on Lemma 1, the polar coding schemes are suitable for physically- and stochastically-degraded channels.

4. Polar Coding Scheme For the DBC-NLD-LS

The polar coding scheme provided in this section is designed to achieve the supremum of the achievable rates given in Corollary 1 (secrecy-capacity without rate sharing). Thus, consider the DMS

(X \times Y_{K} \times \dots \times Y_{1} \times Z_{M} \times \dots \times Z_{1}, p_{X Y_{K} \dots Y_{1} Z_{M} \dots Z_{1}})

that represents the input and output random variables involved in the achievable subregion of Corollary 1, where

X = {0, 1}

. Let

(X^{n}, Y_{K}^{n}, \dots, Y_{1}^{n}, Z_{M}^{n}, \dots, Z_{1}^{n})

be an i.i.d. n-sequence of this source. We define the polar transform

U^{n} ≜ X^{n} G_{n}

, whose distribution is

p_{U^{n}} (u^{n}) = p_{X^{n}} (u^{n} G_{n})

(due to the invertibility of

G_{n}

), and we write:

\begin{matrix} p_{U^{n}} (u^{n}) ≜ \prod_{j = 1}^{n} p_{U (j) | U^{1 : j - 1}} (u (j) | u^{1 : j - 1}) . \end{matrix}

(6)

4.1. Polar Code Construction

Let

δ_{n} ≜ 2^{- n^{β}}

, where

β \in (0, \frac{1}{2})

. Based on

p_{X Y_{K} \dots Y_{1} Z_{M} \dots Z_{1}}

, we define:

\begin{matrix} H_{X}^{(n)} & ≜ \{j \in [n] : H (U (j) | U^{1 : j - 1}) \geq 1 - δ_{n}\}, \end{matrix}

(7)

\begin{matrix} L_{X}^{(n)} & ≜ {j \in [n] : H (U (j) | U^{1 : j - 1}) \leq δ_{n}\}, \end{matrix}

(8)

\begin{matrix} L_{X | Y_{k}}^{(n)} & ≜ \{j \in [n] : H (U (j) | U^{1 : j - 1}, Y_{k}^{n}) \leq δ_{n}\}, k = 1, \dots, K, \end{matrix}

(9)

\begin{matrix} H_{X | Y_{k}}^{(n)} & ≜ \{j \in [n] : H (U (j) | U^{1 : j - 1}, Y_{k}^{n}) \geq 1 - δ_{n}\}, k = 1, \dots, K, \end{matrix}

(10)

\begin{matrix} H_{X | Z_{m}}^{(n)} & ≜ \{j \in [n] : H (U (j) | U^{1 : j - 1}, Z_{m}^{n}) \geq 1 - δ_{n}\}, m = 1, \dots, M . \end{matrix}

(11)

Then, based on the previous sets, we define the following partition of the universal set

[n]

,

\begin{matrix} I_{M}^{(n)} & ≜ H_{X | Z_{M}}^{(n)} \cap {(H_{X | Y_{1}}^{(n)})}^{C}, \end{matrix}

(12)

\begin{matrix} I_{m}^{(n)} & ≜ H_{X | Z_{m}}^{(n)} \cap {(H_{X | Z_{m + 1}}^{(n)})}^{C}, m = 1, \dots, M - 1, \end{matrix}

(13)

\begin{matrix} F^{(n)} & ≜ H_{X | Y_{1}}^{(n)}, \end{matrix}

(14)

\begin{matrix} C^{(n)} & ≜ H_{X}^{(n)} \cap {(H_{X | Z_{1}}^{(n)})}^{C}, \end{matrix}

(15)

\begin{matrix} T^{(n)} & ≜ {(H_{X}^{(n)})}^{C}, \end{matrix}

(16)

which is graphically represented in Figure 3. Roughly speaking, in order to ensure reliability and strong secrecy, the distribution of

{\tilde{U}}^{n}

after the encoding process must be close in terms of statistical distance to the distribution given in Equation (6) corresponding to the original DMS. Hence, the elements

U (j)

such that

j \in H_{X}^{(n)}

will be suitable for storing uniformly-distributed random sequences. On the other hand,

U [T^{(n)}]

will not, and the elements

U (j)

such that

j \in T^{(n)}

will be constructed somehow from

U^{1 : j - 1}

and the distribution

p_{U (j) | U^{1 : j - 1}}

. The set

I_{m}^{(n)}

(

m \in [1, M]

) belongs to

H_{X | Z_{m}}^{(n)}

, and by Lemma 1, we have

H_{X | Z_{m}}^{(n)} \subseteq H_{X | Z_{m^{'}}}^{(n)}

for any

m^{'} < m

. Thus,

U [I_{m}^{(n)}]

will be suitable for storing information to be secured from Eavesdroppers 1–m. Since

C^{(n)} \subseteq {(H_{X | Z_{m}}^{(n)})}^{C}

for any

m \in [1, M]

, the sequence

U [C^{(n)}]

cannot contain information to be secured from any eavesdropper, and it will be used to store the local randomness [8] required to confuse the eavesdroppers (the local randomness in polar codes plays the same role as the stochastic encoding used in [1,2]). According to [27] (Theorem 2), the legitimate Receiver 1 will be able to reliably infer

U [L_{X | Y_{1}}^{(n)}]

given

Y_{1}^{n}

and

U [{(L_{X | Y_{1}}^{(n)})}^{C}]

. Hence, if the polar coding scheme somehow make the entries

U (j)

such that j belongs to

F^{(n)}

and

{(H_{X | Y_{1}}^{(n)})}^{C} \cap {(L_{X | Y_{1}}^{(n)})}^{C}

(hatched areas in Figure 3) available to the legitimate Receiver 1, this receiver will be able to reliably infer the entire sequence

U^{n}

. In this sense,

U [F^{(n)}]

will be used to store the uniformly-distributed random sequence provided by a source of common randomness that will be available to all parties. Since

F^{(n)} \subseteq H_{X | Z_{m}}^{(n)}

for any

m \in [1, M]

, the knowledge of

U [F^{(n)}]

of the eavesdroppers will not compromise the strong secrecy condition. On the other hand,

U [{(H_{X | Y_{1}}^{(n)})}^{C} \cap {(L_{X | Y_{1}}^{(n)})}^{C}]

will contain secret information or elements that cannot be known directly by all the eavesdroppers. Therefore, the transmitter somehow will secretly send it to the legitimate receivers. Nevertheless, as will be seen, this additional transmission will incur an asymptotically negligible rate penalty. Finally, by Lemma 1, we have

{(L_{X | Y_{1}}^{(n)})}^{C} \supseteq {(L_{X | Y_{k}}^{(n)})}^{C}

for any

k > 1

. Hence, given

U [{(L_{X | Y_{1}}^{(n)})}^{C}]

, all the legitimate receivers will be able to reliably infer the entire sequence

U^{n}

from their own channel observations.

Remark 3.

The goal of the polar code construction is to obtain the entropy terms

{H (U (j) | U^{1 : j - 1})}_{j = 1}^{n}

,

{H (U (j) | U^{1 : j - 1}, Y_{1}^{n})}_{j = 1}^{n}

and

{H (U (j) | U^{1 : j - 1}, Z_{m}^{n})}_{j = 1}^{n}

for all

m \in [1, M]

required to define the sets in Equations (7)–(11) and, consequently, to obtain the partition of

[n]

given in Equations (12)–(16). In Section 6, we discuss further how to construct polar codes under both reliability and secrecy constraints.

4.2. Polar Encoding

The polarization-based encoder aims to construct the sequence

{\tilde{U}}^{n}

and, consequently,

{\tilde{X}}^{n} = {\tilde{U}}^{n} G_{n}

. Let

W_{m}

for all

m \in [1, M]

and C be uniformly-distributed random vectors of size

| I_{m}^{(n)} |

and

| C^{(n)} |

, respectively, where C represents the local randomness required to confuse the eavesdroppers, and recall that

W_{m}

represents the message m that is intended for all legitimate receivers. Let F be a given uniformly-distributed random

| F^{(n)} |

-sequence, which represents the source of common randomness that is available to all parties. The encoder constructs the sequence

{\tilde{u}}^{n}

as follows. Consider the realizations

w_{m}

for all

m \in [1, M]

, c and f, whose elements have been indexed by the set of indices

I_{m}^{(n)}

,

C^{(n)}

and

F^{(n)}

, respectively. The encoder draws

{\tilde{u}}^{n}

from the distribution:

\begin{matrix} {\tilde{q}}_{U (j) | U^{1 : j - 1}} (\tilde{u} (j) | {\tilde{u}}^{1 : j - 1}) ≜ \{\begin{matrix} 𝟙 \{\tilde{u} (j) = w_{m} (j)\} & if j \in I_{m}^{(n)}, m = 1, \dots, M, \\ 𝟙 \{\tilde{u} (j) = c (j)\} & if j \in C^{(n)}, \\ 𝟙 \{\tilde{u} (j) = f (j)\} & if j \in F^{(n)}, \\ p_{U (j) | U^{1 : j - 1}} (\tilde{u} (j) | {\tilde{u}}^{1 : j - 1}) & if j \in {(H_{X}^{(n)})}^{C} \cap {(L_{X}^{(n)})}^{C}, \\ 𝟙 \{\tilde{u} (j) = ξ^{(j)} ({\tilde{u}}^{1 : j - 1})\} & if j \in L_{X}^{(n)}, \end{matrix} \end{matrix}

(17)

where:

\begin{matrix} ξ^{(j)} ({\tilde{u}}^{1 : j - 1}) ≜ \underset{u \in X}{arg max} p_{U (j) | U^{1 : j - 1}} (u | {\tilde{u}}^{1 : j - 1}), \end{matrix}

(18)

p_{U (j) | U^{1 : j - 1}}

being the distribution induced by the original DMS. Note that

T^{(n)} = ({(H_{X}^{(n)})}^{C} \cap {(L_{X}^{(n)})}^{C}) \cup L_{X}^{(n)}

, and according to Equation (17),

\tilde{U} [L_{X}^{(n)}]

is constructed deterministically by adapting the SC encoding algorithm in [20], while

\tilde{U} [{(H_{X}^{(n)})}^{C} \cap {(L_{X}^{(n)})}^{C}]

is constructed randomly. By [27] (Theorem 1), we have that the amount of randomness for SC encoding will be asymptotically negligible in terms of rate. Then, the encoder computes

{\tilde{X}}^{n} = {\tilde{U}}^{n} G_{n}

and transmits it over the DBC, inducing

({\tilde{Y}}_{K}, \dots, {\tilde{Y}}_{1}, {\tilde{Z}}_{M}, \dots, {\tilde{Z}}_{1})

.

Finally, besides the sequence

{\tilde{X}}^{n}

, the encoder outputs the following additional secret sequence,

\begin{matrix} Φ ≜ \tilde{U} [{(H_{X | Y_{1}}^{(n)})}^{C} \cap {(L_{X | Y_{1}}^{(n)})}^{C}] . \end{matrix}

(19)

This sequence

Φ

must be additionally transmitted to all legitimate receivers keeping it masked from the eavesdroppers. To do so, the transmitter can perform a modulo-two addition between

Φ

and a uniformly-distributed secret key that is privately shared with the legitimate receivers and somehow additionally send it to them. Nevertheless, by [27] (Theorem 1), we know that this additional transmission is asymptotically negligible in terms of rate.

Remark 4.

The additional secret sequence Φ can be divided into two parts:

\tilde{U} [H_{X}^{(n)} \cap {(H_{X | Y_{1}}^{(n)})}^{C} \cap {(L_{X | Y_{1}}^{(n)})}^{C}]

, which will be uniformly distributed according to Equation (17), and the remaining part that will not. The transmitter could make the uniformly-distributed part available to the legitimate receivers by using a chaining structure as the one presented in [9]. However, such a scheme requires the transmission to take place over several blocks of size n. Moreover, it requires having a large memory capacity on either the transmitter or the legitimate receivers, which can make the polar coding scheme unpractical in communication systems.

4.3. Polar Decoding

Before the decoding process, consider that the realization of the source of common randomness F is available to all parties and the sequence

Φ

has been successfully received by the legitimate receivers.

The legitimate receiver

k \in [1, K]

forms an estimate

{\hat{U}}^{n}

of the sequence

{\tilde{U}}^{n}

as follows. Given that

Φ

and F are available, notice that it knows

\tilde{U} [{(L_{X | Y_{1}}^{(n)})}^{C}]

. Moreover, by Lemma 1,

{(L_{X | Y_{1}}^{(n)})}^{C} \supseteq {(L_{X | Y_{k}}^{(n)})}^{C}

for any

k > 1

. Thus, the k-th legitimate receiver performs SC decoding for source coding with side information [27] to construct

{\tilde{U}}^{n}

from

\tilde{U} [{(L_{X | Y_{1}}^{(n)})}^{C}]

and its channel output observations

{\tilde{Y}}_{k}

. In Section 4.5.3, we show formally that the reliability condition in Equation (2) is satisfied at each legitimate receiver

k \in [1, K]

.

4.4. Information Leakage

Besides the observations

{\tilde{Z}}_{m}^{n}

, the eavesdropper

m \in [1, M]

has access to the common randomness

F = \tilde{U} [F^{(n)}]

. Thus, the information about the messages

{W_{i}}_{i = m}^{M}

leaked to this eavesdropper is:

\begin{matrix} I (W_{m}, \dots, W_{M}; F, {\tilde{Z}}_{m}^{n}) = I (\tilde{U} [\cup_{i = m}^{M} I_{i}^{(n)}]; \tilde{U} [F^{(n)}], {\tilde{Z}}_{m}^{n}) . \end{matrix}

(20)

In Section 4.5.4, we prove that

(W_{m}, W_{m + 1}, \dots, W_{M})

is asymptotically statistically independent of

(F, {\tilde{Z}}_{m}^{n})

.

4.5. Performance of the Polar Coding Scheme

The analysis of the polar coding scheme described previously leads to the following theorem.

Theorem 1.

Consider an arbitrary DBC

(X, p_{Y_{K} \dots Y_{1} Z_{M} \dots Z_{1} | X}, Y_{K} \times \dots \times Y_{1} \times Z_{M} \times \dots \times Z_{1})

such that

X \in {0, 1}

and

p_{Y_{K} \dots Y_{1} Z_{M} \dots Z_{1} | X}

satisfies the Markov chain condition

X - Y_{K} - \dots - Y_{1} - Z_{M} - \dots - Z_{1}

. The polar coding scheme described in Section 4.1, Section 4.2, Section 4.3 and Section 4.4 achieves any rate tuple of the region defined in Corollary 1, satisfying the reliability and strong secrecy conditions given in Equations (2) and (3), respectively.

Corollary 3.

Since

\tilde{U} [I_{m}^{(n)}]

for some

m \in [1, M]

can contain information to be secured from Eavesdroppers 1–m, the polar coding scheme described in Section 4.1, Section 4.2, Section 4.3 and Section 4.4 can achieve the entire region considering rate sharing of Proposition 1 by storing part of any message

W_{m^{'}}

such that

m^{'} < m

into

\tilde{U} [I_{m}^{(n)}]

instead of part of

W_{m}

.

Corollary 4.

If we consider a communication scenario requiring transmissions over several blocks of size n, the same realization of the source of common randomness F that is known by all parties could be used at each block, and the reliability and the strong secrecy conditions would still be ensured.

The proof of Theorem 1 follows in four steps with similar reasoning as in [13] and is provided in Section 4.5.1, Section 4.5.2, Section 4.5.3 and Section 4.5.4. The proof of Corollary 3 is immediate, and the proof of Corollary 4 is provided in Section 4.5.5.

4.5.1. Transmission Rates

In this step, we prove that the polar coding scheme approaches the corner point of the subregion defined in Corollary 1. For any

m \in [1, M - 1]

, the rate

R_{m}

corresponding to the message

W_{m}

satisfies:

\begin{matrix} lim_{n \to \infty} R_{m} = lim_{n \to \infty} \frac{1}{n} | I_{m}^{(n)} | & \overset{(a)}{=} lim_{n \to \infty} \frac{1}{n} | H_{X | Z_{m}}^{(n)} \cap {(H_{X | Z_{m + 1}}^{(n)})}^{C} | \\ \overset{(b)}{=} lim_{n \to \infty} \frac{1}{n} (| H_{X | Z_{m}}^{(n)} | - | H_{X | Z_{m + 1}}^{(n)} |) \\ \overset{(c)}{=} I (X; Z_{m + 1}) - I (X; Z_{m}), \end{matrix}

where

(a)

follows from the definition of the set

I_{m}^{(n)}

in Equation (13),

(b)

holds because, by Lemma 1,

H_{X | Z_{m}}^{(n)} \supseteq H_{X | Z_{m + 1}}^{(n)}

, and

(c)

follows from [27] (Theorem 1). Similarly, according to Equation (12), we obtain:

\begin{matrix} lim_{n \to \infty} R_{M} & = lim_{n \to \infty} \frac{1}{n} | I_{M}^{(n)} | = lim_{n \to \infty} \frac{1}{n} | H_{X | Z_{M}}^{(n)} \cap {(H_{X | Y_{1}}^{(n)})}^{C} | = I (X; Y_{1}) - I (X; Z_{M}) . \end{matrix}

4.5.2. Distribution of the DMS after the Polar Encoding

Let

{\tilde{q}}_{U^{n}}

be the distribution of

{\tilde{U}}^{n}

after the encoding in Section 4.2. The following lemma shows that

{\tilde{q}}_{U^{n}}

and the distribution

p_{U^{n}}

in Equation (6) of the original DMS are nearly statistically indistinguishable for sufficiently large n and, consequently, so are the overall distributions

{\tilde{q}}_{X Y_{K} \dots Y_{1} Z_{M} \dots Z_{1}}

and

p_{X Y_{K} \dots Y_{1} Z_{M} \dots Z_{1}}

.

Lemma 2.

Let

δ_{n} = 2^{- n^{β}}

for some

β \in (0, \frac{1}{2})

. Then,

\begin{matrix} V ({\tilde{q}}_{U^{n}}, p_{U^{n}}) & \leq δ_{nld - ls}^{(n)}, \\ V ({\tilde{q}}_{X^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}, p_{X Y_{K} \dots Y_{1} Z_{M} \dots Z_{1}}) = V ({\tilde{q}}_{U^{n}}, p_{U^{n}}) & \leq δ_{nld - ls}^{(n)}, \end{matrix}

where

δ_{nld - ls}^{(n)} ≜ n \sqrt{4 \sqrt{n δ_{n} ln 2} (2 n - log (2 \sqrt{n δ_{n} ln 2})) + δ_{n}} + \sqrt{2 n δ_{n} ln 2}

.

Proof.

See Appendix A, setting

L = 1

. □

Remark 5.

The first term of

δ_{nld - ls}^{(n)}

bounds the impact on the total variation distance of using the deterministic SC encoding in Equation (18) for the entries

\tilde{U} [L_{X}^{(n)}]

, while the second term bounds the impact of storing uniformly-distributed random sequences (messages, local randomness and common randomness) into the entries

\tilde{U} [H_{X}^{(n)}]

.

As will be seen in the following subsections, an encoding process satisfying Lemma 2 is crucial for the reliability and the secrecy performance of the polar code.

4.5.3. Reliability Performance

Consider the probability of incorrectly decoding all messages

{W_{m}}_{m = 1}^{M}

at the legitimate receiver

k \in [1, K]

. Let

{\tilde{q}}_{X^{n} Y_{k}^{n}}

and

p_{X^{n} Y_{k}^{n}}

be the marginal distributions of

{\tilde{q}}_{X^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}

and

p_{X^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}

, respectively. Consider an optimal coupling [29] (Proposition 4.7) between

{\tilde{q}}_{X^{n} Y_{k}^{n}}

and

p_{X^{n} Y_{k}^{n}}

such that:

\begin{matrix} P [E_{X^{n} Y_{k}^{n}}] = V ({\tilde{q}}_{X^{n} Y_{k}^{n}}, p_{X^{n} Y_{k}^{n}}), \end{matrix}

where

E_{X^{n} Y_{k}^{n}} ≜ {({\tilde{X}}^{n}, {\tilde{Y}}_{k}^{n}) \neq (X^{n}, Y_{k}^{n})}

or, equivalently,

E_{X^{n} Y_{k}^{n}} ≜ {({\tilde{U}}^{n}, {\tilde{Y}}_{k}^{n}) \neq (U^{n}, Y_{k}^{n})}

because of the invertibility of

G_{n}

. Thus, for the legitimate receiver

k \in [1, K]

, we obtain:

\begin{matrix} P [({\hat{W}}_{1}, \dots {\hat{W}}_{M}) \neq (W_{1}, \dots, W_{M})] & \leq P [{\hat{U}}^{n} \neq {\tilde{U}}^{n}] \\ = P [{\hat{U}}^{n} \neq {\tilde{U}}^{n} | E_{X^{n} Y_{k}^{n}}^{C}] P [E_{X^{n} Y_{k}^{n}}^{C}] + P [{\hat{U}}^{n} \neq {\tilde{U}}^{n} | E_{X^{n} Y_{k}^{n}}] P [E_{X^{n} Y_{k}^{n}}] \\ \leq P [{\hat{U}}^{n} \neq {\tilde{U}}^{n} | E_{X^{n} Y_{k}^{n}}^{C}] + P [E_{X^{n} Y_{k}^{n}}] \\ \overset{(a)}{\leq} \sum_{j \in L_{X | Y_{1}}^{(n)}} Z (U (j) | U^{1 : j - 1}, Y_{k}^{n}) + P [E_{X^{n} Y_{k}^{n}}] \\ \overset{(b)}{\leq} n \sqrt{δ_{n}} + P [E_{X^{n} Y_{k}^{n}}] \\ \overset{(c)}{\leq} n \sqrt{δ_{n}} + δ_{nld - ls}^{(n)}, \end{matrix}

(21)

where

(a)

holds by [27] (Theorem 2) because

\tilde{U} [{(L_{X | Y_{1}}^{(n)})}^{C}]

is available to all receivers,

(b)

holds by Lemma 1, that is,

Z (U (j) | U^{1 : j - 1}, Y_{k}^{n}) \leq Z (U (j) | U^{1 : j - 1}, Y_{1}^{n})

for any

k > 1

, and by the definition of

L_{X | Y_{1}}^{(n)}

in Equation (9) and [27] (Proposition 2), that is

Z (U (j) | U^{1 : j - 1}, Y_{1}^{n}) \leq {(H (U (j) | U^{1 : j - 1}, Y_{1}^{n}))}^{1 / 2}

, and

(c)

holds by the optimal coupling and Lemma 2 because

V ({\tilde{q}}_{X^{n} Y_{k}^{n}}, p_{X^{n} Y_{k}^{n}}) \leq V ({\tilde{q}}_{X^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}, p_{X^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}})

. Therefore, the polar coding scheme satisfies the reliability condition given in Equation (2).

4.5.4. Secrecy Performance

Consider the information leakage at the eavesdropper

m \in [1, M]

given in Equation (20). We obtain:

\begin{matrix} I (W_{m}, \dots, W_{M}; F, {\tilde{Z}}_{m}^{n}) & = H (\tilde{U} [\cup_{i = m}^{M} I_{i}^{(n)}]) + H (\tilde{U} [F^{(n)}] | {\tilde{Z}}_{m}^{n}) - H (\tilde{U} [(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}] | {\tilde{Z}}_{m}^{n}) \\ \leq \sum_{i = m}^{M} | I_{i}^{(n)} | + | F^{(n)} | - H (\tilde{U} [(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}] | {\tilde{Z}}_{m}^{n}) . \end{matrix}

(22)

Now, we provide a lower-bound for the conditional entropy term of Equation (22). First, for large enough n,

\begin{matrix} | H (\tilde{U} [(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}] | {\tilde{Z}}_{m}^{n}) - H (U [(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}] | Z_{m}^{n}) | \\ \overset{(a)}{\leq} | H ({\tilde{Z}}_{m}^{n}) - H (Z_{m}^{n}) | + | H (\tilde{U} [(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}], {\tilde{Z}}_{m}^{n}) - H (U [(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}], Z_{m}^{n}) | \\ \overset{(b)}{\leq} V ({\tilde{q}}_{Z_{m}^{n}}, p_{Z_{m}^{n}}) log \frac{2^{n}}{V ({\tilde{q}}_{Z_{m}^{n}}, p_{Z_{m}^{n}})} \\ + V ({\tilde{q}}_{U [(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}] Z_{m}^{n}}, p_{U [(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}] Z_{m}^{n}}) log \frac{2^{(n + | (\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)} |)}}{V ({\tilde{q}}_{U [(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}] Z_{m}^{n}}, p_{U [(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}] Z_{m}^{n}})} \\ \overset{(c)}{\leq} 3 n δ_{nld - ls}^{(n)} - 2 δ_{nld - ls}^{(n)} log δ_{nld - ls}^{(n)}, \end{matrix}

(23)

where

(a)

holds by the chain rule of entropy and the triangle inequality,

(b)

holds by [30] (Lemma 2.9) and

(c)

holds because the function

x \mapsto x log x

is decreasing for

x > 0

small enough and by Lemma 2 because

V ({\tilde{q}}_{Z_{m}^{n}}, p_{Z_{m}^{n}}) \leq V ({\tilde{q}}_{X^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}, p_{X^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}})

, as well as by the invertibility of

G_{n}

,

V ({\tilde{q}}_{U [(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}] Z_{m}^{n}}, p_{U [(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}] Z_{m}^{n}}) \leq V ({\tilde{q}}_{X^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}, p_{X^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}})

. Hence, we have:

\begin{matrix} H (\tilde{U} [(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}] | {\tilde{Z}}_{m}^{n}) & \geq H (U [(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}] | Z_{m}^{n}) - (3 n δ_{nld - ls}^{(n)} - 2 δ_{nld - ls}^{(n)} log δ_{nld - ls}^{(n)}) \\ \overset{(a)}{\geq} \sum_{j \in (\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)}} H (U (j) | U^{1 : j - 1}, Z_{m}^{n}) - (3 n δ_{nld - ls}^{(n)} - 2 δ_{nld - ls}^{(n)} log δ_{nld - ls}^{(n)}) \\ \overset{(b)}{\geq} (\sum_{i = m}^{M} | I_{i}^{(n)} | + | F^{(n)} |) (1 - δ_{n}) - (3 n δ_{nld - ls}^{(n)} - 2 δ_{nld - ls}^{(n)} log δ_{nld - ls}^{(n)}), \end{matrix}

(24)

where

(a)

holds because conditioning does not increase the entropy and

(b)

holds because, according to Equations (12)–(14) and Lemma 1,

(\cup_{i = m}^{M} I_{i}^{(n)}) \cup F^{(n)} \subseteq H_{X | Z_{m}}^{(n)}

, as well as by the definition of

H_{X | Z_{m}}^{(n)}

in Equation (11).

Finally, by substituting Equation (24) into Equation (22), for n sufficiently large, we obtain:

\begin{matrix} I (W_{m}, \dots, W_{M}; F, {\tilde{Z}}_{m}^{n}) \leq n δ_{n} + 3 n δ_{nld - ls}^{(n)} - 2 δ_{nld - ls}^{(n)} log δ_{nld - ls}^{(n)}, \end{matrix}

(25)

Hence, the polar code satisfies the strong secrecy condition in Equation (3), and the proof of Theorem 1 is concluded.

4.5.5. Reuse of the Source of Common Randomness

Consider that the transmission takes place over B blocks of size n. We use the subscript

b \in [1, B]

between parentheses to denote random variables associated with the block b. From Lemma 2, we have

V ({\tilde{q}}_{U_{(b)}^{n}}, p_{U^{n}}) \leq δ_{nld - ls}^{(n)}

for any

b \in [1, B]

because we use the same encoding of Equation (17) at each block. Hence, by the union bound, the polar code satisfies the reliability condition given in Equation (2) because:

\begin{matrix} P [\cup_{b = 1}^{B} \{{\hat{U}}_{(b)}^{n} \neq {\tilde{U}}_{(b)}^{n}\}] & \leq \sum_{b = 1}^{B} P [{\hat{U}}_{(b)}^{n} \neq {\tilde{U}}_{(b)}^{n}] \leq B (n \sqrt{δ_{n}} + δ_{nld - ls}^{(n)}), \end{matrix}

where the last inequality follows from the fact that, since F and

Φ_{(b)}

are perfectly known,

P [{\hat{U}}_{(b)}^{n} \neq {\tilde{U}}_{(b)}^{n}]

only depends on the decoding at block b and, consequently, can be bounded as in Equation (21).

With a slight abuse of notation, let

W_{m : M, (b_{1} : b_{2})}

, where

1 \leq b_{1} \leq b_{2} \leq B

, denote the messages

{(W_{m, (b)}, \dots, W_{M, (b)})}_{b = b_{1}}^{b_{2}}

. It remains to show that

W_{m : M, (1 : B)}

is asymptotically statistically independent of

(F, {\tilde{Z}}_{m, (1 : B)}^{n})

. Since F is reused at each block, we have to consider the dependencies between the random variables of different blocks that are involved in the secrecy analysis. According to these dependencies, which are represented in the Bayesian graph of Figure 4, we obtain:

\begin{matrix} I (W_{m : M, (1 : B)}; {\tilde{Z}}_{m, (1 : B)}^{n}, F) & \overset{(a)}{=} I (W_{m : M, (1 : B)}; {\tilde{Z}}_{m, (1 : B)}^{n} | F) \\ = \sum_{b = 0}^{B - 1} I (W_{m : M, (1 : B)}; {\tilde{Z}}_{m, (b + 1)}^{n} | F, {\tilde{Z}}_{m, (1 : b)}^{n}) \\ \overset{(b)}{\leq} B (n δ_{n} + 3 n δ_{nld - ls}^{(n)} - 2 δ_{nld - ls}^{(n)} log δ_{nld - ls}^{(n)}), \end{matrix}

where

(a)

follows from the independence between

W_{m : M, (1 : B)}

and F, and

(b)

holds because:

\begin{matrix} I (W_{m : M, (1 : B)}; {\tilde{Z}}_{m, (b + 1)}^{n} | F, {\tilde{Z}}_{m, (1 : b)}^{n}) \\ = I (W_{m : M, (1 : b + 1)}; {\tilde{Z}}_{m, (b + 1)}^{n} | F, {\tilde{Z}}_{m, (1 : b)}^{n}) + I (W_{m : M, (b + 2 : B)}; {\tilde{Z}}_{m, (b + 1)}^{n} | F, {\tilde{Z}}_{m, (1 : b)}^{n}, W_{m : M, (1 : b + 1)}) \\ \leq I (W_{m : M, (1 : b + 1)}, F, {\tilde{Z}}_{m, (1 : b)}^{n}; {\tilde{Z}}_{m, (b + 1)}^{n}) + I (W_{m : M, (b + 2 : B)}; {\tilde{Z}}_{m, (1 : b + 1)}^{n}, F, W_{m : M, (1 : b + 1)}) \\ \overset{(a)}{=} I (W_{m : M, (1 : b + 1)}, F, {\tilde{Z}}_{m, (1 : b)}^{n}; {\tilde{Z}}_{m, (b + 1)}^{n}) \\ \leq I (W_{m : M, (b + 1)}, F; {\tilde{Z}}_{m, (b + 1)}^{n}) + I (W_{m : M, (1 : b)}, {\tilde{Z}}_{m, (1 : b)}^{n}; {\tilde{Z}}_{m, (b + 1)}^{n} | W_{m : M, (b + 1)}, F) \\ \overset{(b)}{\leq} (n δ_{n} + 3 n δ_{nld - ls}^{(n)} - 2 δ_{nld - ls}^{(n)} log δ_{nld - ls}^{(n)}) + I (W_{m : M, (1 : b)}, {\tilde{Z}}_{m, (1 : b)}^{n}; W_{m : M, (b + 1)}, {\tilde{Z}}_{m, (b + 1)}^{n} | F) \\ \overset{(c)}{=} n δ_{n} + 3 n δ_{nld - ls}^{(n)} - 2 δ_{nld - ls}^{(n)} log δ_{nld - ls}^{(n)}, \end{matrix}

where

(a)

holds because the messages at blocks

b + 2

–B are independent of F and all the random variables of the previous blocks,

(b)

follows from Equation (25) and

(c)

holds by applying d-separation [31] over the graph of Figure 4 because

(W_{m : M, (1 : b)}, {\tilde{Z}}_{m, (1 : b)}^{n}) \leftarrow F \to (W_{m : M, (b + 1)}, {\tilde{Z}}_{m, (b + 1)}^{n})

forms a common cause and, consequently,

(W_{m : M, (1 : b)}, {\tilde{Z}}_{m, (1 : b)}^{n})

and

(W_{m : M, (b + 1)}, {\tilde{Z}}_{m, (b + 1)}^{n})

are independent given F.

5. Polar Coding Scheme for the DBC-LD-NLS

The polar coding scheme provided in this section is designed to achieve the supremum of the achievable rates given in Corollary 2 (secrecy-capacity without rate sharing). In this model, there are K input random variables

{V_{ℓ}}_{ℓ = 1}^{K}

(where

V_{K} ≜ X

), each one corresponding to a different superposition layer. Consider the DMS

(V_{1} \times \dots \times V_{K} \times Y_{K} \times \dots \times Y_{1} \times Z_{M} \times \dots \times Z_{1}, p_{V_{1} \dots V_{K} Y_{K} \dots Y_{1} Z_{M} \dots Z_{1}})

that represents the input and output random variables involved in the achievable subregion of Corollary 2, where

V_{ℓ} = {0, 1}

for any

ℓ \in [1, K]

. Let

(V_{1}^{n}, \dots, V_{K}^{n}, Y_{K}^{n}, \dots, Y_{1}^{n}, Z_{M}^{n}, \dots, Z_{1}^{n})

be an i.i.d. n-sequence of this source. Then, we define the K polar transforms

U_{ℓ}^{n} ≜ V_{ℓ}^{n} G_{n}

, where

ℓ \in [1, K]

. Since

V_{1} - V_{2} - \dots - V_{K}

and, consequently,

U_{1} - U_{2} - \dots - U_{K}

(by the invertibility of

G_{n}

) form a Markov chain, the joint distribution of

(U_{1}^{n}, \dots, U_{K}^{n})

satisfies”

\begin{matrix} p_{U_{1}^{n} \dots U_{K}^{n}} (u_{1}^{n}, \dots, u_{K}^{n}) ≜ \prod_{ℓ = 1}^{K} \prod_{j = 1}^{n} p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} (u_{ℓ} (j) | u_{ℓ}^{1 : j - 1}, u_{ℓ - 1}^{n} G_{n}) . \end{matrix}

(26)

5.1. Polar Code Construction

Based on

p_{V_{1} \dots V_{K} Y_{K} \dots Y_{1} Z_{M} \dots Z_{1}}

, the construction is carried out similarly at each superposition layer. Consider the polar construction at layer

ℓ \in [1, K]

. Let

δ_{n} ≜ 2^{- n^{β}}

, where

β \in (0, \frac{1}{2})

. For the polar transform

U_{ℓ}^{n} = V_{ℓ}^{n} G_{n}

associated with the ℓ-th layer, we define the sets:

\begin{matrix} H_{V_{ℓ} | V_{ℓ - 1}}^{(n)} & ≜ \{j \in [n] : H (U_{ℓ} (j) | U_{ℓ}^{1 : j - 1}, V_{ℓ - 1}^{n}) \geq 1 - δ_{n}\}, \end{matrix}

(27)

\begin{matrix} L_{V_{ℓ} | V_{ℓ - 1}}^{(n)} & ≜ \{j \in [n] : H (U_{ℓ} (j) | U_{ℓ}^{1 : j - 1}, V_{ℓ - 1}^{n}) \leq δ_{n}\}, \end{matrix}

(28)

\begin{matrix} L_{V_{ℓ} | V_{ℓ - 1} Y_{k}}^{(n)} & ≜ \{j \in [n] : H (U_{ℓ} (j) | U_{ℓ}^{1 : j - 1}, V_{ℓ - 1}^{n}, Y_{k}^{n}) \leq δ_{n}\}, k = ℓ, \dots, K, \end{matrix}

(29)

\begin{matrix} H_{V_{ℓ} | V_{ℓ - 1} Y_{k}}^{(n)} & ≜ \{j \in [n] : H (U_{ℓ} (j) | U_{ℓ}^{1 : j - 1}, V_{ℓ - 1}^{n}, Y_{k}^{n}) \geq 1 - δ_{n}\}, k = ℓ, \dots, K, \end{matrix}

(30)

\begin{matrix} H_{V_{ℓ} | V_{ℓ - 1} Z_{m}}^{(n)} & ≜ \{j \in [n] : H (U_{ℓ} (j) | U_{ℓ}^{1 : j - 1}, V_{ℓ - 1}^{n}, Z_{m}^{n}) \geq 1 - δ_{n}\}, m = 1, \dots, M, \end{matrix}

(31)

where we recall that

V_{0} = ⌀

when

ℓ = 1

and

V_{K} ≜ X

when

ℓ = K

. At each layer

ℓ \in [1, K]

, based on these previous sets, we define the following partition of the universal set

[n]

,

\begin{matrix} I_{ℓ}^{(n)} & ≜ H_{V_{ℓ} | V_{ℓ - 1} Z_{M}}^{(n)} \cap {(H_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C}, \end{matrix}

(32)

\begin{matrix} F_{ℓ}^{(n)} & ≜ H_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)}, \end{matrix}

(33)

\begin{matrix} C_{ℓ}^{(n)} & ≜ H_{V_{ℓ} | V_{ℓ - 1}}^{(n)} \cap {(H_{V_{ℓ} | V_{ℓ - 1} Z_{M}}^{(n)})}^{C}, \end{matrix}

(34)

\begin{matrix} T_{ℓ}^{(n)} & ≜ {(H_{V_{ℓ} | V_{ℓ - 1}}^{(n)})}^{C}, \end{matrix}

(35)

which is graphically represented in Figure 5. The way we define this partition at the ℓ-th layer follows similar reasoning as the one to define the partition in Section 4.1 for the DBC-NLD-LS. In this sense,

U_{ℓ} [H_{V_{ℓ} | V_{ℓ - 1}}^{(n)}]

will be suitable for storing uniformly-distributed random sequences. Otherwise,

U_{ℓ} [T_{ℓ}^{(n)}]

will not and

U_{ℓ} (j)

such that

j \in T_{ℓ}^{(n)}

will be constructed somehow from

(U_{ℓ}^{1 : j - 1}, V_{ℓ - 1})

and the distribution

p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}}

. Now,

U_{ℓ} [I_{ℓ}^{(n)}]

will be suitable for storing information to be secured from all eavesdroppers because

I_{ℓ}^{(n)}

belongs to

H_{V_{ℓ} | V_{ℓ - 1} Z_{M}}^{(n)}

, and by Lemma 1,

H_{V_{ℓ} | V_{ℓ - 1} Z_{M}}^{(n)} \subseteq H_{V_{ℓ} | V_{ℓ - 1} Z_{m^{'}}}^{(n)}

for any

m^{'} \in [1, M - 1]

. Since

C_{ℓ}^{(n)} \subseteq {(H_{V_{ℓ} | V_{ℓ - 1} Z_{M}}^{(n)})}^{C}

,

U [C_{ℓ}^{(n)}]

will be used to store the local randomness required to confuse all eavesdroppers about the secret information carried on this layer. According to [27] (Theorem 2), the legitimate receiver

k \in [1, K]

will be able to reliably infer

U_{ℓ} [L_{V_{ℓ} | V_{ℓ - 1} Y_{k}}^{(n)}]

given

Y_{k}^{n}

and

U_{ℓ} [{(L_{V_{ℓ} | V_{ℓ - 1} Y_{k}}^{(n)})}^{C}]

. By Lemma 1, we have

{(L_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C} \supseteq (L_{V_{ℓ} | V_{ℓ - 1} Y_{k}}^{(n)})^{C}

for any

ℓ < k

. Therefore, given

U_{ℓ} [{(L_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C}]

, the legitimate receivers ℓ–K will be able to reliably reconstruct

U_{ℓ}^{n}

from its own channel observations. In this sense,

U_{ℓ} [F_{ℓ}^{(n)}]

will be used to store the random sequence provided by the source of common randomness. Since

F_{ℓ}^{(n)} \subseteq H_{V_{ℓ} | V_{ℓ - 1} Z_{M}}^{(n)}

, the strong secrecy condition will not be compromised. On the other hand,

U [{(H_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C} \cap {(L_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C}]

(hatched areas in Figure 5) will contain secret information or elements that cannot be known directly by the eavesdroppers. Therefore, the transmitter somehow will make those elements available to the legitimate receivers ℓ–K keeping them masked from all eavesdroppers by incurring an asymptotically-negligible rate penalty.

As mentioned in Remark 3, the goal of the polar construction is to obtain the entropy terms associated with the sets in Equations (27)–(31) and then define the partition of

[n]

given in Equations (32)–(35).

5.2. Polar Encoding

The superposition-based polar encoder will consist of K encoding blocks operating sequentially at each superposition layer, the block at layer

ℓ \in [1, K]

being responsible for the construction of

{\tilde{U}}_{ℓ}^{n}

. In order to construct

{\tilde{U}}_{ℓ}^{n}

for some

ℓ \in [2, K]

, the encoder block needs

{\tilde{V}}_{ℓ - 1}^{n} = {\tilde{U}}_{ℓ - 1}^{n} G_{n}

, which have been constructed previously by the encoding block operating at the

(ℓ - 1)

-th layer.

Consider the encoding procedure at layer

ℓ \in [1, K]

. Let

W_{ℓ}

and

C_{ℓ}

be uniformly-distributed random vectors of size

| I_{ℓ}^{(n)} |

and

| C_{ℓ}^{(n)} |

, respectively, where

W_{ℓ}

represents the message intended for receivers ℓ–K and

C_{ℓ}

the local randomness required at the ℓ-th layer to confuse all eavesdroppers about this message. Let

F_{ℓ}

be a given uniformly-distributed random

| F_{ℓ}^{(n)} |

-sequence, which represents the source of common randomness that is available to all parties. The ℓ-th encoding block constructs the sequence

{\tilde{u}}_{ℓ}^{n}

as follows. Given the realizations

w_{ℓ}

,

c_{ℓ}

and

f_{ℓ}

, whose elements have been indexed by the set of indices

I_{ℓ}^{(n)}

,

C_{ℓ}^{(n)}

and

F_{ℓ}^{(n)}

, respectively, and given

{\tilde{v}}_{ℓ - 1}^{n} = {\tilde{u}}_{ℓ - 1}^{n} G_{n}

provided by the previous encoding block (recall that

{\tilde{v}}_{0}^{n} ≜ ⌀

at the first layer), the ℓ-th encoding block draws

{\tilde{u}}_{ℓ}^{n}

from:

\begin{matrix} {\tilde{q}}_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} ({\tilde{u}}_{ℓ} (j) | {\tilde{u}}_{ℓ}^{1 : j - 1}, {\tilde{v}}_{ℓ - 1}^{n}) \\ ≜ \{\begin{matrix} 𝟙 \{{\tilde{u}}_{ℓ} (j) = w_{ℓ} (j)\} & if j \in I_{ℓ}^{(n)}, \\ 𝟙 \{{\tilde{u}}_{ℓ} (j) = c_{ℓ} (j)\} & if j \in C_{ℓ}^{(n)}, \\ 𝟙 \{{\tilde{u}}_{ℓ} (j) = f_{ℓ} (j)\} & if j \in F_{ℓ}^{(n)}, \\ p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} ({\tilde{u}}_{ℓ} (j) | {\tilde{u}}_{ℓ}^{1 : j - 1}, {\tilde{v}}_{ℓ - 1}^{n}) & if j \in {(H_{V_{ℓ} | V_{ℓ - 1}}^{(n)})}^{C} \cap {(L_{V_{ℓ} | V_{ℓ - 1}}^{(n)})}^{C}, \\ 𝟙 \{{\tilde{u}}_{ℓ} (j) = ξ_{ℓ}^{(j)} ({\tilde{u}}_{ℓ}^{1 : j - 1}, {\tilde{v}}_{ℓ - 1}^{n})\} & if j \in L_{V_{ℓ} | V_{ℓ - 1}}^{(n)}, \end{matrix} \end{matrix}

(36)

where:

\begin{matrix} ξ_{ℓ}^{(j)} ({\tilde{u}}_{ℓ}^{1 : j - 1}, {\tilde{v}}_{ℓ - 1}^{n}) ≜ \underset{u \in V_{ℓ}}{arg max} p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} (u | {\tilde{u}}_{ℓ}^{1 : j - 1}, {\tilde{v}}_{ℓ - 1}^{n}), \end{matrix}

(37)

p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}}

being the distribution induced by the original DMS. Notice that

T_{ℓ}^{(n)} = ({(H_{V_{ℓ} | V_{ℓ - 1}}^{(n)})}^{C} \cap {(L_{V_{ℓ} | V_{ℓ - 1}}^{(n)})}^{C}) \cup L_{V_{ℓ} | V_{ℓ - 1}}^{(n)}

, and similarly to the previous model,

\tilde{U} [L_{V_{ℓ} | V_{ℓ - 1}}^{(n)}]

is constructed in a deterministic way by adapting the SC encoding algorithm in [20]; and

\tilde{U} [{(H_{V_{ℓ} | V_{ℓ - 1}}^{(n)})}^{C} \cap {(L_{V_{ℓ} | V_{ℓ - 1}}^{(n)})}^{C}]

is constructed randomly. By [27] (Theorem 1), the rate of the amount of randomness for SC encoding will be asymptotically negligible. After constructing

{\tilde{U}}_{ℓ}^{n}

, the ℓ-th encoding block computes the sequence

{\tilde{V}}_{ℓ}^{n} = {\tilde{U}}_{ℓ}^{n} G_{n}

and delivers it to the next encoding block. If

ℓ = K

, then

{\tilde{V}}_{K}^{n} ≜ {\tilde{X}}^{n}

, and the encoder transmits it over the DBC, which induces the channel outputs

({\tilde{Y}}_{K}^{n}, \dots, {\tilde{Y}}_{1}^{n}, {\tilde{Z}}_{M}^{n}, \dots, {\tilde{Z}}_{1}^{n})

.

Finally, besides the sequence

{\tilde{X}}^{n}

, the encoder outputs the following additional secret sequences,

\begin{matrix} Φ_{ℓ} ≜ {\tilde{U}}_{ℓ} [{(H_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C} \cap {(L_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C}], ℓ = 1, \dots, K, \end{matrix}

(38)

The sequence

Φ_{ℓ}

corresponding to the layer

ℓ \in [1, K]

must be additionally transmitted to the legitimate receivers ℓ–K keeping it masked from the eavesdroppers. To do so, the transmitter can perform a modulo-two addition between

{Φ_{ℓ}}_{ℓ = 1}^{K}

and a uniformly-distributed secret key privately shared with the legitimate receivers and somehow additionally send it to them. If

K ≪ n

, by [27] (Theorem 1), we have that the overall rate required to transmit these additional secret sequences is asymptotically negligible, i.e.,

{lim}_{n \to \infty} \sum_{ℓ = 1}^{K} \frac{| Φ_{ℓ} |}{n} = 0

. As for the previous model, the uniformly-distributed part of any

Φ_{ℓ}

could be made available to the corresponding legitimate receivers by using a chaining structure as in [9]. However, this approach will present the same disadvantages as those mentioned in Remark 4.

5.3. Polar Decoding

Consider that the realizations of

{F_{ℓ}}_{ℓ = 1}^{K}

are available to all parties, and the sequences

{Φ_{ℓ}}_{ℓ = 1}^{K}

have been successfully received by the corresponding legitimate receivers before the decoding process.

Consider the decoding at the legitimate receiver

k \in [1, K]

. This receiver forms the estimates

{{\hat{U}}_{ℓ}^{n}}_{ℓ = 1}^{k}

of the sequences

{{\tilde{U}}_{ℓ}^{n}}_{ℓ = 1}^{k}

in a successive manner from

{\hat{U}}_{1}^{n}

-

{\hat{U}}_{k}^{n}

, and the procedure to estimate

{\tilde{U}}_{ℓ}^{n}

for some

ℓ \in [1, k]

is as follows. First, given that

Φ_{ℓ}

and

F_{ℓ}

are available, the receiver knows

{\tilde{U}}_{ℓ} [{(L_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C}]

. Moreover, by Lemma 1,

(L_{V_{ℓ} | V_{ℓ - 1} Y_{k}}^{(n)})^{C} \subseteq {(L_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C}

for any

ℓ < k

. Thus, given

{\tilde{U}}_{ℓ} [{(L_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C}]

, the k-th legitimate receiver performs SC decoding for source coding with side information [27] to construct

{\hat{U}}_{ℓ} [L_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)}]

from

{\tilde{Y}}_{k}^{n}

, and from

{\hat{V}}_{ℓ - 1}^{n} = {\hat{U}}_{ℓ - 1}^{n} G_{n}

estimated previously. In Section 5.5.3, we show formally that the polar coding scheme satisfies the reliability condition in Equation (4).

5.4. Information Leakage

Besides the observations

{\tilde{Z}}_{m}^{n}

, the eavesdropper

m \in [1, M]

has access to the common randomness

{F_{ℓ}}_{ℓ = 1}^{K}

. Therefore, the information about all messages leaked to the m-th eavesdropper is:

\begin{matrix} I (W_{1}, \dots, W_{K}; F_{1}, \dots, F_{K}, {\tilde{Z}}_{m}^{n}) = I ({\tilde{U}}_{1} [I_{1}^{(n)}], \dots, {\tilde{U}}_{K} [I_{K}^{(n)}]; {\tilde{U}}_{1} [F_{1}^{(n)}], \dots, {\tilde{U}}_{K} [F_{K}^{(n)}], {\tilde{Z}}_{m}^{n}) . \end{matrix}

(39)

In Section 5.5.4, we prove that

(W_{1}, \dots, W_{K})

is asymptotically statistically independent of

(F_{1}, \dots, F_{K}, {\tilde{Z}}_{m}^{n})

.

5.5. Performance of the Polar Coding Scheme

The analysis of the polar coding scheme leads to the following theorem.

Theorem 2.

Consider an arbitrary DBC

(X, p_{Y_{K} \dots Y_{1} Z_{M} \dots Z_{1} | X}, Y_{K} \times \dots \times Y_{1} \times Z_{M} \times \dots \times Z_{1})

such that

X \in {0, 1}

and

p_{Y_{K} \dots Y_{1} Z_{M} \dots Z_{1} | X}

satisfies the Markov chain condition

X - Y_{K} - \dots - Y_{1} - Z_{M} - \dots - Z_{1}

. The polar coding scheme described in Section 5.1, Section 5.2, Section 5.3 and Section 5.4 achieves any rate tuple of the achievable region defined in Corollary 2, satisfying the reliability and strong secrecy conditions in Equations (4) and (5), respectively.

Corollary 5.

Since

{\tilde{U}}_{ℓ} [I_{ℓ}^{(n)}]

for some

ℓ \in [1, K]

can contain any information to be reliably decoded by the legitimate receivers ℓ–K, the coding scheme in Section 5.1, Section 5.2, Section 5.3 and Section 5.4 can achieve the entire region considering the rate sharing of Proposition 2 by storing part of any message

W_{ℓ^{'}}

such that

ℓ^{'} > ℓ

into

{\tilde{U}}_{ℓ} [I_{ℓ}^{(n)}]

instead of part of

W_{ℓ}

.

Corollary 6.

If we consider a communication scenario requiring transmissions over several blocks of size n, the same realization of the source of common randomness

(F_{1}, \dots, F_{K})

that is known by all parties could be used at each block, and the reliability and the strong secrecy conditions would still be ensured.

As in Theorem 1, the proof of Theorem 2 follows in four steps and is provided in Section 4.5.1, Section 4.5.2, Section 4.5.3 and Section 4.5.4. The proof of Corollary 5 is immediate. The proof of Corollary 6 is omitted because it follows similar reasoning as in Corollary 4. Despite that in this model, we have different superposition layers, the dependencies between the random variables at different blocks have the same structure of those graphically represented in Figure 4.

5.5.1. Transmission Rates

We prove that the polar coding scheme approaches the corner point of the subregion defined in Corollary 2. For any

ℓ \in [1, K]

, the transmission rate

R_{ℓ}

corresponding to the message

W_{ℓ}

satisfies:

\begin{matrix} lim_{n \to \infty} R_{ℓ} & = lim_{n \to \infty} \frac{1}{n} | I_{ℓ}^{(n)} | \overset{(a)}{=} lim_{n \to \infty} \frac{1}{n} | H_{V_{ℓ} | V_{ℓ - 1} Z_{M}}^{(n)} \cap {(H_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C} | \\ \overset{(b)}{=} lim_{n \to \infty} \frac{1}{n} | H_{V_{ℓ} | V_{ℓ - 1} Z_{M}}^{(n)} | - lim_{n \to \infty} \frac{1}{n} | H_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)} | \\ \overset{(c)}{=} I (V_{ℓ}; Y_{ℓ} | V_{ℓ - 1}) - I (V_{ℓ}; Z_{M} | V_{ℓ - 1}), \end{matrix}

(40)

where

(a)

follows from the definition of the set

I_{ℓ}^{(n)}

in Equation (32),

(b)

holds because, by Lemma 1,

H_{V_{ℓ} | V_{ℓ - 1} Z_{M}}^{(n)} \supseteq H_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)}

, and

(c)

holds by [27] (Theorem 1).

5.5.2. Distribution of the DMS after the Polar Encoding

Let

{\tilde{q}}_{U_{1}^{n} \dots U_{K}^{n}}

be the distribution of

({\tilde{U}}_{1}^{n}, \dots, {\tilde{U}}_{K}^{n})

after the encoding in Section 5.2. The following lemma shows that

{\tilde{q}}_{U_{1}^{n} \dots U_{K}^{n}}

and

p_{U_{1}^{n} \dots U_{K}^{n}}

of the DMS are nearly statistically indistinguishable for sufficiently large n and, consequently, so are the overall distributions

{\tilde{q}}_{V_{1}^{n} \dots V_{K}^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}

and

p_{V_{1}^{n} \dots V_{K}^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}

.

Lemma 3.

Let

δ_{n} = 2^{- n^{β}}

for some

β \in (0, \frac{1}{2})

. Then,

\begin{matrix} V ({\tilde{q}}_{U_{1}^{n} \dots U_{K}^{n}}, p_{U_{1}^{n} \dots U_{K}^{n}}) & \leq δ_{ld - nls}^{(n)}, \\ V ({\tilde{q}}_{V_{1}^{n} \dots V_{K}^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}, p_{V_{1}^{n} \dots V_{K}^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}) = V ({\tilde{q}}_{U_{1}^{n} \dots U_{K}^{n}}, p_{U_{1}^{n} \dots U_{K}^{n}}) & \leq δ_{ld - nls}^{(n)}, \end{matrix}

where

δ_{ld - nls}^{(n)} ≜ K n \sqrt{4 \sqrt{n δ_{n} ln 2} (2 n - log (2 \sqrt{n δ_{n} ln 2})) + δ_{n}} + \sqrt{K 2 n δ_{n} ln 2}

.

Proof.

See Appendix A setting

L = K

. □

Remark 6.

The first term of

δ_{ld - nls}^{(n)}

bounds the impact on the total variation distance of using the deterministic SC encoding in Equation (37) for

{\tilde{U}}_{ℓ} [L_{V_{ℓ} | V_{ℓ - 1}}^{(n)}]

at each layer

ℓ \in [1, K]

. The second term bounds the impact of storing uniformly-distributed random sequences that are independent of

{\tilde{V}}_{ℓ - 1}^{n}

into

{\tilde{U}}_{ℓ} [H_{V_{ℓ} | V_{ℓ - 1}}^{(n)}]

.

5.5.3. Reliability Performance

Consider the probability of incorrectly decoding

{W_{ℓ}}_{ℓ = 1}^{k}

at the legitimate receiver

k \in [1, K]

. Let

{\tilde{q}}_{V_{ℓ}^{n} Y_{k}^{n}}

and

p_{V_{ℓ}^{n} Y_{k}^{n}}

for any

ℓ \leq k

be marginals of

{\tilde{q}}_{V_{1}^{n} \dots V_{K}^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}

and

p_{V_{1}^{n} \dots V_{K}^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}

, respectively. Consider an optimal coupling [29] (Proposition 4.7) between

{\tilde{q}}_{V_{ℓ}^{n} Y_{k}^{n}}

and

p_{V_{ℓ}^{n} Y_{k}^{n}}

such that:

\begin{matrix} P [E_{V_{ℓ}^{n} Y_{k}^{n}}] = V ({\tilde{q}}_{V_{ℓ}^{n} Y_{k}^{n}}, p_{V_{ℓ}^{n} Y_{k}^{n}}), \end{matrix}

where

E_{V_{ℓ}^{n} Y_{k}^{n}} ≜ {({\tilde{V}}_{ℓ}^{n}, {\tilde{Y}}_{k}^{n}) \neq (V_{ℓ}^{n}, Y_{k}^{n})}

or, equivalently,

E_{V_{ℓ}^{n} Y_{k}^{n}} ≜ {({\tilde{U}}_{ℓ}^{n}, {\tilde{Y}}_{k}^{n}) \neq (U_{ℓ}^{n}, Y_{k}^{n})}

due to the invertibility of

G_{n}

. Furthermore, for all

ℓ \in [1, k]

, we define the error events

E_{{\hat{V}}_{ℓ}^{n}} ≜ {{\hat{V}}_{ℓ}^{n} \neq {\tilde{V}}_{ℓ}^{n}}

or, equivalently,

E_{{\hat{V}}_{ℓ}^{n}} ≜ {{\hat{U}}_{ℓ}^{n} \neq {\tilde{U}}_{ℓ}^{n}}

; and we define

E_{{\hat{V}}_{0}^{n}} ≜ \emptyset

. Hence, for any

ℓ \in [1, k]

, the average probability of incorrectly decoding the message

W_{ℓ}

at the k-th receiver can be upper-bounded as:

\begin{matrix} P [{\hat{W}}_{ℓ} \neq W_{ℓ}] & \leq P [{\hat{U}}_{ℓ}^{n} \neq {\tilde{U}}_{ℓ}^{n}] \\ = P [{\hat{U}}_{ℓ}^{n} \neq {\tilde{U}}_{ℓ}^{n} | E_{V_{ℓ}^{n} Y_{k}^{n}}^{C} \cap E_{{\hat{V}}_{ℓ - 1}^{n}}^{C}] P [E_{V_{ℓ}^{n} Y_{k}^{n}}^{C} \cap E_{{\hat{V}}_{ℓ - 1}^{n}}^{C}] \\ + P [{\hat{U}}_{ℓ}^{n} \neq {\tilde{U}}_{ℓ}^{n} | E_{V_{ℓ}^{n} Y_{k}^{n}} \cup E_{{\hat{V}}_{ℓ - 1}^{n}}] P [E_{V_{ℓ}^{n} Y_{k}^{n}} \cup E_{{\hat{V}}_{ℓ - 1}^{n}}] \\ \leq P [{\hat{U}}_{ℓ}^{n} \neq {\tilde{U}}_{ℓ}^{n} | E_{V_{ℓ}^{n} Y_{k}^{n}}^{C} \cap E_{{\hat{V}}_{ℓ - 1}^{n}}^{C}] + P [E_{V_{ℓ}^{n} Y_{k}^{n}}] + P [E_{{\hat{V}}_{ℓ - 1}^{n}}] \\ \overset{(a)}{\leq} \sum_{j \in L_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)}} Z (U_{ℓ} (j) | U_{ℓ}^{1 : j - 1}, V_{ℓ - 1}^{n}, Y_{k}^{n}) + P [E_{V_{ℓ}^{n} Y_{k}^{n}}] + P [E_{{\hat{V}}_{ℓ - 1}^{n}}] \\ \overset{(b)}{\leq} n \sqrt{δ_{n}} + P [E_{V_{ℓ}^{n} Y_{k}^{n}}] + P [E_{{\hat{V}}_{ℓ - 1}^{n}}] \\ \overset{(c)}{\leq} n \sqrt{δ_{n}} + δ_{ld - nls}^{(n)} + P [E_{{\hat{V}}_{ℓ - 1}^{n}}] \end{matrix}

(41)

where

(a)

holds by [27] (Theorem 2) because

{\tilde{U}}_{ℓ} [{(L_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C}]

for any

ℓ \leq k

is available to the k-th receiver,

(b)

holds by Lemma 1, by the definition of the set

L_{V_{ℓ} | V_{ℓ - 1} Y_{1}}^{(n)}

in Equation (29) and by applying [27] (Proposition 2) and

(c)

holds by the optimal coupling and Lemma 3 because

V ({\tilde{q}}_{V_{ℓ}^{n} Y_{k}^{n}}, p_{V_{ℓ}^{n} Y_{k}^{n}}) \leq V ({\tilde{q}}_{V_{1}^{n} \dots V_{K}^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}, p_{V_{1}^{n} \dots V_{K}^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}})

. Thus, by induction, we obtain:

\begin{matrix} P [({\hat{W}}_{1}, \dots {\hat{W}}_{k}) \neq (W_{1}, \dots, W_{k})] & \leq \sum_{ℓ = 1}^{k} P [{\hat{U}}_{ℓ} \neq {\tilde{U}}_{ℓ}] \leq \frac{k (k + 1)}{2} (n \sqrt{δ_{n}} + δ_{ld - nls}^{(n)}) . \end{matrix}

(42)

Consequently, if

K ≪ n

, the polar coding scheme satisfies the reliability condition in Equation (4).

5.5.4. Secrecy Performance

Consider the leakage at the eavesdropper

m \in [1, M]

given in Equation (39). As in Equation (22), we obtain:

\begin{matrix} I (W_{1}, \dots, W_{K}; F_{1}, \dots, F_{K}, {\tilde{Z}}_{m}^{n}) \leq \sum_{ℓ = 1}^{K} | I_{ℓ}^{(n)} \cup F_{ℓ}^{(n)} | - H ({\tilde{U}}_{1} [I_{1}^{(n)} \cup F_{1}^{(n)}], \dots, {\tilde{U}}_{K} [I_{K}^{(n)} \cup F_{K}^{(n)}] | {\tilde{Z}}_{m}^{n}) . \end{matrix}

(43)

Following similar reasoning as in Equation (23), for n large enough, we have:

\begin{matrix} | H ({\tilde{U}}_{1} [I_{1}^{(n)} \cup F_{1}^{(n)}], \dots, {\tilde{U}}_{K} [I_{K}^{(n)} \cup F_{K}^{(n)}] | {\tilde{Z}}_{m}^{n}) - H (U_{1} [I_{1}^{(n)} \cup F_{1}^{(n)}], \dots, U_{K} [I_{K}^{(n)} \cup F_{K}^{(n)}] | Z_{m}^{n}) | \\ \overset{(a)}{\leq} V ({\tilde{q}}_{Z_{m}^{n}}, p_{Z_{m}^{n}}) log \frac{2^{n}}{V ({\tilde{q}}_{Z_{m}^{n}}, p_{Z_{m}^{n}})} + V^{†} log \frac{2^{(n + \sum_{ℓ = 1}^{K} | I_{ℓ}^{(n)} \cup F_{ℓ}^{(n)} |)}}{V^{†}} \\ \overset{(b)}{\leq} (K + 2) n δ_{ld - nls}^{(n)} - 2 δ_{ld - nls}^{(n)} log δ_{ld - nls}^{(n)}, \end{matrix}

(44)

where

(a)

holds by defining

V^{†} ≜ V ({\tilde{q}}_{U_{1} [I_{1}^{(n)} \cup F_{1}^{(n)}], \dots, U_{K} [I_{K}^{(n)} \cup F_{K}^{(n)}] Z_{m}^{n}}, p_{U_{1} [I_{1}^{(n)} \cup F_{1}^{(n)}], \dots, U_{K} [I_{K}^{(n)} \cup F_{K}^{(n)}] Z_{m}^{n}})

and [30] (Lemma 2.9) and

(b)

follows from Lemma 2 by using similar reasoning as in Equation (23) and because the function

x \mapsto x log x

is decreasing for

x > 0

small enough. Hence, we obtain:

\begin{matrix} H ({\tilde{U}}_{1} [I_{1}^{(n)} \cup F_{1}^{(n)}], \dots, {\tilde{U}}_{K} [I_{K}^{(n)} \cup F_{K}^{(n)}] | {\tilde{Z}}_{m}^{n}) \\ \geq H (U_{1} [I_{1}^{(n)} \cup F_{1}^{(n)}], \dots, U_{K} [I_{K}^{(n)} \cup F_{K}^{(n)}] | Z_{m}^{n}) - ((K + 2) n δ_{ld - nls}^{(n)} - 2 δ_{ld - nls}^{(n)} log δ_{ld - nls}^{(n)}) \\ \overset{(a)}{\geq} \sum_{ℓ = 1}^{K} \sum_{j \in I_{ℓ}^{(n)} \cup F_{ℓ}^{(n)}} H (U_{ℓ} (j) | U_{ℓ}^{1 : j - 1}, V_{ℓ - 1}^{n}, Z_{m}^{n}) - ((K + 2) n δ_{ld - nls}^{(n)} - 2 δ_{ld - nls}^{(n)} log δ_{ld - nls}^{(n)}) \\ \overset{(b)}{\geq} \sum_{ℓ = 1}^{K} | I_{ℓ}^{(n)} \cup F_{ℓ}^{(n)} | (1 - 2 δ_{n}) - ((K + 2) n δ_{ld - nls}^{(n)} - 2 δ_{ld - nls}^{(n)} log δ_{ld - nls}^{(n)}), \end{matrix}

(45)

where

(a)

holds because conditioning does not increase the entropy and because

U_{1}^{n} - \dots - U_{K - 1}^{n} - U_{K}^{n}

forms a Markov chain and the invertibility of

G_{n}

and

(b)

holds because, according to Equations (32) and (33),

I_{ℓ}^{(n)} \cup F_{ℓ}^{(n)} \subseteq H_{V_{ℓ} | V_{ℓ - 1} Z_{M}}^{(n)}

for all

ℓ \in [1, K]

, because by Lemma 1, we have

H_{V_{ℓ} | V_{ℓ - 1} Z_{M}}^{(n)} \subseteq H_{V_{ℓ} | V_{ℓ - 1} Z_{m}}^{(n)}

for any

m \in [1, M - 1]

, and by the definition of the set

H_{V_{ℓ} | V_{ℓ - 1} Z_{m}}^{(n)}

given in Equation (31).

Finally, by substituting Equation (45) into Equation (43), we obtain:

\begin{matrix} I (W_{1}, \dots, W_{K}; F_{1}, \dots, F_{K}, {\tilde{Z}}_{m}^{n}) \leq n δ_{n} + (K + 2) n δ_{ld - nls}^{(n)} - 2 δ_{ld - nls}^{(n)} log δ_{ld - nls}^{(n)}, \end{matrix}

(46)

Hence, if

K ≪ n

, the polar code satisfies the secrecy condition in Equation (5), and the proof is concluded.

6. Polar Construction and Performance Evaluation

In this section, we discuss further how to construct the polar codes for the DBC-NLD-LS and DBC-LD-NLS proposed in Section 4 and Section 5, respectively. Moreover, we evaluate the reliability and the secrecy performance of both polar coding schemes according to different parameters involved in the polar code construction. Although the construction of polar codes has been covered in a large number of references (see, for instance, [21,22,23]), they only focus on polar codes under reliability constraints.

For the DBC-NLD-LS, we consider the Binary Erasure Broadcast Channel (BE-BC), where each individual channel of the DBC is a Binary Erasure Channel (BEC). For this model, we propose a construction of the polar code that is based on the Bhattacharyya parameters instead of the corresponding entropy terms. The reason is that, for the BE-BC, the Bhattacharyya parameters associated with the sets in Equations (7)–(11) can be computed exactly [7] (Proposition 5). Then, we evaluate the reliability and the secrecy performance of the code, and we focus on how different parameters involved in the proposed polar code construction impact its performance.

On the other hand, for the DBC-LD-NLS, we consider the Binary Symmetric Broadcast Channel (BS-BC), where each individual channel is a Binary Symmetric Channel (BSC). From [7] (Proposition 5), we know that the method to compute the exact values of the Bhattacharyya parameters for a BEC provides an upper-bound on the Bhattacharyya parameters of the BSC. Although this method can be useful to construct polar codes under reliability constraints [21,22,23], it fails when the code must guarantee some secrecy condition based on the information leakage. Indeed, in order to upper-bound the information leakage in Equation (39), according to Equation (45), notice that we need a lower-bound on the entropy terms (or Bhattacharyya parameters). Hence, for this model, we focus more on proposing a new polar code construction that is based directly on the entropy terms associated with the sets in Equations (27)–(31).

Throughout this section, as in [7], we say that a channel or a conditional distribution

p_{Y | X} (y | x)

with

x \in X ≜ {0, 1}

and

y \in Y ≜ {0, \dots, | Y | - 1}

is symmetric if the columns of the probability transition matrix

P_{Y | X} ≜ [\begin{matrix} \begin{matrix} p_{Y | X} (0 | 0) & \dots & p_{Y | X} (| Y | - 1 | 0) \\ p_{Y | X} (0 | 1) & \dots & p_{Y | X} (| Y | - 1 | 1) \end{matrix} \end{matrix}]

can be grouped into sub-matrices such that for each sub-matrix, each row is a permutation of each other row and each column is a permutation of each other column. Therefore, the individual channels of both BE-BC and the BS-BC are symmetric.

Due to the symmetry of BE-BC, we will see that the distribution induced by the encoding described in Section 4.2 for the DBC-NLD-LS will approach exactly the optimum distribution of the original DMS used in the polar code construction. Consequently, the performance of the polar code will depend only on the parameters involved in the construction. On the other hand, despite the symmetry of the BS-BC, due to its superposition-based structure, the encoding described in Section 5.2 for the DBC-NLD-LS only approaches the target distribution asymptotically. Hence, this encoding will impact the reliability and secrecy performance of the polar code when we consider a finite blocklength.

6.1. DBC-NLD-LS

For this model, we consider BE-BC with two legitimate receivers (

K = 2

) and two eavesdroppers (

M = 2

). Therefore, each individual channel is a BEC with

X ≜ {0, 1}

and

Y_{k} = Z_{m} ≜ {0, 1, E}

, E being the erasure symbol and

k, m \in {1, 2}

. The individual channels are defined simply by their erasure probability, which is denoted by

ϵ_{Y_{k}}

for the corresponding legitimate receiver k (

P [Y_{k} = E] = ϵ_{Y_{k}}

) and

ϵ_{Z_{m}}

for the eavesdropper m (

P [Z_{m} = E] = ϵ_{Z_{m}}

). Due to the degradedness condition of the broadcast channel given in Equation (1), we have

ϵ_{Y_{2}} < ϵ_{Y_{1}} < ϵ_{Z_{2}} < ϵ_{Z_{1}}

. By properly applying [19] (Proposition 3.2), it is easy to shown that the secrecy-capacity achieving distribution

p_{X}^{⋆}

for this model is the uniform, i.e.,

p_{X}^{⋆} (x) = \frac{1}{2} \forall x \in {0, 1}

. For the simulations, we consider a BE-BC such that

ϵ_{Y_{2}} = 0.01

,

ϵ_{Y_{1}} = 0.04

,

ϵ_{Z_{2}} = 0.2

and

ϵ_{Z_{1}} = 0.35

. According to Corollary 1 and since

p_{X}^{⋆} (x)

is uniform, we obtain that the capacity without considering rate sharing is

R_{1}^{⋆} = 0.15

and

R_{2}^{⋆} = 0.16

.

6.1.1. Practical Polar Code Construction

Given the blocklength n and the distribution

p_{X Y_{2} Y_{1} Z_{2} Z_{1}}^{⋆} = p_{X}^{⋆} p_{Y_{2} Y_{1} Z_{2} Z_{1} | X}

, the goal of the polar code construction is to obtain the partition of the universal set

[n]

defined in Equations (12)–(16) and graphically represented in Figure 3. Hence, we need to define first the required sets of Equations (7)–(11), which means having to compute the entropy terms

{H (U (j) | U^{1 : j - 1})}_{j = 1}^{n}

,

{H (U (j) | U^{1 : j - 1}, Y_{1}^{n})}_{j = 1}^{n}

and

{H (U (j) | U^{1 : j - 1}, Z_{m}^{n})}_{j = 1}^{n} \forall m \in {1, 2}

associated with the polar transform

U^{n} = X^{n} G_{n}

. Alternatively, as mentioned in Section 3, we can define the sets in Equations (7)–(11) from the corresponding Bhattacharyya parameters. Indeed, since each individual channel is a BEC, by [7] (Proposition 5), we can compute with very low complexity the exact values of

{Z (U (j) | U^{1 : j - 1})}_{j = 1}^{n}

,

{Z (U (j) | U^{1 : j - 1}, Y_{1}^{n})}_{j = 1}^{n}

and

{Z (U (j) | U^{1 : j - 1}, Z_{m}^{n})}_{j = 1}^{n} \forall m \in {1, 2}

. To do so, we use the recursive algorithm [22] (PCC-0) adapted to the BEC, which, for instance, will obtain

{Z (U (j) | U^{1 : j - 1}, Y_{1}^{n})}_{j = 1}^{n}

from the initial value

Z (X | Y_{1}) = ϵ_{Y_{1}}

(the entire code in MATLAB used for this section is provided as Supplementary Material—see Endnote [32]). Regarding

{Z (U (j) | U^{1 : j - 1})}_{j = 1}^{n}

, since

p_{X}^{⋆}

is uniform, it is clear that

Z (U (j) | U^{1 : j - 1}) = H (U (j) | U^{1 : j - 1}) = 1

for all

j \in [n]

, which means

H_{X}^{(n)} = [n]

. Consequently, the set

T^{(n)} = \emptyset

, and according to Equation (17), neither random, nor deterministic SC encoding will be needed.

In order to compare the performance of the polar coding scheme according to different parameters and to provide more flexibility in the design, instead of using only

δ_{n}

to define the sets in Equations (7)–(11), we introduce the pair

(δ_{n}^{(r)}, δ_{n}^{(s)})

, where

δ_{n}^{(r)} ≜ 2^{- n^{β^{(r)}}}

and

δ_{n}^{(s)} ≜ 2^{- n^{β^{(s)}}}

for some

β^{(r)}, β^{(s)} \in (0, \frac{1}{2})

. Let

R_{1}^{'} \in [0, R_{1}^{⋆}]

and

R_{2}^{'} \in [0, R_{2}^{⋆}]

denote the target rates that the polar coding scheme must approach. We obtain the partition defined in Equations (12)–(16) as follows. First, we define

{(H_{X | Y_{1}}^{(n)})}^{C} ≜ \{j \in [n] : H (U (j) |U^{1 : j - 1}, Y_{1}^{n}) \leq 1 - δ_{n}^{(s)}\}

, where one can notice that we have used

δ_{n}^{(s)}

. Then, we choose

I_{2}^{(n)}

by taking the

n R_{2}^{'}

indices

j \in {(H_{X | Y_{1}}^{(n)})}^{C}

that correspond to the highest Bhattacharyya parameters

{Z (U (j) | U^{1 : j - 1}, Z_{2}^{n})}_{j = 1}^{n}

for Eavesdropper 2. Second, we choose

I_{1}^{(n)}

by taking the

n R_{1}^{'}

indices

j \in {(H_{X | Y_{1}}^{(n)})}^{C} \ I_{2}^{(n)}

that correspond to the highest Bhattacharyya parameters

{Z (U (j) | U^{1 : j - 1}, Z_{1}^{n})}_{j = 1}^{n}

for Eavesdropper 1. Finally, we obtain

C^{(n)} = {(H_{X | Y_{1}}^{(n)})}^{C} \ (I_{1}^{(n)} \cup I_{2}^{(n)})

and

F^{(n)} = H_{X | Y_{1}}^{(n)}

. Furthermore, in order to evaluate the reliability performance of the code, we define

L_{X | Y_{1}}^{(n)} ≜ \{j \in [n] : H (U (j) |U^{1 : j - 1}, Y_{1}^{n}) \leq δ_{n}^{(r)}\}

, where one can notice that we have used

δ_{n}^{(r)}

. Since the additional secret sequence

Φ

corresponds to those entries belonging to

{(H_{X | Y_{1}}^{(n)})}^{C} \cap {(L_{X | Y_{1}}^{(n)})}^{C}

, its length will depend on

(δ_{n}^{(r)}, δ_{n}^{(s)})

. According to the polar code construction proposed in this section, notice that

δ_{n}^{(s)}

must be small enough to guarantee that

| {(H_{X | Y_{1}}^{(n)})}^{C} | \geq R_{1}^{'} + R_{2}^{'}

.

6.1.2. Performance Evaluation

First, notice that the encoding of Section 4.2 will induce a distribution

{\tilde{q}}_{X^{n} Y_{2}^{n} Y_{1}^{n} Z_{2}^{n} Z_{1}^{n}} = p_{X^{n} Y_{2}^{n} Y_{1}^{n} Z_{2}^{n} Z_{1}^{n}}^{⋆}

because

T^{(n)} = \emptyset

(we do not use SC encoding), and the encoder will store uniformly-distributed sequences into the entries

U (j)

that satisfy

H (U (j) | U^{1 : j - 1}) = 1

for all

j \in H_{X}^{(n)} = [n]

. Hence,

V ({\tilde{q}}_{X^{n} Y_{2}^{n} Y_{1}^{n} Z_{2}^{n} Z_{1}^{n}}, p_{X^{n} Y_{2}^{n} Y_{1}^{n} Z_{2}^{n} Z_{1}^{n}}^{⋆}) = 0

, and the performance will only depend on the code construction.

To evaluate the reliability performance, we obtain an upper-bound

P_{b}^{ub (1)}

on the average bit error probability at the legitimate Receiver 1. Since

V ({\tilde{q}}_{X^{n} Y_{2}^{n} Y_{1}^{n} Z_{2}^{n} Z_{1}^{n}}, p_{X^{n} Y_{2}^{n} Y_{1}^{n} Z_{2}^{n} Z_{1}^{n}}^{⋆}) = 0

, from Equation (21), we have:

\begin{matrix} P_{b}^{ub (1)} ≜ \frac{1}{| L_{X | Y_{1}}^{(n)} |} \sum_{j \in L_{X | Y_{1}}^{(n)}} Z (U (j) | U^{1 : j - 1}, Y_{1}^{n}) . \end{matrix}

(47)

Due to the degradedness condition of the BE-BC and, consequently, by Lemma 1, the average bit error probability at the legitimate Receiver 2 will be always less than the one at the legitimate Receiver 1. Since the legitimate receivers must estimate the entries belonging to

L_{X | Y_{1}}^{(n)}

regardless of

{(H_{X | Y_{1}}^{(n)})}^{C}

and the target rates

(R_{1}^{'}, R_{2}^{'})

, the reliability performance only depends on the pair

(n, δ_{n}^{(r)})

.

In order to evaluate the secrecy performance, we compute an upper-bound on the information leakage

I (W_{1}, W_{2}; F, {\tilde{Z}}_{1}^{n})

and an upper-bound on the information leakage

I (W_{2}; F, {\tilde{Z}}_{2}^{n})

. Since

V ({\tilde{q}}_{X^{n} Y_{2}^{n} Y_{1}^{n} Z_{2}^{n} Z_{1}^{n}}, p_{X^{n} Y_{2}^{n} Y_{1}^{n} Z_{2}^{n} Z_{1}^{n}}^{⋆}) = 0

, from Equations (22) and (24), we obtain:

\begin{matrix} I^{ub} (W_{1}, W_{2}; F, {\tilde{Z}}_{1}^{n}) & ≜ \sum_{i = 1}^{2} | I_{i}^{(n)} | + | F^{(n)} | - \sum_{j \in I_{1}^{(n)} \cup I_{2}^{(n)} \cup F^{(n)}} Z {(U (j) | U^{1 : j - 1}, Z_{1}^{n})}^{2}, \end{matrix}

(48)

\begin{matrix} I^{ub} (W_{2}; F, {\tilde{Z}}_{2}^{n}) & ≜ | I_{2}^{(n)} | + | F^{(n)} | - \sum_{j \in I_{2}^{(n)} \cup F^{(n)}} Z {(U (j) | U^{1 : j - 1}, Z_{2}^{n})}^{2}, \end{matrix}

(49)

where we have used [27] (Proposition 2) to express the information leakage in terms of the Bhattacharyya parameters because

H (U (j) | U^{1 : j - 1}, Z_{m}^{n}) \geq Z {(U (j) | U^{1 : j - 1}, Z_{m}^{n})}^{2}

. According to the proposed polar code construction, the secrecy performance will depend on

(n, δ_{n}^{(s)})

and the rates

(R_{1}^{'}, R_{2}^{'})

, but not on

δ_{n}^{(r)}

.

Additionally, we evaluate the rate of the additional sequence

Φ

simply by computing:

\begin{matrix} \frac{1}{n} | Φ | = \frac{1}{n} | {(H_{X | Y_{1}}^{(n)})}^{C} \cap {(L_{X | Y_{1}}^{(n)})}^{C} |, \end{matrix}

(50)

which will depend on the triple

(n, δ_{n}^{(r)}, δ_{n}^{(s)})

, but not on

(R_{1}^{'}, R_{2}^{'})

.

Let

ρ_{R}

be the normalized target rate in which the polar coding scheme operates, that is

ρ_{R} ≜ \frac{R_{1}^{'}}{R_{1}^{⋆}} = \frac{R_{2}^{'}}{R_{2}^{⋆}}

. In Figure 6A,B, we evaluate the upper-bounds on the information leakage defined in Equations (48) and (49), respectively, as a function of the blocklength n for different values of

ρ_{R}

. To do so, we set

β^{(r)} = 0.16

and

β^{(s)} = 0.30

, which defines a particular pair

(δ_{n}^{(r)}, δ_{n}^{(s)})

for each value of n (recall that

δ_{n}^{(r)}

does not impact on the secrecy performance of the polar code). As we proved in Section 4.5.4, for large enough n, the secrecy performance improves as n increases. Moreover, to achieve a particular secrecy performance level, the polar code will require a larger blocklength n as the rates approach the capacity. This happens because, given

(n, δ_{n}^{(s)})

and, consequently,

{(H_{X | Y_{1}}^{(n)})}^{C}

, the parameter

ρ_{R}

only determines the amount of indices that will belong to

I_{1}^{(n)} \cup I_{2}^{(n)} \subseteq {(H_{X | Y_{1}}^{(n)})}^{C}

. Since, by construction, we take those indices corresponding to the highest Bhattacharyya parameters associated with the eavesdroppers, taking more elements always increases the corresponding leakage. For rates approaching the capacity and small values of n, notice that we obtain a secrecy performance that is getting worse as n increases (for instance, for

ρ_{R} = 0.94

, we obtain that the information leakage is increasing from

n = 2^{9}

to

n = 2^{12}

). This behavior is mainly explained because the elements of

U^{n}

have not been polarized enough for small values of n. Consequently, for a given value of

β^{(s)}

, not all the Bhattacharyya parameters associated with the eavesdroppers corresponding to the sets

I_{1}^{(n)}

and

I_{2}^{(n)}

are sufficiently close to one. Since, for a given

ρ_{R}

, the cardinality of

I_{1}^{(n)}

and

I_{2}^{(n)}

increases with n, then the information leakage can increase with n when n is not large enough. Moreover, since operating at lower rates means taking a fewer number of indices in

I_{1}^{(n)}

and

I_{2}^{(n)}

, but taking those that are closest to one, this behavior appears only for large values of

ρ_{R}

.

The impact of

δ_{n}^{(s)}

on the secrecy performance is graphically represented in Figure 7A,B, where the former plots the upper-bound defined in Equation (48) and the latter the upper-bound in Equation (49) as a function of the blocklength n for different values of

β^{(s)}

. Now, we set

β^{(r)} = 0.16

and

ρ_{R} = 0.90

. As can be seen in Figure 7, the secrecy performance improves as the value of

β^{(s)}

increases (or equivalently, as

δ_{n}^{(s)}

decreases). This behavior is as expected because notice that

δ_{n}^{(s)}

defines the value of the highest Bhattacharyya parameter

Z (U (j) | U^{1 : j - 1}, Y_{1}^{n})

that will belong to

{(H_{X | Y_{1}}^{(n)})}^{C}

, that is the set containing the possible candidates for

I_{1}^{(n)} \cup I_{2}^{(n)}

. Since the polar construction chooses the indices that will belong to

I_{1}^{(n)}

and

I_{2}^{(n)}

by taking the ones corresponding to the highest Bhattacharyya parameters associated with the eavesdroppers and since, by Lemma 1,

Z (U (j) | U^{1 : j - 1}, Z_{1}^{n}) \geq Z (U (j) | U^{1 : j - 1}, Z_{2}^{n}) \geq Z (U (j) | U^{1 : j - 1}, Y_{1}^{n})

for any

j \in [n]

, the sums in Equations (48) and (49) over the indices

j \in I_{1}^{(n)} \cup I_{2}^{(n)}

will be larger as

β^{(s)}

increases (as

δ_{n}^{(s)}

decreases), while their cardinality remains the same for a given

ρ_{R}

. Furthermore, notice that

δ_{n}^{(s)}

also defines

F^{(n)} = H_{X | Y_{1}}^{(n)} = {j \in [n] : Z (U (j) | U^{1 : j - 1}, Y_{1}^{n}) > 1 - δ_{n}^{(s)}}

. Thus, the larger is the value of

β^{(s)}

(the lower is

δ_{n}^{(s)}

), the smaller is the cardinality of

F^{(n)}

and the higher are the Bhattacharyya parameters associated with the eavesdroppers that belong to this set.

Figure 8 plots the upper-bound on the average bit error probability at the legitimate Receiver 1 defined in Equation (47) as a function of the blocklength n for different values of

β^{(r)}

(which defines a particular

δ_{n}^{(r)}

for each n). For this figure, we set

β^{(s)} = 0.30

and

ρ_{R} = 0.90

. As can be seen in Figure 8, the higher is the value of

β^{(r)}

(the smaller is the value of

δ_{n}^{(r)}

), the better is the reliability performance of the polar code. This is because

δ_{n}^{(r)}

defines the higher Bhattacharyya parameter associated with the legitimate Receiver 1 whose corresponding index will belong to the set

L_{X | Y_{1}}^{(n)}

(recall that this set contains the indices of those entries that the legitimate receivers have to estimate). Hence, it is clear that the upper-bound in Equation (47) is decreasing as

δ_{n}^{(r)}

decreases (as

β^{(r)}

increases). Moreover, as we have proven in Section 4.5.3, we can see that the reliability performance is always improving as n increases.

Finally, how the values of the pair

(β^{(r)}, β^{(s)})

, or equivalently, the values of

(δ_{n}^{(r)}, δ_{n}^{(s)})

, impact the rate of the additional secret sequence

Φ

given in Equation (50) is represented graphically in Figure 9. In Figure 9A, we set

ρ_{R} = 0.90

and

β^{(r)} = 0.16

, and we represent the rate of

Φ

as a function of the blocklength n for different values of

β^{(s)}

. Otherwise, in Figure 9B, we evaluate the rate of

Φ

as a function of n for different values of

β^{(r)}

when

ρ_{R} = 0.90

and

β^{(s)} = 0.30

. As mentioned in Section 4.2, this rate tends to be negligible for sufficiently large n. Moreover, according to the polar code construction proposed previously, for a fixed n, the cardinality of the set

{(H_{X | Y_{1}}^{(n)})}^{C} \cap {(L_{X | Y_{1}}^{(n)})}^{C}

will be higher for larger values of

(β^{(r)}, β^{(s)})

, or equivalently, smaller values of

(δ_{n}^{(r)}, δ_{n}^{(s)})

. Therefore, as can be seen in Figure 9, it is clear that higher values of

(β^{(r)}, β^{(s)})

mean also higher rate of the additional secret sequence.

In conclusion, Figure 6, Figure 7, Figure 8 and Figure 9 show that, for a particular value of the blocklength n, there is a trade-off between the reliability or the secrecy performance of the polar code and the length of the additional secret sequence

Φ

, which can be controlled by the value of

β^{(r)}

or

β^{(s)}

, respectively, in the polar code construction. Moreover, for sufficiently large n, the performance of the polar coding scheme always is improving as n increases. Indeed, these figures show that we can transmit at rates very close to the capacity, providing good reliability and secrecy performance levels.

6.2. DBC-LD-NLS

For this model, we consider BS-BC with two legitimate receivers (

K = 2

) and two eavesdroppers (

M = 2

). Hence, each individual channel is a BSC where

X = Y_{k} = Z_{m} = {0, 1}

, and

k, m \in {1, 2}

. The individual channels are defined simply by their crossover probability, which is denoted by

α_{Y_{k}}

for the corresponding legitimate receiver k (

P [Y_{k} = 0 | X = 1] = P [Y_{k} = 1 | X = 0] = α_{Y_{k}}

) and

α_{Z_{m}}

for the corresponding eavesdropper m (

P [Z_{m} = 0 | X = 1] = P [Z_{m} = 0 | X = 1] = α_{Z_{m}}

). Due to the degradedness condition of the broadcast channel given in Equation (1), we have

α_{Y_{2}} < α_{Y_{1}} < α_{Z_{2}} < α_{Z_{1}}

. Due to the symmetry of the channel, it is easy to prove by using similar reasoning as in [33] (Ex. 15.6.5) and by properly applying [19] (Proposition 3.2) that the secrecy-capacity achieving distribution

p_{V X}^{⋆}

satisfies

p_{V}^{⋆} (v) = p_{X}^{⋆} (x) = \frac{1}{2} \forall v, x \in {0, 1}

, and consequently,

p_{X | V}^{⋆}

is symmetric. Thus, the distribution

p_{X | V}^{⋆}

can be characterized simply by the crossover probability

α_{X | V} ≜ p_{X | V}^{⋆} (0 | 1) = p_{X | V}^{⋆} (1 | 0)

, where

α_{X | V} \in [0, \frac{1}{2}]

. Indeed, the overall rate in Proposition 2 is maximized when

α_{X | V} = \frac{1}{2}

, which implies that

R_{1} = 0

. Then, by taking

α_{X | V} < \frac{1}{2}

, we can transfer part of the rate associated with the message

W_{2}

to the rate

R_{1}

,

R_{2} = 0

and

R_{1}

being maximum if

α_{X | V} = 0

. For the simulations, we consider a BS-BC with

α_{Y_{2}} = 0.01

,

α_{Y_{1}} = 0.04

,

α_{Z_{2}} = 0.2

and

α_{Z_{1}} = 0.35

. We set

α_{X | V} = 0.1084

, which corresponds to the distribution that maximizes

ln (R_{1}) + ln (R_{2})

for this particular channel (proportional fair allocation). Thus, according to Corollary 2, the maximum achievable rates are

R_{1}^{⋆} = 0.2507

and

R_{2}^{⋆} = 0.3254

.

6.2.1. Practical Polar Code Construction

Given the blocklength n and the distribution

p_{V X Y_{2} Y_{1} Z_{2} Z_{1}}^{⋆} = p_{V X}^{⋆} p_{Y_{2} Y_{1} Z_{2} Z_{1} | X}

, the goal of the polar code construction is to obtain the partition of the universal set

[n]

defined in Equations (32)–(35) and graphically represented in Figure 5. Hence, we need to define first the sets in Equations (27)–(31), which means having to compute the entropy terms

{H (U_{1} (j) | U_{1}^{1 : j - 1})}_{j = 1}^{n}

,

{H (U_{1} (j) | U_{1}^{1 : j - 1}, Y_{1}^{n})}_{j = 1}^{n}

and

{H (U_{1} (j) | U_{1}^{1 : j - 1}, Z_{2}^{n})}_{j = 1}^{n}

associated with the polar transform

U_{1}^{n} = V^{n} G_{n}

for the first superposition layer and

{H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n})}_{j = 1}^{n}

,

{H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n}, Y_{2}^{n})}_{j = 1}^{n}

and

{H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n}, Z_{2}^{n})}_{j = 1}^{n}

associated with the polar transform

U_{2}^{n} = X^{n} G_{n}

for the second layer. In the following, we propose an adaptation of the Monte Carlo method [22] (PCC-1), which is based on the butterfly algorithm described in [7] for SC decoding, to directly estimate these entropy terms.

Monte-Carlo method to estimate the entropy terms. First, consider the entropy terms associated with to the first layer. As for the previous model, since

p_{V}^{⋆} (v) = \frac{1}{2}

, we have

H (U_{1} (j) | U_{1}^{1 : j - 1}) = 1

for all

j \in [n]

. In order to compute

{H (U_{1} (j) | U_{1}^{1 : j - 1}, Y_{k}^{n})}_{j = 1}^{n}

and

{H (U_{1} (j) | U_{1}^{1 : j - 1}, Z_{m}^{n})}_{j = 1}^{n}

for some

k, m \in {1, 2}

, we run the Monte Carlo simulation as follows. First, due to the symmetry of the channel and the symmetry of

p_{X | V}^{⋆}

, as in [22] (PCC-1), we can set

v^{n} = u_{1}^{n} = 0^{n}

at each iteration. For the realization

τ \in [1, N_{τ}]

,

N_{τ}

being the number of realizations, we randomly generate

y_{k}^{n (τ)}

and

z_{m}^{n (τ)}

from

p_{Y_{k}^{n} | V^{n}}^{⋆}

and

p_{Z_{m}^{n} | V^{n}}^{⋆}

, respectively (by abuse of notation, we use

(τ)

in any sequence

a^{n (τ)}

to emphasize that it is generated at the iteration

τ \in [1, N_{τ}]

). Next, we obtain the log-likelihood ratios

{L_{Y_{k} | V}^{(τ)} (j)}_{j = 1}^{n}

and

{L_{Z_{m} | V}^{(τ)} (j)}_{j = 1}^{n}

by using the algorithm [22] (PCC-1). For instance, consider

{L_{Y_{k} | V}^{(τ)} (j)}_{j = 1}^{n}

. From the initial values

{p_{Y_{k} | V}^{⋆} (y_{k}^{(τ)} (j) | 0) / p_{Y_{k} | V}^{⋆} (y_{k}^{(τ)} (j) | 1)}_{j = 1}^{n}

, the algorithm recursively computes:

\begin{matrix} L_{Y_{k} | V}^{(τ)} (j) ≜ ln \frac{p_{Y_{k}^{n} U_{1}^{1 : j - 1} | U_{1} (j)}^{⋆} (y_{k}^{n (τ)}, 0^{j - 1} | 0)}{p_{Y_{k}^{n} U_{1}^{1 : j - 1} | U_{1} (j)}^{⋆} (y_{k}^{n (τ)}, 0^{j - 1} | 1)} \overset{(a)}{=} \frac{p_{U_{1} (j) | U_{1}^{1 : j - 1} Y_{k}^{n}}^{⋆} (0 | 0^{j - 1}, y_{k}^{n (τ)})}{1 - p_{U_{1} (j) | U_{1}^{1 : j - 1} Y_{k}^{n}}^{⋆} (0 | 0^{j - 1}, y_{k}^{n (τ)})}, \end{matrix}

for all

j \in [n]

, where

(a)

follows from the fact that

p_{U_{1} (j)}^{⋆} (0) = p_{U_{1} (j)}^{⋆} (1) = \frac{1}{2}

because

H (U_{1} (j) | U_{1}^{1 : j - 1}) = 1

for all

j \in [n]

. Hence, we can obtain

p_{U_{1} (j) | U_{1}^{1 : j - 1} Y_{k}^{n}}^{⋆} (0 | 0^{j - 1}, y_{k}^{n (τ)})

from

L_{Y_{k} | V}^{(τ)} (j)

, and since:

\begin{matrix} H (U_{1} (j) | U_{1}^{1 : j - 1}, Y_{k}^{n}) = E_{U_{1}^{1 : j - 1} Y_{k}^{n}} [h_{2} (p_{U_{1} (j) | U_{1}^{1 : j - 1} Y_{k}^{n}}^{⋆} (0 | u_{1}^{1 : j - 1}, y_{k}^{n}))], \end{matrix}

after

N_{τ}

realizations, we can estimate

H (U_{1} (j) | U_{1}^{1 : j - 1}, Y_{k}^{n})

by computing the empirical mean, that is,

\begin{matrix} H (U_{1} (j) | U_{1}^{1 : j - 1}, Y_{k}^{n}) \approx \frac{1}{N_{r}} \sum_{τ = 1}^{N_{τ}} h_{2} (p_{U_{1} (j) | U_{1}^{1 : j - 1} Y_{k}^{n}}^{⋆} (0 | 0^{j - 1}, y_{k}^{n (τ)})) . \end{matrix}

Now, consider the Monte Carlo method to estimate

{H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n})}_{j = 1}^{n}

,

{H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n}, Y_{k}^{n})}_{j = 1}^{n}

and

{H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n}, Z_{m}^{n})}_{j = 1}^{n}

for any

k, m \in {1, 2}

associated with the second layer. To obtain

{H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n})}_{j = 1}^{n}

, we can see X and V as the input and output random variables, respectively, of a symmetric channel with distribution

p_{V | X}^{⋆}

. Now, although

p_{X}^{⋆}

is uniform and, consequently,

H (U_{2} (j) | U_{2}^{1 : j - 1}) = 1

for all

j \in [n]

, notice that

H_{X | V}^{(n)} \neq [n]

and

T_{1}^{(n)} \neq \emptyset

because

H_{X | V}^{(n)}

and its complementary set depend on

p_{X | V}^{⋆}

. On the other hand, to obtain

{H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n}, Y_{k}^{n})}_{j = 1}^{n}

or

{H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n}, Z_{m}^{n})}_{j = 1}^{n}

, we can see

(V, Y_{k})

or

(V, Z_{m})

as the output of a symmetric channel with distribution

p_{V Y_{k} | X}^{⋆}

or

p_{V Z_{m} | X}^{⋆}

, respectively, where notice that

p_{V Y_{k} | X}^{⋆} = p_{V | X}^{⋆} p_{Y_{k} | X}^{⋆}

and

p_{V Z_{m} | X}^{⋆} = p_{V | X}^{⋆} p_{Z_{m} | X}^{⋆}

because

V - X - Y_{k} - Z_{m}

forms a Markov chain. Hence, due to the symmetry of the previous distributions, we can set

x^{n} = u_{2}^{n} = 0^{n}

at each iteration. Then, for the realization

τ \in [1, N_{τ}]

, we draw

v^{n (τ)}

,

y_{k}^{n (τ)}

and

z_{m}^{n (τ)}

from the distributions

p_{V^{n} | X^{n}}^{⋆}

,

p_{Y_{k}^{n} | X^{n}}

and

p_{Z_{m}^{n} | X^{n}}

, respectively. Next, we obtain the log-likelihood ratios

{L_{V | X}^{(τ)} (j)}_{j = 1}^{n}

,

{L_{V Y_{k} | X}^{(τ)} (j)}_{j = 1}^{n}

and

{L_{V Z_{m} | X}^{(τ)} (j)}_{j = 1}^{n}

by using [22] (PCC-1). Since

H (U_{2} (j) | U_{2}^{1 : j - 1}) = 1

for all

j \in [n]

, we have

p_{U_{2} (j)}^{⋆} (u) = \frac{1}{2}

for all

u \in {0, 1}

, and we can compute

p_{U_{2} (j) | U_{2}^{1 : j - 1} V^{n}}^{⋆} (0 | 0^{j - 1}, v^{n (τ)})

,

p_{U_{2} (j) | U_{2}^{1 : j - 1} V^{n} Y_{k}^{n}}^{⋆} (0 | 0^{j - 1}, v^{n (τ)}, y_{k}^{n (τ)})

and

p_{U_{2} (j) | U_{2}^{1 : j - 1} V^{n} Z_{m}^{n}}^{⋆} (0 | 0^{j - 1}, v^{n (τ)}, z_{m}^{n (τ)})

from the corresponding log-likelihood ratios. Finally, after

N_{τ}

realizations, we can estimate the corresponding entropy terms by computing the empirical mean.

Partition of the universal set

[n]

. In order to provide more flexibility on the design, now we introduce

(δ_{n}^{(1, r)}, δ_{n}^{(1, s)})

for the first layer, where

δ_{n}^{(1, r)} ≜ 2^{- n^{β^{(1, r)}}}

and

δ_{n}^{(1, s)} ≜ 2^{- n^{β^{(1, s)}}}

for some

β^{(1, r)}, β^{(1, s)} \in (0, \frac{1}{2})

. For the second layer, we introduce

(δ_{n}^{(2, r)}, δ_{n}^{(2, s)})

and

(δ_{n}^{(2, L)}, δ_{n}^{(2, H)})

, where

δ_{n}^{(2, r)} ≜ 2^{- n^{β^{(2, r)}}}

,

δ_{n}^{(2, s)} ≜ 2^{- n^{β^{(2, s)}}}

,

δ_{n}^{(2, L)} ≜ 2^{- n^{β^{(2, L)}}}

and

δ_{n}^{(2, H)} ≜ 2^{- n^{β^{(2, H)}}}

for some

β^{(2, r)}, β^{(2, s)}, β^{(2, L)}, β^{(2, H)} \in (0, \frac{1}{2})

.

Consider the partition of

[n]

for the first layer (

ℓ = 1

in Equations (32)–(35)). As mentioned previously, since

p_{V}^{⋆} (v) = \frac{1}{2}

, we have

H_{V}^{(n)} = [n]

and

T_{1}^{(n)} = \emptyset

. Let

R_{1}^{'} \in [0, R_{1}^{⋆}]

denote the target rate corresponding to the message

W_{1}

that the polar coding scheme must approach. We obtain the partition in Equations (32)–(35) as follows. First, we define

{(H_{V | Y_{1}}^{(n)})}^{C} ≜ {j \in [n] : H (U_{1} (j) | U_{1}^{1 : j - 1}, Y_{1}^{n}) \leq 1 - δ_{n}^{(1, s)}}

. Then, we choose

I_{1}^{(n)}

by taking the

n R_{1}^{'}

indices

j \in {(H_{V | Y_{1}}^{(n)})}^{C}

that correspond to the highest entropy terms

{H (U_{1} (j) | U_{1}^{1 : j - 1}, Z_{2}^{n})}_{j = 1}^{n}

associated with Eavesdropper 2. Notice that

δ_{n}^{(1, s)}

must guarantee

| {(H_{V | Y_{1}}^{(n)})}^{C} | \leq R_{1}^{'}

. Finally, we obtain

C_{1}^{(n)} = {(H_{V | Y_{1}}^{(n)})}^{C} \ I_{1}^{(n)}

and

F_{1}^{(n)} = H_{V | Y_{1}}^{(n)}

. Furthermore, in order to evaluate the reliability performance, we define

L_{V | Y_{1}}^{(n)} ≜ {j \in [n] : H (U_{1} (j) | U_{1}^{1 : j - 1}, Y_{1}^{n}) \leq δ_{n}^{(1, r)}}

.

Consider the partition of

[n]

for the second layer (

ℓ = 2

in Equations (32)–(35)). Since

H_{X | V}^{(n)} \neq [n]

and

T_{1}^{(n)} \neq \emptyset

, we define

H_{X | V}^{(n)} ≜ {j \in [n] : H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n}) \geq 1 - δ_{n}^{(2, H)}}

and

L_{X | V}^{(n)} ≜ {j \in [n] : H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n}) \leq δ_{n}^{(2, L)}}

, where we have used

δ_{n}^{(2, H)}

and

δ_{n}^{(2, L)}

, respectively. Let

R_{2}^{'} \in [0, R_{2}^{⋆}]

denote the target rate corresponding to

W_{2}

. We define

{(H_{X | V Y_{2}}^{(n)})}^{C} ≜ {j \in H_{X | V}^{(n)} : H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n}, Y_{2}^{n}) \leq 1 - δ_{n}^{(2, s)}}

. Then, we choose

I_{2}^{(n)}

by taking the

n R_{2}^{'}

indices

j \in {(H_{X | V Y_{2}}^{(n)})}^{C}

that correspond to the highest entropy terms

{H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n}, Z_{2}^{n})}_{j = 1}^{n}

associated with Eavesdropper 2. Thus, notice that

δ_{n}^{(2, H)}

and

δ_{n}^{(2, s)}

must guarantee

| H_{X | V}^{(n)} | \geq | {(H_{X | V Y_{2}}^{(n)})}^{C} | \geq R_{2}^{'}

. Then, we obtain

C_{2}^{(n)} = {(H_{X | V Y_{2}}^{(n)})}^{C} \ I_{2}^{(n)}

and

F_{2}^{(n)} = H_{X | V Y_{2}}^{(n)}

. Finally, in order to evaluate the reliability performance, we define

L_{X | V Y_{2}}^{(n)} ≜ {j \in [n] : H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n}, Y_{2}^{n}) \leq δ_{n}^{(2, r)}}

.

6.2.2. Performance Evaluation

First, notice that the encoding at the first layer induces a distribution

{\tilde{q}}_{V^{n}} = p_{V^{n}}

. For the second layer, the entries

U [H_{X | V}^{(n)}]

of the original DMS only are almost independent of

V^{n}

because

H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n}) \leq 1 - δ_{n}^{(2, s)}

for

j \in H_{X | V}^{(n)}

. Nevertheless, the encoding will construct

{\tilde{U}}_{2} [H_{X | V}^{(n)}]

by storing uniformly-distributed sequences that are totally independent of

V^{n}

. On the other hand, since

L_{X | V}^{(n)} \subseteq T_{2}^{(n)} \neq \emptyset

, the encoder will use the deterministic SC encoding in Equation (37) to construct

{\tilde{U}}_{2} [L_{X | V}^{(n)}]

. Therefore, according to Lemma 3 and Remark 6, we will have

V ({\tilde{q}}_{V^{n} X^{n} Y_{2}^{n} Y_{1}^{n} Z_{2}^{n} Z_{1}^{n}}, p_{V^{n} X^{n} Y_{2}^{n} Y_{1}^{n} Z_{2}^{n} Z_{1}^{n}}^{⋆}) \neq 0

for finite n. Since, as seen in Section 5.5, this total variation distance impacts the performance, we obtain first an upper-bound

d_{TV}^{ub}

on

V ({\tilde{q}}_{V^{n} X^{n} Y_{2}^{n} Y_{1}^{n} Z_{2}^{n} Z_{1}^{n}}, p_{V^{n} X^{n} Y_{2}^{n} Y_{1}^{n} Z_{2}^{n} Z_{1}^{n}}^{⋆})

, which is defined as:

\begin{matrix} d_{TV}^{ub} ≜ d_{TV}^{ub (L)} + d_{TV}^{ub (H)}, \end{matrix}

where

d_{TV}^{ub (L)}

will measure the impact of using the deterministic SC encoding in Equation (37) for the entries

{\tilde{U}}_{2} [L_{X | V}^{(n)}]

, and

d_{TV}^{ub (H)}

is the contribution on the total variation distance of storing uniformly-distributed random sequences into

{\tilde{U}}_{2} [H_{X | V}^{(n)}]

that are totally independent of

V^{n}

.

Consider

d_{TV}^{ub (L)}

, which corresponds to the analytic bound found in Lemma A2. For the simulations, we can use the Monte Carlo method to directly estimate Equation (A4) by computing the empirical mean,

\begin{matrix} d_{TV}^{ub (L)} ≜ \frac{1}{N_{τ^{'}}} \sum_{τ^{'} = 1}^{N_{τ^{'}}} [\sum_{j \in L_{X | V}^{(n)}} (1 - p_{U_{2} (j) | U_{2}^{1 : j - 1} V^{n}}^{⋆} (u_{2}^{*} (j) | {\overset{ˇ}{u}}_{2}^{1 : j - 1 (τ^{'})}, {\overset{ˇ}{v}}^{n (τ^{'})}))], \end{matrix}

(51)

where

({\overset{ˇ}{v}}^{n (τ^{'})}, {\overset{ˇ}{u}}_{2}^{n (τ^{'})})

must be drawn at each iteration

τ^{'} \in [1, N_{τ^{'}}]

according to Equation (A2),

L_{X | V}^{(n)}

has been obtained previously in the polar code construction and, according to Equation (A4),

u_{2}^{*} (j) ≜ {arg max}_{u \in {0, 1}} p_{U_{2} (j) | U_{2}^{1 : j - 1} V^{n}}^{⋆} (u | {\overset{ˇ}{u}}_{2}^{1 : j - 1 (τ^{'})}, {\overset{ˇ}{v}}^{n (τ^{'})})

. Due to the symmetry of

p_{V | X}^{⋆}

, the probabilities

p_{U_{2} (j) | U_{2}^{1 : j - 1} V^{n}}^{⋆}

can be obtained with low complexity using the butterfly algorithm described in [7].

Consider now

d_{TV}^{ub (H)}

, which corresponds to the analytic bound found in Lemma A1. We can compute exactly the Kullback-Leibler divergence as in Equation (A3) by using the corresponding entropy terms obtained in the polar code construction. Thus, by applying Pinsker’s inequality, we have:

\begin{matrix} d_{TV}^{ub (H)} ≜ {(2 ln 2 \sum_{j \in H_{X | V}^{(n)}} (1 - H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n})))}^{1 / 2} . \end{matrix}

(52)

According to the polar code construction,

| L_{X | V}^{(n)} |

and

| H_{X | V}^{(n)} |

will depend only on the values of

δ_{n}^{(2, L)}

and

δ_{n}^{(2, H)}

, respectively, for a particular n. Hence, the value of

d_{TV}^{ub}

can be controlled by adjusting

(β^{(2, L)}, β^{(2, H)})

. It is clear that higher values of

(β^{(2, L)}, β^{(2, H)})

mean lower cardinalities of the sets

L_{X | V}^{(n)}

and

H_{X | V}^{(n)}

and, consequently, lower

d_{TV}^{ub}

. However,

| {(H_{X | V}^{(n)})}^{C} \cap {(L_{X | V}^{(n)})}^{C} |

increases with

(β^{(2, L)}, β^{(2, H)})

, and the encoder in Equation (36) requires more randomness to form

{\tilde{U}}_{2} [{(H_{X | V}^{(n)})}^{C} \cap {(L_{X | V}^{(n)})}^{C}]

.

To evaluate the reliability performance, we obtain the upper-bounds

P_{b}^{ub (1)}

and

P_{b}^{ub (2)}

on the average bit error probability at Receivers 1 and 2, respectively. From Equations (41) and (42) and by applying [27] (Proposition 2) to upper-bound the Bhattacharyya parameters from the entropy terms, we have:

\begin{matrix} P_{b}^{ub (1)} & ≜ d_{TV}^{ub} + \frac{1}{| L_{V | Y_{1}}^{(n)} |} \sum_{j \in L_{V | Y_{1}}^{(n)}} \sqrt{H (U_{1} (j) | U_{1}^{1 : j - 1}, Y_{1}^{n})}, \end{matrix}

(53)

\begin{matrix} P_{b}^{ub (2)} & ≜ 2 d_{TV}^{ub} + \frac{2}{| L_{V | Y_{1}}^{(n)} |} \sum_{j \in L_{V | Y_{1}}^{(n)}} \sqrt{H (U_{1} (j) | U_{1}^{1 : j - 1}, Y_{2}^{n})} + \frac{1}{| L_{X | V Y_{2}}^{(n)} |} \sum_{j \in L_{X | V Y_{2}}^{(n)}} \sqrt{H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n}, Y_{2}^{n})} . \end{matrix}

(54)

To evaluate the secrecy performance, we compute an upper-bound

I^{ub} (W_{1}, W_{2}; F_{1}, F_{2}, {\tilde{Z}}_{2}^{n})

on the information leakage

I (W_{1}, W_{2}; F_{1}, F_{2}, {\tilde{Z}}_{2}^{n})

for Eavesdropper 2. From Equation (45) we obtain:

\begin{matrix} I^{ub} (W_{1}, W_{2}; F_{1}, F_{2}, {\tilde{Z}}_{2}^{n}) & ≜ 4 n d_{TV}^{ub} - 2 d_{TV}^{ub} log d_{TV}^{ub} + \sum_{ℓ = 1}^{2} | I_{ℓ}^{(n)} \cup F_{ℓ}^{(n)} | \\ - \sum_{j \in I_{1}^{(n)} \cup F_{1}^{(n)}} H (U_{1} (j) | U_{1}^{1 : j - 1}, Z_{2}^{n}) - \sum_{j \in I_{2}^{(n)} \cup F_{2}^{(n)}} H (U_{2} (j) | U_{2}^{1 : j - 1}, V^{n}, Z_{2}^{n}), \end{matrix}

(55)

Due to the degradedness condition of BS-BC and, consequently, by Lemma 1, the information leakage at Eavesdropper 1 will be always less than the one at Eavesdropper 2.

Finally, we evaluate the overall rate of the additional sequences

{Φ_{1}, Φ_{2}}

by computing:

\begin{matrix} \frac{1}{n} (| Φ_{1} | + | Φ_{2} |) = \frac{1}{n} (| {(H_{V | Y_{1}}^{(n)})}^{C} \cap {(L_{V | Y_{1}}^{(n)})}^{C} | + | {(H_{X | V Y_{2}}^{(n)})}^{C} \cap {(L_{X | V Y_{2}}^{(n)})}^{C} |) . \end{matrix}

(56)

The performance of the polar coding scheme is graphically shown in Figure 10. As for the previous model, let

ρ_{R}

be the normalized target rate in which the polar coding scheme operates, that is

ρ_{R} ≜ \frac{R_{1}^{'}}{R_{1}^{⋆}} = \frac{R_{2}^{'}}{R_{2}^{⋆}}

. In Figure 10A, we evaluate the upper-bound

I_{0}^{ub} (W_{1}, W_{2}; F_{1}, F_{2}, Z_{2}^{n})

, which corresponds to the upper-bound on the information leakage defined in Equation (55) when we consider

d_{TV}^{ub} = 0

, as a function of the blocklength n for different values of

ρ_{R}

. For this plot, we set

β^{(1, s)} = 0.30

and

β^{(2, s)} = 0.36

. Notice that

(β^{(1, r)}, β^{(2, r)})

and

(β^{(2, L)}, β^{(2, H)})

if we set

d_{TV}^{ub} = 0

will not impact the information leakage. As we have proven in Section 5.5.4, the secrecy performance is improving as n increases. Moreover, to satisfy a particular secrecy performance level, the polar code will need higher values of n as the target rates approach the capacity.

In Figure 10B, we evaluate the upper-bounds

P_{b, 0}^{ub (1)}

and

P_{b, 0}^{ub (2)}

, which correspond to the bounds on the average bit error probability at the legitimate Receivers 1 and 2, respectively, when we set

d_{TV}^{ub} = 0

, as a function of the blocklength n. For this plot, we set

β^{(1, r)} = β^{(2, r)} = 0.24

and notice that the reliability performance will not depend on the values of

(β^{(1, s)}, β^{(2, s)})

and

ρ_{R}

. If we set

d_{TV}^{ub} = 0

, then it is clear that it will not depend on

(β^{(2, L)}, β^{(2, H)})

either. As shown theoretically in Section 5.5.3, the error probability becomes lower as the blocklength n increases.

Figure 10C plots the overall rate of the additional secret sequences computed as in Equation (56) when we set

β^{(1, r)} = β^{(2, r)} = 0.24

,

β^{(1, s)} = 0.30

and

β^{(2, s)} = 0.36

. As mentioned in Section 5.2, we can see that this rate tends to be negligible for n sufficiently large.

Finally, Figure 10D plots the upper-bounds

d_{TV}^{ub (L)}

and

d_{TV}^{ub (H)}

defined in Equations (51) and (52), respectively, when we set

β^{(2, L)} = β^{(2, H)} = 0.36

. As we have proven theoretically in Lemma 3, notice that the total variation distance decays with the blocklength n. Precisely, notice that

d_{TV}^{ub (L)}

is lower than

d_{TV}^{ub (H)}

, and therefore, the bound on the total variation distance is practically governed by

d_{TV}^{ub (H)}

(

d_{TV}^{ub} \approx d_{TV}^{ub (H)}

). This happens because although we can compute exactly the Kullback–Leibler divergence as in Equation (A3) from the entropy terms estimated in the polar code construction, Pinsker’s inequality to obtain

d_{TV}^{ub (H)}

as in Equation (52) can be too loose for n not sufficiently large. Consider the impact of

d_{TV}^{ub}

on the reliability performance of the code. The average error probability bounds in Equations (53) and (54) are modeled as the sum of two terms, one depending directly on

d_{TV}^{ub}

and the other depending on the polar construction (which has been plotted in Figure 10B). Since

d_{TV}^{ub (H)}

is too loose, what we obtain is that the reliability performance of the code will be governed practically by the bound

d_{TV}^{ub}

for small values of the blocklength n. Now, consider the impact of

d_{TV}^{ub}

on the secrecy performance of the code. The bound on the information leakage in Equation (55) is modeled as the sum of two terms, one also depending only on the polar code construction (which has been plotted in Figure 10A) and the other depending on

d_{TV}^{ub}

. However, in this situation,

d_{TV}^{ub}

impacts the information leakage approximately as

n \cdot d_{TV}^{ub}

, which means that this term will totally govern the secrecy performance. Recall that this term follows from Equation (44), which bounds the impact of the encoding in Equation (36) on the conditional entropy term of the information leakage as a function of the total variation distance. Hence, we can conclude that this bound, which follows from applying [30] (Lemma 2.9), can be too loose for n not sufficiently large.

7. Conclusions

We have described two polar coding schemes for two different models over the degraded broadcast channel: DBC-NLD-LS and DBC-LD-NLS. For both models, we have proven that the proposed polar coding schemes are asymptotically secrecy-capacity achieving, providing reliability and strong secrecy simultaneously. Then, we have discussed how to construct these polar codes in practice, and we have evaluated their performance for a finite blocklength by means of simulations. Although several polar code constructions methods have been proposed in the literature, this paper, as far as we know, is the first to discuss practical constructions when the polar code must satisfy both reliability and secrecy constraints. In addition, we have evaluated the secrecy performance of the polar code in terms of the strong secrecy performance, which has been possible by obtaining an upper-bound on the corresponding information leakage at the eavesdroppers. Indeed, we have shown that the proposed polar coding schemes can perform well in practice for a finite blocklength.

The criteria we have chosen for designing the polar codes are: to provide reliability and strong secrecy in one block of size n by using only a secret key that is negligible in terms of rate and to minimize the amount of random decisions for the SC encoding. For the first purpose, we have introduced the source of common randomness, and we have avoided the use of the chaining construction given in [9] (which is possible due to the degraded nature of the broadcast channel); for the second one, we have adapted the deterministic SC encoding given in [20]. These two types of randomness have different implications on the practical design: while the common randomness is uniformly distributed and can be provided by the communication system, the randomness for SC encoding is not and must be drawn by the encoder. In communication scenarios requiring several transmissions of size n, we have shown that one realization of the common randomness can be reused without worsening the performance.

Despite the good performance of the polar coding schemes, some issues still persist. How to avoid the transmissions of the additional secret sequences is a problem that remains open. Despite the length of the required secret key being asymptotically negligible in terms of rate, these additional transmissions can be problematic in practical scenarios. As pointed out in Remark 4, one can adopt the chaining construction in [9] to further reduce the length of these sequences, but this requires the transmission to take place over several blocks of size n and a very large memory capacity at the transmitter or receiver side. Furthermore, despite the rate of the amount of randomness required for SC encoding being negligible, how to replace the random decisions entirely by deterministic ones is a problem that still remains unsolved. Another problem that remains open is how to avoid the use of the common randomness, which allows keyless secret communication over a single block of size n (keyless in the sense that the rate of the required secret key is negligible). Finally, to design polar codes based on the proposed performance evaluation, it seems necessary to find tighter upper-bounds on the total variation distance between the distribution induced by the encoder and the original distribution used in the code construction, particularly for the term that models the impact of storing uniformly-distributed sequences. Also, for the secrecy performance, it would be interesting to find a tighter upper-bound to evaluate the impact of the total variation distance on the information leakage.

Lastly, it is worth mentioning that having to know the statistics of the eavesdropper channels for the polar code construction may seem problematic. Nevertheless, for the polar code construction, one can consider virtual eavesdroppers with some target channel qualities. For DBC-LD-NLS, we can design a polar code according to the statistics of this virtual eavesdropper, and due to the degradedness condition of the channel, this code will perform well if the real eavesdroppers have worse channel quality (worst-case design). On the other hand, for the DBC-NLD-LS, one can simply consider different levels of secrecy depending on different target channel qualities. Depending on the channel quality of the real eavesdropper with respect to the virtual ones considered for the design, the polar coding scheme will provide a particular secrecy performance level.

Supplementary Materials

The MATLAB code used in this paper for Section 6 is available at https://0-www-mdpi-com.brum.beds.ac.uk/1099-4300/20/6/467/s1.

Author Contributions

Conceptualization, J.d.O.A. and J.R.F. Formal analysis, J.d.O.A. Funding acquisition, J.R.F. Investigation, J.d.O.A. and J.R.F. Methodology, J.d.O.A. and J.R.F. Software, J.d.O.A. Supervision, J.R.F. Validation, J.R.F. Writing, original draft, J.d.O.A.

Funding

This work is supported by the “Ministerio de Ciencia, Innovación y Universidades” and the “Agencia Estatal de Investigación” of the Spanish Government, ERDF funds (TEC2013-41315-R, TEC2015-69648-REDC, TEC2016-75067-C4-2-R) and the Catalan Government (2017 SGR 578 AGAUR).

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

DBC	Degraded Broadcast Channel
DBC-NLD-LS	Degraded Broadcast Channel with Non-Layered Decoding and Layered Secrecy
DBC-LD-NLS	Degraded Broadcast Channel with Layered Decoding and Non-Layered Secrecy
SC	Successive Cancellation
DMS	Discrete Memoryless Source
BEC	Binary Erasure Channel
BSC	Binary Symmetric Channel
BE-BC	Binary Erasure Broadcast Channel
BS-BC	Binary Symmetric Broadcast Channel

Appendix A. Proof of Lemmas 2 and 3

Consider a DMS

(V_{1} \times \dots \times V_{L} \times Y_{K} \times \dots \times Y_{1} \times Z_{M} \times \dots \times Z_{1}, p_{V_{1} \dots V_{L} Y_{K} \dots Y_{1} Z_{M} \dots Z_{1}})

, the joint distribution of which satisfies the Markov chain condition

V_{1} - \dots - V_{L} - Y_{K} - \dots - Y_{1} - Z_{M} - \dots - Z_{1}

. Consider an i.i.d. n-sequence

(V_{1}^{n}, \dots, V_{L}^{n}, Y_{K}^{n}, \dots, Y_{1}^{n}, Z_{M}^{n}, \dots, Z_{1}^{n})

of this DMS, n being any power of two. We define the polar transforms

(U_{1}^{n}, \dots, U_{L}^{n})

, where

U_{ℓ}^{n} ≜ V_{ℓ}^{n} G_{n}

for each

ℓ \in [1, L]

, with joint distribution

p_{U_{1}^{n} \dots U_{L}^{n}}

. Then, define

H_{V_{ℓ} | V_{ℓ - 1}}^{(n)}

and

L_{V_{ℓ} | V_{ℓ - 1}}^{(n)}

as in Equations (27) and (28), where

V_{0} = U_{0} ≜ ⌀

. Let

V_{L} ≜ X

; if

L ≜ 1

, notice that this DMS is the one considered for the code construction of DBC-NLD-LS. Otherwise, if

L ≜ K

, it is the one considered for DBC-LD-NLS.

Now, consider the polar encoding procedures described for both models in Section 4.2 and Section 5.2. Let

{\tilde{q}}_{U_{1}^{n} \dots U_{L}^{n}}

be the joint distribution of

({\tilde{U}}_{1}^{n}, \dots, {\tilde{U}}_{L}^{n})

after the encoding. For both models, we have:

\begin{matrix} {\tilde{q}}_{U_{1}^{n} \dots U_{L}^{n}} ({\tilde{u}}_{1}^{n}, \dots, {\tilde{u}}_{L}^{n}) & = \prod_{ℓ = 1}^{L} \prod_{j = 1}^{n} {\tilde{q}}_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} ({\tilde{u}}_{ℓ} (j) | {\tilde{u}}_{ℓ}^{1 : j - 1}, {\tilde{u}}_{ℓ - 1}^{n} G_{n}), \end{matrix}

where, for all

ℓ \in [1, L]

,

\begin{matrix} {\tilde{q}}_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} ({\tilde{u}}_{ℓ} (j) | {\tilde{u}}_{ℓ}^{1 : j - 1}, {\tilde{v}}_{ℓ - 1}^{n}) \\ = \{\begin{matrix} \frac{1}{2} & if j \in H_{V_{ℓ} | V_{ℓ - 1}}^{(n)}, \\ p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} ({\tilde{u}}_{ℓ} (j) | {\tilde{u}}_{ℓ}^{1 : j - 1}, {\tilde{v}}_{ℓ - 1}^{n}) & if j \in {(H_{V_{ℓ} | V_{ℓ - 1}}^{(n)})}^{C} \cap {(L_{V_{ℓ} | V_{ℓ - 1}}^{(n)})}^{C}, \\ 𝟙 \{{\tilde{u}}_{ℓ} (j) = ξ^{(j)} ({\tilde{u}}_{ℓ}^{1 : j - 1}, {\tilde{v}}_{ℓ - 1}^{n})\} & if j \in L_{V_{ℓ} | V_{ℓ - 1}}^{(n)}, \end{matrix} \end{matrix}

(A1)

p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}}

being the distribution induced by the original DMS and

ξ^{(j)}

being the deterministic arg max function given in Equation (18) for DBC-NLD-LS or given in Equation (37) for DBC-LD-NLS.

Additionally, consider another encoding process that constructs

({\overset{ˇ}{U}}_{1}^{n}, \dots, {\overset{ˇ}{U}}_{L}^{n})

by omitting the use of the deterministic arg max function, but samples

{\overset{ˇ}{U}}_{1} (j)

from the distribution:

\begin{matrix} {\overset{ˇ}{q}}_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} ({\overset{ˇ}{u}}_{ℓ} (j) | {\overset{ˇ}{u}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{v}}_{ℓ - 1}^{n}) = \{\begin{matrix} \frac{1}{2} & if j \in H_{V_{ℓ} | V_{ℓ - 1}}^{(n)}, \\ p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} ({\overset{ˇ}{u}}_{ℓ} (j) | {\overset{ˇ}{u}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{v}}_{ℓ - 1}^{n}) & if j \in {(H_{V_{ℓ} | V_{ℓ - 1}}^{(n)})}^{C} . \end{matrix} \end{matrix}

(A2)

First, the following lemma shows that the joint distributions

p_{U_{1}^{n} \dots U_{L}^{n}}

and

{\overset{ˇ}{q}}_{U_{1}^{n} \dots U_{L}^{n}}

are nearly statistically indistinguishable for sufficiently large n.

Lemma A1.

Let

δ_{n} = 2^{- n^{β}}

for some

β \in (0, \frac{1}{2})

, and define

δ_{n}^{(1)} ≜ \sqrt{2 n δ_{n} ln 2}

. Then,

\begin{matrix} V ({\overset{ˇ}{q}}_{U_{1}^{n} \dots U_{L}^{n}}, p_{U_{1}^{n} \dots U_{L}^{n}}) \leq \sqrt{L} δ_{n}^{(1)} . \end{matrix}

Proof.

The Kullback-Leibler distance between

p_{U_{1}^{n} \dots U_{L}^{n}}

and

{\overset{ˇ}{q}}_{U_{1}^{n} \dots U_{L}^{n}}

is:

\begin{matrix} D (p_{U_{1}^{n} \dots U_{L}^{n}} ∥ {\overset{ˇ}{q}}_{U_{1}^{n} \dots U_{L}^{n}}) & \overset{(a)}{=} \sum_{ℓ = 1}^{L} \sum_{j = 1}^{n} E_{p_{U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}}} [D (p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} ∥ {\overset{ˇ}{q}}_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}})] \\ \overset{(b)}{=} \sum_{ℓ = 1}^{L} \sum_{j \in H_{V_{ℓ} | V_{ℓ - 1}}^{(n)}} (1 - H (U_{ℓ} (j) | U_{ℓ}^{1 : j - 1}, V_{ℓ - 1}^{n})) \\ \overset{(c)}{\leq} L δ_{n} | H_{V_{ℓ} | V_{ℓ - 1}}^{(n)} |, \end{matrix}

(A3)

where

(a)

holds by the chain rule, the invertibility of

G_{n}

and the fact that

U_{1}^{n} - U_{2}^{n} - \dots - U_{L}

(and

{\overset{ˇ}{U}}_{1}^{n} - {\overset{ˇ}{U}}_{2}^{n} - \dots - {\overset{ˇ}{U}}_{L}

) forms a Markov chain,

(b)

follows from Equation (A2) and by applying [14] (Lemma 10), and

(c)

holds by the definition of

H_{V_{ℓ} | V_{ℓ - 1}}^{(n)}

in Equation (27). Finally, since

| H_{V_{ℓ} | V_{ℓ - 1}}^{(n)} | \leq n

and by using Pinsker’s inequality, we obtain

V ({\overset{ˇ}{q}}_{U_{1}^{n} \dots U_{L}^{n}}, p_{U_{1}^{n} \dots U_{L}^{n}}) \leq \sqrt{2 L n δ_{n} ln 2}

. □

Now, we show that

{\overset{ˇ}{q}}_{U_{1}^{n} \dots U_{L}^{n}}

and

{\tilde{q}}_{U_{1}^{n} \dots U_{L}^{n}}

are nearly indistinguishable for n large enough.

Lemma A2.

Let

δ_{n} = 2^{- n^{β}}

for some

β \in (0, \frac{1}{2})

. Then,

\begin{matrix} V ({\tilde{q}}_{U_{1}^{n} \dots U_{L}^{n}}, {\overset{ˇ}{q}}_{U_{1}^{n} \dots U_{L}^{n}}) \leq δ_{n}^{(2)}, \end{matrix}

where

δ_{n}^{(2)} ≜ L n \sqrt{2 \sqrt{2} δ_{n}^{(1)} (2 n - log \sqrt{2} δ_{n}^{(1)}) + δ_{n}}

and

δ_{n}^{(1)}

defined as in Lemma A1.

Proof.

The proof follows similar reasoning as the one for [20] (Lemma 2). Hence, define a coupling [29] for

({\overset{ˇ}{U}}_{1}^{n}, \dots, {\overset{ˇ}{U}}_{L}^{n})

and

({\tilde{U}}_{1}^{n}, \dots, {\tilde{U}}_{L}^{n})

such that

{\overset{ˇ}{U}}_{ℓ} [{(L_{V_{ℓ} | V_{ℓ - 1}}^{(n)})}^{C}] = {\tilde{U}}_{ℓ} [{(L_{V_{ℓ} | V_{ℓ - 1}}^{(n)})}^{C}]

. Thus, we have:

\begin{matrix} V ({\tilde{q}}_{U_{1}^{n} \dots U_{L}^{n}}, {\overset{ˇ}{q}}_{U_{1}^{n} \dots U_{L}^{n}}) & \overset{(a)}{\leq} P [({\tilde{U}}_{1}^{n}, \dots, {\tilde{U}}_{L}^{n}) \neq ({\overset{ˇ}{U}}_{1}^{n}, \dots, {\overset{ˇ}{U}}_{L}^{n})] \\ \overset{(b)}{\leq} \sum_{ℓ = 1}^{L} P [{\tilde{U}}_{ℓ}^{n} \neq {\overset{ˇ}{U}}_{ℓ}^{n} | {\tilde{V}}_{ℓ - 1}^{n} = {\overset{ˇ}{V}}_{ℓ - 1}^{n}] \\ \overset{(c)}{\leq} \sum_{ℓ = 1}^{L} \sum_{j = 1}^{n} P [{\tilde{U}}_{ℓ} (j) \neq {\overset{ˇ}{U}}_{ℓ} (j) | {\tilde{U}}_{ℓ}^{1 : j - 1} = {\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\tilde{V}}_{ℓ - 1}^{n} = {\overset{ˇ}{V}}_{ℓ - 1}^{n}] \\ \overset{(d)}{=} \sum_{ℓ = 1}^{L} \sum_{j \in L_{V_{ℓ} | V_{ℓ - 1}}^{(n)}} E_{({\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{V}}_{ℓ - 1}^{n})} [(1 - p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} (u_{ℓ}^{*} (j) | {\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{V}}_{ℓ - 1}^{n}))], \end{matrix}

(A4)

where

(a)

follows from the coupling lemma [29] (Proposition 4.7),

(b)

holds by the union bound, the invertibility of

G_{n}

and the fact that

{\tilde{U}}_{1}^{n} - {\tilde{U}}_{2}^{n} - \dots - {\tilde{U}}_{L}

(and

{\overset{ˇ}{U}}_{1}^{n} - {\overset{ˇ}{U}}_{2}^{n} - \dots - {\overset{ˇ}{U}}_{L}

) forms a Markov chain,

(c)

also holds by the union bound and

(d)

follows from Equations (A1) and (A2) given that

{\overset{ˇ}{U}}_{ℓ} [{(L_{V_{ℓ} | V_{ℓ - 1}}^{(n)})}^{C}] = {\tilde{U}}_{ℓ} [{(L_{V_{ℓ} | V_{ℓ - 1}}^{(n)})}^{C}]

and from defining

u_{ℓ}^{*} (j) ≜ {arg max}_{u \in {0, 1}} p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} (u | {\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{V}}_{ℓ - 1}^{n})

.

Next, for any

ℓ \in [1, L]

and

j \in [n]

, for sufficiently large n, we have:

\begin{matrix} | H (U_{ℓ} (j) | U_{ℓ}^{1 : j - 1}, V_{ℓ - 1}^{n}) - H (U_{ℓ} (j) | {\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{V}}_{ℓ - 1}^{n}) | \\ \overset{(a)}{\leq} | H (U_{ℓ}^{1 : j - 1}, V_{ℓ - 1}^{n}) - H ({\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{V}}_{ℓ - 1}^{n}) | + | H (U_{ℓ}^{1 : j}, V_{ℓ - 1}^{n}) - H (U_{ℓ} (j), {\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{V}}_{ℓ - 1}^{n}) | \\ \overset{(b)}{\leq} 2 V ({\overset{ˇ}{q}}_{U_{ℓ}^{1 : j - 1} U_{ℓ - 1}^{n}}, p_{U_{ℓ}^{1 : j - 1} U_{ℓ - 1}^{n}}) log \frac{2^{n}}{V ({\overset{ˇ}{q}}_{U_{ℓ}^{1 : j - 1} U_{ℓ - 1}^{n}}, p_{U_{ℓ}^{1 : j - 1} U_{ℓ - 1}^{n}})} \\ \overset{(c)}{\leq} 2 \sqrt{2} δ_{n}^{(1)} (2 n - log \sqrt{2} δ_{n}^{(1)}), \end{matrix}

(A5)

where

(a)

holds by the chain rule of entropy and the triangle inequality,

(b)

follows from applying [30] (Lemma 2.9), the invertibility of

G_{n}

and because

V (p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} U_{ℓ - 1}^{n}} {\overset{ˇ}{q}}_{U_{ℓ}^{1 : j - 1} U_{ℓ - 1}^{n}}, p_{U_{ℓ}^{1 : j} U_{ℓ - 1}^{n}}) = V ({\overset{ˇ}{q}}_{U_{ℓ}^{1 : j - 1} U_{ℓ - 1}^{n}}, p_{U_{ℓ}^{1 : j - 1} U_{ℓ - 1}^{n}})

, and

(c)

holds because

V ({\overset{ˇ}{q}}_{U_{ℓ}^{1 : j - 1} U_{ℓ - 1}^{n}}, p_{U_{ℓ}^{1 : j - 1} U_{ℓ - 1}^{n}}) \leq V ({\overset{ˇ}{q}}_{U_{ℓ - 1}^{n} U_{ℓ}^{n}}, p_{U_{ℓ - 1}^{n} U_{ℓ}^{n}}) \leq \sqrt{2} δ_{n}^{(1)}

(by using Lemma A1 and taking

L ≜ 2

) and because the function

x \mapsto x log x

is monotonically decreasing for

x > 0

small enough.

Thus, for any

ℓ \in [1, L]

and

j \in L_{V_{ℓ} | V_{ℓ - 1}}^{(n)}

, we have:

\begin{matrix} 2 \sqrt{2} δ_{n}^{(1)} (2 n - log \sqrt{2} δ_{n}^{(1)}) + δ_{n} \\ \overset{(a)}{\geq} 2 \sqrt{2} δ_{n}^{(1)} (2 n - log \sqrt{2} δ_{n}^{(1)}) + H (U_{ℓ} (j) | U_{ℓ}^{1 : j - 1}, V_{ℓ - 1}^{n}) \\ \overset{(b)}{\geq} H (U_{ℓ} (j) | {\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{V}}_{ℓ - 1}^{n}) \\ = E_{({\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{U}}_{ℓ - 1}^{n})} [h_{2} (p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} (u_{ℓ}^{⋆} (j) | {\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{V}}_{ℓ - 1}^{n}))] \\ \geq E_{({\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{U}}_{ℓ - 1}^{n})} [- (1 - p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} (u_{ℓ}^{⋆} (j) | {\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{V}}_{ℓ - 1}^{n})) \\ \cdot log (1 - p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} (u_{ℓ}^{⋆} (j) | {\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{V}}_{ℓ - 1}^{n}))] \\ \overset{(c)}{\geq} E_{({\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{U}}_{ℓ - 1}^{n})} [{(1 - p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} (u_{ℓ}^{⋆} (j) | {\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{V}}_{ℓ - 1}^{n}))}^{2}] \\ \overset{(d)}{\geq} {(E_{({\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{U}}_{ℓ - 1}^{n})} [(1 - p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} (u_{ℓ}^{⋆} (j) | {\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{V}}_{ℓ - 1}^{n}))])}^{2}, \end{matrix}

(A6)

where

(a)

holds because, by definition,

H (U_{ℓ} (j) | U_{ℓ}^{1 : j - 1}, V_{ℓ - 1}^{n}) \leq δ_{n}

if

j \in L_{V_{ℓ} | V_{ℓ - 1}}^{(n)}

,

(b)

holds by Equation (A5),

(c)

holds because

p_{U_{ℓ} (j) | U_{ℓ}^{1 : j - 1} V_{ℓ - 1}^{n}} (u_{ℓ}^{⋆} (j) | {\overset{ˇ}{U}}_{ℓ}^{1 : j - 1}, {\overset{ˇ}{V}}_{ℓ - 1}^{n}) \geq 1 / 2

and

log (x) < - x

if

x \in [0, 1 / 2)

and

(d)

follows from Jensen’s inequality.

Finally, by combining Equations (A4) and (A6) and because

| L_{V_{ℓ} | V_{ℓ - 1}}^{(n)} | \leq n

, we have

V ({\tilde{q}}_{U_{1}^{n} \dots U_{L}^{n}}, {\overset{ˇ}{q}}_{U_{1}^{n} \dots U_{L}^{n}}) \leq L n \sqrt{2 \sqrt{2} δ_{n}^{(1)} (2 n - log \sqrt{2} δ_{n}^{(1)}) + δ_{n}}

. □

Hence, by Lemma A1, Lemma A2 and by applying the triangle inequality, we obtain:

\begin{matrix} V ({\tilde{q}}_{U_{1}^{n} \dots U_{L}^{n}}, p_{U_{1}^{n} \dots U_{L}^{n}}) & \leq V ({\tilde{q}}_{U_{1}^{n} \dots U_{L}^{n}}, {\overset{ˇ}{q}}_{U_{1}^{n} \dots U_{L}^{n}}) + V ({\overset{ˇ}{q}}_{U_{1}^{n} \dots U_{L}^{n}}, p_{U_{1}^{n} \dots U_{L}^{n}}) \\ \leq L n \sqrt{2 \sqrt{2} δ_{n}^{(1)} (2 n - log \sqrt{2} δ_{n}^{(1)}) + δ_{n}} + \sqrt{L} δ_{n}^{(1)} . \end{matrix}

(A7)

Consequently, since

{\tilde{q}}_{Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n} | V_{1}^{n} \dots V_{L}^{n}} = p_{Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n} | V_{1}^{n} \dots V_{L}^{n}}

and the invertibility of

G_{n}

, we obtain

V ({\tilde{q}}_{V_{1}^{n} \dots V_{L}^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}, p_{V_{1}^{n} \dots V_{L}^{n} Y_{K}^{n} \dots Y_{1}^{n} Z_{M}^{n} \dots Z_{1}^{n}}) = V ({\tilde{q}}_{U_{1}^{n} \dots U_{L}^{n}}, p_{U_{1}^{n} \dots U_{L}^{n}})

, and this concludes the proof.

References and Notes

Wyner, A. The wire-tap channel. Bell Syst. Tech. J. 1975, 54, 1355–1387. [Google Scholar] [CrossRef]
Csiszár, I.; Körner, J. Broadcast channels with confidential messages. IEEE Trans. Inf. Theory 1978, 24, 339–348. [Google Scholar] [CrossRef]
Maurer, U.; Wolf, S. Information-theoretic key agreement: From weak to strong secrecy for free. In Advances in Cryptology—EUROCRYPT 2000; Springer: Berlin/Heidelberg, Germany, 2000; pp. 351–368. [Google Scholar]
Zou, S.; Liang, Y.; Lai, L.; Poor, H.; Shamai, S. Broadcast networks with layered decoding and layered secrecy: Theory and applications. Proc. IEEE 2015, 103, 1841–1856. [Google Scholar] [CrossRef]
Liang, Y.; Lai, L.; Poor, H.V.; Shamai, S. A broadcast approach for fading wiretap channels. IEEE Trans. Inf. Theory 2014, 60, 842–858. [Google Scholar] [CrossRef]
Ekrem, E.; Ulukus, S. Secrecy capacity of a class of broadcast channels with an eavesdropper. EURASIP J. Wirel. Commun. Netw. 2009, 2009. [Google Scholar] [CrossRef]
Arikan, E. Channel polarization: A method for constructing capacity-achieving codes for symmetric binary-input memoryless channels. IEEE Trans. Inf. Theory 2009, 55, 3051–3073. [Google Scholar] [CrossRef] [Green Version]
Mahdavifar, H.; Vardy, A. Achieving the secrecy capacity of wiretap channels using polar codes. IEEE Trans. Inf. Theory 2011, 57, 6428–6443. [Google Scholar] [CrossRef]
Şaşoğlu, E.; Vardy, A. A new polar coding scheme for strong security on wiretap channels. In Proceedings of the IEEE International Symposium on Information Theory Proceedings (ISIT), Istanbul, Turkey, 7–12 July 2013; pp. 1117–1121. [Google Scholar] [CrossRef]
Renes, J.M.; Renner, R.; Sutter, D. Efficient one-way secret key agreement and private channel coding via polarization. In Advances in Cryptology-ASIACRYPT; Springer: Berlin/Heidelberg, Germany, 2013; pp. 194–213. [Google Scholar]
Wei, Y.; Ulukus, S. Polar coding for the general wiretap channel with extensions to multiuser scenarios. IEEE J. Sel. Areas Commun. 2016, 34, 278–291. [Google Scholar] [CrossRef]
Cihad Gulcu, T.; Barg, A. Achieving secrecy capacity of the wiretap channel and broadcast channel with a confidential component. arXiv, 2014; arXiv:1410.3422. [Google Scholar]
Chou, R.A.; Bloch, M.R. Polar coding for the broadcast channel with confidential messages: A random binning analogy. IEEE Trans. Inf. Theory 2016, 62, 2410–2429. [Google Scholar] [CrossRef]
Goela, N.; Abbe, E.; Gastpar, M. Polar codes for broadcast channels. IEEE Trans. Inf. Theory 2015, 61, 758–782. [Google Scholar] [CrossRef]
Chou, R.A.; Bloch, M.R.; Abbe, E. Polar coding for secret-key generation. IEEE Trans. Inf. Theory 2015, 61, 6213–6237. [Google Scholar] [CrossRef]
Wang, L.; Sasoglu, E. Polar coding for interference networks. In Proceedings of the 2014 IEEE International Symposium on Information Theory, Honolulu, HI, USA, 29 June–4 July 2014; pp. 311–315. [Google Scholar] [CrossRef]
Chou, R.A.; Yener, A. Polar coding for the multiple access wiretap channel via rate-splitting and cooperative jamming. In Proceedings of the 2016 IEEE International Symposium on Information Theory (ISIT), Barcelona, Spain, 10–15 July 2016; pp. 983–987. [Google Scholar] [CrossRef]
Hirche, C.; Morgan, C.; Wilde, M.M. Polar codes in network quantum information theory. IEEE Trans. Inf. Theory 2016, 62, 915–924. [Google Scholar] [CrossRef]
Bloch, M.; Barros, J. Physical-Layer Security: From Information Theory to Security Engineering; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Chou, R.A.; Bloch, M.R. Using deterministic decisions for low-entropy bits in the encoding and decoding of polar codes. In Proceedings of the 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, IL, USA, 29 September–2 October 2015; pp. 1380–1385. [Google Scholar] [CrossRef]
Tal, I.; Vardy, A. How to construct polar codes. IEEE Trans. Inf. Theory 2013, 59, 6562–6582. [Google Scholar] [CrossRef]
Vangala, H.; Viterbo, E.; Hong, Y. A comparative study of polar code constructions for the AWGN channel. arXiv, 2015; arXiv:1501.02473. [Google Scholar]
Honda, J.; Yamamoto, H. Polar coding without alphabet extension for asymmetric models. IEEE Trans. Inf. Theory 2013, 59, 7829–7838. [Google Scholar] [CrossRef]
Throughout this paper, we assume binary polarization. An extension to q-ary alphabets is possible [25,26].
Karzand, M.; Telatar, E. Polar codes for q-ary source coding. In Proceedings of the 2010 IEEE International Symposium on Information Theory, Austin, TX, USA, 12–18 June 2010; pp. 909–912. [Google Scholar] [CrossRef]
Şasoğlu, E.; Telatar, E.; Arikan, E. Polarization for arbitrary discrete memoryless channels. In Proceedings of the IEEE Information Theory Workshop, Sicily, Italy, 11–16 October 2009; pp. 144–148. [Google Scholar]
Arikan, E. Source polarization. In Proceedings of the 2010 IEEE International Symposium on Information Theory, Austin, TX, USA, 12–18 June 2010; pp. 899–903. [Google Scholar]
Korada, S.B.; Urbanke, R.L. Polar codes are optimal for lossy source coding. IEEE Trans. Inf. Theory 2010, 56, 1751–1768. [Google Scholar] [CrossRef]
Levin, D.A.; Peres, Y.; Wilmer, E.L. Markov Chains and Mixing Times; American Mathematical Society: Providence, RI, USA, 2009. [Google Scholar]
Csiszar, I.; Körner, J. Information Theory: Coding Theorems for Discrete Memoryless Systems; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Pearl, J. Causality; Cambridge University Press: Cambridge, UK, 2009. [Google Scholar]
Most of the code in MATLAB is adapted from https://ecse.monash.edu/staff/eviterbo/polarcodes.html.
Cover, T.M.; Thomas, J.A. Elements of Information Theory; John Wiley & Sons: Hoboken, NJ, USA, 2012. [Google Scholar]

Figure 1. DBC with Non-Layered Decoding and Layered Secrecy (DBC-NLD-LS).

Figure 2. DBC with Layered Decoding and Non-Layered Secrecy (DBC-LD-NLS).

Figure 3. Polar code construction for DBC-NLD-LS. The hatched area represents those indices

j \in {(H_{X | Y_{1}}^{(n)})}^{C} \cap {(L_{X | Y_{1}}^{(n)})}^{C}

, which can belong to the sets

I_{m}^{(n)}

(

m \in [1, M]

),

C^{(n)}

,

F^{(n)}

or

T^{(n)}

.

Figure 3. Polar code construction for DBC-NLD-LS. The hatched area represents those indices

j \in {(H_{X | Y_{1}}^{(n)})}^{C} \cap {(L_{X | Y_{1}}^{(n)})}^{C}

, which can belong to the sets

I_{m}^{(n)}

(

m \in [1, M]

),

C^{(n)}

,

F^{(n)}

or

T^{(n)}

.

Figure 4. Bayesian graph plotting the dependencies between the random variables of different blocks that are involved in the secrecy analysis when we consider a transmission over several blocks of size n.

Figure 5. Polar code construction for the DBC-LD-NLS at the ℓ-th layer. The hatched area represents those indices

j \in {(H_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C} \cap {(L_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C}

, which can belong to the sets

I_{ℓ}^{(n)}

,

C_{ℓ}^{(n)}

or

T_{ℓ}^{(n)}

.

Figure 5. Polar code construction for the DBC-LD-NLS at the ℓ-th layer. The hatched area represents those indices

j \in {(H_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C} \cap {(L_{V_{ℓ} | V_{ℓ - 1} Y_{ℓ}}^{(n)})}^{C}

, which can belong to the sets

I_{ℓ}^{(n)}

,

C_{ℓ}^{(n)}

or

T_{ℓ}^{(n)}

.

Figure 6. Secrecy performance of the polar coding scheme for DBC-NLD-LS over BE-BC as a function of the blocklength n and the normalized target rate

ρ_{R}

when we set

β^{(r)} = 0.16

and

β^{(s)} = 0.30

. (A) Upper-bound on the information about

(W_{1}, W_{2})

leaked to Eavesdropper 1 defined as in Equation (48). (B) Upper-bound on the information about

W_{2}

leaked to Eavesdropper 2 defined as in Equation (49).

Figure 6. Secrecy performance of the polar coding scheme for DBC-NLD-LS over BE-BC as a function of the blocklength n and the normalized target rate

ρ_{R}

when we set

β^{(r)} = 0.16

and

β^{(s)} = 0.30

. (A) Upper-bound on the information about

(W_{1}, W_{2})

leaked to Eavesdropper 1 defined as in Equation (48). (B) Upper-bound on the information about

W_{2}

leaked to Eavesdropper 2 defined as in Equation (49).

Figure 7. Secrecy performance of the polar coding scheme for DBC-NLD-LS over BE-BC as a function of n and

β^{(s)}

, which defines

δ_{n}^{(s)}

for each n, when we set

β^{(r)} = 0.16

and

ρ_{R} = 0.90

. (A) Upper-bound on the information about

(W_{1}, W_{2})

leaked to Eavesdropper 1 defined as in Equation (48). (B) Upper-bound on the information about

W_{2}

leaked to Eavesdropper 2 defined as in Equation (49).

Figure 7. Secrecy performance of the polar coding scheme for DBC-NLD-LS over BE-BC as a function of n and

β^{(s)}

, which defines

δ_{n}^{(s)}

for each n, when we set

β^{(r)} = 0.16

and

ρ_{R} = 0.90

. (A) Upper-bound on the information about

(W_{1}, W_{2})

leaked to Eavesdropper 1 defined as in Equation (48). (B) Upper-bound on the information about

W_{2}

leaked to Eavesdropper 2 defined as in Equation (49).

Figure 8. Reliability performance of the polar coding scheme for DBC-NLD-LS over BE-BC as a function of n and

β^{(r)}

, which defines

δ_{n}^{(r)}

for each n, when we set

β^{(s)} = 0.30

and

ρ_{R} = 0.90

. That is, the bound

P_{b}^{ub (1)}

on the average bit error probability at the legitimate Receiver 1 is defined as in Equation (47).

Figure 8. Reliability performance of the polar coding scheme for DBC-NLD-LS over BE-BC as a function of n and

β^{(r)}

, which defines

δ_{n}^{(r)}

for each n, when we set

β^{(s)} = 0.30

and

ρ_{R} = 0.90

. That is, the bound

P_{b}^{ub (1)}

on the average bit error probability at the legitimate Receiver 1 is defined as in Equation (47).

Figure 9. Rate of the additional secret sequence

Φ

computed as in Equation (50) for DBC-NLD-LS over BE-BC as a function of the blocklength n for different values of

(β^{(r)}, β^{(s)})

, which defines

(δ_{n}^{(r)}, δ_{n}^{(s)})

for each n. (A) Rate of

Φ

for different values of

β^{(s)}

when

β^{(r)} = 0.16

and

ρ_{R} = 0.90

. (B) Rate of

Φ

for different values of

β^{(r)}

when

β^{(s)} = 0.30

and

ρ_{R} = 0.90

.

Figure 9. Rate of the additional secret sequence

Φ

computed as in Equation (50) for DBC-NLD-LS over BE-BC as a function of the blocklength n for different values of

(β^{(r)}, β^{(s)})

, which defines

(δ_{n}^{(r)}, δ_{n}^{(s)})

for each n. (A) Rate of

Φ

for different values of

β^{(s)}

when

β^{(r)} = 0.16

and

ρ_{R} = 0.90

. (B) Rate of

Φ

for different values of

β^{(r)}

when

β^{(s)} = 0.30

and

ρ_{R} = 0.90

.

Figure 10. Performance of the polar coding scheme for DBC-LD-NLS over BS-BC as a function of the blocklength n when

β^{(1, r)} = β^{(2, r)} = 0.24

,

β^{(1, s)} = 0.30

,

β^{(2, s)} = 0.36

and

β^{(2, H)} = β^{(2, H)} = 0.36

. (A) Upper-bound on the information about

(W_{1}, W_{2})

leaked to Eavesdropper 2 defined as in Equation (55) for different normalized target rates

ρ_{R}

when we set

d_{TV}^{ub} = 0

. (B) Upper-bounds on the average error probability at legitimate Receivers 1 and 2 defined as in Equations (53) and (54), respectively, when

d_{TV}^{ub} = 0

. (C) Overall rate of the sequences

{Φ_{1}, Φ_{2}}

computed as in Equation (56). (D) terms

d_{TV}^{ub (H)}

and

d_{TV}^{ub (L)}

that contribute to the bound on the total variation distance

d_{TV}^{ub}

defined as in Equations (51) and (52), respectively.

Figure 10. Performance of the polar coding scheme for DBC-LD-NLS over BS-BC as a function of the blocklength n when

β^{(1, r)} = β^{(2, r)} = 0.24

,

β^{(1, s)} = 0.30

,

β^{(2, s)} = 0.36

and

β^{(2, H)} = β^{(2, H)} = 0.36

. (A) Upper-bound on the information about

(W_{1}, W_{2})

leaked to Eavesdropper 2 defined as in Equation (55) for different normalized target rates

ρ_{R}

when we set

d_{TV}^{ub} = 0

. (B) Upper-bounds on the average error probability at legitimate Receivers 1 and 2 defined as in Equations (53) and (54), respectively, when

d_{TV}^{ub} = 0

. (C) Overall rate of the sequences

{Φ_{1}, Φ_{2}}

computed as in Equation (56). (D) terms

d_{TV}^{ub (H)}

and

d_{TV}^{ub (L)}

that contribute to the bound on the total variation distance

d_{TV}^{ub}

defined as in Equations (51) and (52), respectively.

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Del Olmo Alos, J.; Rodríguez Fonollosa, J. Strong Secrecy on a Class of Degraded Broadcast Channels Using Polar Codes. Entropy 2018, 20, 467. https://0-doi-org.brum.beds.ac.uk/10.3390/e20060467

AMA Style

Del Olmo Alos J, Rodríguez Fonollosa J. Strong Secrecy on a Class of Degraded Broadcast Channels Using Polar Codes. Entropy. 2018; 20(6):467. https://0-doi-org.brum.beds.ac.uk/10.3390/e20060467

Chicago/Turabian Style

Del Olmo Alos, Jaume, and Javier Rodríguez Fonollosa. 2018. "Strong Secrecy on a Class of Degraded Broadcast Channels Using Polar Codes" Entropy 20, no. 6: 467. https://0-doi-org.brum.beds.ac.uk/10.3390/e20060467

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Strong Secrecy on a Class of Degraded Broadcast Channels Using Polar Codes

Abstract

1. Introduction

1.1. Relation to Prior Work

1.2. Overview of Novel Contributions

1.3. Notation

1.4. Organization

2. System Model and Secrecy-Capacity Region

2.1. Degraded Broadcast Channel with Non-Layered Decoding and Layered Secrecy

2.2. Degraded Broadcast Channel with Layered Decoding and Non-Layered Secrecy

3. Review of Polar Codes

4. Polar Coding Scheme For the DBC-NLD-LS

4.1. Polar Code Construction

4.2. Polar Encoding

4.3. Polar Decoding

4.4. Information Leakage

4.5. Performance of the Polar Coding Scheme

4.5.1. Transmission Rates

4.5.2. Distribution of the DMS after the Polar Encoding

4.5.3. Reliability Performance

4.5.4. Secrecy Performance

4.5.5. Reuse of the Source of Common Randomness

5. Polar Coding Scheme for the DBC-LD-NLS

5.1. Polar Code Construction

5.2. Polar Encoding

5.3. Polar Decoding

5.4. Information Leakage

5.5. Performance of the Polar Coding Scheme

5.5.1. Transmission Rates

5.5.2. Distribution of the DMS after the Polar Encoding

5.5.3. Reliability Performance

5.5.4. Secrecy Performance

6. Polar Construction and Performance Evaluation

6.1. DBC-NLD-LS

6.1.1. Practical Polar Code Construction

6.1.2. Performance Evaluation

6.2. DBC-LD-NLS

6.2.1. Practical Polar Code Construction

6.2.2. Performance Evaluation

7. Conclusions

Supplementary Materials

Author Contributions

Funding

Conflicts of Interest

Abbreviations

Appendix A. Proof of Lemmas 2 and 3

References and Notes

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI