Comparative Study of Three Steganographic Methods Using a Chaotic System and Their Universal Steganalysis Based on Three Feature Vectors

Battikh, Dalia; El Assad, Safwan; Hoang, Thang Manh; Bakhache, Bassem; Deforges, Olivier; Khalil, Mohamad

doi:10.3390/e21080748

Open AccessArticle

Comparative Study of Three Steganographic Methods Using a Chaotic System and Their Universal Steganalysis Based on Three Feature Vectors

¹

LASTRE Laboratory, Lebanese University, 210 Tripoli, Lebanon

²

Institut d’Electronique et des Télécommunications de Rennes (IETR), UMR CNRS 6164, Université de Nantes—Polytech Nantes, Rue Christian Pauc CS 50609, CEDEX 3, 44306 Nantes, France

³

School of Electronics and Telecommunications, Hanoi University of Science and Technology, 1 Dai Co Viet, Hai Ba Trung, Hanoi, Vietnam

⁴

INSA de Rennes, CNRS, IETR, CEDEX 7, 35708 Rennes, France

^*

Author to whom correspondence should be addressed.

Entropy 2019, 21(8), 748; https://0-doi-org.brum.beds.ac.uk/10.3390/e21080748

Submission received: 27 June 2019 / Revised: 21 July 2019 / Accepted: 24 July 2019 / Published: 30 July 2019

(This article belongs to the Special Issue Entropy Based Data Hiding)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

In this paper, we firstly study the security enhancement of three steganographic methods by using a proposed chaotic system. The first method, namely the Enhanced Edge Adaptive Image Steganography Based on LSB Matching Revisited (EEALSBMR), is present in the spatial domain. The two other methods, the Enhanced Discrete Cosine Transform (EDCT) and Enhanced Discrete Wavelet transform (EDWT), are present in the frequency domain. The chaotic system is extremely robust and consists of a strong chaotic generator and a 2-D Cat map. Its main role is to secure the content of a message in case a message is detected. Secondly, three blind steganalysis methods, based on multi-resolution wavelet decomposition, are used to detect whether an embedded message is hidden in the tested image (stego image) or not (cover image). The steganalysis approach is based on the hypothesis that message-embedding schemes leave statistical evidence or structure in images that can be exploited for detection. The simulation results show that the Support Vector Machine (SVM) classifier and the Fisher Linear Discriminant (FLD) cannot distinguish between cover and stego images if the message size is smaller than 20% in the EEALSBMR steganographic method and if the message size is smaller than 15% in the EDCT steganographic method. However, SVM and FLD can distinguish between cover and stego images with reasonable accuracy in the EDWT steganographic method, irrespective of the message size.

Keywords:

steganography; chaotic system; steganalysis; wavelet; feature vector; SVM; FLD

1. Introduction

Steganography is an increasingly important security domain; it aims to hide a message (secret information) in digital cover media without causing perceptual degradation (in this study, we use images as cover media). It should be noted that many steganographic methods have been proposed in the spatial and frequency domains. In the spatial domain, pixels are directly used to hide secret messages; these techniques are normally easy to implement and have a high capacity. However, they are not generally robust against statistical attacks [1,2]. In the transform domain, coefficients of frequency transforms, such as DCT (Discrete Cosine Transform), FFT (Fast Fourier Transform), and DWT (Discrete Wavelet Transform), are used to hide secret data. Generally, these techniques are complex, but they are more robust against steganalysis (to noise and to image processing).

The main steganographic methods in the spatial domain [3,4,5,6,7,8,9,10,11,12,13,14,15,16,17] are LSB-based (Low Significant Bit). Recently, entropy has also been extensively used to support data-hiding algorithms [18,19,20]. The LSB methods entail replacing the least significant bit of pixels with a bit of the secret data. Among these methods, the EALSBMR method [3] is an edge adaptive scheme with respect to the message size and can embed data according to the difference between two consecutive pixels in the cover image. To the best of our knowledge, we conclude that this method is the best (good PNSR, high embedding capacity, and especially adaptive), but it suffers from low security in terms of message detection. For this reason, we have enhanced its security.

Frequency domain steganography, as a watermarking domain [21,22,23,24,25,26,27,28,29], is widely based on the DCT and DWT transforms. The DCT usually transforms an image representation into a frequency representation by grouping pixels into 8 × 8 pixel blocks and transforming each block, using the DCT transform, into 64 DCT coefficients. A message is then embedded into the DCT coefficients. The Forward Discrete Wavelet Transform is, in general, suitable for identifying areas in the cover image where a secret message can be effectively embedded due to excellent space-frequency localization properties. In particular, these properties allow exploiting the masking effect of a human visual system so that if a DWT coefficient is modified, it modifies only the region that corresponds to that coefficient. The Haar wavelet is the simplest possible wavelet that can achieve the DWT.

However, the aforementioned steganographic methods are not secure in terms of message detection. To protect the content of messages, chaos can be used. Indeed, chaotic sequences play an important role in information hiding and in security domains, such as cryptography, steganography, and watermarking, because of their properties such as sensitivity to initial conditions and parameters of the system, ergodicity, uniformity, and pseudo-randomness. Steganography generally leaves traces that can be detected in stego images. This can allow an adversary, using steganalysis techniques, to divulge a hiding secret message. There are two types of opponents: passive and active. A passive adversary only examines communication to detect whether communication contains hidden messages. In this case, the content of the communication is not modified by the rival. An active adversary can intentionally cause disruption, distortion, or destruction of communication, even in the absence of evidence of secret communication. The main steganographic methods have been designed for cases of passive adversary. In general, there are two kinds of steganalysis: specific and universal. Specific steganalysis is designed to attack a specific steganography algorithm. This type of specific steganalysis can generally produce more accurate results, but it fails to produce satisfactory results if the inserted secret messages are in the form of a modified algorithm. Universal steganalysis, on the other hand, can be regarded as a universal technique to detect various types of steganography. Moreover, it can be used to detect new steganographic techniques where specific steganalysis does not yet exist. In other words, universal steganalysis is an irreplaceable tool for detection if the integration algorithm is unknown or secret.

In this paper, we first integrate an efficient chaotic system into the three steganographic methods mentioned above to make them more secure. The chaotic system quasi-chaotically chooses pixel positions in the cover image where the bits of the secret message will be embedded. Thus, the inserted bits of the secret message becomes secure against message bits recovery attacks because their position is unknown.

Second, we study and apply three universal steganalysis methods to the aforementioned chaos-based steganographic methods. The first steganalysis method, developed by Farid [30], uses higher-order statistics of high-frequency wavelet sub-bands and their prediction errors to form the feature vectors. In the second steganalysis method, as formulated by Shi et al. [31], the statistical moments of the characteristic functions of the prediction-error image, the test image, and their wavelet sub-bands are selected as the feature vectors. The third steganalysis method, introduced by Wang et al. [32], uses the features that are extracted from both the empirical probability density function (PDF) moments and the normalized absolute characteristic function (CF). For the three steganalysis algorithms, we applied FLD analysis and the SVM method with the RBF kernel as classifiers between cover images and stego images.

The paper has been organized as follows: In Section 2, we describe the proposed chaotic system. In Section 3, we present the three enhanced steganographic algorithms. In Section 4, we illustrate the experimental results and analyze the enhanced algorithms. In Section 5, we develop, in detail, the steganalysis techniques for the previous algorithms. In Section 6, we report the results of the steganalysis, and in the last section, we conclude our work.

2. Description of the Proposed Chaotic System

This system is made of a perturbed chaotic generator and a 2-D cat map. The chaotic generator supplies the dynamic keys

K_{p}

for the process of provides the position of the new random pixel (see Figure 1). The chaotic system allows inserting a message both in a secretive and uniform manner [33,34,35,36,37,38,39,40].

The generator of discrete chaotic sequences exhibits orbits with very large lengths. It is based on two connected non-linear digital IIR filters (cells). The discrete PWLCM and SKEW TENT maps (non-linear functions) are used. A linear feedback shift register (m-LFSR) is then used to disturb each cell (Figure 2). The disturbing technique is associated with the cascading technique, which allows controlling and increasing the length of the orbits that are produced. The minimum orbit length of the generator output is calculated using Equation (1):

o_{m i n} = l c m \{Δ_{1} \times (2^{k_{1}} - 1), Δ_{2} \times (2^{k_{2}} - 1)\}

(1)

In the above equation,

l c m

is the least common multiple,

k_{1}

= 23 and

k_{2}

= 21 are the degrees of the LFSR’s primitive polynomials, and

Δ_{1}

and

Δ_{2}

are the lengths

s_{1}

and

s_{2}

of outputs cells, respectively, without disturbance. The equations of the chaotic generators are formulated as follows:

\begin{matrix} s_{i} (n) = N L F_{i} \{u_{i} (n - 1), p_{i}\}, i = 1, 2 \\ u_{i} (n - 1) = m o d \{s_{i} (n - 1) \times c_{i, 1} + s_{i} (n - 2) \times c_{i, 2}, 2^{N}\}, i = 1, 2 \\ s (n) = s_{1} (n) + s_{2} (n) \end{matrix}

(2)

The two previously mentioned functions, PWLCM map and Skew map, are defined according to the following relations:

\begin{matrix} s_{1} (n) & = N L F_{1} \{u_{1} (n - 1), p_{1}\} \\ = \{\begin{matrix} ⌊2^{N} \times \frac{u_{1} (n - 1)}{p_{1}}⌋ i f 0 \leq u_{1} (n - 1) < p_{1} \\ ⌊2^{N} \times \frac{2^{N} - u_{1} (n - 1)}{2^{N} - p_{1}}⌋ i f p_{1} \leq u_{1} (n - 1) < 2^{N - 1} \\ N L F_{1} [2^{N} - u_{1} (n - 1)] o t h e r w i s e \end{matrix} \end{matrix}

(3)

\begin{matrix} s_{2} (n) & = N L F_{2} [u_{2} (n - 2), p_{2}] \\ = \{\begin{matrix} ⌊2^{N} \times \frac{u_{2} (n - 1)}{p_{1}}⌋ i f 0 \leq u_{2} (n - 1) < p_{2} \\ ⌊2^{N} \times \frac{2^{N} - u_{2} (n - 1)}{2^{N} - p_{2}}⌋ + 1 i f p_{2} \leq u_{2} (n - 1) < 2^{N} \end{matrix} \end{matrix}

(4)

The control parameter

p_{1}

is used for the PWLCM map and ranges from 1 to

2^{N - 1} - 1

, and

p_{2}

is the control parameter that is used for the Skew map and ranges from 1 to

2^{N} - 1

.

N = 32

is the word length used for simulations. The size of the secret key K, formed by all initial conditions and parameters of the chaotic generator, is (6 × 32 + 5 × 32 + 31 + 23 +21) = 427 bits. It is large enough to resist a brute-force attack.

Description of the Cat Map Used

The permutation process is based on the modified Cat map and is calculated in a very efficient manner using the equation below [37]:

[\begin{matrix} M_{c n} \\ M_{l n} \end{matrix}] = m o d \{(\begin{matrix} 1 & u \\ v & 1 + u v \end{matrix}) \times (\begin{matrix} M_{l} \\ M_{c} \end{matrix}) + [\begin{matrix} r_{l} + r_{c} \\ r_{c} \end{matrix}], [\begin{matrix} M \\ M \end{matrix}]\} + [\begin{matrix} 1 \\ 1 \end{matrix}]

(5)

In the above equation,

(M_{l}, M_{c})

and

(M_{l n}, M_{c n})

are the original and permuted square matrices of size

(M, M)

, from which we calculate the

I n d

matrix as follows:

M_{l} = (\begin{matrix} 1 & 1 & . & . & 1 \\ 2 & 2 & . & . & 2 \\ . & . & . \\ . & . & . \\ M & M & . & . & M \end{matrix}); M_{c} = (\begin{matrix} 1 & 2 & . & . & M \\ 1 & 2 & . & . & M \\ . & . & . \\ . & . & . \\ 1 & 2 & . & . & M \end{matrix})

I n d = (M_{l n} - 1) + (M_{c n} - 1) \times M + 1

The dynamic key

K_{p}

is structured as follows:

K_{p} = [k_{p_{1}}, k_{p_{2}}, \dots, k_{p_{r}}]

k_{p_{i}} = \{u_{i}, v_{i}, r l_{i}, r c_{i}\}; i = 1, 2, \dots, r

In the above equations,

0 \leq u_{i}, v_{i}, r l_{i}, r c_{i} \leq M - 1

are the parameters of the Cat map and r is the number of rounds.

3. Enhanced Steganographic Algorithms

In this section, we describe three enhanced steganographic algorithms by using an efficient chaotic system.

3.1. Enhanced EALSBMR (EEALSBMR)

Below, we present the insertion procedure and the extraction procedure of the proposed enhancement of the EALSBMR method (EEALSBMR) [41].

3.1.1. Insertion Procedure

The flow diagram of the embedding scheme can be found in Figure 3.

The detailed embedding steps for this algorithm have been explained as follows:

Step 1:

Capacity estimation

To estimate the insertion capacity, we arrange the cover image into a 1D vector V, and we divide its content into non-overlapping embedding units (blocks) consisting of two consecutive pixels $(p_{i}, p_{i + 1})$ . Following this, we calculate the difference between the pixels of each block, and we increase by one the content of the vector-difference $V D$ of 31 elements $t \in \{1, 2, 3, \dots, 31\}$ , in which each element contains $|E U (t)|$ number of blocks where $E U (t)$ is a set of pixel pairs whose absolute differences are greater than or equal to t, as shown below:

$E U (t) = \{(p_{i}, p_{i + 1}) | | |p_{i} - p_{i + 1}| \geq t, \forall (p_{i}, p_{i + 1}) \in V\}$

(6)

For a given secret message M of size

|M|

bits, the threshold T used in the embedding process is determined by the following expression and pseudo-code (Algorithm 1):

T = a r g m a x_{t} \{2 * |E U (t)| \geq |M|\}

(7)

Algorithm 1 Pseudo-code determining the value of the threshold T

1:: procedure
2:: $n u m b e r_p i x e l s$ = 0;
3:: for t = 31:-1:1 do
4:: $n u m b e r_p i x e l s$ = $n u m b e r_p i x e l s$ + $V D (t)$ ;
5:: if (2* $n u m b e r_p i x e l s$ > = |M|) then
6:: $T = t$ ;
7:: break;
8:: end if;
9:: end for;
10:: end procedure

Step 2:

Embedding process

The embedding process is achieved as follows: we divide the cover image into two sub-images; one includes the odd columns, and the other includes the even columns.
Following this, the chaotic system chooses a pixel position ( $I n d$ ) from the odd sub-image; the second pixel position of the corresponding block must have the same $I n d$ in the even image. If a pair of pixel units $(p_{i}, p_{i + 1})$ satisfies Equation (8), then a 2 bit-message can be hidden (one bit by pixel); otherwise, the chaotic system chooses another $I n d$ .

$(|p_{i} - p_{i + 1}| \geq T, \forall (p_{i}, p_{i + 1}) \in V)$

(8)
For each unit $(p_{i}, p_{i + 1})$ , we perform data-hiding based on the following four cases [42]:
Case 1:
if $L S B (p_{i}) = m_{i}$ and $f (p_{i}, p_{i + 1}) = m_{i + 1}$ $\to (p_{i}^{^{'}}, p_{i + 1}^{^{'}}) = (p_{i}, p_{i + 1})$
Case 2:
if $L S B (p_{i}) = m_{i}$ and $f (p_{i}, p_{i + 1}) \neq m_{i + 1}$ $\to (p_{i}^{^{'}}, p_{i + 1}^{^{'}}) = (p_{i}, p_{i + 1} + r)$
Case 3:
if $L S B (p_{i}) \neq m_{i}$ and $f (p_{i} - 1, p_{i + 1}) = m_{i + 1}$ $\to (p_{i}^{^{'}}, p_{i + 1}^{^{'}}) = (p_{i} - 1, p_{i + 1})$
Case 4:
if $L S B (p_{i}) \neq m_{i}$ and $f (p_{i} - 1, p_{i + 1}) \neq m_{i + 1}$ $\to (p_{i}^{^{'}}, p_{i + 1}^{^{'}}) = (p_{i} + 1, p_{i + 1})$
In the above equations, $m_{i}$ and $m_{i + 1}$ are the $i^{}$ th and ${(i + 1)}^{}$ th secret bits of the message to be embedded; r is a random value belonging to $\{- 1, 1\}$ , and $(p_{i}^{^{'}}, p_{i + 1}^{^{'}})$ denotes the pixel pair after data-hiding. The function f is defined as follows:

$f (a, b) = L S B (⌊\frac{a}{2}⌋ + b)$

(9)
Readjustment if necessary: After hiding, $(p_{i}^{^{'}}, p_{i + 1}^{^{'}})$ may be out of range [0, 255] or the new difference value $|p_{i}^{^{'}} - p_{i + 1}^{^{'}}|$ may be less than the threshold T. In these cases, we need to readjust $p_{i}^{^{'}}$ and $p_{i + 1}^{^{'}}$ , and the new readjusted values, $p_{i}^{″}$ and $p_{i + 1}^{″}$ , are calculated as follows [3]:

$(p_{i}^{″}, p_{i + 1}^{″}) = a r g m i n_{(e_{1}, e_{2})} \{|e_{1} - p_{i}^{^{'}}| + |e_{2} - p_{i + 1}^{^{'}}|\}$

(10)

with :

$\{\begin{matrix} e_{1} = p_{i}^{^{'}} + 4 k_{1} \\ e_{2} = p_{i + 1}^{^{'}} + 2 k_{2} \end{matrix} k_{1}, k_{2} \in Z$

(11)

$k_{1}, k_{2}$ are two arbitrary numbers from $Z$ ; when:

$0 \leq e_{1}, e_{1} \leq 255 and |e_{1} - e_{2}| \geq T$

(12)

then :

$\begin{matrix} p_{i}^{^{″}} = e_{1} \\ p_{i + 1}^{^{″}} = e_{2} \end{matrix}$

(13)

The sequence follows as such for each new block position.
Finally, we embed the parameter T of the stego image into the first five pixels or the last five pixels, for example.

3.1.2. Extraction Procedure

Extract the parameter T from the stego image.
Divide the stego image into two sub-images; one includes the odd columns, and the other includes the even columns.
Generate a pseudo-chaotic position (using the same secret key K), as done in the insertion procedure, to obtain the same order of pixel unit position as the odd sub-image. The second pixel block has the same $I n d$ in the even image.
Verify if $|p_{i}^{s} - p_{i + 1}^{s}| \geq T$ and then extract the two secret bits of M $(m_{i}, m_{i + 1})$ as follows:

$m_{i} = L S B (p_{i}^{s}); m_{i + 1} = f (p_{i}^{s}, p_{i + 1}^{s})$

(14)

with : $p_{i}^{s} = p_{i}^{^{'}}$ or $p_{i}^{″}$
Otherwise, the chaotic system chooses another pseudo-chaotic position. The sequence follows as such for each unit position until all messages have been extracted.
Example of insertion:
The cover image is this image of “peppers” as in Figure 4:
The embedded message appears as follows in 40 × 40 pixels as shown in Figure 5:
The corresponding sequence of the bits message has been given as follows:

$M = 10001000100011001000110001100111001001111010010110$

$11101011000110101011101000000110100010110010 \dots$

The length of the binary message is 13,120 bits.
Capacity estimation produces the threshold $T = 12$
Suppose that the pseudo-chaotic positions of a block to embed the two bits message $m_{1} = 1$ and $m_{2} = 0$ are (354, 375) and (354, 376) that correspond to the 141 and 129 gray values (see Figure 6).
Hiding the message bits:

$L S B (141) = 1 = m_{1} = 1$

$f (p_{1}, p_{2}) = L S B (⌊\frac{p_{1}}{2}⌋ + p_{2}) = L S B (70 + 129) = 1 \neq m_{2}$

We are in Case 2:

$L S B (p_{i}) = m_{i}; f (p_{i}, p_{i + 1}) \neq m_{i + 1}$

Therefore, the new pixel values are as follows:

$(p_{1}^{^{'}}, p_{2}^{^{'}}) = (p_{1}, p_{2} + r) = (141, 130) with r = 1$

The difference between the new pixel values is:

$d^{^{'}} = |p_{1}^{^{'}} - p_{2}^{^{'}}| = | 141 - 130 | = 11 < T$

Then we need to adjust the new pixel values:
We test the values $- 50 < k_{1} < 50$ and $- 50 < k_{2} < 50$ until we obtain the smallest difference between the initial values $p_{1}^{^{'}}$ and $p_{2}^{^{'}}$ and the corresponding obtained values $e_{1}$ and $e_{2}$ by using Equations (12) and (13). In our example, we find $k_{1} = 0$ and $k_{2} = - 1$ and then: $p_{1}^{″} = 141$ , $p_{2}^{″} = 128$ .
Extraction of the bits message in the previous insertion example:
The extraction is performed using the following equation:

$m_{1} = L S B (p_{1}^{″}) = L S B (141) = 1$

$m_{2} = f (p_{1}^{″}, p_{2}^{″}) = L S B (⌊\frac{p_{1}^{″}}{2}⌋ + p_{2}^{″})) = L S B (70 + 128) = L S B (198) = 0$

3.2. Enhanced DCT Steganographic Method (EDCT)

The DCT transforms a signal or image from the spatial domain into the frequency domain [43,44]. A DCT expresses a sequence of finitely many data points in terms of a sum of cosine functions, oscillating at different frequencies. The 2D DCT is calculated as follows:

D C T_{i, j} = α_{i} α_{j} \sum_{m = 0}^{M - 1} \sum_{n = 0}^{N - 1} C_{m n} cos \frac{π (2 m + 1) i}{2 M} cos \frac{π (2 n + 1) j}{2 N}

(15)

where:

α_{i} = \{\begin{matrix} \frac{1}{\sqrt{M}} i = 0 \\ \sqrt{\frac{2}{M}} 0 \leq i \leq M - 1 \end{matrix} α_{j} = \{\begin{matrix} \frac{1}{\sqrt{N}} i = 0 \\ \sqrt{\frac{2}{N}} 0 \leq i \leq N - 1 \end{matrix}

The block diagram of the proposed enhanced steganographic-based DCT transform has been shown in Figure 7.

3.2.1. Insertion Procedure

The embedding process consists of the following steps:

Read the cover image and the secret message.
Convert the secret message into a 1-D binary vector.
Divide the cover image into 8 × 8 blocks. Then apply the 2D DCT transformation to each block (from left to right, top to bottom).
Use the same chaotic system to generate a pseudo-chaotic $I n d$ .
Replace the LSB of each located DCT coefficient with the one bit of the secret message to hide.
Apply the 2D Inverse DCT transform to produce the stego image.

3.2.2. Extraction Procedure

The extraction procedure consists of the following steps:

Read the stego image.
Divide the stego image into 8 × 8 blocks and then apply the 2D DCT to each block.
Use the same chaotic system to generate pseudo-chaotic $I n d$ .
Extract the LSB of each pseudo-located coefficient.
Construct the secret image.

3.3. Enhanced DWT Steganographic Method (EDWT)

The embedded secret image in the lower frequency sub-band

(A)

is generally more robust than the other sub-bands, but it significantly decreases the visual quality of the image, as normally, most of the image energy is stored in this sub-band. In contrast, the edges and textures of the image and the human eye are not generally sensitive to changes in the high-frequency sub-band

(D)

; this allows secret information to be embedded without being perceived by the human eye. However, the sub-band

(D)

is not robust against active attacks (filtering, compression, etc.). The compromise adopted by many DWT-based algorithms to achieve accepted performance of imperceptibility and robustness enables embedding the secret image in the middle-frequency sub-bands

(H)

or

(V)

. In the block diagram of the proposed steganographic EDWT method shown in Figure 8, we embed the secret image in the sub-band

(H)

of the cover image (the size of the secret message must, at most, be equal to the size of the sub-band

(H)

of the cover image).

3.3.1. Insertion Procedure

The embedding process consists of the following steps:

Read the cover image and the secret image.
Transform the cover image into one level of decomposition using Haar Wavelet.
Permute the secret image in a pseudo-chaotic manner.
Fuse the DWT coefficients $(H)$ of the cover image and the permuted secret image $P S I$ as follows [45]:

$\begin{matrix} X^{^{'}} = α X + β \times P S I \\ α + β = 1; α ≫ β \end{matrix}$

(16)

In the above equations, $X^{^{'}}$ is the modified DWT coefficient $(H)$ ; X is the original DWT coefficient $(H)$ . $α$ and $β$ are the embedding strength factors; they are chosen such that the resulting stego image has a large $P S N R$ . In our experiments, we tested some values of $β$ , and the best value was found to be approximately 0.01.
Apply Inverse Discrete Wavelet Transform (IDWT) to produce the stego image in the spatial domain.

3.3.2. Extraction Procedure

The extraction procedure involves the following steps:

Read the stego image.
Transform the stego image into one level of decomposition using Haar Wavelet.
Apply inverse fusion transform to extract the permuted secret image as follows:

$P S I = (X^{^{'}} - α X) / β$

(17)

The extraction procedure is not blind, as we need the cover image to extract the permuted secret message.
Apply the inverse permutation procedure using the same chaotic system to obtain the secret image.

4. Experimental Results and Analysis

In the experiments, we first create the stego images by using the implemented steganographic methods that were applied on the standard gray level cover images “Lena”, “Peppers”, “Baboon” in 512 × 512 pixels and using “Boat” as a secret message with different sizes (embedding rates, ranging from 5% to 40%). The six criteria used to evaluate the qualities of the stego images have been listed as follows: Peak Signal-to-Noise Ratio (

P S N R

) [46], Image Fidelity (

I F

), structural similarity (

S S I M

), the entropy (E), the redundancy (R), and the image redundancy (

I R

). They can be represented by the following equations:

P S N R = 10 \times {log}_{10} (\frac{M a x p_{c}^{2} (i, j)}{\frac{1}{M \times N} (\sum_{i = 0}^{M - 1} \sum_{j = 0}^{N - 1} {[p_{c} (i, j) - p_{s} (i, j)]}^{2})})

(18)

I F = 1 - \frac{\sum_{i = 0}^{M - 1} \sum_{j = 0}^{N - 1} {[p_{c} (i, j)]}^{2})}{(\sum_{i = 0}^{M - 1} \sum_{j = 0}^{N - 1} {[p_{c} (i, j) - p_{s} (i, j)]}^{2})}

(19)

S S I M = \frac{(2 μ_{c} μ_{s} - 1) (2 c o v_{c s} + c_{2})}{(μ_{c}^{2} + μ_{s}^{2} + c_{1}) (σ_{c}^{2} + σ_{s}^{2} + c_{2})}

(20)

In the above equations,

p_{c} (i, j)

and

p_{s} (i, j)

are the pixel value of the

i^{}

th row and

j^{}

th column of the cover and stego image; M and N are the width and height of the considered cover image.

μ_{c}

,

μ_{s}

are the average of the cover and stego images;

σ_{c}^{2}

,

σ_{s}^{2}

are the variance of the cover and stego images;

μ_{c s}

is the co-variance of the cover-stego;

c_{1} = {(k_{1} L)}^{2}

,

c_{2} = {(k_{2} L)}^{2}

are two variables that are used to stabilize the division with a weak denominator; L is the dynamic range of the pixel values, and

k_{1}

,

k_{2}

are two much smaller constants compared to 1. We considered

k_{1}

=

k_{2}

= 0.05.

The higher the

P S N R

,

I F

, and

S S I M

, the better the quality of the stego image.

P S N R

values falling below 40 dB indicate a fairly low quality. Therefore, a high-quality stego should strive to be above 40 dB.

Additionally, we used three other parameters to estimate the qualities of the stego images. These parameters have been listed as follows:

-

The Entropy E, given by the following relation:

E = - \sum_{0}^{2^{L} - 1} p (P_{i}) l o g_{2} (p (P_{i}))

(21)

L is already defined.

p (P_{i})

is the probability of the pixel value

P_{i}

.

-

The Redundancy R is usually represented by the following formula:

R = \frac{E_{m a x} - E}{E}

(22)

Here,

E_{m a x} = 8

. However, this relationship is problematic because the value of the minimal entropy is not known. For that, Tasnime [47] proposed using the following relationship, which seems to be more precise:

I R = \frac{\sum_{i = 1}^{L} |R_{i} - R_{o p t}|}{R_{o p t} (2^{L} - 1) + (S - R_{o p t})}

(23)

Called Image Redundancy (

I R

) with:

S being the size of the image under test;
$R_{i}$ being the number of occurrences of each pixel value;
$R_{o p t}$ being the optimal number of occurrences that each pixel value should have to get a non-redundant image.

In the following section, we present and compare the performance of the three implemented steganographic methods.

4.1. Enhanced EALSBMR

The results obtained from the parameters

P S N R

,

I F

, and

S S I M

for the algorithm have been presented in Table 1; their values indicate the high quality of the stego images, even with a high embedding rate of 40%. We observe that the

P S N R

,

I F

, and

S S I M

values decrease, as expected, when the size of the secret message increases.

In Figure 9a–c, we show the “Baboon” cover image and the corresponding stego images for 5% and 40% embedding rates, respectively. The visual quality obtained from the “Baboon” stego images is very high because visually, it is impossible to discriminate between the cover and stego images.

Just to fix the ideas, using the Lina image as the cover, and to obtain approximately identical capacity, we globally compared the obtained

P S N R

of the EEALSBMP method with that obtained by the following methods: [4,5,6,17]. We observed that only the method proposed by Borislav et al. [17] produces a better

P S N R

than the EEALSBMP method. However, this method cannot be adapted.

4.2. Enhanced DCT Steganographic Method

The results obtained from this method, as presented in Table 2, indicate the high quality of the stego images, even with a high embedding rate. Additionally, even the visual quality obtained is very high, as shown in Figure 10.

4.3. Enhanced DWT Steganographic Method

Table 3 presents the results obtained from the EDWT algorithm, which indicate that the steganographic algorithm exhibits good performance. Furthermore, no visual trace can be found in the resulting stego images, as shown in Figure 11a–c.

4.4. Performance Comparison of the Three Steganographic Methods

Table 1, Table 2 and Table 3 of

P S N R

,

I F

, and

S S I M

of the three methods show that the EEALSBMR and EDCT methods, in comparison with the EDWT method, ensure better quality of the stego images at different embedding rates. There is approximately a 10-dB difference in

P S N R

s at a 5% embedding rate and a 5 to 8 dB difference in

P S N R

s at a 40% embedding rate.

4.5. Performance Using Parameters E, R and $I R$

The results obtained from parameters E, R, and

I R

for the three algorithms on the stego images with different embedding rates have been presented in Table 4, Table 5 and Table 6. As we can see, these values, given in Table 7, are too close to the values obtained over the original images. This is consistent with the previous results obtained from the parameters

P S N R

,

I F

, and

S S I M

regarding the high quality of the stego images.

5. Universal Steganalysis

A good steganographic method should be imperceptible not only to human vision systems but also to computer analysis. Steganalysis is the art and science that detects whether a given image has a message hidden in it [1,48]. The extensive range of natural images and the wide range of data embedding algorithms make steganalysis a difficult task. In this work, we consider universal steganalysis to be based on statistical analysis.

Universal (blind) steganalysis attempts to detect hidden information without any knowledge about the steganographic algorithm. The idea is to extract the features of cover images and the features of stego images and then use them as the feature vectors that are used by a supervised classifier (SVM, FLD, neural networks…) to distinguish whether the image under test is a stego image. This procedure is illustrated in Figure 12. The left side of the flowchart displays the different steps of the learning process while the right side illustrates the different steps of the testing process.

5.1. Multi-Resolution Wavelet Decomposition

The DWT, which uses a sub-bands coding algorithm, is found to quickly compute the Wavelet Transform. Furthermore, it is easy to implement and reduces the computation time and the number of resources required. The DWT analyses the signal at different frequency bands with different resolutions by decomposing the signal into a coarse approximation and into detailed information. The decomposition of the signal into different frequencies is achieved by applying separable low-pass

\hat{g} (n)

and high-pass

\hat{h} (n)

filters along the image axes. The DWT computes the approximation coefficients matrix A and details coefficients matrices H, V, and D (horizontal, vertical, and diagonal, respectively) of the input matrix X, as illustrated in Figure 13.

5.2. Feature Vector Extraction

As the amount of image data is enormous, it is not feasible to directly use the complete image data for analysis. Therefore, for steganalysis, it is useful to extract a certain amount of useful data features that represent the image instead of the image itself. The addition of a message to a cover image may not affect the visual appearance of the image, but it will affect some statistics. The features required for steganalysis should be able to detect these minor statistical disorders that are created during the data-hiding process.

Three feature-extraction techniques are used in this paper to detect the presence of a secret message; these methods calculate the statistical properties of the images by employing multi-resolution wavelet decomposition.

5.2.1. Method 1: Feature Vectors Extracted from the Empirical Moments of the PDF-Based Multi-Resolution Coefficients and Their Prediction Error

The multi-resolution wavelet decomposition employed here is based on separable quadrature mirror filters (QMFs). This decomposition splits the frequency space into multiple scales and orientations. This is accomplished by applying separable low-pass and high-pass filters along the image axes, generating a vertical, horizontal, diagonal, and low-pass sub-band. The horizontal, vertical, and diagonal sub-bands at scale m = 1, 2, ..., n are denoted as

H_{m}

,

V_{m}

and

D_{m}

.

In our work, the first set of features is extracted from the statistics over coefficients

S_{m}

(x,y) of each sub-band and for levels (scales) m = 1 and n = 3. These characteristics represent the following: mean

μ

, variance

σ^{2}

, skewness

ξ

, and kurtosis

κ

. They can be represented as follows:

\begin{matrix} μ = \frac{1}{N_{x} N_{y}} \sum_{x, y} S_{m} (x, y) \\ σ^{2} = \frac{1}{N_{x} N_{y}} \sum_{x, y} {(S_{m} (x, y) - μ)}^{2} \\ ξ = \frac{1}{N_{x} N_{y} σ^{3}} \sum_{x, y} {(S_{m} (x, y) - μ)}^{3} \\ κ = \frac{1}{N_{x} N_{y} σ^{4}} \sum_{x, y} {(S_{m} (x, y) - μ)}^{4} - 3 \end{matrix}

(24)

From Equation (24), we can build the first feature vector

Z_{s}

of

N_{m} \times N_{b d} \times n =

4 × 3 × 3 = 36 elements, where

N_{m}, N_{b d}

, and n are the number of moments, sub-bands, and scales. The feature vector

Z_{s}

is represented as follows:

Z_{s} = [Z_{1}, Z_{2}, Z_{3}]

where:

Z_{1} = [μ_{H_{1}}, μ_{V_{1}}, μ_{D_{1}} | σ_{H_{1}}, σ_{V_{1}}, σ_{D_{1}} | ξ_{H_{1}}, ξ_{V_{1}}, ξ_{D_{1}} | κ_{H_{1}}, κ_{V_{1}}, κ_{D_{1}}]

Z_{2} = [μ_{H_{2}}, μ_{V_{2}}, μ_{D_{2}} | σ_{H_{2}}, σ_{V_{2}}, σ_{D_{2}} | ξ_{H_{2}}, ξ_{V_{2}}, ξ_{D_{2}} | κ_{H_{2}}, κ_{V_{2}}, κ_{D_{2}}]

Z_{3} = [μ_{H_{3}}, μ_{V_{3}}, μ_{D_{3}} | σ_{H_{3}}, σ_{V_{3}}, σ_{D_{3}} | ξ_{H_{3}}, ξ_{V_{3}}, ξ_{D_{3}} | κ_{H_{3}}, κ_{V_{3}}, κ_{D_{3}}]

The second set of statistics is based on the prediction errors of coefficients

S_{m} (x, y)

of an optimal linear predictor. The sub-band coefficients are correlated with their spatial, orientation, and scale neighbors. Several prediction techniques of coefficients

S_{H_{m}}^{p} (x, y)

,

S_{V_{m}}^{p} (x, y)

, and

S_{D_{m}}^{p} (x, y)

(m = 1, 2, 3) may be used. In this work, we used a linear predictor, specifically the one proposed by Farid in [30], as shown below:

\begin{matrix} S_{H_{m}}^{p} (x, y) = w_{1} S_{H_{m}} (x - 1, y) + w_{2} S_{H_{m}} (x + 1, y) + w_{3} S_{H_{m}} (x, y - 1) + \\ w_{4} S_{H_{m}} (x, y + 1) + w_{5} S_{H_{m + 1}} (\frac{x}{2}, \frac{y}{2}) + w_{6} S_{D_{m}} (x, y) + \\ w_{7} S_{D_{m + 1}} (\frac{x}{2}, \frac{y}{2}) \end{matrix}

(25)

\begin{matrix} S_{V_{m}}^{p} (x, y) = w_{1} S_{V_{m}} (x - 1, y) + w_{2} S_{V_{m}} (x + 1, y) + w_{3} S_{V_{m}} (x, y - 1) + \\ w_{4} S_{V_{m}} (x, y + 1) + w_{5} S_{V_{m + 1}} (\frac{x}{2}, \frac{y}{2}) + w_{6} S_{D_{m}} (x, y) + \\ w_{7} S_{D_{m + 1}} (\frac{x}{2}, \frac{y}{2}) \end{matrix}

(26)

\begin{matrix} S_{D_{m}}^{p} (x, y) = w_{1} S_{D_{m}} (x - 1, y) + w_{2} S_{D_{m}} (x + 1, y) + w_{3} S_{D_{m}} (x, y - 1) + \\ w_{4} S_{D_{m}} (x, y + 1) + w_{5} S_{D_{m + 1}} (\frac{x}{2}, \frac{y}{2}) + w_{6} S_{H_{m}} (x, y) + \\ w_{7} S_{V_{m + 1}} (\frac{x}{2}, \frac{y}{2}) \end{matrix}

(27)

For more clarity, in Figure 14, we provide the block diagram for the prediction of coefficient

S_{V_{1}}^{p} (x, y)

.

The parameters

w_{i}

(scalar weighting values) of the error prediction coefficients of each sub-band for a given level m are adjusted to minimize the prediction error by minimizing the quadratic error function, as shown below:

E (w) = {[S_{m} - Q w]}^{2}

(28)

The columns of the matrix Q contain the neighboring coefficient magnitudes, as specified in Equations (25)–(27). The quadratic error function is minimized analytically as follows:

\frac{d E (w)}{d w} = 2 Q^{T} (S_{m} - Q w) = 0

(29)

Then, we obtain:

w_{o p t} = {(Q^{t} Q)}^{- 1} Q^{t} S_{m}

(30)

For the optimal predictor, we use the log error given by the following equation to predict error coefficients of each sub-band for a given level m:

ϵ_{m}^{p} = {log}_{2} S_{m} - {log}_{2} (| Q w_{o p t} |)

(31)

By using Equation (31), additional statistics are collected, namely the mean, variance, skewness, and kurtosis (see Equation (24)). The feature vector

Z_{ϵ}^{p}

is similar to

Z_{s}

; it is represented as follows:

Z_{ϵ}^{p} = [Z_{1 ϵ}^{p}, Z_{2 ϵ}^{p}, Z_{3 ϵ}^{p}]

where:

Z_{1 ϵ}^{p} = [μ_{ϵ_{H_{1}}}^{p}, μ_{ϵ_{V_{1}}}^{p}, μ_{ϵ_{D_{1}}}^{p} | σ_{ϵ_{H_{1}}}^{p}, σ_{ϵ_{V_{1}}}^{p}, σ_{ϵ_{D_{1}}}^{p} | ξ_{ϵ_{H_{1}}}^{p}, ξ_{ϵ_{V_{1}}}^{p}, ξ_{ϵ_{D_{1}}}^{p} | κ_{ϵ_{H_{1}}}^{p}, κ_{ϵ_{V_{1}}}^{p}, κ_{ϵ_{D_{1}}}^{p}]

Z_{2 ϵ}^{p} = [μ_{ϵ_{H_{2}}}^{p}, μ_{ϵ_{V_{2}}}^{p}, μ_{ϵ_{D_{2}}}^{p} | σ_{ϵ_{H_{2}}}^{p}, σ_{ϵ_{V_{2}}}^{p}, σ_{ϵ_{D_{2}}}^{p} | ξ_{ϵ_{H_{2}}}^{p}, ξ_{ϵ_{V_{2}}}^{p}, ξ_{ϵ_{D_{2}}}^{p} | κ_{ϵ_{H_{2}}}^{p}, κ_{ϵ_{V_{2}}}^{p}, κ_{ϵ_{D_{2}}}^{p}]

Z_{3 ϵ}^{p} = [μ_{ϵ_{H_{3}}}^{p}, μ_{ϵ_{V_{3}}}^{p}, μ_{ϵ_{D_{3}}}^{p} | σ_{ϵ_{H_{3}}}^{p}, σ_{ϵ_{V_{3}}}^{p}, σ_{ϵ_{D_{3}}}^{p} | ξ_{ϵ_{H_{3}}}^{p}, ξ_{ϵ_{V_{3}}}^{p}, ξ_{ϵ_{D_{3}}}^{p} | κ_{ϵ_{H_{3}}}^{p}, κ_{ϵ_{V_{3}}}^{p}, κ_{ϵ_{D_{3}}}^{p}]

Finally, the feature vector that will be used for the learning classifier is represented by

Z = [Z_{s} | Z_{ϵ}^{p}]

. It contains 72 components.

5.2.2. Method 2: Feature Vectors Extracted from Empirical Moments of CF-Based Multi-Resolution

The first set of feature vectors

Z_{s}

is extracted based on the CF and the wavelet decomposition, as proposed by Shi et al. [31]. The statistical moments of the characteristic function

ϕ (k)

of order

n =

1 to 3 are represented for each sub-band

(A_{m}, H_{m}, V_{m}, D_{m})

at different levels m = 1, 2, and 3 of the wavelet decomposition as follows:

M_{S_{m}}^{n} = \frac{\sum_{k = 1}^{\frac{N}{2}} | ϕ (k) | \times k^{n}}{\sum_{k = 1}^{\frac{N}{2}} | ϕ (k) |}

(32)

ϕ (k) = \sum_{i = 1}^{N} h (i) exp \{\frac{j 2 π i k}{K}\} 1 \leq k \leq K

(33)

is a component of the characteristic function at frequency k, calculated from the histogram of the sub-band

S_{m}

, and N is the total number of points of the histogram. Equation (32) allows us to build the first feature vector

Z_{m}

of size 12 × 3 = 36 components and 3 moments of the initial image. The feature vectors

Z_{m}

have been listed as follows:

\begin{matrix} Z_{s} = & [M_{I}^{1}, M_{I}^{2}, M_{I}^{3} | M_{A_{1}}^{1}, M_{A_{1}}^{2}, M_{A_{1}}^{3} | M_{H_{1}}^{1}, M_{H_{1}}^{2}, M_{H_{1}}^{3} | M_{V_{1}}^{1}, M_{V_{1}}^{2}, M_{V_{1}}^{3} \\ | M_{D_{1}}^{1}, M_{D_{1}}^{2}, M_{D_{1}}^{3} | M_{A_{2}}^{1}, M_{A_{2}}^{2}, M_{A_{2}}^{3} | M_{H_{2}}^{1}, M_{H_{2}}^{2}, M_{H_{2}}^{3} | M_{V_{2}}^{1}, M_{V_{2}}^{2}, M_{V_{2}}^{3} \\ | M_{D_{2}}^{1}, M_{D_{2}}^{2}, M_{D_{2}}^{3} | M_{A_{3}}^{1}, M_{A_{3}}^{2}, M_{A_{3}}^{3} | M_{H_{3}}^{1}, M_{H_{3}}^{2}, M_{H_{3}}^{3} | M_{V_{3}}^{1}, M_{V_{3}}^{2}, M_{V_{3}}^{3} \\ | M_{D_{3}}^{1}, M_{D_{3}}^{2}, M_{D_{3}}^{3}] \end{matrix}

In the above equation,

M_{I}^{1}, M_{I}^{2}, M_{I}^{3}

are the moments of the initial image.

The second category of features is calculated from the moments of prediction-error image and its wavelet decomposition.

Prediction-error image:

In steganalysis, we only care about the distortion caused by data-hiding. This type of distortion may be rather weak and, hence, covered by other types of noises, including those caused due to the peculiar feature of the image itself. To make the steganalysis more effective, it is necessary to keep the noise of the dissimulation and eliminate most of the other noises. For this purpose, we calculate the moments of characteristic functions of order

n =

1, 3 of the predicted error image and of its wavelet decomposition at the various levels

m =

1, 2, and 3 (see Equation (32)). The prediction-error image is obtained by subtracting the predicted image (in which each predicted pixel grayscale value in the cover image uses its neighboring pixels’ grayscale values (see Equation (34))) from the cover image. Such features make the steganalysis more efficient because the hidden data is usually unrelated to the cover media. The prediction pixel is expressed as follows:

\hat{x} = \{\begin{matrix} m a x (a, b) c \leq m i n (a, b) \\ m i n (a, b) c \geq m a x (a, b) \\ a + b - c o t h e r w i s e \end{matrix}

(34)

In the above equation, a, b, c are the context of the pixel x under consideration;

\hat{x}

is the prediction value of x. The location of a, b, c can be illustrated as in Figure 15.

The feature vector

Z_{ϵ}^{p}

is represented as follows:

\begin{matrix} Z_{ϵ}^{p} = & [M_{ϵ_{1}}^{p_{1}}, M_{ϵ_{1}}^{p_{2}}, M_{ϵ_{1}}^{p_{3}} | M_{A_{1}}^{1}, M_{A_{1}}^{2}, M_{A_{1}}^{3} | M_{H_{1}}^{1}, M_{H_{1}}^{2}, M_{H_{1}}^{3} | M_{V_{1}}^{1}, M_{V_{1}}^{2}, M_{V_{1}}^{3} \\ | M_{D_{1}}^{1}, M_{D_{1}}^{2}, M_{D_{1}}^{3} | M_{A_{2}}^{1}, M_{A_{2}}^{2}, M_{A_{2}}^{3} | M_{H_{2}}^{1}, M_{H_{2}}^{2}, M_{H_{2}}^{3} | M_{V_{2}}^{1}, M_{V_{2}}^{2}, M_{V_{2}}^{3} \\ | M_{D_{2}}^{1}, M_{D_{2}}^{2}, M_{D_{2}}^{3} | M_{A_{3}}^{1}, M_{A_{3}}^{2}, M_{A_{3}}^{3} | M_{H_{3}}^{1}, M_{H_{3}}^{2}, M_{H_{3}}^{3} | M_{V_{3}}^{1}, M_{V_{3}}^{2}, M_{V_{3}}^{3} \\ | M_{D_{3}}^{1}, M_{D_{3}}^{2}, M_{D_{3}}^{3}] \end{matrix}

In the above equation,

M_{A_{1}}^{1}, M_{A_{1}}^{2}, M_{A_{1}}^{3}

are the 1st, 2nd , and 3rd order moments of the corresponding CFs, from the sub-band

A_{1}

of the 1^st level decomposition on the error image.

Finally, the feature vector that will be used for learning classification is

Z = [Z_{s} | Z_{ϵ}^{p}]

, containing 78 components.

5.2.3. Method 3: Feature Vector Extracted from Empirical Moments Based on the FC and the PDF of Image Prediction Error and Its Different Sub-Bands of the Multi-Resolution Decomposition

The first characteristic vector

Z_{s}

combines two types of normalized moments: moments based on the function density of probability and moments based on the characteristic function of various sub-bands of the multi-resolution decomposition at three levels of the gray image. We use the expression of Wang and Moulin [32] to calculate the moments of order

n =

1 to 6 of the initial image and its sub-band

(A_{m}, H_{m}, V_{m}, D_{m})

of the three-level (

m =

1 to 3) wavelet decomposition, as shown below:

M_{S_{m}}^{n} = \frac{\sum_{k = 1}^{\frac{N}{2}} | ϕ (k) | \times {sin}^{n} (\frac{π k}{K})}{\sum_{k = 1}^{\frac{N}{2}} | ϕ (k) |}

(35)

ϕ (k) = \sum_{i = 1}^{N} h (i) exp \{\frac{j 2 π i k}{K}\} 1 \leq k \leq K

(36)

is a component of the characteristic function at frequency k, estimated from the histogram. Equation (35) already allows having a feature vector of 6 × 1 + 6 × (4 × 3) = 78 components. Also, to improve the performance of the learning system, we calculate the moments of the sub-bands

A_{2}^{^{'}}

,

H_{2}^{^{'}}

,

V_{2}^{^{'}}

,

D_{2}^{^{'}}

obtained from the decomposition of the diagonal sub-band

D_{1}

. Therefore, the total size of the vector

Z_{s}

is 78 + (6 × 4) = 102 components.

\begin{matrix} Z_{s} = & [M_{I}^{i} | M_{A_{1}}^{i} | M_{H_{1}}^{i} | M_{V_{1}}^{i} | M_{D_{1}}^{i} | M_{A_{2}}^{i} | M_{H_{2}}^{i} | M_{V_{2}}^{i} | M_{D_{2}}^{i} | M_{A_{3}}^{i} | M_{H_{3}}^{i} | M_{V_{3}}^{i} | M_{D_{3}}^{i} \\ | M_{A_{2}^{^{'}}}^{i} | M_{H_{2}^{^{'}}}^{i} | M_{V_{2}^{^{'}}}^{i} | M_{D_{2}^{^{'}}}^{i}], i = 1, 2, \dots, 6 \end{matrix}

For example,

M_{I}^{i} = [M_{I}^{1}, M_{I}^{2}, M_{I}^{3}, M_{I}^{4}, M_{I}^{5}, M_{I}^{6}]

are the first six order moments of the original image.

The second category of characteristics consists of the first six moments of the prediction error, which is

ϵ_{m}^{p} = {log}_{2} S_{m} - {log}_{2} (| Q w_{o p t} |)

of coefficients of each sub-band for a given level m, as shown below:

m_{ϵ_{m}^{p}}^{n} = \frac{1}{N} \sum_{i = 1}^{N} {(ϵ_{m}^{p})}^{n} n = 1, 2 . \dots, 6

(37)

The vector of the second category is defined by

Z_{ϵ}^{p}

, as shown below:

Z_{ϵ}^{p} = [m_{ϵ H_{m}}^{i} | m_{ϵ V_{m}}^{i} | m_{ϵ D_{m}}^{i}]

for each

m = \{1, 2, 3\}; i = 1, 2, \dots, 6

The size of

Z_{ϵ}^{p}

is 3 x 6 x 3 = 54 components.

Finally, the feature vector to be used for classification by learning is

Z = [Z_{s} | Z_{ϵ}^{p}]

. It has 156 components.

5.3. Classification

The last stage of the learning and test process of the universal steganalysis is classification (see Figure 12). Its objective is to group the images into two classes, class of the cover images and class of the stego images, according to their feature vectors. We adopt the Fisher linear discriminator (FLD) and the support vector machine (SVM) for training and testing.

5.3.1. FLD Classifier

Below, we reformulate the FLD classifier for our application and apply it to two classes. Let

Z = \{Z_{1}, Z_{2}, \dots, Z_{N}\}

be a set of feature vectors, each with

n d

dimensions. Among these vectors,

N_{1}

vectors are

Z_{c}

feature vectors labeled 1, indicating cover images.

N_{2}

vectors are

Z_{s}

labeled 2, indicating stego images, with

N = N_{1} + N_{2}

. We want to form all projection values

(Z_{p}) = \{Z_{p_{1}}, Z_{p_{2}}, \dots, Z_{p_{N}}\}

of dimension N through linear combinations of feature vectors

Z_{p}

as follows:

Z_{p} = W^{t} Z

(38)

In the above equation,

W

is an orientation vector of dimension

n d

.

In our study, the feature vector

Z

is projected into a space of two classes. This projection tends to maximize the distance between the projected class means

(M_{c p}, M_{s p})

while minimizing projected class scatters

S_{c p}, S_{s p}

.

Learning process
The learning process involves optimizing the following expression:

$J (W) = \frac{| M_{c p} - M_{s p} |^{2}}{S_{c p} + S_{s p}}$

(39)

where:

$M_{c p} = \frac{1}{N_{1}} \sum_{Z_{p} \in Z_{c p}} Z_{p} = \frac{1}{N_{1}} \sum_{Z \in Z_{c}} W^{t} Z = W^{t} M_{c}$

(40)

is the mean feature vector of cover class after projection, and

$M_{c} = \frac{1}{N_{1}} \sum_{Z \in Z_{c}} Z$

(41)

is the mean feature vector of cover class of dimension $n d$ .
The mean feature vector of stego class after projection is represented as follows:

$M_{s p} = \frac{1}{N_{2}} \sum_{Z_{p} \in Z_{s p}} Z_{p} = \frac{1}{N_{2}} \sum_{Z \in Z_{s}} W^{t} Z = W^{t} M_{s}$

(42)

where:

$M_{s} = \frac{1}{N_{2}} \sum_{Z \in Z_{s}} Z$

(43)

is the mean feature vector of a stego class of dimension $n d$ .
The scatter matrix of the cover class after projection has been shown as follows:

$S_{c p} = \sum_{Z_{p} \in Z_{c p}} {(Z_{p} - M_{c p})}^{2} = \sum_{Z \in Z_{c}} {(W^{t} Z - W^{t} M_{c})}^{2} = \sum_{Z \in Z_{c}} W^{t} (Z - M_{c}) {(Z - M_{c})}^{t} W = W^{t} S_{c} W$

(44)

where:

$S_{c} = (Z - M_{c}) {(Z - M_{c})}^{t}$

(45)

is the scatter matrix (of dimension $n d \times n d$ ) of a cover class.
The scatter matrix of the projected samples of a stego class has been shown as follows:

$S_{s p} = \sum_{Z_{p} \in Z_{s p}} {(Z_{p} - M_{s p})}^{2} = \sum_{Z \in Z_{s}} {(W^{t} Z - W^{t} M_{s})}^{2} = \sum_{Z \in Z_{s}} W^{t} (Z - M_{s}) {(Z - M_{s})}^{t} W = W^{t} S_{s} W$

(46)

where:

$S_{s} = (Z - M_{s}) {(Z - M_{s})}^{t}$

(47)

is a scatter matrix (of dimension $n d \times n d$ ) for the samples in the original feature space of a stego class.
The within-class scatter matrix after projection is defined as follows:

$S_{c p} + S_{s p} = W^{t} (S_{c} + S_{s}) W = W^{t} S_{w} W$

(48)

where:

$S_{w} = S_{c} + S_{s}$

(49)

The difference between the projected means is expressed as follows:

${(M_{c p} - M_{s p})}^{2} = {(W^{t} M_{c} - W^{t} M_{s})}^{2} = W^{t} (M_{c} - M_{s}) {(M_{c} - M_{s})}^{t} W = W^{t} S_{B} W$

(50)

where:

$S_{B} = (M_{c} - M_{s}) {(M_{c} - M_{s})}^{t}$

(51)

We can finally express the Fisher criterion (Equation (39)) in terms of $S_{B}$ and $S_{W}$ as follows:

$J (W) = \frac{W^{t} S_{B} W}{W^{t} S_{w} W}$

(52)

The solution of Equation (52) is given by [49].

$W_{o p t} = S_{w}^{- 1} (M_{c} - M_{s})$

(53)
Testing process
The testing process (classification step) is conducted as follows:
Let Z be the matrix containing the feature vectors of covers and stegos.
The projection of Z on the orientation vector $W_{o p t}$ gives all projected values $Z_{p}$ .

$Z_{p} (j) = \sum_{i = 1}^{9} W_{o p t} (i) \times Z (i, j) + b j = 1, 2, \dots, N$

(54)

b is a threshold of discrimination between both classes, and it can be fixed to a value that is halfway between both averages projected on the cover and stego.

$b = 0.5 \times (M_{c p} + M_{s p})$

(55)

with:

$\begin{matrix} M_{c p} = W_{o p t}^{t} \times M_{c} \\ M_{s p} = W_{o p t}^{t} \times M_{s} \end{matrix}$

In the above equations, $W_{o p t}^{t}$ is the transposed of $W_{o p t}$ .
The result $Z_{p} (j)$ , $j = 1, \dots, N$ determines the cover or stego class of every test image.
Indeed, if $Z_{p} (j) \geq 0$ , then the image under test is cover; otherwise, it is stego.

5.3.2. SVM Classifier

According to numerous recent studies, the SVM classification method is better than the other data classification algorithms in terms of classification accuracy [50]. SVM performs classification by creating a hyper-plan that separates the data into two categories in the most optimal way.

Let

{(Z_{i}, y_{i})}_{(1 \leq i \leq N)}

be a set of training examples, each example

Z_{i} \in R^{n d}

,

n d

being the dimension of the input space; it belongs to a class labeled as

y_{i} \in \{- 1, 1\}

. SVM classification constructs a hyper-plan

W^{T} Z + b = 0

, which best separates the data through a minimizing process, as shown below:

\begin{matrix} \frac{1}{2} {∥w∥}^{2} + C \sum_{i = 1}^{N} ζ_{i} \\ s u b j e c t t o : y_{i} (w Z_{i} + b) \geq 1 - ζ_{i} \end{matrix}

(56)

Variables

ζ_{i}

are called slack variables, and they measure the error made at point

(Z_{i}, y_{i})

.

Parameter C can be viewed as a way to control overfitting.

ζ_{i} \geq 0

and

C > 0

is the trade-off between regularization and constraint violation.

Problems related to quadratic optimization are a well-known class of mathematical programming problems, and many (rather intricate) algorithms exist to aid in solving them. Solutions involve constructing a dual problem where a Lagrange multiplier

α_{i}

is associated with every constraint in the primary problem, as shown below:

\begin{matrix} L (α) = \sum_{i} α_{i} - \frac{1}{2} \sum_{i} \sum_{j} α_{i} α_{j} y_{i} y_{j} Z_{i}^{T} Z_{j} \\ s u b j e c t t o : \sum_{i} α_{i} y_{i} = 0 \\ 0 \leq α_{i} y_{i} \leq C \end{matrix}

(57)

α_{i}

or Lagrange multipliers are also known as support values.

The linear classifier presented previously is very limited. In most case, classes not only overlap, but the genuine separation functions are non-linear hyper-surfaces. The motivation for such an extension is that an SVM that can create a non-linear decision hyper-surface will be able to non-linearly classify separable data.

The idea is that the input space can always be mapped on to a higher dimensional feature space where the training set is separable.

The linear classifier relies on the dot product between vectors

K (Z_{i}, Z_{j}) = Z_{i}^{T} Z_{j}

. If every data point is mapped on to a high-dimensional space via some transformation

Φ : Z \to φ (Z)

, the dot product becomes:

K (Z_{i}, Z_{j}) = φ {(Z_{i})}^{T} φ (Z_{j})

. Then in the dual formulation, we maximize the following:

\begin{matrix} L (α) = \sum_{i = 1}^{N} α_{i} - \frac{1}{2} \sum_{i} \sum_{j} α_{i} α_{j} y_{i} y_{j} K (Z_{i}, Z_{j}) \\ s u b j e c t t o : \sum_{i} α_{i} y_{i} = 0 \\ 0 \leq α_{i} y_{i} \leq C \end{matrix}

(58)

Subsequently, the decision function turns into the following:

f (x) = s g n (\sum_{i = n}^{m} α_{i} y_{i} K (Z_{i}, Z) + b)

(59)

It should be noted that the dual formulation only requires access to the kernel function and not the features

Φ (.)

, allowing one to solve the formulation in very high-dimensional feature spaces efficiently. This is also called the kernel trick.

There are many kernel functions in SVM. Therefore, determining how to select a good kernel function is also a research issue. However, for general purposes, there are some popular kernel functions [50,51], which have been listed as follows:

Linear Kernel:

$K (Z_{i}, Z_{j}) = Z_{i}^{T} Z_{j}$

(60)
Polynomial Kernel:

$K (Z_{i}, Z_{j}) = {(γ Z_{i}^{T} Z_{j} + r)}^{d} γ > 0$

(61)
RBF Kernel:

$K (Z_{i}, Z_{j}) = exp (- γ {∥Z_{i} - Z_{j}∥}^{2}) γ > 0$

(62)
Sigmoid Kernel:

$K (Z_{i}, Z_{j}) = \tan h (γ Z_{i}^{T} Z_{j} + r)$

(63)

Here,

γ

, r, and d are kernel parameters.

In our work, we used the RBF kernel function.

6. Experimental Results of Steganalysis

In this section, we present some experimental results that were obtained from the studied steganalysis system that was applied to the enhanced steganographic methods in the spatial and frequency domain. For this purpose, the image dataset UCID [52,53] is used, which includes 1338 uncompressed color images, and all the images were converted to grayscale before conducting the experiments.

In our experiments, we first created the stego images using the following steganographic methods: Enhanced EALSBMR (EEALSBMR), Enhanced DCT steganography (EDCT), and Enhanced DWT steganography (EDWT). We used these methods with different embedding rates of 5%, 10%, and 20%. Following this, we extracted the image features using the three feature-extraction techniques described above (Farid, Shi, and Moulin techniques) for both the cover and stego images. Finally, we employed the classifiers FLD and SVM to classify the images as either containing a hidden message or not. The evaluation of the classification (binary classification) and the steganalysis (also indirectly the efficiency of insertion methods) is performed by calculating the following parameters: sensibility, specificity, and precision of the confusion matrix and the Kappa coefficient (see Table 8 and Equation (64))

K a p p a = \frac{P_{0} - P_{a}}{1 - P_{a}}

(64)

with:

P_{0} = T P + T N; P_{a} = (T P + F P) \times (T P + F N) + (F N + T N) \times (F P + T N)

(65)

In the above equation,

P_{0}

is the total agreement probability (related to the accuracy), and

P_{a}

is the agreement probability that arises out of chance.

Here is one possible interpretation of Kappa values:

Poor agreement = Less than 0.20
Fair agreement = 0.20 to 0.40
Moderate agreement = 0.40 to 0.60
Good agreement = 0.60 to 0.80
Very good agreement = 0.80 to 1.00

6.1. Classification Results Applied to the Steganographic Method EEALSBMR

In Table 9, Table 10, Table 11, Table 12, Table 13 and Table 14, we present the classification results (steganalysis) based on the classifiers FLD and SVM and the features of Farid, Shi, and Moulin for the EEALSBMR insertion method with different insertion rates of 5%, 10%, and 20%. The results show that steganalysis is not effective for all insertion rates. Indeed, the values

S e

,

S p

, and

P r

vary around 50%, so these values are not informative values and do not give any idea about the nature of the data. The value of the Kappa coefficient (lower than 0.2) confirms these results. The EEALSBMR steganographic method is robust against statistical steganalysis techniques.

6.2. Classification Results Applied to the Steganographic Method EDCT

The classification results (steganalysis) provided in Table 15, Table 16, Table 17, Table 18, Table 19 and Table 20 for the EDCT insertion method show that with the FLD classifier, when the insertion rate is equal to or higher than 20%, steganalysis is very effective with Shi features and Moulin features, but it is less effective with Farid features. With the SVM classifier, except in the case of Shi features, when an insertion rate of 20% is applied, the results obtained are quite similar to those obtained from the EEALSBMR algorithm and, therefore, steganalysis is not effective. It should be noted that the FLD classifier is more effective for a feature vector of a high dimension than the SVM classifier.

6.3. Classification Results Applied to the Steganographic Method EDWT

With respect to the EDWT method, the results are provided in Table 21, Table 22, Table 23, Table 24, Table 25 and Table 26. These results obtained with the classifiers FLD and SVM indicate that the values of the parameters

S e

,

S p

,

P r

,

A c

, and

K a p p a

are high for all insertion rates and feature vectors (Farid, Shi, and Moulin). These results can easily inform us about the presence of hidden information; therefore, steganalysis can be concluded to be very effective. As a result, the insertion method is not robust. It should be noted that steganalysis is very effective here because both the steganagraphic method and feature vectors are based on multi-resolution wavelet decomposition.

6.4. Discussion

The enhanced adaptive LSB methods of steganography in the spatial domain (EEALSBMR) and frequency domain (EDCT and EDWT) provide stego images with a good visual quality up to an embedding rate of 40%: the

P S N R

is over 50 dB, and the distortion is not visible to the naked eye. Security of the message contents, in case detected by an opponent, is ensured by using the chaotic system. On the other hand, we applied a universal steganalysis method that can work well with all known and unknown steganography algorithms. Universal steganalysis methods exploit the changes in certain inherent features of the cover images when a message is embedded. The accuracy of the classification (discrimination between two classes: cover and stego) of the system greatly relies on several factors, such as the choice of the right characteristic vectors, the classifier, and its parameters.

7. Conclusions

In this work, we first improved the structure and security of three steganagraphic methods that are studied in the spatial and frequency domain by integrating them with a robust proposed chaotic system. Following this, we built a statistical steganalysis system to evaluate the robustness of the three enhanced steganographic methods. In this system, we selected three different feature vectors, namely higher-order statistics of high-frequency wavelet sub-bands and their prediction errors, statistical moments of the characteristic functions of the prediction-error image, the test image, and their wavelet sub-bands, and both empirical PDF moments and the normalized absolute CF. After this, we applied two types of classifiers, namely FLD and SVM, with the RBF kernel.

Extensive experimental work has demonstrated that the proposed steganalysis system based on the multi-dimensional feature vectors can detect hidden messages using the EDWT steganographic method, irrespective of the message size. However, it cannot distinguish between cover and stego images using the EEALSBMR steganographic and EDCT methods if the message size is smaller than 20% and 15%, respectively.

Author Contributions

Funding acquisition, T.M.H.; Supervision, B.B., O.D. and M.K.; Writing—original draft preparation, D.B.; Writing—review & editing, S.E.A., T.M.H.

Funding

This work is supported by the National Foundation for Science and Technology Development (NAFOSTED) of Vietnam through the grant number 102.04-2018.06.

Acknowledgments

The authors thank the anonymous reviewers for useful comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

Xia, Z.; Wang, X.; Sun, X.; Liu, Q.; Xiong, N. Steganalysis of LSB matching using differences between nonadjacent pixels. Multimed. Tools Appl. 2016, 75, 1947–1962. [Google Scholar] [CrossRef]
Mohammadi, F.G.; Abadeh, M.S. Image steganalysis using a bee colony based feature selection algorithm. Eng. Appl. Artif. Intell. 2014, 31, 35–43. [Google Scholar] [CrossRef]
Luo, W.; Huang, F.; Huang, J. Edge Adaptive Image Steganography Based on LSB Matching Revisited. IEEE Trans. Inf. Forensics Secur. 2010, 5, 201–214. [Google Scholar]
Chan, C.K.; Cheng, L. Hiding data in images by simple LSB substitution. Pattern Recognit. 2004, 37, 469–474. [Google Scholar] [CrossRef]
Wu, H.C.; Wu, N.I.; Tsai, C.S.; Hwang, M.S. Image steganographic scheme based on pixel-value differencing and LSB replacement methods. IEE Proc.-Vis. Image Signal Process. 2005, 152, 611–615. [Google Scholar] [CrossRef] [Green Version]
Jung, K.; Ha, K.; Yoo, K. Image Data Hiding Method Based on Multi-Pixel Differencing and LSB Substitution Methods. In Proceedings of the 2008 International Conference on Convergence and Hybrid Information Technology, Daejeon, Korea, 28–30 August 2008; pp. 355–358. [Google Scholar] [CrossRef]
Huang, Q.; Ouyang, W. Protect fragile regions in steganography LSB embedding. In Proceedings of the 2010 Third International Symposium on Knowledge Acquisition and Modeling, Wuhan, China, 20–21 October 2010; pp. 175–178. [Google Scholar]
Xi, L.; Ping, X.; Zhang, T. Improved LSB matching steganography resisting histogram attacks. In Proceedings of the 2010 3rd International Conference on Computer Science and Information Technology, Chengdu, China, 9–11 July 2010; Volume 1, pp. 203–206. [Google Scholar]
Swain, G.; Lenka, S.K. Steganography using two sided, three sided, and four sided side match methods. CSI Trans. ICT 2013, 1, 127–133. [Google Scholar] [CrossRef] [Green Version]
Islam, S.; Modi, M.R.; Gupta, P. Edge-based image steganography. EURASIP J. Inf. Secur. 2014, 2014, 1–14. [Google Scholar] [CrossRef]
Mungmode, S.; Sedamkar, R.; Kulkarni, N. A Modified High Frequency Adaptive Security Approach using Steganography for Region Selection based on Threshold Value. Procedia Comput. Sci. 2016, 79, 912–921. [Google Scholar] [CrossRef] [Green Version]
Akhter, F. A Novel Approach for Image Steganography in Spatial Domain. arXiv 2015, arXiv:1506.03681. [Google Scholar]
Iranpour, M.; Rahmati, M. An efficient steganographic framework based on dynamic blocking and genetic algorithm. Multimed. Tools Appl. 2015, 74, 11429–11450. [Google Scholar] [CrossRef]
Kumar, R.; Chand, S. A reversible high capacity data hiding scheme using pixel value adjusting feature. Multimed. Tools Appl. 2016, 75, 241–259. [Google Scholar] [CrossRef]
Muhammad, K.; Ahmad, J.; Farman, H.; Jan, Z. A new image steganographic technique using pattern based bits shuffling and magic LSB for grayscale images. arXiv 2016, arXiv:1601.01386. [Google Scholar]
Kordov, K.; Stoyanov, B. Least Significant Bit Steganography using Hitzl-Zele Chaotic Map. Int. J. Electron. Telecommun. 2017, 63, 417–422. [Google Scholar] [CrossRef] [Green Version]
Stoyanov, B.P.; Zhelezov, S.K.; Kordov, K.M. Least significant bit image steganography algorithm based on chaotic rotation equations. C. R. L’Academie Bulgare Sci. 2016, 69, 845–850. [Google Scholar]
Taleby Ahvanooey, M.; Li, Q.; Hou, J.; Rajput, A.R.; Chen, Y. Modern Text Hiding, Text Steganalysis, and Applications: A Comparative Analysis. Entropy 2019, 21, 355. [Google Scholar] [CrossRef]
Sadat, E.S.; Faez, K.; Saffari Pour, M. Entropy-Based Video Steganalysis of Motion Vectors. Entropy 2018, 20, 244. [Google Scholar] [CrossRef]
Yu, C.; Li, X.; Chen, X.; Li, J. An Adaptive and Secure Holographic Image Watermarking Scheme. Entropy 2019, 21, 460. [Google Scholar] [CrossRef]
Hashad, A.; Madani, A.S.; Wahdan, A.E.M.A. A robust steganography technique using discrete cosine transform insertion. In Proceedings of the 2005 International Conference on Information and Communication Technology, Cairo, Egypt, 5–6 December 2005; pp. 255–264. [Google Scholar]
Fard, A.M.; Akbarzadeh-T, M.R.; Varasteh-A, F. A new genetic algorithm approach for secure JPEG steganography. In Proceedings of the 2006 IEEE International Conference on Engineering of Intelligent Systems, Islamabad, Pakistan, 22–23 April 2006; pp. 1–6. [Google Scholar]
McKeon, R.T. Strange Fourier steganography in movies. In Proceedings of the 2007 IEEE International Conference on Electro/Information Technology, Chicago, IL, USA, 17–20 May 2007; pp. 178–182. [Google Scholar]
Abdelwahab, A.; Hassaan, L. A discrete wavelet transform based technique for image data hiding. In Proceedings of the 2008 National Radio Science Conference, Tanta, Egypt, 18–20 March 2008; pp. 1–9. [Google Scholar]
Singh, I.; Khullar, S.; Laroiya, D.S. DFT based image enhancement and steganography. Int. J. Comput. Sci. Commun. Eng. 2013, 2, 5–7. [Google Scholar]
Samata, R.; Parghi, N.; Vekariya, D. An Enhanced Image Steganography Technique using DCT, Jsteg and Data Mining Bayesian Classification Algorithm. Int. J. Sci. Technol. Eng. (IJSTE) 2015, 2, 9–13. [Google Scholar]
Karri, S.; Sur, A. Steganographic algorithm based on randomization of DCT kernel. Multimed. Tools Appl. 2015, 74, 9207–9230. [Google Scholar] [CrossRef]
Pan, J.S.; Li, W.; Yang, C.S.; Yan, L.J. Image steganography based on subsampling and compressive sensing. Multimed. Tools Appl. 2015, 74, 9191–9205. [Google Scholar] [CrossRef]
Ali, M.; Ahn, C.W.; Siarry, P. Differential evolution algorithm for the selection of optimal scaling factors in image watermarking. Eng. Appl. Artif. Intell. 2014, 31, 15–26. [Google Scholar] [CrossRef]
Farid, H. Detecting hidden messages using higher-order statistical models. In Proceedings of the International Conference on Image Processing, Rochester, NY, USA, 22–25 September 2002; Volume 2. [Google Scholar]
Shi, Y.Q.; Zou, D.; Chen, W.; Chen, C. Image steganalysis based on moments of characteristic functions using wavelet decomposition, prediction-error image, and neural network. In Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, Amsterdam, The Netherlands, 6 July 2005; p. 4. [Google Scholar]
Wang, Y.; Moulin, P. Optimized Feature Extraction for Learning-Based Image Steganalysis. IEEE Trans. Inf. Forensics Secur. 2007, 2, 31–45. [Google Scholar] [CrossRef]
Abutaha, M. Real-Time and Portable Chaos-Based Crypto-Compression Systems for Efficient Embedded Architectures. Ph.D. Thesis, University of Nantes, Nantes, France, 2017. [Google Scholar]
Abu Taha, M.; El Assad, S.; Queudet, A.; Deforges, O. Design and efficient implementation of a chaos-based stream cipher. Int. J. Internet Technol. Secur. Trans. 2017, 7, 89–114. [Google Scholar] [CrossRef]
El Assad, S. Chaos based information hiding and security. In Proceedings of the 2012 International Conference for Internet Technology and Secured Transactions, London, UK, 10–12 December 2012; pp. 67–72. [Google Scholar]
Song, C.Y.; Qiao, Y.L.; Zhang, X.Z. An image encryption scheme based on new spatiotemporal chaos. Opt.-Int. J. Light Electron Opt. 2013, 124, 3329–3334. [Google Scholar] [CrossRef]
Tataru, R.L.; Battikh, D.; Assad, S.E.; Noura, H.; Déforges, O. Enhanced adaptive data hiding in spatial LSB domain by using chaotic sequences. In Proceedings of the 2012 Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Piraeus, Greece, 18–20 July 2012; pp. 85–88. [Google Scholar]
Assad, S.E.; Noura, H. Generator of Chaotic Sequences and Corresponding Generating System. International Patent No. WO2011121218A1, 28 March 2011. [Google Scholar]
Farajallah, M.; El Assad, S.; Deforges, O. Fast and secure chaos-based cryptosystem for images. Int. J. Bifurc. Chaos 2015. [Google Scholar] [CrossRef]
El Assad, S.; Farajallah, M. A new chaos-based image encryption system. Signal Proc. Image Commun. 2015. [Google Scholar] [CrossRef]
Battikh, D.; El Assad, S.; Bakhache, B.; Déforges, O.; Khalil, M. Enhancement of two spatial steganography algorithms by using a chaotic system: Comparative analysis. In Proceedings of the 8th International Conference for Internet Technology and Secured Transactions (ICITST-2013), London, UK, 9–12 December 2013; pp. 20–25. [Google Scholar]
Mielikainen, J. LSB matching revisited. IEEE Signal Process. Lett. 2006, 13, 285–287. [Google Scholar] [CrossRef]
Habib, M.; Bakhache, B.; Battikh, D.; El Assad, S. Enhancement using chaos of a Steganography method in DCT domain. In Proceedings of the 2015 Fifth International Conference on Digital Information and Communication Technology and its Applications (DICTAP), Beirut, Lebanon, 29 April–1 May 2015; pp. 204–209. [Google Scholar]
Danti, A.; Acharya, P. Randomized embedding scheme based on dct coefficients for image steganography. IJCA Spec. Issue Recent Trends Image Process. Pattern Recognit 2010, 2, 97–103. [Google Scholar]
Boora, M.; Gambhir, M. Arnold Transform Based Steganography. Int. J. Soft Comput. Eng. (IJSCE) 2013, 3, 136–140. [Google Scholar]
Walia, E.; Jain, P.; Navdeep, N. An analysis of LSB & DCT based steganography. Glob. J. Comput. Sci. Technol. 2010, 10, 4–8. [Google Scholar]
Omrani, T. Conception et Cryptanalyse des CryptosystèMes LéGers Pour l’IoT. Ph.D. Thesis, El Manar University, Tunis, Tunisia, 2019. [Google Scholar]
Song, X.; Liu, F.; Luo, X.; Lu, J.; Zhang, Y. Steganalysis of perturbed quantization steganography based on the enhanced histogram features. Multimed. Tools Appl. 2015, 74, 11045–11071. [Google Scholar] [CrossRef]
Lee, C.K. Infrared Face Recognition. 2004. Available online: https://apps.dtic.mil/dtic/tr/fulltext/u2/a424713.pdf (accessed on 26 July 2019).
Vapnik, V.N. Statistical Learning Theory; Adaptive and Learning Systems for Signal Processing, Communications, and Control; Wiley: Hoboken, NJ, USA, 1998. [Google Scholar]
Vapnik, V.N. An overview of statistical learning theory. Neural Netw. IEEE Trans. 1999, 10, 988–999. [Google Scholar] [CrossRef] [PubMed]
Schaefer, G.; Stich, M. UCID: An uncompressed color image database. In Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, San Jose, CA, USA, 18–22 January 2004; pp. 472–480. [Google Scholar]
Battikh, D.; El Assad, S.; Deforges, O.; Bakhache, B.; Khalil, M. Stéganographie Basée Chaos Pour Assurer la Sécurité de L’information; Presses Académiques Francophones: Sarrebruck, France, 2015. (In French) [Google Scholar]

Figure 1. Proposed chaotic generator.

Figure 2. Chaotic generator.

Figure 3. EEALSBMR insertion procedure.

Figure 4. “Peppers” as cover image.

Figure 5. “Bike” is as embedded message.

Figure 6. Pseudo-chaotic block selection and its corresponding gray value.

Figure 7. Diagram of the enhanced steganographic-based DCT transform.

Figure 8. Diagram of the EDWT algorithm.

Figure 9. (a) Cover image, (b) Stego image with embedding rate of 5%, (c) Stego image with embedding rate of 40%.

Figure 10. (a) Cover image, (b) Stego image with embedding rate of 5%, (c) Stego image with embedding rate of 40%.

Figure 11. (a) Cover image, (b) Stego image with embedding rate of 5%, (c) Stego image with embedding rate of 40%.

Figure 12. Flowchart of the blind steganalysis process.

Figure 13. Multi-resolution wavelet decomposition.

Figure 14. Block diagram for the prediction of coefficient

S_{V_{1}}^{p} (x, y)

.

Figure 14. Block diagram for the prediction of coefficient

S_{V_{1}}^{p} (x, y)

.

Figure 15. Prediction context of a pixel x.

Table 1.

P S N R

,

I F

, and

S S I M

values for the EEALSBMR method.

Table 1.

P S N R

,

I F

, and

S S I M

values for the EEALSBMR method.

Embedding Rate	Cover Image	$PSNR$	$IF$	$SSIM$
5%	Baboon	68.3810	0.9999	0.9999
	Lena	68.1847	0.9999	0.9999
	Peppers	67.7160	0.9999	0.9999
10%	Baboon	65.5986	0.9999	0.9999
	Lena	65.2821	0.9999	0.9999
	Peppers	64.7763	0.9999	0.9999
20%	Baboon	62.3551	0.9999	0.9999
	Lena	62.3559	0.9999	0.9996
	Peppers	61.7066	0.9999	0.9995
30%	Baboon	60.6902	0.9998	0.9999
	Lena	60.5630	0.9998	0.9990
	Peppers	59.9585	0.9998	0.9992
40%	Baboon	59.4245	0.9997	0.9999
	Lena	59.2608	0.9997	0.9985
	Peppers	58.6662	0.9997	0.9988

Table 2.

P S N R

,

I F

, and

S S I M

values for the EDCT method.

Table 2.

P S N R

,

I F

, and

S S I M

values for the EDCT method.

Embedding Rate	Cover Image	$PSNR$	$IF$	$SSIM$
5%	Baboon	71.2372	0.9999	0.9999
	Lena	71.1769	0.9999	0.9999
	Peppers	70.4866	0.9999	0.9999
10%	Baboon	64.8846	0.9999	0.9999
	Lena	64.9487	0.9999	0.9998
	Peppers	64.1426	0.9999	0.9998
20%	Baboon	59.6895	0.9997	0.9999
	Lena	59.6225	0.9997	0.9992
	Peppers	58.9535	0.9997	0.9993
30%	Baboon	57.4212	0.9995	0.9998
	Lena	57.3421	0.9995	0.9989
	Peppers	56.7406	0.9995	0.9988
40%	Baboon	56.3421	0.9994	0.9997
	Lena	56.2265	0.79994	0.9987
	Peppers	55.4876	0.9994	0.9985

Table 3.

P S N R

,

I F

, and

S S I M

values for EDWT method.

Table 3.

P S N R

,

I F

, and

S S I M

values for EDWT method.

Embedding Rate	Cover Image	$PSNR$	$IF$	$SSIM$
5%	Baboon	59.1876	0.9999	0.9999
	Lena	58.7673	0.9997	0.9999
	Peppers	58.1699	0.9997	0.9999
10%	Baboon	56.2224	0.9997	0.9999
	Lena	55.8085	0.9994	0.9999
	Peppers	55.2086	0.9993	0.9999
20%	Baboon	53.3463	0.9988	0.9999
	Lena	52.8205	0.9988	0.9999
	Peppers	52.2269	0.9987	0.9999
30%	Baboon	52.0465	0.9984	0.9999
	Lena	51.6471	0.9984	0.9999
	Peppers	51.0509	0.9983	0.9999
40%	Baboon	51.3450	0.9982	0.9999
	Lena	50.9536	0.9981	0.9999
	Peppers	50.3417	0.9980	0.9999

Table 4. E, R, and

I R

for the EEALSBMR method.

Table 4. E, R, and

I R

for the EEALSBMR method.

Embedding Rate	Cover Image	E	R	$IR$
5%	Baboon	7.3586	0.0802	0.3805
	Lena	7.4455	0.0693	0.3261
	Peppers	7.5715	0.0536	0.2975
10%	Baboon	7.3586	0.0802	0.3805
	Lena	7.4456	0.0693	0.3261
	Peppers	7.5715	0.0535	0.2976
20%	Baboon	7.3585	0.0802	0.3805
	Lena	7.4457	0.0693	0.3261
	Peppers	7.5717	0.0535	0.2977
30%	Baboon	7.3584	0.0802	0.3805
	Lena	7.4457	0.0693	0.3261
	Peppers	7.5718	0.0535	0.2975
40%	Baboon	7.3578	0.0803	0.3806
	Lena	7.4454	0,0693	0.3260
	Peppers	7.5722	0.0535	0.2973

Table 5. E, R, and

I R

values for the EDCT method.

Table 5. E, R, and

I R

values for the EDCT method.

Embedding Rate	Cover Image	E	R	$IR$
5%	Baboon	7.3585	0.0802	0.3804
	Lena	7.4456	0.0693	0.3261
	Peppers	7.5716	0.0536	0.2976
10%	Baboon	7.3585	0.0802	0.3805
	Lena	7.4456	0.0693	0.3262
	Peppers	7.5717	0.0535	0.2976
20%	Baboon	7.3585	0.0802	0.3804
	Lena	7.4457	0.0693	0.3263
	Peppers	7.5725	0.0534	0.2973
30%	Baboon	7.3584	0.0802	0.3802
	Lena	7.4459	0.0693	0.3261
	Peppers	7.5730	0.0534	0.2969
40%	Baboon	7.3578	0.0803	0.3806
	Lena	7.4462	0,0692	0.3257
	Peppers	7.5734	0.0533	0.2973

Table 6. E, R, and

I R

values for EDWT method.

Table 6. E, R, and

I R

values for EDWT method.

Embedding Rate	Cover Image	E	R	$IR$
5%	Baboon	7.3581	0.0802	0.3805
	Lena	7.4455	0.0693	0.3261
	Peppers	7.5715	0.0536	0.2975
10%	Baboon	7.3580	0.0802	0.3806
	Lena	7.4456	0.0693	0.3261
	Peppers	7.5717	0.0535	0.2974
20%	Baboon	7.3580	0.0802	0.3806
	Lena	7.4456	0.0693	0.3261
	Peppers	7.5718	0.0535	0.2975
30%	Baboon	7.3580	0.0802	0.3805
	Lena	7.4456	0.0693	0.3261
	Peppers	7.5718	0.0535	0.2974
40%	Baboon	7.3580	0.0803	0.3806
	Lena	7.4457	0,0693	0.3261
	Peppers	7.5721	0.0533	0.2973

Table 7. E, R, and

I R

values for the cover images.

Table 7. E, R, and

I R

values for the cover images.

Cover Image	E	R	$IR$
Baboon	7.3585	0.0802	0.3805
Lena	7.4455	0.0693	0.3261
Peppers	7.5715	0.0536	0.2976

Table 8. Confusion matrix.

		H0: Stego Image	H1: Cover Image
Test outcome	Test outcome positive	True Positive $T P$	False Positive $F P$	Positive predictive value ( $P P V$ ), or Precision $P r = \frac{T P}{T P + F P}$
	Test outcome negative	False Negative $F N$	True Negative $T N$	Negative predictive value ( $N P V$ ) $N P V = \frac{T N}{T N + F N}$
		True positive rate ( $T P R$ ), or, Sensitivity ( $S e$ ), $S e = \frac{T P}{T P + F N}$	True negative rate ( $T N R$ ), or Specificity( $S p$ ), $S p = \frac{T N}{T N + F P}$	Accuracy ( $A c$ ), $A c = \frac{T P + T N}{T P + F N + F P + T N}$

Table 9. FLD classification evaluation of EEALSBMR algorithm using Farid features.

5%	H0: Stego Images	H1: Cover Images
H0	0.2744	0.2714	$P r$ = 0.5027
H1	0.2256	0.2286	$N P V$ = 0.5033
	$S e$ = 0.5487	$S p$ = 0.4572	$E x$ = 0.5030
$K a p p a$ = 0.0060
10%	H0: Stego Images	H1: Cover Images
H0	0.2690	0.2645	$P r$ = 0.5042
H1	0.2310	0.2355	$N P V$ = 0.5048
	$S e$ = 0.5380	$S p$ = 0.4710	$E x$ = 0.5045
$K a p p a$ = 0.0090
20%	H0: Stego Images	H1: Cover Images
H0	0.2745	0.2459	$P r$ = 0.5275
H1	0.2255	0.2541	$N P V$ = 0.5298
	$S e$ = 0.5490	$S p$ = 0.5082	$E x$ = 0.5286
$K a p p a$ = 0.0572

Table 10. FLD classification evaluation of EEALSBMR algorithm using Shi features.

5%	H0: Stego Images	H1: Cover Images
H0	0.2612	0.2405	$P r$ = 0.5207
H1	0.2387	0.2595	$N P V$ = 0.5208
	$S e$ = 0.5225	$S p$ = 0.5190	$E x$ = 0.5208
$K a p p a$ = 0.0415
10%	H0: Stego Images	H1: Cover Images
H0	0.2504	0.2448	$P r$ = 0.5057
H1	0.2496	0.2552	$N P V$ = 0.5056
	$S e$ = 0.5008	$S p$ = 0.5105	$E x$ = 0.5056
$K a p p a$ = 0.0112
20%	H0: Stego Images	H1: Cover Images
H0	0.3191	0.1946	$P r$ = 0.6212
H1	0.1809	0.3054	$N P V$ = 0.6280
	$S e$ = 0.6382	$S p$ = 0.6108	$E x$ = 0.6245
$K a p p a$ = 0.2490

Table 11. FLD classification evaluation of EEALSBMR algorithm using Moulin features.

5%	H0: Stego Images	H1: Cover Images
H0	0.2489	0.2476	$P r$ = 0.5013
H1	0.2511	0.2524	$N P V$ = 0.5012
	$S e$ = 0.4977	$S p$ = 0.5048	$E x$ = 0.5012
$K a p p a$ = 0.0025
10%	H0: Stego Images	H1: Cover Images
H0	0.2559	0.2299	$P r$ = 0.5268
H1	0.2441	0.2701	$N P V$ = 0.5253
	$S e$ = 0.5117	$S p$ = 0.5403	$E x$ = 0.5260
$K a p p a$ = 0.0520
20%	H0: Stego Images	H1: Cover Images
H0	0.2990	0.1985	$P r$ = 0.6010
H1	0.2010	0.3015	$N P V$ = 0.6000
	$S e$ = 0.5980	$S p$ = 0.6030	$E x$ = 0.6005
$K a p p a$ = 0.2010

Table 12. SVM classification evaluation of EEALSBMR algorithm using Farid features.

5%	H0: Stego Images	H1: Cover Images
H0	0.3438	0.3431	$P r$ = 0.5005
H1	0.1562	0.1569	$N P V$ = 0.5011
	$S e$ = 0.6876	$S p$ = 0.3137	$A c$ = 0.6870
$K a p p a$ = 0.0013
10%	H0: Stego Images	H1: Cover Images
H0	0.4006	0.3977	$P r$ = 0.5018
H1	0.0994	0.1023	$N P V$ = 0.5071
	$S e$ = 0.8011	$S p$ = 0.2046	$A c$ = 0.5029
$K a p p a$ = 0.0057
20%	H0: Stego Images	H1: Cover Images
H0	0.3251	0.3199	$P r$ = 0.5041
H1	0.1749	0.1801	$N P V$ = 0.5074
	$S e$ = 0.6503	$S p$ = 0.3602	$A c$ = 0.5052
$K a p p a$ = 0.0105

Table 13. SVM classification evaluation of EEALSBMR algorithm using Shi features.

5%	H0: Stego Images	H1: Cover Images
H0	0.2220	0.2188	$P r$ = 0.5037
H1	0.2780	0.2812	$N P V$ = 0.5029
	$S e$ = 0.4440	$S p$ = 0.5625	$A c$ = 0.5032
$K a p p a$ = 0.0065
10%	H0: Stego Images	H1: Cover Images
H0	0.2189	0.2161	$P r$ = 0.5032
H1	0.2811	0.2839	$N P V$ = 0.5024
	$S e$ = 0.4377	$S p$ = 0.5678	$A c$ = 0.5028
$K a p p a$ = 0.0055
20%	H0: Stego Images	H1: Cover Images
H0	0.2282	0.1999	$P r$ = 0.5330
H1	0.2718	0.3001	$N P V$ = 0.5247
	$S e$ = 0.4564	$S p$ = 0.6002	$A c$ = 0.5283
$K a p p a$ = 0.0566

Table 14. SVM classification evaluation of EEALSBMR algorithm using Moulin features.

5%	H0: Stego Images	H1: Cover Images
H0	0.2275	0.2264	$P r$ = 0.5013
H1	0.2725	0.2736	$N P V$ = 0.5010
	$S e$ = 0.4550	$S p$ = 0.5472	$A c$ = 0.5011
$K a p p a$ = 0.0023
10%	H0: Stego Images	H1: Cover Images
H0	0.2412	0.2380	$P r$ = 0.5034
H1	0.2588	0.2620	$N P V$ = 0.5031
	$S e$ = 0.4825	$S p$ = 0.5240	$A c$ = 0.5032
$K a p p a$ = 0.0065
20%	H0: Stego Images	H1: Cover Images
H0	0.2922	0.2684	$P r$ = 0.5212
H1	0.2078	0.2316	$N P V$ = 0.5271
	$S e$ = 0.5844	$S p$ = 0.4632	$A c$ = 0.5238
$K a p p a$ = 0.0476

Table 15. FLD classification evaluation of EDCT algorithm using Farid features.

5%	H0: Stego Images	H1: Cover Images
H0	0.2524	0.2454	$P r$ = 0.5070
H1	0.2476	0.2546	$N P V$ = 0.5069
	$S e$ = 0.5048	$S p$ = 0.5091	$A c$ = 0.5070
$K a p p a$ = 0.0139
10%	H0: Stego Images	H1: Cover Images
H0	0.2617	0.2238	$P r$ = 0.5390
H1	0.2383	0.2762	$N P V$ = 0.5368
	$S e$ = 0.5234	$S p$ = 0.5524	$A c$ = 0.5379
$K a p p a$ = 0.0758
20%	H0: Stego Images	H1: Cover Images
H0	0.3104	0.1719	$P r$ = 0.6436
H1	0.1896	0.3281	$N P V$ = 0.6337
	$S e$ = 0.6208	$S p$ = 0.6562	$A c$ = 0.6385
$K a p p a$ = 0.2770

Table 16. FLD classification evaluation of EDCT algorithm using Shi features.

5%	H0: Stego Images	H1: Cover Images
H0	0.2548	0.2343	$P r$ = 0.5209
H1	0.2452	0.2657	$N P V$ = 0.5200
	$S e$ = 0.5095	$S p$ = 0.5314	$A c$ = 0.5205
$K a p p a$ = 0.0410
10%	H0: Stego Images	H1: Cover Images
H0	0.3242	0.1893	$P r$ = 0.6313
H1	0.1758	0.3107	$N P V$ = 0.6386
	$S e$ = 0.6484	$S p$ = 0.6213	$A c$ = 0.6349
$K a p p a$ = 0.2697
20%	H0: Stego Images	H1: Cover Images
H0	0.4409	0.0635	$P r$ = 0.8741
H1	0.0591	0.4365	$N P V$ = 0.8807
	$S e$ = 0.8817	$S p$ = 0.8730	$A c$ = 0.8773
$K a p p a$ = 0.7547

Table 17. FLD classification evaluation of EDCT algorithm using Moulin features.

5%	H0: Stego Images	H1: Cover Images
H0	0.2611	0.2499	$P r$ = 0.5110
H1	0.2389	0.2501	$N P V$ = 0.5115
	$S e$ = 0.5223	$S p$ = 0.5002	$A c$ = 0.5112
$K a p p a$ = 0.0225
10%	H0: Stego Images	H1: Cover Images
H0	0.2780	0.2136	$P r$ = 0.5655
H1	0.2220	0.2864	$N P V$ = 0.5633
	$S e$ = 0.5560	$S p$ = 0.5728	$A c$ = 0.5644
$K a p p a$ = 0.1288
20%	H0: Stego Images	H1: Cover Images
H0	0.3739	0.1243	$P r$ = 0.7505
H1	0.1261	0.3757	$N P V$ = 0.7487
	$S e$ = 0.7478	$S p$ = 0.7514	$A c$ = 0.7496
$K a p p a$ = 0.4992

Table 18. SVM classification evaluation of EDCT algorithm using Farid features.

5%	H0: Stego Images	H1: Cover Images
H0	0.0653	0.0591	$P r$ = 0.5249
H1	0.4347	0.4409	$N P V$ = 0.5035
	$S e$ = 0.1307	$S p$ = 0.8817	$A c$ = 0.5062
$K a p p a$ = 0.0124
10%	H0: Stego Images	H1: Cover Images
H0	0.0848	0.0644	$P r$ = 0.5683
H1	0.4152	0.4356	$N P V$ = 0.5120
	$S e$ = 0.1695	$S p$ = 0.8712	$A c$ = 0.5204
$K a p p a$ = 0.0408
20%	H0: Stego Images	H1: Cover Images
H0	0.1734	0.0843	$P r$ = 0.6729
H1	0.3266	0.4157	$N P V$ = 0.5600
	$S e$ = 0.3469	$S p$ = 0.8314	$A c$ = 0.5891
$K a p p a$ = 0.1783

Table 19. SVM classification evaluation of EDCT algorithm using Shi features.

5%	H0: Stego Images	H1: Cover Images
H0	0.3156	0.3138	$P r$ = 0.5014
H1	0.1844	0.1862	$N P V$ = 0.5024
	$S e$ = 0.6312	$S p$ = 0.3724	$A c$ = 0.5018
$K a p p a$ = 0.0036
10%	H0: Stego Images	H1: Cover Images
H0	0.3572	0.3266	$P r$ = 0.5224
H1	0.1428	0.1734	$N P V$ = 0.5485
	$S e$ = 0.7145	$S p$ = 0.3469	$A c$ = 0.5307
$K a p p a$ = 0.0613
20%	H0: Stego Images	H1: Cover Images
H0	0.4217	0.2220	$P r$ = 0.6551
H1	0.0783	0.2780	$N P V$ = 0.7803
	$S e$ = 0.8434	$S p$ = 0.5560	$A c$ = 0.6997
$K a p p a$ = 0.3994

Table 20. SVM classification evaluation of EDCT algorithm using Moulin features.

5%	H0: Stego Images	H1: Cover Images
H0	0.3053	0.3020	$P r$ = 0.5027
H1	0.1947	0.1980	$N P V$ = 0.5042
	$S e$ = 0.6107	$S p$ = 0.3960	$A c$ = 0.5033
$K a p p a$ = 0.0067
10%	H0: Stego Images	H1: Cover Images
H0	0.3021	0.2924	$P r$ = 0.5082
H1	0.1979	0.2076	$N P V$ = 0.5120
	$S e$ = 0.6042	$S p$ = 0.4152	$A c$ = 0.5097
$K a p p a$ = 0.0194
20%	H0: Stego Images	H1: Cover Images
H0	0.3264	0.2427	$P r$ = 0.5736
H1	0.1736	0.2573	$N P V$ = 0.5971
	$S e$ = 0.6528	$S p$ = 0.5147	$A c$ = 0.5837
$K a p p a$ = 0.1674

Table 21. FLD classification evaluation of EDWT algorithm using Farid features.

5%	H0: Stego Images	H1: Cover Images
H0	0.4786	0.0150	$P r$ = 0.9695
H1	0.0214	0.4850	$N P V$ = 0.9577
	$S e$ = 0.9571	$S p$ = 0.9699	$A c$ = 0.9635
$K a p p a$ = 0.9270
10%	H0: Stego Images	H1: Cover Images
H0	0.4941	0.0056	$P r$ = 0.9888
H1	0.0059	0.4944	$N P V$ = 0.9882
	$S e$ = 0.9882	$S p$ = 0.9888	$A c$ = 0.9885
$K a p p a$ = 0.9770
20%	H0: Stego Images	H1: Cover Images
H0	0.4993	0.0005	$P r$ = 0.9990
H1	0.0007	0.4995	$N P V$ = 0.9987
	$S e$ = 0.9987	$S p$ = 0.9990	$A c$ = 0.9989
$K a p p a$ = 0.9977

Table 22. FLD classification evaluation of EDWT algorithm using Shi features.

5%	H0: Stego Images	H1: Cover Images
H0	0.4048	0.0470	$P r$ = 0.8961
H1	0.0952	0.4530	$N P V$ = 0.8263
	$S e$ = 0.8095	$S p$ = 0.9061	$A c$ = 0.8578
$K a p p a$ = 0.7156
10%	H0: Stego Images	H1: Cover Images
H0	0.4536	0.0311	$P r$ = 0.9358
H1	0.0464	0.4689	$N P V$ = 0.9100
	$S e$ = 0.9072	$S p$ = 0.9377	$A c$ = 0.9225
$K a p p a$ = 0.8450
20%	H0: Stego Images	H1: Cover Images
H0	0.4753	0.0232	$P r$ = 0.9534
H1	0.0247	0.4768	$N P V$ = 0.9508
	$S e$ = 0.9507	$S p$ = 0.9535	$A c$ = 0.9521
$K a p p a$ = 0.9042

Table 23. FLD classification evaluation of EDWT algorithm using Moulin features.

5%	H0: Stego Images	H1: Cover Images
H0	0.3946	0.0650	$P r$ = 0.8587
H1	0.1054	0.4350	$N P V$ = 0.8049
	$S e$ = 0.7891	$S p$ = 0.8701	$A c$ = 0.8296
$K a p p a$ = 0.6592
10%	H0: Stego Images	H1: Cover Images
H0	0.4394	0.0387	$P r$ = 0.9191
H1	0.0606	0.4613	$N P V$ = 0.8839
	$S e$ = 0.8789	$S p$ = 0.9227	$A c$ = 0.9008
$K a p p a$ = 0.8015
20%	H0: Stego Images	H1: Cover Images
H0	0.4603	0.0321	$P r$ = 0.9348
H1	0.0397	0.4679	$N P V$ = 0.9218
	$S e$ = 0.9206	$S p$ = 0.9358	$A c$ = 0.9282
$K a p p a$ = 0.8564

Table 24. SVM classification evaluation of EDWT algorithm using Farid features.

5%	H0: Stego Images	H1: Cover Images
H0	0.4770	0.0230	$P r$ = 0.9541
H1	0.0230	0.4770	$N P V$ = 0.9541
	$S e$ = 0.9541	$S p$ = 0.9541	$A c$ = 0.9541
$K a p p a$ = 0.9082
10%	H0: Stego Images	H1: Cover Images
H0	0.4893	0.0058	$P r$ = 0.9883
H1	0.0107	0.4942	$N P V$ = 0.9789
	$S e$ = 0.9787	$S p$ = 0.9884	$A c$ = 0.9835
$K a p p a$ = 0.9670
20%	H0: Stego Images	H1: Cover Images
H0	0.4984	0.0084	$P r$ = 0.9835
H1	0.0016	0.4916	$N P V$ = 0.9967
	$S e$ = 0.9968	$S p$ = 0.9832	$A c$ = 0.9900
$K a p p a$ = 0.9800

Table 25. SVM classification evaluation of EDWT algorithm using Shi features.

5%	H0: Stego Images	H1: Cover Images
H0	0.3366	0.1658	$P r$ = 0.6700
H1	0.1634	0.3342	$N P V$ = 0.6716
	$S e$ = 0.6731	$S p$ = 0.6684	$A c$ = 0.6708
$K a p p a$ = 0.3415
10%	H0: Stego Images	H1: Cover Images
H0	0.4107	0.1371	$P r$ = 0.7497
H1	0.0893	0.3629	$N P V$ = 0.8024
	$S e$ = 0.8213	$S p$ = 0.7257	$A c$ = 0.7735
$K a p p a$ = 0.5470
20%	H0: Stego Images	H1: Cover Images
H0	0.4605	0.1175	$P r$ = 0.7967
H1	0.0395	0.3825	$N P V$ = 0.9063
	$S e$ = 0.9210	$S p$ = 0.7650	$A c$ = 0.8430
$K a p p a$ = 0.6859

Table 26. SVM classification evaluation of EDWT algorithm using Moulin features.

5%	H0: Stego Images	H1: Cover Images
H0	0.3707	0.1108	$P r$ = 0.7699
H1	0.1293	0.3892	$N P V$ = 0.7506
	$S e$ = 0.7413	$S p$ = 0.7785	$A c$ = 0.7599
$K a p p a$ = 0.5198
10%	H0: Stego Images	H1: Cover Images
H0	0.4332	0.0725	$P r$ = 0.8567
H1	0.0668	0.4275	$N P V$ = 0.8649
	$S e$ = 0.8665	$S p$ = 0.8550	$A c$ = 0.8608
$K a p p a$ = 0.7215
20%	H0: Stego Images	H1: Cover Images
H0	0.4672	0.0724	$P r$ = 0.8659
H1	0.0668	0.4276	$N P V$ = 0.9288
	$S e$ = 0.9345	$S p$ = 0.8552	$A c$ = 0.8949
$K a p p a$ = 0.7897

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Battikh, D.; El Assad, S.; Hoang, T.M.; Bakhache, B.; Deforges, O.; Khalil, M. Comparative Study of Three Steganographic Methods Using a Chaotic System and Their Universal Steganalysis Based on Three Feature Vectors. Entropy 2019, 21, 748. https://0-doi-org.brum.beds.ac.uk/10.3390/e21080748

AMA Style

Battikh D, El Assad S, Hoang TM, Bakhache B, Deforges O, Khalil M. Comparative Study of Three Steganographic Methods Using a Chaotic System and Their Universal Steganalysis Based on Three Feature Vectors. Entropy. 2019; 21(8):748. https://0-doi-org.brum.beds.ac.uk/10.3390/e21080748

Chicago/Turabian Style

Battikh, Dalia, Safwan El Assad, Thang Manh Hoang, Bassem Bakhache, Olivier Deforges, and Mohamad Khalil. 2019. "Comparative Study of Three Steganographic Methods Using a Chaotic System and Their Universal Steganalysis Based on Three Feature Vectors" Entropy 21, no. 8: 748. https://0-doi-org.brum.beds.ac.uk/10.3390/e21080748

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Comparative Study of Three Steganographic Methods Using a Chaotic System and Their Universal Steganalysis Based on Three Feature Vectors

Abstract

1. Introduction

2. Description of the Proposed Chaotic System

Description of the Cat Map Used

3. Enhanced Steganographic Algorithms

3.1. Enhanced EALSBMR (EEALSBMR)

3.1.1. Insertion Procedure

3.1.2. Extraction Procedure

3.2. Enhanced DCT Steganographic Method (EDCT)

3.2.1. Insertion Procedure

3.2.2. Extraction Procedure

3.3. Enhanced DWT Steganographic Method (EDWT)

3.3.1. Insertion Procedure

3.3.2. Extraction Procedure

4. Experimental Results and Analysis

4.1. Enhanced EALSBMR

4.2. Enhanced DCT Steganographic Method

4.3. Enhanced DWT Steganographic Method

4.4. Performance Comparison of the Three Steganographic Methods

4.5. Performance Using Parameters E, R and I R

5. Universal Steganalysis

5.1. Multi-Resolution Wavelet Decomposition

5.2. Feature Vector Extraction

5.2.1. Method 1: Feature Vectors Extracted from the Empirical Moments of the PDF-Based Multi-Resolution Coefficients and Their Prediction Error

5.2.2. Method 2: Feature Vectors Extracted from Empirical Moments of CF-Based Multi-Resolution

5.2.3. Method 3: Feature Vector Extracted from Empirical Moments Based on the FC and the PDF of Image Prediction Error and Its Different Sub-Bands of the Multi-Resolution Decomposition

5.3. Classification

5.3.1. FLD Classifier

5.3.2. SVM Classifier

6. Experimental Results of Steganalysis

6.1. Classification Results Applied to the Steganographic Method EEALSBMR

6.2. Classification Results Applied to the Steganographic Method EDCT

6.3. Classification Results Applied to the Steganographic Method EDWT

6.4. Discussion

7. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.5. Performance Using Parameters E, R and $I R$