Zeroing Neural Network Approaches Based on Direct and Indirect Methods for Solving the Yang–Baxter-like Matrix Equation

Jiang, Wendong; Lin, Chia-Liang; Katsikis, Vasilios N.; Mourtas, Spyridon D.; Stanimirović, Predrag S.; Simos, Theodore E.

doi:10.3390/math10111950

Open AccessArticle

Zeroing Neural Network Approaches Based on Direct and Indirect Methods for Solving the Yang–Baxter-like Matrix Equation

¹

Department of Digital Media Art, School of Art and Design, Fuzhou University of International Studies and Trade, Fuzhou 350200, China

²

General Department, National & Kapodistrian University of Athens, GR-34400 Euripus Campus, 15772 Athens, Greece

³

Department of Visual Communications, Huzhou University, Huzhou 313000, China

⁴

Department of Economics, Division of Mathematics and Informatics, National and Kapodistrian University of Athens, Sofokleous 1 Street, 10559 Athens, Greece

⁵

Faculty of Sciences and Mathematics, University of Niš, Višegradska 33, 18000 Niš, Serbia

⁶

Department of Medical Research, China Medical University Hospital, China Medical University, Taichung 40402, Taiwan

⁷

Data Recovery Key Laboratory of Sichun Province, Neijing Normal University, Neijiang 641100, China

⁸

Section of Mathematics, Department of Civil Engineering, Democritus University of Thrace, 67100 Xanthi, Greece

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(11), 1950; https://0-doi-org.brum.beds.ac.uk/10.3390/math10111950

Submission received: 12 May 2022 / Revised: 1 June 2022 / Accepted: 3 June 2022 / Published: 6 June 2022

(This article belongs to the Special Issue Selected Papers from the International Conference of Numerical Analysis and Applied Mathematics (ICNAAM))

Download

Browse Figures

Versions Notes

Abstract

:

This research introduces three novel zeroing neural network (ZNN) models for addressing the time-varying Yang–Baxter-like matrix equation (TV-YBLME) with arbitrary (regular or singular) real time-varying (TV) input matrices in continuous time. One ZNN dynamic utilizes error matrices directly arising from the equation involved in the TV-YBLME. Moreover, two ZNN models are proposed using basic properties of the YBLME, such as the splitting of the YBLME and sufficient conditions for a matrix to solve the YBLME. The Tikhonov regularization principle enables addressing the TV-YBLME with an arbitrary input real TV matrix. Numerical experiments, including nonsingular and singular TV input matrices, show that the suggested models deal effectively with the TV-YBLME.

Keywords:

Yang–Baxter-like matrix equation (YBLME); zeroing neural network (ZNN); dynamical system; Tikhonov regularization

MSC:

15A24; 65F20; 68T05

1. Introduction, Motivation, and Preliminaries

The Yang–Baxter equation [1,2] is a consistency equation that is frequently encountered in physics. The practice has shown that solving a Yang–Baxter equation is a central topic in braid groups [3], knot theory [4], statistical mechanics [5], and quantum theory [6]. Let

A \in R^{n \times n}

be the input matrix and

X \in R^{n \times n}

be the unknown matrix of interest in the following quadratic matrix equation:

X A X = A X A,

(1)

which we refer to as a Yang–Baxter-like matrix equation (YBLME) since its form is similar to the classic parameter-free Yang–Baxter equation [1,2]. Notice that (1) has two apparent solutions,

X = 0

and

X = A

, but its nonlinearity makes finding non-trivial solutions generally challenging. In the current research, the zeroing (or Zhang) neural network (ZNN) method was employed to solve the following time-varying YBLME (TV-YBLME):

X (t) A (t) X (t) = A (t) X (t) A (t),

(2)

for an arbitrary input real-valued time-varying (TV) matrix

A (t) \in R^{n \times n}

. More precisely, this research proposes and investigates one ZNN model based on an immediate solution to the TV-YBLME and two ZNN models based on indirect methods to solve the TV-YBLME, such as the model proposed in [7]. It is worth mentioning that the two models based on indirect methods arise from the splitting of the YBLME [8] and from sufficient conditions for a matrix to solve the YBLME [9]. Furthermore, unlike the models presented in [7], the three models proposed in this research solve the TV-YBLME for both nonsingular and singular square input matrices

A (t)

because they utilize the Tikhonov regularization procedure.

The ZNN method originated from the Hopfield neural network and was developed by Zhang et al. in [10] for producing online solutions to TV problems. Notice that the majority of ZNN-designed dynamical systems are classified as recurrent neural networks (RNNs) used to find the zeros of equations. The ZNN method, as a result of its in-depth examination, has been broadly utilized to solve a variety of TV problems; the main applications include problems of generalized inversions [11,12,13], matrix equations systems [14,15], problems of tensor and matrix inversions [16], problems of quadratic optimizations [17], linear equations systems [15,18], and various matrix function approximations [19,20].

The first step in creating the ZNN evolution is to create an appropriate error function

Z (t) \in R^{n \times n}

(or Zhang function [21], or error matrix equation (EME)) that is suited to the underlying problem. The second step exploits the subsequent dynamical flow:

\dot{Z} (t) = \frac{d Z (t)}{d t} = - λ F (Z (t)),

(3)

where

(\dot{})

denotes the time derivative, the scaling parameter

λ > 0

is used to accelerate the convergence, whereas

F (\cdot) : R^{n \times n} \to R^{n \times n}

signifies element-wise usage of an increasing and odd activation function (AF) on

Z (t)

. The time derivative of the time-varying matrix

Z (t) = [z_{i j} (t)]

is the matrix

\frac{d Z (t)}{d t} = \dot{Z} (t) = [\frac{d z_{i j} (t)}{d t}]

. More precisely, the time derivative

\dot{Z} (t)

of

Z (t)

represents the derivative of

Z (t)

by the scalar t. Further, a matrix

Z (t) = [z_{i j} (t)] \in R^{n \times m}

is differentiable if the derivative

\frac{d z_{i j} (t)}{d t}

of each element

z_{i j} (t)

exists at each point in its domain. Our research will explore the linear ZNN dynamics (3) that satisfy

F = I

, which yield the following:

\dot{Z} (t) = - λ Z (t) .

(4)

The following are the main points of this work:

Two novel ZNN models (ZNN2 and ZNN3) are introduced based on principles to find indirect numerical solutions to the TV-YBLME for an arbitrary input real TV matrix.
Application of the Tikhonov regularization enables usability of the proposed dynamical systems in solving the TV-YBLME with the arbitrary (regular or singular) input real TV matrix.
In particular, the ZNN model from [7] (ZNN1), based on a straightforward error matrix corresponding to the TV-YBLME, is extended to an arbitrary input real TV matrix using the Tikhonov principle.
Four numerical experiments, including nonsingular and singular input matrices, are presented to confirm the efficiency of the proposed dynamics in addressing the TV-YBLME solving.

Some of the generic notations are also worth mentioning:

I_{n}

signifies the unit matrix with dimensions

n \times n

;

O_{n}

and

1_{n}

signify the matrix consisting of zeros and ones, respectively, with dimensions

n \times n

;

vec (\cdot)

signifies the vectorization procedure; ⊗ signifies the Kronecker product;

{∥\cdot∥}_{F}

signifies the matrix Frobenius norm; ⊙ signifies the Hadamard (or element-wise) product.

This paper is organized as follows. The ZNN1 model is defined and analyzed in Section 2. Further, ZNN2 and ZNN3 models are defined and analyzed in Section 3. The findings of four numerical experiments for solving the TV-YBLME with nonsingular and singular input matrices are presented and discussed in Section 4. Finally, Section 5 contains the final thoughts and conclusions.

2. ZNN Model Based on Direct Solution to the TV-YBLME

In this section, we introduce and analyze a ZNN model, called ZNN1, based on a direct numerical solution to the TV-YBLME for a random real TV matrix. Assuming a TV smooth matrix

A (t) \in R^{n \times n}

(2) is utilized in deploying the ZNN dynamics. That is, the ZNN1 model considers the EME used in [7]:

Z (t) = X (t) A (t) X (t) - A (t) X (t) A (t),

(5)

where

X (t)

is the desirable direct solution of the TV-YBLME. Additionally, the following is the time derivative of (5), i.e.,

\frac{d Z (t)}{d t} = \dot{Z} (t)

:

\begin{matrix} \dot{Z} (t) = & \dot{X} (t) A (t) X (t) + X (t) \dot{A} (t) X (t) + X (t) A (t) \dot{X} (t) \\ - \dot{A} (t) X (t) A (t) - A (t) \dot{X} (t) A (t) - A (t) X (t) \dot{A} (t) . \end{matrix}

(6)

After that, the following can be obtained by merging (5) and (6) with the ZNN method using the linear AF (4):

\begin{array}{l} \dot{X} (t) & A (t) X (t) + X (t) \dot{A} (t) X (t) + X (t) A (t) \dot{X} (t) - \dot{A} (t) X (t) A (t) - A (t) \dot{X} (t) A (t) \\ - A (t) X (t) \dot{A} (t) = - λ (X (t) A (t) X (t) - A (t) X (t) A (t)), \end{array}

(7)

or equivalently:

\begin{matrix} \dot{X} (t) & A (t) X (t) + X (t) A (t) \dot{X} (t) - A (t) \dot{X} (t) A (t) = - λ (X (t) A (t) X (t) \\ - A (t) X (t) A (t)) - X (t) \dot{A} (t) X (t) + \dot{A} (t) X (t) A (t) + A (t) X (t) \dot{A} (t) . \end{matrix}

(8)

The dynamics (8) are adjusted using the Kronecker product and vectorization [7]:

\begin{matrix} ({(A (t) X (t))}^{T} \otimes I_{n} + I_{n} \otimes X (t) A (t) - A (t) \otimes A (t)) vec (\dot{X} (t)) = vec (- λ (X (t) A (t) X (t) \\ - A (t) X (t) A (t)) - X (t) \dot{A} (t) X (t) + \dot{A} (t) X (t) A (t) + A (t) X (t) \dot{A} (t)) . \end{matrix}

(9)

As a result, setting:

\begin{matrix} M (t) = & {(A (t) X (t))}^{T} \otimes I_{n} + I_{n} \otimes X (t) A (t) - A (t) \otimes A (t), \\ b (t) = & vec (- λ (X (t) A (t) X (t) - A (t) X (t) A (t)) - X (t) \dot{A} (t) X (t) \\ + \dot{A} (t) X (t) A (t) + A (t) X (t) \dot{A} (t)), \\ x (t) = & vec (X (t)), \dot{x} (t) = vec (\dot{X} (t)), \end{matrix}

(10)

the following ZNN model from [7] is derived:

M (t) \dot{x} (t) = b (t) .

(11)

It is observable that

M (t)

is a singular or nonsingular mass matrix when

A (t)

is singular or nonsingular, respectively. In order to extend the results from [7], the Tikhonov regularization [22] is employed to address the singularity problem in (11). If a constant diagonal matrix is chosen as the regularization matrix, (11) is changed into:

(M (t) + β_{1} I_{n^{2}}) \dot{x} (t) = b (t),

(12)

such that

β_{1} \geq 0

denotes the parameter of regularization. The ZNN model (12) is referred to as the ZNN1 model, and it may be handled effectively using a suitable ode MATLAB solver. Theorem 1 proves that the ZNN1 model exponentially converges to the theoretical solution of the TV-YBLME based on the input matrix

A (t)

.

Theorem 1.

Let

A (t) \in R^{n \times n}

be differentiable. Starting from any initial condition

x (0)

, the ZNN1 model (12) exponentially converges to the exact solution

x^{*} (t) = vec (X^{*} (t))

, where

X^{*} (t)

is the exact solution of the TV-YBLME (2) based on the input matrix

A (t)

.

Proof.

We define the following Lyapunov energy function:

l (t) : = \frac{1}{2} {∥Z (t)∥}_{F}^{2}

where

Z (t)

refers to (5), to prove the global asymptotic convergence of the ZNN model (7). Considering the

(i, j)

th element

z_{i j} (t)

of

Z (t)

,

i, j \in {1, \dots, n}

, we have the following:

\begin{matrix} l (t) & = \frac{1}{2} {∥Z (t)∥}_{F}^{2} = \frac{1}{2} [z_{11}^{2} (t) + z_{12}^{2} (t) + \dots + z_{1 n}^{2} (t) + \dots + z_{n n}^{2} (t)] \\ = \frac{1}{2} tr [Z^{T} (t) Z (t)] . \end{matrix}

Replacement of the ZNN rule (4) into the derivative of the energy function

l (t)

:

\begin{matrix} \dot{l} (t) & = [z_{11} (t) {\dot{z}}_{11} (t) + z_{12} (t) {\dot{z}}_{12} (t) + \dots + z_{1 n} (t) {\dot{z}}_{1 n} (t) + \dots + z_{n n} (t) {\dot{z}}_{n n} (t)] \\ = tr [Z^{T} (t) \dot{Z} (t)], \end{matrix}

gives:

\begin{matrix} \dot{l} (t) & = tr [Z^{T} (t) \dot{Z} (t)] = - λ tr [Z^{T} (t) Z (t)] = - λ \sum_{i, j = 1}^{n} z_{i j} (t) z_{i j} (t) \\ = - λ \sum_{i, j = 1}^{n} z_{i j}^{2} (t) < 0, \end{matrix}

which guarantees the final negative-definiteness of

{\dot{l}}_{i j} (t)

. That is to say,

{\dot{l}}_{i j} (t) < 0

for any

z_{i j} (t) \neq 0

, and

{\dot{l}}_{i j} (t) = 0

for

z_{i j} (t) = 0

. In addition, as

z_{i j} (t) \to \infty

,

l_{i j} (t) \to \infty

. By the Lyapunov theory, the equilibrium point

z_{i j} (t) = 0

globally converges to zero for any

i, j \in {1, \dots, n}

. Therefore, we have

Z (t) \to O_{n}

as

t \to \infty

. So, if

{∥Z (t)∥}_{F} = 0

, then the neural state matrix

X (t)

is the exact solution

X^{*} (t)

to (2). If

{∥Z (t)∥}_{F} > 0

then

\dot{l} (t) < 0

and it will converge to the global asymptotic stable point; that is, we have

{∥Z (t)∥}_{F}^{2} = 0

or the neural state matrix

X (t)

will converge to

X^{*} (t)

.

Furthermore, solving the linear first-order differential equation

{\dot{z}}_{i j} (t) = - λ z_{i j} (t)

, yields readily

z_{i j} (t) = exp (- λ t) z_{i j} (0)

. In other words, the matrix-valued error function

Z (t)

is expressed explicitly as:

Z (t) = Z (0) exp (- λ t),

which indicates that

X (t)

exponentially converges to

X^{*} (t)

with the convergence rate

λ > 0

. That is, starting from an initial state

X (0)

, the state matrix

X (t)

of (7) derived from (5) exponentially converges to

X^{*} (t)

.

In summary, the state matrix

X (t)

of (7) converges to

X^{*} (t)

globally and exponentially, starting from an initial state

X (0)

. Furthermore, because of the derivation process, we know that (12) is an equivalent vectored form of (7) so that (12) converges exponentially to

x^{*} (t) = vec (X^{*} (t))

. The proof has been completed. □

3. ZNN Models Based on Indirect Methods for Solving the TV-YBLME

This section presents and analyzes ZNN models, ZNN2 and ZNN3, based on indirect numerical methods for solving the TV-YBLME for an arbitrary real TV matrix. It is worth mentioning that the utilized indirect numerical methods are based on splitting the YBLME [8] and the sufficient conditions for a matrix to solve the YBLME [9].

3.1. ZNN Model Based on Splitting the TV-YBLME

The YBLME splitting, as presented in [8] (Lemma 3.1), enables us to solve two matrix equations instead of one equation in (2). As a result, if

A \in R^{n \times n}

is singular (in general) and

W \in R^{n \times n}

is taken as

A X = W

, it gives a guarantee that W satisfies the conditions in [8] (Lemma 3.1). That is, a matrix W that is taken as

A X = W

and satisfies

W^{2} = A W A

makes (1) consistent. More precisely, assuming a TV smooth matrix

A (t) \in R^{n \times n}

, we multiply (2) on the left by

A (t)

and then set

W (t) = A (t) X (t)

, we obtain the following system of matrix equations:

\{\begin{matrix} W (t) W (t) = A (t) W (t) A (t) \\ W (t) = A (t) X (t), \end{matrix}

(13)

where

X (t)

is the desirable solution to the problem. According to (13), the ZNN2 model assumes the next EME group for solving the TV-YBLME for an arbitrary

A (t)

:

\{\begin{matrix} Z_{1} (t) = W (t) W (t) - A (t) W (t) A (t) \\ Z_{2} (t) = A (t) X (t) - W (t) . \end{matrix}

(14)

Furthermore, the following are the time derivatives of EMEs included in (14):

\{\begin{matrix} {\dot{Z}}_{1} (t) = \dot{W} (t) W (t) + W (t) \dot{W} (t) - \dot{A} (t) W (t) A (t) - A (t) \dot{W} (t) A (t) - A (t) W (t) \dot{A} (t) \\ {\dot{Z}}_{2} (t) = \dot{A} (t) X (t) + A (t) \dot{X} (t) - \dot{W} (t) . \end{matrix}

(15)

Thereafter, the following can be obtained by merging (14) and (15) with the ZNN method based on the linear model (4):

\{\begin{matrix} \dot{W} (t) & W (t) + W (t) \dot{W} (t) - \dot{A} (t) W (t) A (t) - A (t) \dot{W} (t) A (t) - A (t) W (t) \dot{A} (t) \\ = - λ (W (t) W (t) - A (t) W (t) A (t)) \dot{A} (t) X (t) + A (t) \dot{X} (t) - \dot{W} (t) \\ = - λ (A (t) X (t) - W (t)) . \end{matrix}

(16)

The design (16) is equivalent to:

\{\begin{matrix} \dot{W} (t) W (t) + W (t) \dot{W} (t) - A (t) \dot{W} (t) A (t) = \\ - λ (W (t) W (t) - A (t) W (t) A (t)) - \dot{A} (t) W (t) A (t) - A (t) W (t) \dot{A} (t) A (t) \dot{X} (t) - \dot{W} (t) \\ = - λ (A (t) X (t) - W (t)) - \dot{A} (t) X (t) . \end{matrix}

(17)

The dynamical systems involved in (17) are adjusted as follows using the Kronecker product and vectorization:

\{\begin{matrix} (W^{T} (t) \otimes I_{n} + I_{n} \otimes W (t) - A^{T} (t) \otimes A (t)) vec (\dot{W} (t)) = \\ vec (- λ (W (t) W (t) - A (t) W (t) A (t)) - \dot{A} (t) W (t) A (t) - A (t) W (t) \dot{A} (t)) \\ (I_{n} \otimes A (t)) vec (\dot{X} (t)) - vec (\dot{W} (t)) = \\ vec (- λ (A (t) X (t) - W (t)) - \dot{A} (t) X (t)) . \end{matrix}

(18)

As a result, setting:

\begin{matrix} M (t) & = [\begin{matrix} W^{T} (t) \otimes I_{n} + I_{n} \otimes W (t) \dot{W} (t) - A^{T} (t) \otimes A (t) & O_{n} \\ - I_{n^{2}} & I_{n} \otimes A (t) \end{matrix}], \\ b (t) & = [\begin{matrix} vec (- λ (W (t) W (t) - A (t) W (t) A (t)) - \dot{A} (t) W (t) A (t) - A (t) W (t) \dot{A} (t)) \\ vec (- λ (A (t) X (t) - W (t)) - \dot{A} (t) X (t)) \end{matrix}], \\ x (t) & = [\begin{matrix} vec (W (t)) \\ vec (X (t)) \end{matrix}], \dot{x} (t) = [\begin{matrix} vec (\dot{W} (t)) \\ vec (\dot{X} (t)) \end{matrix}], \end{matrix}

(19)

the following ZNN model is derived:

M (t) \dot{x} (t) = b (t) .

(20)

The mass matrix

M (t)

is singular or nonsingular if

A (t)

is singular or nonsingular, respectively. The Tikhonov regularization is employed to address the singularity problem, and (20) is changed into:

(M (t) + β_{2} I_{2 n^{2}}) \dot{x} (t) = b (t),

(21)

where

β_{2} \geq 0

denotes the parameter of regularization. The ZNN flow (21) is referred to as the ZNN2 model, and it may be handled effectively using a suitable ode MATLAB solver. Theorem 2 proves that the ZNN2 model exponentially converges to the theoretical solution of the TV-YBLME based on the input matrix

A (t)

.

Theorem 2.

Let

A (t) \in R^{n \times n}

be differentiable. Starting from any initial condition

x (0)

, the ZNN2 model (21) converges exponentially to the exact solution

x^{*} (t) = vec (X^{*} (t))

, where

X^{*} (t)

is the exact solution of the TV-YBLME (2) based on the input matrix

A (t)

.

Proof.

From [8] (Lemma 3.1), solving the matrix equation group defined in (13) results in a TV solution of the TV-YBLME. The EME is constructed as in (14), in keeping with the ZNN method and the matrix equation group (13), to produce the solution

X^{*} (t)

that correlates with the TV solution of the TV-YBLME based on the input matrix

A (t)

. Thereafter, the model (16) is derived by using the linear design formula for zeroing (14). Setting

Z (t)

as the EME of (14) and following the same procedure as in Theorem 1, it is proved that the state matrix

X (t)

of (16), starting from any initial state

X (0)

, globally and exponentially converges to

X^{*} (t)

. Consequently, when

t \to \infty

, the solution of (16) converges to

X^{*} (t)

. Because of the derivation process, it is clear that (21) is just an equivalent vector form of (16), and it converges to

x^{*} (t) = vec (X^{*} (t))

. The proof has been completed. □

3.2. ZNN Model Based on Sufficient Conditions for a Solution

From [9] (Theorem 2.1), we know that if B is a matrix that satisfies

A B = B A = B^{2}

, then B is a solution of (1). As a result, assuming a TV smooth matrix

A (t) \in R^{n \times n}

, we obtain the following system of matrix equations with respect to the unknown matrix

X (t)

:

\{\begin{matrix} A (t) X (t) = X (t) A (t) \\ X (t) X (t) = X (t) A (t) . \end{matrix}

(22)

According to (22), the ZNN3 model assumes the next EME group for solving the TV-YBLME for an arbitrary

A (t)

:

\{\begin{matrix} Z_{1} (t) = X (t) A (t) - A (t) X (t) \\ Z_{2} (t) = X (t) A (t) - X (t) X (t) . \end{matrix}

(23)

Furthermore, the following is the time derivative of EMEs involved in (23):

\{\begin{matrix} {\dot{Z}}_{1} (t) = \dot{X} (t) A (t) + X (t) \dot{A} (t) - \dot{A} (t) X (t) - A (t) \dot{X} (t) \\ {\dot{Z}}_{2} (t) = \dot{X} (t) A (t) + X (t) \dot{A} (t) - \dot{X} (t) X (t) - X (t) \dot{X} (t) . \end{matrix}

(24)

Thereafter, the following can be obtained by merging (23) and (24) with the ZNN method using the linear AF (4):

\{\begin{matrix} \dot{X} (t) A (t) + X (t) \dot{A} (t) - \dot{A} (t) X (t) - A (t) \dot{X} (t) = - λ (X (t) A (t) - A (t) X (t)) \\ \dot{X} (t) A (t) + X (t) \dot{A} (t) - \dot{X} (t) X (t) - X (t) \dot{X} (t) = - λ (X (t) A (t) - X (t) X (t)), \end{matrix}

(25)

which is equivalent to:

\{\begin{matrix} \dot{X} (t) A (t) - A (t) \dot{X} (t) = - λ (X (t) A (t) - A (t) X (t)) - X (t) \dot{A} (t) + \dot{A} (t) X (t) \\ \dot{X} (t) A (t) - \dot{X} (t) X (t) - X (t) \dot{X} (t) = - λ (X (t) A (t) - X (t) X (t)) - X (t) \dot{A} (t) . \end{matrix}

(26)

Two dynamics involved in (26) are adjusted as follows using the Kronecker product and vectorization:

\{\begin{matrix} (A^{T} (t) \otimes I_{n} - I_{n} \otimes A (t)) vec (\dot{X} (t)) = vec (- λ (X (t) A (t) - A (t) X (t)) - X (t) \dot{A} (t) + \dot{A} (t) X (t)) \\ (A^{T} (t) \otimes I_{n} - X^{T} (t) \otimes I_{n} - I_{n} \otimes X (t)) vec (\dot{X} (t)) = vec (- λ (X (t) A (t) - X (t) X (t)) - X (t) \dot{A} (t)) . \end{matrix}

(27)

As a result, setting:

\begin{matrix} M (t) & = [\begin{matrix} A^{T} (t) \otimes I_{n} - I_{n} \otimes A (t) \\ A^{T} (t) \otimes I_{n} - X^{T} (t) \otimes I_{n} - I_{n} \otimes X (t) \end{matrix}], \\ b (t) & = [\begin{matrix} vec (- λ (X (t) A (t) - A (t) X (t)) - X (t) \dot{A} (t) + \dot{A} (t) X (t)) \\ vec (- λ (X (t) A (t) - X (t) X (t)) - X (t) \dot{A} (t)) \end{matrix}], \\ x (t) & = vec (X (t)), \dot{x} (t) = vec (\dot{X} (t)), \end{matrix}

(28)

the following ZNN model is derived:

M^{T} (t) M (t) \dot{x} (t) = M^{T} (t) b (t),

(29)

where

M^{T} (t) M (t)

is a singular or nonsingular mass matrix when

A (t)

is singular or nonsingular, respectively. The Tikhonov regularization is employed to address the singularity problem, and (29) is changed into:

(M^{T} (t) M (t) + β_{3} I_{n^{2}}) \dot{x} (t) = M^{T} (t) b (t),

(30)

where

β_{3} \geq 0

denotes the parameter of regularization. The ZNN model (30) is referred to as the ZNN3 model, and it may be handled effectively using a suitable ode MATLAB solver. Theorem 3 proves that the ZNN3 model exponentially converges to the theoretical solution of the TV-YBLME based on the input matrix

A (t)

.

Theorem 3.

Let

A (t) \in R^{n \times n}

be differentiable. Starting from any initial condition

x (0)

, the ZNN3 model (30) converges exponentially to the exact solution

x^{*} (t) = vec (X^{*} (t))

, where

X^{*} (t)

is the exact solution to the TV-YBLME (2) of the input matrix

A (t)

.

Proof.

From [9] (Theorem 2.1), solving the matrix equation group defined in (22) results in a TV solution of the TV-YBLME. The EME is constructed as in (23), in keeping with the ZNN method and the matrix equation group (22), to produce the solution

X^{*} (t)

that correlates with the TV solution of the TV-YBLME based on the input matrix

A (t)

. After that, the model (25) is derived by using the linear design formula for zeroing (23). Setting

Z (t)

as the EME of (23) and following the same procedure as Theorem 1, it is proved that the state matrix

X (t)

of (25), starting from an arbitrary initial state

X (0)

, globally and exponentially converges to

X^{*} (t)

. Consequently, when

t \to \infty

, the solution of (25) converges to the solution

X^{*} (t)

. In addition, we know that (30) is an equivalent form of (25) due to the derivation process, and converges to

x^{*} (t) = vec (X^{*} (t))

. The proof has been completed. □

4. Simulation Results

This section analyzes and compares the performance of the ZNN1 (12), ZNN2 (21), and ZNN3 (30) models in four numerical experiments, which solve the TV-YBLME with both nonsingular and singular input matrices

A (t)

. During the computation in all experiments, the time interval is limited to

[0, 10]

, with the ZNN gain parameter

λ = 10

and with successive values of the Tikhonov regularization parameter equal to

β_{1}

=1e-3,

β_{2}

=1e-1, and

β_{3}

=1e-8. Moreover, note that the symbols

ZNN 1

,

ZNN 2

, and

ZNN 3

in the legends in Figure 1 indicate the solutions generated by the ZNN1, ZNN2, and ZNN3 models, respectively. Finally, the MATLAB solver ode45 was employed, with the initial condition of

W (t)

in the ZNN2 model assigned to

W (0) = A (0) X (0)

.

4.1. Experiment 1

The TV-YBLME with the following nonsingular input matrix is solved in this simulation:

A (t) = [\begin{matrix} 7 + cos (t) & 6 + sin (2 t) & 5 - cos (2 t) \\ 5 + sin (t) & 8 + sin (t) & - 4 - sin (t) \\ 3 + cos (t) & - 6 + cos (t) & 9 - sin (2 t) \end{matrix}] .

The initial condition of

X (t)

employed in all models tested in this example is:

X (0) = [\begin{matrix} - 1 & 1 & 1 \\ 1 & - 1 & - 1 \\ 1 & - 1 & - 1 \end{matrix}] .

4.2. Experiment 2

The input of the TV-YBLME in this experiment is the nonsingular matrix:

A (t) = [\begin{matrix} t + 1 & t & t & t \\ t & t + 1 & t & t \\ t & t & t + 1 & t \\ t & t & t & t + 1 \end{matrix}] + (1 + sin (t)) ⊙ 1_{4} .

The initial condition for

X (t)

used in all models in this experiment is given by:

X (0) = [\begin{matrix} 0.4 & - 0.1 & 0.2 & - 0.5 \\ - 0.1 & 0.7 & - 0.4 & - 0.2 \\ 0.1 & - 0.4 & 0.3 & 0.1 \\ - 0.4 & - 0.2 & 0 & 0.6 \end{matrix}] .

4.3. Experiment 3

The TV-YBLME, which involves the following singular input matrix of the rank

rank (A (t)) = 1

, is solved in this simulation experiment:

A (t) = [\begin{matrix} 2 - 1 / 2 sin (t) & 2 + 1 / 2 sin (t) & 0 \\ 2 - 1 / 2 sin (t) & 2 + 1 / 2 sin (t) & 0 \\ 2 - 1 / 2 sin (t) & 2 + 1 / 2 sin (t) & 0 \end{matrix}] .

The following is the initial condition of

X (t)

that is used in all models tested in this example:

X (0) = [\begin{matrix} 7 & - 3 & - 3 \\ 7 & - 3 & - 3 \\ 7 & - 3 & - 3 \end{matrix}] .

4.4. Experiment 4

This experiment is concerned with the solution of the TV-YBLME with the singular input matrix of

rank (A (t)) = 4

:

A (t) = [\begin{matrix} 1 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \\ 0 & 1 - sin (t) & sin (t) & 0 & 0 \\ cos (t) & 0 & 0 & 2 - sin (t) & 1 \\ cos (t) & 0 & 0 & 2 - sin (t) & 1 \end{matrix}] .

The initial conditions of

X (t)

(denoted by IC1:

X_{1} (0)

and IC2:

X_{2} (0)

) equally utilized in all models compared in this example are:

X_{1} (0) = [\begin{matrix} 1 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0.01 & - 0.01 \\ - 0.5 & 0 & 0 & 2 & - 2 \\ - 0.5 & 0 & 0 & 2 & - 2 \end{matrix}], X_{2} (0) = [\begin{matrix} 1 & 0 & 0 & 0 & 0 \\ 1 & 1 & 4 & 0 & 0 \\ 0 & 1 & 4 & 0.01 & - 0.01 \\ - 0.5 & 0 & 0 & 4 & - 4 \\ - 0.5 & 0 & 0 & 4 & - 4 \end{matrix}] .

4.5. Numerical Experiments Analysis—Findings and Comparison

The strengths of the ZNN1, ZNN2, and ZNN3 models for solving the TV-YBLME based on nonsingular and singular matrices

A (t)

are examined through four experiments presented in Section 4.1, Section 4.2, Section 4.3 and Section 4.4. The graphs generated by the ZNN1, ZNN2, and ZNN3 models are presented in Figure 1. Notice that the arrangement of Figure 1 and Figure 2 is as follows: the figures of the first column depict the tracking errors of the ZNN models, i.e.,

{∥Z (t)∥}_{F}

of the ZNN1 model and

{∥Z_{i} (t)∥}_{F}, i = 1, 2

, of the ZNN2 and ZNN3 models; the figures of the second column depict the residual errors for solving the TV-YBLME, i.e.,

{∥X (t) A (t) X (t) - A (t) X (t) A (t)∥}_{F}

; the figures in the third column depict the trajectories of solutions produced by the tested dynamical systems along with the obvious solution

A (t)

. Values marked with

X_{Z N N 1}, X_{Z N N 2}

, and

X_{Z N N 3}

are appropriate to the solutions generated by ZNN1, ZNN2, and ZNN3, respectively.

From the numerical experience of this section, the following observations may be noted. Overall, the ZNN3 model’s error functions have lower values than the ZNN1 and ZNN2 models’ error functions in all experiments, as depicted in Figure 1a,d,g and Figure 2a,d, while the ZNN2 model’s error function

∥Z_{2} (t)∥

has the fastest convergence speed in Section 4.1, Section 4.2 and Section 4.3, as depicted in Figure 1a,d,g, respectively. The values of the residual norm

{∥X (t) A (t) X (t) - A (t) X (t) A (t)∥}_{F}

show that the tested models have similar convergence speeds in Section 4.1, Section 4.2 and Section 4.3 as depicted in Figure 1b,e,h, respectively, while the ZNN3 model has the fastest convergence speed in Section 4.4, as depicted in Figure 2b. In addition, the residual errors

{∥X (t) A (t) X (t) - A (t) X (t) A (t)∥}_{F}

of the ZNN3 model receive lower values than the ZNN1 and ZNN2 models in Section 4.2, Section 4.3 and Section 4.4, as depicted in Figure 1e and Figure 2b,e, respectively. For the initial conditions used, all the ZNN models produce the same solution in Section 4.1, the ZNN1 and ZNN2 models produce the same solution in Section 4.2, while all the ZNN models produce different solutions in Section 4.3 and Section 4.4. Furthermore, all the ZNN models produce different solutions in Section 4.4 under the initial conditions IC1 and IC2. The general conclusion is that the change of the initial value in the system dynamics initiates a different solution. This is conditioned by the fact that the closest solution can change with respect to different initial states. It is worth noting—in Figure 1c,f,i and Figure 2c,f—that all the solutions produced by the ZNN models are different from the obvious solution

A (t)

.

The following are some general conclusions based on the presented simulation experiments. The ZNN3 model, which is based on an indirect method for solving the TV-YBLME, performs better than the ZNN1 and ZNN2 models, based on direct and indirect methods for solving the TV-YBLME, respectively. Furthermore, the ZNN3 model provides the smallest Frobenius norm for both the residual errors and error functions. However, because all models usually produce different TV-YBLME solutions for the same initial conditions, we can conclude that all models are efficient for addressing the TV-YBLME. It is also worth noting that the bigger the value of the acceleration parameter

λ

, the quicker the models will converge.

5. Conclusions

This study addresses the problem of solving the TV-YBLME for a random real TV matrix by employing the ZNN neural design. As a result, three ZNN models were defined, analyzed, and compared. The first ZNN model, ZNN1, exploits an immediate method to solve the TV-YBLME from [7], whereas the other two, ZNN2 and ZNN3, utilize indirect methods to solve the TV-YBLME based on the fundamental properties of the Yang–Baxter matrix equation. According to four numerical experiments, all models efficiently address the TV-YBLME, including nonsingular and singular input matrices. However, the ZNN3 model converges to the TV-YBLME solution quicker than the ZNN1 and ZNN2 models. One interesting observation is that the dynamics based on the indirect approaches achieve a quicker convergence (ZNN2) and smallest residuals (ZNN3). Furthermore, because all three tested models usually produce different TV-YBLME solutions for the same initial conditions, we can conclude that all models are efficient and valuable for addressing the TV-YBLME.

The following are possible research topics:

Research involving the fuzzy control parameters in the ZNN design. Corresponding results are presented in [23,24,25,26]. Future research could use cautiously selected fuzzy parameters to specify a certain rate of adaptation in the ZNN dynamics and corresponding improvements.
Since all types of noise significantly impact the ZNN model accuracies, noise sensitivity is a shortcoming of the proposed ZNN1, ZNN2, and ZNN3 models. As a result, future studies might concentrate on adapting the ZNN1, ZNN2, and ZNN3 models to a noise-handling ZNN dynamical system class. Such research will be a continuation of [27] from the constant matrix case to the time-varying case and from the direct ZNN model to various ZNN models.
One could expect that further developments of ZNN evolutions (arising from different properties of solutions to the Yang–Baxter equation) will be possible.

Author Contributions

W.J.: validation, investigation. C.-L.L.: formal analysis, investigation. V.N.K.: conceptualization, methodology, validation, formal analysis, investigation, writing—original draft. S.D.M.: conceptualization, methodology, validation, formal analysis, investigation, writing—original draft. P.S.S.: conceptualization, methodology, validation, formal analysis, investigation, writing—original draft. T.E.S.: methodology, formal analysis, investigation. All authors have read and agreed to the published version of the manuscript.

Funding

Predrag Stanimirović is supported by the Ministry of Education, Science, and Technological Development, Republic of Serbia, grant no. 451-03-68/2022-14/200124, and by the Science Fund of the Republic of Serbia, (no. 7750185, Quantitative Automata Models: Fundamental Problems and Applications—QUAM).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Yang, C.N. Some exact results for the many-body problem in one dimension with repulsive delta-function interaction. Phys. Rev. Lett. 1967, 19, 1312–1315. [Google Scholar] [CrossRef]
Baxter, R.J. Partition function of the eight-vertex lattice model. Ann. Phys. 1972, 70, 193–228. [Google Scholar] [CrossRef]
Matsumoto, D.K.; Shibukawa, Y. Quantum Yang-Baxter equation, braided semigroups, and dynamical Yang-Baxter maps. Tokyo J. Math. 2015, 38, 227–237. [Google Scholar] [CrossRef]
Przytycki, J.H. Knot theory: From Fox 3-colorings of links to Yang-Baxter homology and Khovanov homology. In Knots, Low-Dimensional Topology and Applications; Springer: Cham, Switzerland, 2019; Volume 284, pp. 115–145. [Google Scholar] [CrossRef] [Green Version]
Vieira, R.S.; Lima-Santos, A. Solutions of the Yang-Baxter equation for (n + 1) (2n + 1)-vertex models using a differential approach. J. Stat. Mech. 2021, 2021, 053103. [Google Scholar] [CrossRef]
Tsuboi, Z. Quantum groups, Yang-Baxter maps and quasi-determinants. Nucl. Phys. B 2018, 926, 200–238. [Google Scholar] [CrossRef]
Zhang, H.; Wan, L. Zeroing neural network methods for solving the Yang-Baxter-like matrix equation. Neurocomputing 2020, 383, 409–418. [Google Scholar] [CrossRef]
Kumar, A.; Cardoso, J.R.; Singh, G. Explicit solutions of the singular Yang-Baxter-like matrix equation and their numerical computation. Mediterr. J. Math. 2022, 19, 85. [Google Scholar] [CrossRef]
Ding, J.; Zhang, C.; Rhee, N.H. Further solutions of a Yang-Baxter-like matrix equation. East Asian J. Appl. Math. 2013, 3, 352–362. [Google Scholar] [CrossRef]
Zhang, Y.; Ge, S.S. Design and analysis of a general recurrent neural network model for time-varying matrix inversion. IEEE Trans. Neural Netw. 2005, 16, 1477–1490. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Stanimirović, P.S.; Katsikis, V.N.; Zhang, Z.; Li, S.; Chen, J.; Zhou, M. Varying-parameter Zhang neural network for approximating some expressions involving outer inverses. Optim. Methods Softw. 2020, 35, 1304–1330. [Google Scholar] [CrossRef]
Kornilova, M.; Kovalnogov, V.; Fedorov, R.; Zamaleev, M.; Katsikis, V.N.; Mourtas, S.D.; Simos, T.E. Zeroing neural network for pseudoinversion of an arbitrary time-varying matrix based on singular value decomposition. Mathematics 2022, 10, 1208. [Google Scholar] [CrossRef]
Simos, T.E.; Katsikis, V.N.; Mourtas, S.D.; Stanimirović, P.S.; Gerontitis, D. A higher-order zeroing neural network for pseudoinversion of an arbitrary time-varying matrix with applications to mobile object localization. Inf. Sci. 2022, 600, 226–238. [Google Scholar] [CrossRef]
Stanimirović, P.S.; Katsikis, V.N.; Li, S. Integration enhanced and noise tolerant ZNN for computing various expressions involving outer inverses. Neurocomputing 2019, 329, 129–143. [Google Scholar] [CrossRef]
Katsikis, V.N.; Mourtas, S.D.; Stanimirović, P.S.; Zhang, Y. Solving complex-valued time-varying linear matrix equations via QR decomposition with applications to robotic motion tracking and on angle-of-arrival localization. IEEE Trans. Neural Netw. Learn. Syst. 2021, 1–10. [Google Scholar] [CrossRef]
Ma, H.; Li, N.; Stanimirović, P.S.; Katsikis, V.N. Perturbation theory for Moore–Penrose inverse of tensor via Einstein product. Comput. Appl. Math. 2019, 38, 111. [Google Scholar] [CrossRef]
Katsikis, V.N.; Mourtas, S.D.; Stanimirović, P.S.; Li, S.; Cao, X. Time-varying mean-variance portfolio selection problem solving via LVI-PDNN. Comput. Oper. Res. 2022, 138, 105582. [Google Scholar] [CrossRef]
Stanimirović, P.S.; Katsikis, V.N.; Li, S. Hybrid GNN-ZNN models for solving linear matrix equations. Neurocomputing 2018, 316, 124–134. [Google Scholar] [CrossRef]
Katsikis, V.N.; Stanimirović, P.S.; Mourtas, S.D.; Li, S.; Cao, X. Generalized Inverses: Algorithms and Applications; Mathematics Research Developments; Chapter Towards Higher Order Dynamical Systems; Nova Science Publishers, Inc.: Hauppauge, NY, USA, 2021; pp. 207–239. [Google Scholar]
Katsikis, V.N.; Mourtas, S.D.; Stanimirović, P.S.; Zhang, Y. Continuous-time varying complex QR decomposition via zeroing neural dynamics. Neural Process. Lett. 2021, 53, 3573–3590. [Google Scholar] [CrossRef]
Zhang, Y.; Guo, D. Zhang Functions and Various Models; Springer: Berlin/Heidelberg, Germany, 2015. [Google Scholar] [CrossRef]
Khalaf, G.; Shukur, G. Choosing ridge parameter for regression problems. Commun. Stat. Theory Methods 2005, 34, 1177–1182. [Google Scholar] [CrossRef]
Dai, J.; Chen, Y.; Xiao, L.; Jia, L.; He, Y. Design and analysis of a hybrid GNN-ZNN model with a fuzzy adaptive factor for matrix inversion. IEEE Trans. Ind. Inform. 2022, 18, 2434–2442. [Google Scholar] [CrossRef]
Jia, L.; Xiao, L.; Dai, J.; Cao, Y. A novel fuzzy-power zeroing neural network model for time-variant matrix Moore-Penrose inversion with guaranteed performance. IEEE Trans. Fuzzy Syst. 2021, 29, 2603–2611. [Google Scholar] [CrossRef]
Jia, L.; Xiao, L.; Dai, J.; Qi, Z.; Zhang, Z.; Zhang, Y. Design and application of an adaptive fuzzy control strategy to zeroing neural network for solving time-variant QP problem. IEEE Trans. Fuzzy Syst. 2021, 29, 1544–1555. [Google Scholar] [CrossRef]
Katsikis, V.N.; Stanimirović, P.S.; Mourtas, S.; Xiao, L.; Karabašević, D.; Stanujkić, D. Zeroing Neural Network with fuzzy parameter for computing pseudoinverse of arbitrary matrix. IEEE Trans. Fuzzy Syst. 2021, 1. [Google Scholar] [CrossRef]
Shi, T.; Tian, Y.; Sun, Z.; Liu, K.; Jin, L.; Yu, J. Noise-tolerant neural algorithm for online solving Yang–Baxter-type matrix equation in the presence of noises: A control-based method. Neurocomputing 2021, 424, 84–96. [Google Scholar] [CrossRef]

Figure 1. The ZNN error tracking and the convergence and trajectories of the solutions in Section 4.1, Section 4.2 and Section 4.3. (a) Section 4.1: ZNN error tracking. (b) Section 4.1: Solutions convergence. (c) Section 4.1: Solutions trajectories. (d) Section 4.2: ZNN error tracking. (e) Section 4.2: Solutions convergence. (f) Section 4.2: Solutions trajectories. (g) Section 4.3: ZNN error tracking. (h) Section 4.3: Solutions convergence. (i) Section 4.3: Solutions trajectories.

Figure 2. The ZNN error tracking, convergence, and trajectories of the solutions in Section 4.4 under IC1 and IC2. (a) Section 4.4 under IC1: ZNN error tracking. (b) Section 4.4 under IC1: Solutions convergence. (c) Section 4.4 under IC1: Solutions trajectories. (d) Section 4.4 under IC2: ZNN error tracking. (e) Section 4.4 under IC2: Solutions convergence. (f) Section 4.4 under IC2: Solutions trajectories.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jiang, W.; Lin, C.-L.; Katsikis, V.N.; Mourtas, S.D.; Stanimirović, P.S.; Simos, T.E. Zeroing Neural Network Approaches Based on Direct and Indirect Methods for Solving the Yang–Baxter-like Matrix Equation. Mathematics 2022, 10, 1950. https://0-doi-org.brum.beds.ac.uk/10.3390/math10111950

AMA Style

Jiang W, Lin C-L, Katsikis VN, Mourtas SD, Stanimirović PS, Simos TE. Zeroing Neural Network Approaches Based on Direct and Indirect Methods for Solving the Yang–Baxter-like Matrix Equation. Mathematics. 2022; 10(11):1950. https://0-doi-org.brum.beds.ac.uk/10.3390/math10111950

Chicago/Turabian Style

Jiang, Wendong, Chia-Liang Lin, Vasilios N. Katsikis, Spyridon D. Mourtas, Predrag S. Stanimirović, and Theodore E. Simos. 2022. "Zeroing Neural Network Approaches Based on Direct and Indirect Methods for Solving the Yang–Baxter-like Matrix Equation" Mathematics 10, no. 11: 1950. https://0-doi-org.brum.beds.ac.uk/10.3390/math10111950

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Zeroing Neural Network Approaches Based on Direct and Indirect Methods for Solving the Yang–Baxter-like Matrix Equation

Abstract

1. Introduction, Motivation, and Preliminaries

2. ZNN Model Based on Direct Solution to the TV-YBLME

3. ZNN Models Based on Indirect Methods for Solving the TV-YBLME

3.1. ZNN Model Based on Splitting the TV-YBLME

3.2. ZNN Model Based on Sufficient Conditions for a Solution

4. Simulation Results

4.1. Experiment 1

4.2. Experiment 2

4.3. Experiment 3

4.4. Experiment 4

4.5. Numerical Experiments Analysis—Findings and Comparison

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI