Next Article in Journal
Venture Capital Contracting with Ambiguity Sharing and Effort Complementarity Effect
Previous Article in Journal
The Basic Algorithm for the Constrained Zero-One Quadratic Programming Problem with k-diagonal Matrix and Its Application in the Power System
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Nash Equilibrium Investment-Reinsurance Strategies for an Insurer and a Reinsurer with Intertemporal Restrictions and Common Interests

1
School of Business Administration, Hunan University, Changsha 410082, China
2
School of Mathematics, Hunan University, Changsha 410082, China
3
School of Business, Hunan Normal University, Changsha 410081, China
*
Author to whom correspondence should be addressed.
Submission received: 21 December 2019 / Revised: 9 January 2020 / Accepted: 13 January 2020 / Published: 19 January 2020

Abstract

:
This paper investigates the generalized multi-period mean-variance investment-reinsurance optimization model in a discrete-time framework for a general insurance company that contains a reinsurer and an insurer. The intertemporal restrictions and the common interests of the reinsurer and the insurer are considered. The common goal of the reinsurer and the insurer is to maximize the expectation of the weighted sum of their wealth processes and minimize the corresponding variance. Based on the game method, we obtain the Nash equilibrium investment-reinsurance strategies for the above-proposed model and find out the equilibrium strategies when unilateral interest is considered. In addition, the Nash equilibrium investment-reinsurance strategies are deduced under two special premium calculated principles (i.e., the expected value premium principle and the variance value premium principle). We theoretically study the effect of the intertemporal restrictions on Nash equilibrium investment-reinsurance strategies and find the effect depends on the value of some parameters, which differs from the previous researches that generally believed that intertemporal restrictions would make investors avoid risks. Finally, we perform corresponding numerical analyses to verify our theoretical results.

1. Introduction

In recent decades, investment-reinsurance optimization problem, as a hot research topic in the field of actuarial insurance, has been widely studied by many experts and scholars (e.g., Browne [1], Schmidli et al. [2], Zeng and Li [3], Zhu et al. [4], Hu and Wang [5], Deng et al. [6], and so on). With the rapid development of society, the relationship between companies is getting closer. At present, the research on the relationship between companies in the insurance market is mainly divided into two aspects. On the one hand, the optimal investment-reinsurance between two insurance companies is studied in the framework of zero-sum game or non-zero-sum game (e.g., Zeng [7], Li et al. [8], Bensoussan et al. [9], Pun and Wong [10], Pun et al. [11], Pun [12], Deng et al. [13], Wang et al. [14], and so on). On the other hand, the interests of both insurance companies and reinsurance companies have gradually been studied since any reinsurance contract is obviously a mutual agreement between the insurer and reinsurer, strategies that consider only the unilateral interest of the insurer are likely to be unacceptable to the reinsurer. When the reinsurance company and insurance company are independent individuals, some literature regarded them as the leader and follower of the Stackelberg game, and studied the optimal reinsurance problem (e.g., Chen and Shen [15] and Chen and Shen [16], etc.). When the reinsurer and insurer belong to two different divisions of the same large insurance company, some scholars studied the optimal investment-reinsurance problem by targeting the common interests of the two divisions (e.g., Wang et al. [17], Chen et al. [18], Li et al. [19], Zhou et al. [20], etc.).
Nowadays, many large financial companies, in addition to insurance departments, also have reinsurance departments, such as the American International Group, Starr Insurance and Reinsurance Limited, China Reinsurance (Group) Company, Zurich Financial Services and so on. Therefore, it is of great practical significance to study the investment-reinsurance optimization problem for a large insurance company with both insurer and reinsurer sectors. Li et al. [19] studied the management of an insurance company that includes both insurer and reinsurer sectors, with the aim of maximizing the expected utility of the weighted sum of the terminal wealth of the two sectors. Zhou et al. [20] and Huang et al. [21] researched robust optimal reinsurance and investment problems under different objective functions for an ambiguity-averse manager (AAM) who holds shares of an insurance company and a reinsurance company. Zhao et al. [22] studied time-consistent investment-reinsurance strategies towards the joint interests of the insurer and the reinsurer.
However, most of the research on investment-reinsurance optimization is studied under the continuous-time framework, the research in discrete-time is rare. Actually, the discrete-time setting is more reasonable for decision makers because they will not trade continuously, otherwise it will cause plenty of transaction costs. Brandt [23] also pointed out that the continuous-time strategies are often inadmissible in discrete time because they may generate a negative wealth. Especially for special institutional investors, such as insurers and reinsurers, their wealth values are more likely to be negative, because in addition to investment risks, they also need to face random claim risk in the future (see Zhou et al. [24]).
Therefore, compared with the optimal investment-reinsurance research in continuous time, it is more appropriate and practical to study the optimal investment-reinsurance strategy under the framework of discrete-time. Xiao et al. [25] first studied a generalized multi-period investment-reinsurance optimization problem in discrete time, in which the insurer and reinsurer were studied as two independent individuals. In view of the fact that many large financial institutions include both insurance and reinsurance sectors, we investigate the optimal investment-reinsurance optimization problem under the discrete-time framework when the reinsurer and insurer have common interest objectives.
As we all know, mean-variance and expected utility preference are the most commonly used optimization objectives in the research of investment-reinsurance optimization. The classical mean-variance model was proposed by Markowitz [26], which measures the return and risk by expectation and variance respectively. However, in the traditional expected utility theory, investment risk cannot be directly quantified separately from returns. The strategies obtained under the mean-variance model may be more popular with investors who prefer to quantify the investment risk and return. But the classical mean-variance model only considers the terminal performance and ignores the intermediate performance, which leads to a high probability of bankruptcy in the early stage of investment (refer to Zhu et al. [27], Zhou et al. [28]). For more literature of portfolio optimization considering intertemporal restrictions can be referred to Costa and Nabholz [29], Costa and Araujo [30], Costa and de Oliveira [31], Cui et al. [32], Zhou et al. [28] and Xiao et al. [25], and so on. Most of the above literature that considers the intertemporal restrictions generally directly believes that the intertemporal restrictions will lead investors to avoid risks and adopt more conservative investment strategies, but does not give the corresponding results in theory. In this paper, we will theoretically study the effect of intertemporal restrictions on investment-reinsurance strategies. Consider whether the intertemporal restrictions will make the decisions more aggressive or more conservative? Under the discrete-time framework, we will construct the generalized multi-period mean-variance investment-reinsurance optimization model with intertemporal restrictions and the common interests of the reinsurer and insurer, which can be regarded as two different departments of a large financial company.
Note that the dynamic programming approach cannot be directly used to solve the multi-period mean-variance optimization problems because the dynamic variance measure violates the time consistency. Usually, there exist two approaches to this problem. The first approach (called the embedding method) is proposed by Li and Ng [33], and the corresponding optimal strategy is called the pre-commitment strategy (or time-inconsistent strategy) due to the future changes are not considered. To overcome the time-inconsistency, Basak and Chabakauri [34] and Björk and Murgoci [35] proposed the second method, called as the game method, by which the time-consistent strategy can be obtained for decision-makers. More precisely, the game method is to treat the mean-variance optimization problem as a non-cooperative game, in which there is a separate player at every discrete time point. The Nash equilibrium strategy obtained by this method satisfies the time consistency. Since then, based on the game method, various multi-period mean-variance optimization problems have been studied, such as Zhou et al. [28], Björk and Murgoci [36] and Zhou et al. [37] and so on. In the paper, we will apply the game method to deduce the Nash equilibrium investment-reinsurance strategies under the generalized multi-period mean-variance investment-reinsurance optimization model.
Inspired by the above literature, in a discrete-time setting, we build a generalized multi-period mean-variance investment-reinsurance optimization model with consideration of intertemporal restrictions and the common interests of the reinsurer and insurer. Both the reinsurer and insurer can invest their idle assets in a risky asset and a riskless asset. The insurer can transfer claim risk by purchasing a proportional reinsurance contract from the reinsurer. The common goal of the insurer and the reinsurer is to maximize the expectation of the weighted sum of their wealth processes and minimize the corresponding variance by finding the Nash equilibrium investment-reinsurance strategies. We use the game method provided by Björk and Murgoci [35] to obtain the Nash equilibrium investment-reinsurance strategies for the above optimization model, and deduce Nash equilibrium strategies under the expected premium principle and variance premium principle, respectively. In addition, we find out the Nash equilibrium investment-reinsurance strategies when unilateral interest is considered. Furthermore, we study the effect of the intertemporal restrictions on the Nash equilibrium strategies and perform corresponding numerical analyses to verify our theoretical results and give some economic explanations.
Our paper differs from previous research in at least three respects. (i) We first build the generalized multi-period mean-variance investment-reinsurance optimization model in discrete-time framework for a large insurance company with both reinsurer and insurer sectors. We take the weighted sum of the wealth processes of the insurer and reinsurer as their common optimization objective. (ii) Under the framework of generalized multi-period mean-variance, we first deduce Nash equilibrium investment-reinsurance strategies (which satisfy time consistency) rather than the traditional pre-commitment strategy and obtain the equilibrium value function. (iii) We first theoretically analyze the effect of intertemporal restrictions on Nash equilibrium strategies, and find that it depends on the value of model parameters, which is different from the existing literature that directly believes that the intertemporal restrictions can make investors avoid risks (e.g., [25,28], etc.). In particular, we find that the effect of intertemporal restrictions on Nash equilibrium investment strategies depends on the reduction speed of risk aversion parameters and the value of risk-free benefits. When computing the reinsurance premium under the expected value premium principle, the effect of intertemporal restrictions on Nash equilibrium reinsurance strategy depends not only on the reduction speed of risk aversion parameters and the value of risk-free benefits but also on the weight parameter. When computing the reinsurance premium under the variance value premium principle, the effect of intertemporal restrictions on Nash equilibrium reinsurance strategy is related to a number of other factors, in addition to those mentioned above.
The rest of the paper is constructed as follows. Section 2 formulates a generalized multi-period mean-variance investment-reinsurance optimization model with consideration of intertemporal restrictions and the common objective. In Section 3, we obtain the Nash equilibrium strategies and equilibrium value function by using the game method. Moreover, we deduce the Nash equilibrium strategies in some special settings and discuss some properties of equilibrium strategies. Section 4 conducts some numerical examples to illustrate the effects of intertemporal restrictions on Nash equilibrium strategies. Finally, Section 5 concludes the paper.

2. Model Setup

In this section, we consider a general insurance company with both insurer and reinsurer sectors. Both the reinsurer and the insurer can invest their idle assets in the financial market. Meanwhile, the insurer can purchase the reinsurance contract from the reinsurer to diversify its claim risk.

2.1. Financial Market and Wealth Processes Dynamics

Suppose that both the insurer and the reinsurer will join into the financial market at time 0 with initial capital w 0 1 and w 0 2 , respectively. They can engage in investment activities and reinsurance business during the time horizon [ 0 , T ] , where T > 0 is a constant. Both the insurer and the reinsurer can invest their idle assets in a risk-free asset and a risky asset, where the risk-free asset has determinate return s t > 1 . Note that we assume that the insurer and reinsurer focus on two different risk assets in their investment, while the two risk assets with random return e t 1 and e t 2 , respectively. The amount invested in the above two risk assets at the beginning of the t-th time period is denoted by u t 1 and u t 2 , respectively. The remaining wealth (i.e., w t 1 u t 1 and w t 2 u t 2 ) is invested in the risk-free asset. In addition to investment activities, the insurer can also buy a proportional reinsurance contract from the reinsurer to diversify its claim risk. The proportion of claims borne by the insurer at time t is recorded as q t , and q t [ 0 , 1 ] , t = 0 , 1 , , T 1 . The reinsurer will cover the remaining (i.e., 1 q t ) at time t. For example, when facing policy-holder insurance claims z t , the insurer is only liable for claim amount q t z t , and the rest of claim amount ( 1 q t ) z t is undertaken by the reinsurer. In the meantime, the reinsurance contract stipulates that the insurer has to pay the given premium δ t ( q t ) to the reinsurer from his/her own premium income c t , where c t is assumed to be a certain value. Then, for t = 0 , 1 , , T 1 , the dynamics of wealth for the insurer and the reinsurer are respectively expressed as
w t + 1 1 = s t w t 1 + P t 1 u t 1 + c t δ t ( q t ) q t z t ,
and
w t + 1 2 = s t w t 2 + P t 2 u t 2 + δ t ( q t ) ( 1 q t ) z t ,
where P t 1 = e t 1 s t , P t 2 = e t 2 s t .
Analogue to the existing research, we give the following assumptions:
  • Assumption 1. z 0 , z 1 , , z T 1 are statistically independent.
  • Assumption 2. For t = 0 , 1 , , T 1 , e t = [ e t 1 , e t 2 ] , μ t i = E ( P t i ) , σ t i = V a r ( P t i ) , θ t = C o v ( P t 1 , P t 2 ) , μ ˜ t = E ( z t ) , ν ˜ t = E [ ( z t ) 2 ] and σ ˜ t = V a r ( z t ) , where i = 1 , 2 .
  • Assumption 3. t = k l ( · ) = 0 and t = k l ( · ) = 1 for k > l .

2.2. Generalized Mean-Variance Model with Common Objective and Intertemporal Restrictions

Considering that both the insurer and the reinsurer belong to the same large insurance company, they have a common interest goal, which is to achieve the maximum expectation of the weighted sum of their wealth processes and minimum corresponding variance. Therefore, the interests of both the insurer and the reinsurer should be considered when formulating investment-reinsurance strategies. Supposing that the insurer and the reinsurer have the common optimization objective. Based on the dynamic wealth processes (1) and (2), under the generalized mean-variance formulation, the classical multi-period investment-reinsurance optimization problem with intertemporal restrictions and common interests can be posed as follows
max π t = 1 T ξ t { E [ α w t 1 + ( 1 α ) w t 2 ] η t V a r [ α w t 1 + ( 1 α ) w t 2 ] } s . t . w t + 1 1 = s t w t 1 + c t + P t 1 u t 1 δ t ( q t ) q t z t , t = 0 , 1 , , T 1 ; w t + 1 2 = s t w t 2 + P t 2 u t 2 + δ t ( q t ) ( 1 q t ) z t , t = 0 , 1 , , T 1 .
Note that π = ( π 0 , π 1 , , π T 1 ) and π k = ( u k 1 , u k 2 , q k ) , k = 0 , 1 , , T 1 . In addition, the α is the weighting efficient which satisfies α [ 0 , 1 ] . The optimal solution of the multi-period mean-variance model (henceforth, the pre-commitment strategy) is time-inconsistent. In other words, the pre-commitment strategy is not updated with the information accumulated over time. However, the objective in a pragmatic decision changes over time. Refer to Björk and Murgoci [35] and Basak and Chabakauri [34], the time-varying mean-variance optimization objective function is given by
J k ( w k 1 , w k 2 , π ) = t = k + 1 T ξ t { E k [ α w t 1 + ( 1 α ) w t 2 ] η t V a r k [ α w t 1 + ( 1 α ) w t 2 ] } .
Definition 1.
Given a fixed control law π ^ = ( π ^ 0 , π ^ 1 , , π ^ T 1 ) . For k = 0 , 1 , , T 1 , denote
π ( k ) = ( π k , π ^ k + 1 , , π ^ T 1 ) , π ^ ( k ) = ( π ^ k , π ^ k + 1 , , π ^ T 1 ) ,
where π k = ( u k 1 , u k 2 , q k ) is arbitrarily control variable. If for all k = 0 , 1 , , T 1 , the following conditions hold
max π k J k ( w k 1 , w k 2 , π ( k ) ) = J k ( w k 1 , w k 2 , π ^ ( k ) ) .
Then, π is called the Nash equilibrium strategy for model (3) and the equilibrium value function is given by
V k ( w k 1 , w k 2 ) = J k ( w k 1 , w k 2 , π ^ ( k ) ) .
According to Definition 1, we know that the Nash equilibrium strategy is time-consistent. Thus, the Nash equilibrium strategy and the equilibrium value function can be called as the time-consistent reinsurance-investment strategy and the optimal value function.

3. Nash Equilibrium Strategies and Equilibrium Value Function

Based on the definition of the Nash equilibrium strategies for k = 0 , 1 , , T 1 , we can find the equilibrium value function V k ( w k 1 , w k 2 ) satisfies the following proposition.
Proposition 1.
The equilibrium value function V k ( w k 1 , w k 2 ) satisfies the following recursive formula.
V k ( w k 1 , w k 2 ) = max π k E k [ V k + 1 ( w k + 1 1 , w k + 1 2 ) ] m = k + 2 T ξ m η m V a r k [ f k + 1 , m ( w k + 1 1 , w k + 1 2 ) ] + ξ k + 1 E k [ α w k + 1 1 + ( 1 α ) w k + 1 2 ] ξ k + 1 η k + 1 V a r k [ α w k + 1 1 + ( 1 α ) w k + 1 2 ] , k = 0 , 1 , , T 2 ,
and
V T 1 ( w T 1 1 , w T 1 2 ) = max π T 1 ξ T E T 1 [ α w T 1 + ( 1 α ) w T 2 ] ξ T η T V a r T 1 [ α w T 1 + ( 1 α ) w T 2 ] ,
where
f k , τ ( w k 1 , w k 2 ) = f k + 1 , τ ( w k + 1 1 , w k + 1 2 ) , τ > k , τ , k = 0 , 1 , , T 1 ; α w k 1 + ( 1 α ) w k 2 , τ = k , k = 0 , 1 , , T 1 .
Proof. 
Please refer to Appendix A for the specific proof process. □
Based on Definition 1 and Proposition 1, we have the following theorem.
Theorem 1.
If α ( 0 , 1 ) , Nash equilibrium investment-reinsurance strategies π ^ t = ( u ^ t 1 , u ^ t 2 , q ^ t ) , t = 0 , 1 , , T 1 , for model (3) are shown as follows.
u ^ t 1 = ( μ t 1 σ t 2 μ t 2 θ t ) m = t + 1 T ξ m i = t + 1 m 1 s i 2 α ( σ t 1 σ t 2 ( θ t ) 2 ) m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 ,
u ^ t 2 = ( μ t 2 σ t 1 μ t 1 θ t ) m = t + 1 T ξ m i = t + 1 m 1 s i 2 ( 1 α ) ( σ t 1 σ t 2 ( θ t ) 2 ) m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 ,
q ^ t = arg max 0 q t 1 m = t + 1 T ξ m i = t + 1 m 1 s i { ( 1 2 α ) δ t ( q t ) μ ˜ t [ α q t + ( 1 α ) ( 1 q t ) ] } m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 [ α q t + ( 1 α ) ( 1 q t ) ] 2 σ ˜ t ,
and the equilibrium value function V t ( w t 1 , w t 2 ) and the function f t , τ ( w t 1 , w t 2 ) are given by
V t ( w t 1 , w t 2 ) = m = t + 1 T ξ m i = t m 1 s i [ α w t 1 + ( 1 α ) w t 2 ] + κ t ,
f t , τ ( w t 1 , w t 2 ) = i = t τ 1 s i [ α w t 1 + ( 1 α ) w t 2 ] + γ t , τ ,
where κ t and γ t , τ satisfy the following equations.
κ t = k = t T 1 m = k + 1 T ξ m i = k + 1 m 1 s i { α c k μ ˜ k [ α q ^ k + ( 1 α ) ( 1 q ^ k ) ] + ( 1 2 α ) δ k ( q ^ k ) } + k = t T 1 [ ( μ k 1 ) 2 σ k 2 2 μ k 1 μ k 2 θ k + ( μ k 2 ) 2 σ k 1 ] 4 [ σ k 1 σ k 2 ( θ k ) 2 ] m = k + 1 T ξ m i = k + 1 m 1 s i m = k + 1 T ξ m η m i = k + 1 m 1 s i 2 k = t T 1 m = k + 1 T ξ m η m i = k + 1 m 1 s i 2 [ ( 1 α ) ( 1 q ^ k ) + α q ^ k ] 2 σ ˜ k , γ t , τ = k = t τ 1 i = k + 1 τ 1 s i { α c k + ( 1 2 α ) δ k ( q ^ k ) μ ˜ k [ α q ^ k + ( 1 α ) ( 1 q ^ k ) ] } + k = t τ 1 [ ( μ k 1 ) 2 σ k 2 2 μ k 1 μ k 2 θ k + ( μ k 2 ) 2 σ k 1 ] 2 [ σ k 1 σ k 2 ( θ k ) 2 ] m = k + 1 T ξ m i = k + 1 m 1 s i i = k + 1 τ 1 s i m = k + 1 T ξ m η m i = k + 1 m 1 s i 2 .
Proof. 
Please refer to Appendix B for the specific proof process. □
From Theorem 1, we can find that the equilibrium investment strategies of the reinsurer and the insurer are both dependent on the weight coefficient α . In addition, we can find that | u ^ t 1 | is a decrease function of α , while | u ^ t 2 | is an increase function of α . That is, with the increase of the weight coefficient α , the insurer will shrink the investment position, and the reinsurer will expand the investment position at each period time t. The larger weight coefficient α means that the common optimization objective function is more sensitive to the insurer’s decision. In this case, the insurer and reinsurer will reach a consensus, that is, shrinking the investment position of risk asset 1 and expanding the investment position of risk asset 2 simultaneously.
From Theorem 1, we can find that the insurer and the reinsurer not only concern the terminal performance but also concern the intermediate performance of their portfolio. However, when they only concern the performance of terminal wealth, we only need to let ξ t = 0 for t = 1 , 2 , , T 1 and ξ T = 1 . In this situation, the Nash equilibrium investment strategies are given in the following remark.
Remark 1.
If α ( 0 , 1 ) , ξ t = 0 for t = 1 , 2 , , T 1 , ξ T = 1 , the Nash equilibrium investment strategies u ^ t 1 and u ^ t 2 for model (3) are given by
u ^ t 1 = μ t 1 σ t 2 μ t 2 θ t 2 α η T [ σ t 1 σ t 2 ( θ t ) 2 ] i = t + 1 T 1 s i ,
u ^ t 2 = μ t 2 σ t 1 μ t 1 θ t 2 ( 1 α ) η T [ σ t 1 σ t 2 ( θ t ) 2 ] i = t + 1 m 1 s i .
At present, most of the literature considering intertemporal restrictions directly believe that intertemporal restrictions will lead investors to avoid risks and adopt more conservative investment strategies (for example, Zhou et al. [28], Xiao et al. [25], etc.), but they do not give corresponding theoretical results. Next, we theoretically analyze the impact of intertemporal restrictions on the Nash equilibrium investment strategies. Consider whether the intertemporal restrictions will make the investment strategy more aggressive to obtain more wealth or make the investment strategy more conservative to avoid investment risk?
Refer to Zhu et al. [27], the number of investment bankruptcies in the early stage is larger than that in the later stage. Therefore, a larger penalty should be set for the earlier intertemporal restrictions in the mathematical formulation. In other words, the investors have a higher risk aversion coefficient in the initial stage of investment. Thus, the risk aversion coefficient is set to be in exponential form in this paper. That is, η t = η T x ( T t ) , t = 0 , 1 , , T 1 , where x > 1 and η T is a positive constant given arbitrarily. Then, we can draw the following conclusions.
Corollary 1.
Suppose that α ( 0 , 1 ) , s t = s > 1 and η t = η T x ( T t ) , where x > 1 , x s 2 , t = 0 , 1 , , T 1 , η T is a positive constant. We have the following conclusions.
  • When x s 2 x T t s 2 ( T t ) > 1 s s T t 1 ( 1 s T t ) , for t = 0 , 1 , , T 1 , we have
    | u ^ t 1 | w i t h i n t e r t e m p o r a l r e s t r i c t i o n s > | u ^ t 1 | w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s ,
    | u ^ t 2 | w i t h i n t e r t e m p o r a l r e s t r i c t i o n s > | u ^ t 2 | w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s .
  • When x s 2 x T t s 2 ( T t ) = 1 s s T t 1 ( 1 s T t ) , for t = 0 , 1 , , T 1 , we have
    | u ^ t 1 | w i t h i n t e r t e m p o r a l r e s t r i c t i o n s = | u ^ t 1 | w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s ,
    | u ^ t 2 | w i t h i n t e r t e m p o r a l r e s t r i c t i o n s = | u ^ t 2 | w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s .
  • When x s 2 x T t s 2 ( T t ) < 1 s s T t 1 ( 1 s T t ) , for t = 0 , 1 , , T 1 , we have
    | u ^ t 1 | w i t h i n t e r t e m p o r a l r e s t r i c t i o n s < | u ^ t 1 | w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s ,
    | u ^ t 2 | w i t h i n t e r t e m p o r a l r e s t r i c t i o n s < | u ^ t 2 | w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s .
Proof. 
Please refer to Appendix C. □
From Corollary 1, we note that the Nash equilibrium investment strategies are not influenced by the intertemporal restrictions when the reduction speed of risk aversion parameters and the return rate of the riskless asset meet the relationship x s 2 x T t s 2 ( T t ) = 1 s s T t 1 ( 1 s T t ) . When x s 2 x T t s 2 ( T t ) > 1 s s T t 1 ( 1 s T t ) , the intertemporal restrictions will stimulate investment. The insurer and the reinsurer tend to take more investment risk in order to gain more wealth when the intertemporal restrictions are considered. When x s 2 x T t s 2 ( T t ) < 1 s s T t 1 ( 1 s T t ) , the intertemporal restrictions will discourage investment. Investment strategies that consider the intertemporal restrictions are more conservative than that without considering the intertemporal restrictions. In conclusion, when the risk aversion coefficient is in exponential form, whether the intertemporal restrictions will make the investment strategy more radical or more conservative depends on the reduction speed of risk aversion parameters and the value of risk-free benefits.
Furthermore, we study some special cases of model (3). When we only consider the interest of the insurer/reinsurer, that is, α = 1 / α = 0 , the Nash equilibrium investment-reinsurance strategies can be deduced in a similar way.
Remark 2.
If α = 1 , i.e., only the insurer’s interest is considered, model (3) degenerates into the classical investment-reinsurance optimization problem. In this case, the Nash equilibrium strategies ( u ^ t 1 , q ^ t ) , t = 0 , 1 , , T 1 , are given by
u ^ t 1 = μ t 1 m = t + 1 T ξ m i = t + 1 m 1 s i 2 σ t 1 m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 ,
q ^ t = arg max 0 q t 1 m = t + 1 T ξ m i = t + 1 m 1 s i [ μ ˜ t q t + δ t ( q t ) ] m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 ( q t ) 2 σ ˜ t .
The equilibrium value function becomes
V t ( w t 1 ) = m = t + 1 T ξ m i = t m 1 s i w t 1 + κ t ,
and the function f t , τ ( w t 1 ) is given by
f t , τ ( w t 1 ) = i = t τ 1 s i w t 1 + γ t , τ ,
where
κ t = k = t T 1 m = k + 1 T ξ m i = k + 1 m 1 s i ( c k δ k ( q ^ k ) μ ˜ k q ^ k ) + k = t T 1 ( μ k 1 ) 2 4 σ k 1 m = k + 1 T ξ m i = k + 1 m 1 s i m = k + 1 T ξ m η m i = k + 1 m 1 s i 2 k = t T 1 ( q ^ k ) 2 σ ˜ k m = k + 1 T ξ m η m i = k + 1 m 1 s i 2 , γ t , τ = k = t τ 1 i = k + 1 τ 1 s i ( c k δ k ( q ^ k ) μ ˜ k q ^ k ) + k = t τ 1 ( μ k 1 ) 2 2 σ k 1 m = k + 1 T ξ m i = k + 1 m 1 s i i = k + 1 τ 1 s i m = k + 1 T ξ m η m i = k + 1 m 1 s i 2 .
Remark 3.
If α = 0 , i.e., only the reinsurer’s interest is considered, the Nash equilibrium strategies ( u ^ t 2 , q ^ t ) , t = 0 , 1 , , T 1 , are given by
u ^ t 2 = μ t 2 2 σ t 2 m = t + 1 T ξ m i = t + 1 m 1 s i m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 ,
q ^ t = arg max 0 q t 1 m = t + 1 T ξ m i = t + 1 m 1 s i [ δ t ( q t ) μ ˜ t ( 1 q t ) ] m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 σ ˜ t ( 1 q t ) 2 .
The equilibrium value function becomes
V t ( w t 2 ) = m = t + 1 T ξ m i = t m 1 s i w t 2 + κ t ,
and the function f t , τ ( w t 2 ) is given by
f t , τ ( w t 2 ) = i = t T 1 s i w t 2 + γ t , τ ,
where
κ t = k = t T 1 m = k + 1 T ξ m i = k + 1 m 1 s i [ δ k q ^ k μ ˜ k ( 1 q ^ k ) ] + k = t T 1 ( μ k 2 ) 2 4 σ k 2 m = k + 1 T ξ m i = k + 1 m 1 s i m = k + 1 T ξ m η m i = k + 1 m 1 s i 2 k = t T 1 m = k + 1 T ξ m η m i = k + 1 m 1 s i 2 σ ˜ k ( 1 q ^ k ) 2 , γ t , τ = k = t τ 1 i = k + 1 τ 1 s i [ δ k ( q ^ k ) μ ˜ k ( 1 q ^ k ) ] + k = t τ 1 ( μ k 2 ) 2 2 σ k 2 m = k + 1 T ξ m i = k + 1 m 1 s i i = k + 1 τ 1 s i m = k + 1 T ξ m η m i = k + 1 m 1 s i 2 .
As shown in Remarks 2 and 3, we can find that V t ( w t 1 ) and f t , τ ( w t 1 ) only depend on w t 1 , i.e., the insurer’s wealth at time t; the function V t ( w t 2 ) and f t , τ ( w t 2 ) only depend on w t 2 , i.e., the reinsurer’s wealth at time t. This is because in both cases only unilateral interests are targeted.
Note that model (3) is an investment-reinsurance optimization problem with a generalized premium calculated principle. Next, we will find the corresponding Nash equilibrium strategies under two classical premium calculation criteria principle (i.e., the expected value premium principle and variance value premium principle). The results are presented in Section 3.1 and Section 3.2.

3.1. Nash Equilibrium Investment-Reinsurance Strategies under the Expected Value Premium Principle

In this subsection, we assume that the reinsurance premium is computed under the following expected value premium principle, i.e., for t = 0 , 1 , , T 1 ,
δ t ( q t ) = ( 1 + β t ) ( 1 q t ) μ ˜ t ,
where β t > 0 is the reinsurer’s safety loading. For t = 0 , 1 , , T 1 , let
a t = 1 α 2 α 1 + β t μ ˜ t 2 ( 2 α 1 ) σ ˜ t m = t + 1 T ξ m i = t + 1 m 1 s i m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 .
Then, Nash equilibrium investment-reinsurance strategies shown in Theorem 1 can be reduced as follows.
Corollary 2.
Suppose that α ( 0 , 1 ) and compute the reinsurance premium under the expected value premium principle, the Nash equilibrium strategies { ( u ^ t 1 , u ^ t 2 , q ^ t ) , t = 0 , 1 , , T 1 } can be derived for model (3), where u ^ t 1 and u ^ t 2 are consistent with (8) and (9) in Theorem 1, respectively; q ^ t can be shown as follows:
q ^ t = 0 , i f a t < 0 ; a t , i f 0 a t 1 ; 1 , i f a t > 1 . , f o r α 1 2 ; q t [ 0 , 1 ] , f o r α = 1 2 .
As shown in Corollary 2, the Nash equilibrium reinsurance strategy depends on the weight coefficient α when computing the reinsurance premium under the expected value premium principle. In addition, we can find that the reinsurance strategy can be taken as any value within [ 0 , 1 ] when α = 0.5 . This cause is that the common wealth α w t 1 + ( 1 α ) w t 2 is independent with the weight coefficient α if α = 0.5 .
When the reinsurer and the insurer only concern the performance of terminal wealth, i.e., ξ t = 0 , t = 1 , 2 , , T 1 and ξ T = 1 , a t can be reduce as
a t = 1 α 2 α 1 + β t μ ˜ t 2 ( 2 α 1 ) η T σ ˜ t 1 i = t + 1 T 1 s i ,
where t = 0 , 1 , , T 1 , and then the detailed Nash equilibrium strategies are given by the following remark.
Remark 4.
If α ( 0 , 1 ) , ξ t = 0 for t = 1 , 2 , , T 1 , ξ T = 1 , and compute the reinsurance premium under the expected value premium principle, then the Nash equilibrium investment strategies u ^ t 1 and u ^ t 2 for model (3) are consistent with (14) and (15), respectively; the Nash equilibrium reinsurance strategy q ^ t for model (3) can be shown as follows.
q ^ t = 0 , i f a t < 0 ; a t , i f 0 a t 1 ; 1 , i f a t > 1 . , f o r α 1 2 ; q t [ 0 , 1 ] , f o r α = 1 2 .
Remark 4 shows that the insurer and the reinsurer only concern the performance of terminal wealth, ignoring the intertemporal expectations and variances. Compared with Corollary 2 and Remark 4, we note that the former strategy is related to the risk aversion coefficient in each period, while the latter strategy only depends on the risk aversion coefficient at the terminal moment.
Furthermore, similar to Corollary 1, we want to study the effect of intertemporal restrictions on equilibrium reinsurance strategy in the intermediate case when the risk aversion coefficient is exponential and the reinsurance premium is computed under the expected value premium principle.
Corollary 3.
Suppose that α ( 0 , 1 ) , s t = s > 1 and η t = η T x ( T t ) , where x > 1 and x s 2 , t = 0 , 1 , , T 1 , η T is a positive constant; the reinsurance premium is computed under the expected value premium principle.
If α > 0.5 and 0 a t 1 , we have the following conclusions.
  • When x s 2 x T t s 2 ( T t ) > 1 s s T t 1 ( 1 s T t ) , for t = 0 , 1 , , T 1 , we have
    q ^ t w i t h i n t e r t e m p o r a l r e s t r i c t i o n s > q ^ t w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s .
  • When x s 2 x T t s 2 ( T t ) = 1 s s T t 1 ( 1 s T t ) , for t = 0 , 1 , , T 1 , we have
    q ^ t w i t h i n t e r t e m p o r a l r e s t r i c t i o n s = q ^ t w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s .
  • When x s 2 x T t s 2 ( T t ) < 1 s s T t 1 ( 1 s T t ) , for t = 0 , 1 , , T 1 , we have
    q ^ t w i t h i n t e r t e m p o r a l r e s t r i c t i o n s < q ^ t w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s .
If α < 0.5 and 0 a t 1 , we have the following conclusions.
  • When x s 2 x T t s 2 ( T t ) > 1 s s T t 1 ( 1 s T t ) , for t = 0 , 1 , , T 1 , we have
    q ^ t w i t h i n t e r t e m p o r a l r e s t r i c t i o n s < q ^ t w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s .
  • When x s 2 x T t s 2 ( T t ) = 1 s s T t 1 ( 1 s T t ) , for t = 0 , 1 , , T 1 , we have
    q ^ t w i t h i n t e r t e m p o r a l r e s t r i c t i o n s = q ^ t w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s .
  • When x s 2 x T t s 2 ( T t ) < 1 s s T t 1 ( 1 s T t ) , for t = 0 , 1 , , T 1 , we have
    q ^ t w i t h i n t e r t e m p o r a l r e s t r i c t i o n s > q ^ t w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s .
Corollary 3 indicates that when computing the reinsurance premium under the expected value premium principle, the influence of intertemporal restrictions on the Nash equilibrium reinsurance strategy is not only related to the reduction speed of risk aversion parameters (i.e., x) and the value of risk-free benefits (i.e., s), but also related to the weight parameter (i.e., α ).
Remark 5.
Assume that the reinsurance premium is computed under the expected value premium principle. When α takes the boundary point, model (3) degenerates into investment-reinsurance optimization problems considering unilateral interest, i.e., the insurer’s interest is considered when α = 1 ; the reinsurer’s interest is considered when α = 0 . The optimal investment-reinsurance strategies in both cases are given by Table 1.
From Remark 5, we can find that u ^ t 1 and u ^ t 2 are consistent with (22) in Remark 2 and (27) in Remark 3, respectively.

3.2. Nash Equilibrium Investment-Reinsurance Strategies under the Variance Value Premium Principle

In this subsection, we assume that the reinsurance premium is computed under the following variance value premium principle, i.e., for t = 0 , 1 , , T 1 ,
δ t ( q t ) = ( 1 q t ) μ ˜ t + β t ( 1 q t ) 2 ν ˜ t .
Let
a ^ t = 1 α σ ˜ t m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 ( 2 α 1 ) σ ˜ t m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 + β t ν ˜ t m = t + 1 T ξ m i = t + 1 m 1 s i .
Corollary 4.
Suppose that α ( 0 , 1 ) and the reinsurance premium is computed under the variance value premium principle, we can obtain the Nash equilibrium strategies ( u ^ t 1 , u ^ t 2 , q ^ t ) , t = 0 , , T 1 , for model (3), where u ^ t 1 and u ^ t 2 are consistent with (8) and (9) in Theorem 1, respectively; q ^ t can be shown as follows:
q ^ t = 0 , i f a ^ t < 0 ; a ^ t , i f 0 a ^ t 1 ; 1 , i f a ^ t > 1 . , f o r α 1 2 ; q t [ 0 , 1 ] , f o r α = 1 2 .
Comparing Corollary 2 and Corollary 4, we can find that the premium calculation principle does not change the investment strategies, but only affects the reinsurance strategy.
Similarly, when they only concern the terminal wealth’s performance, that is, ξ t = 0 for t = 1 , 2 , , T 1 and ξ T = 1 , the results are as follows.
Remark 6.
If α ( 0 , 1 ) , ξ t = 0 for t = 1 , 2 , , T 1 , ξ T = 1 , and the reinsurance premium is computed under the variance value premium principle, we can obtain the Nash equilibrium strategies { ( u ^ t 1 , u ^ t 2 , q ^ t ) , t = 0 , , T 1 } for model (3), where u ^ t 1 and u ^ t 2 are consistent with (14) and (15) in Remark 1, respectively; q ^ t can be shown as follows:
q ^ t = 0 , i f a ^ t < 0 ; a ^ t , i f 0 a ^ t 1 ; 1 , i f a ^ t > 1 . , f o r α 1 2 ; q t [ 0 , 1 ] , f o r α = 1 2 .
Here, the parameter a ^ t is defined as a ^ t = β t ν ˜ t ( 1 α ) η T σ ˜ t i = t + 1 T 1 s i β t ν ˜ t + ( 2 α 1 ) η T σ ˜ t i = t + 1 T 1 s i , t = 0 , 1 , , T 1 .
Comparing with Corollary 4 and Remark 6, we note that the strategy considering intertemporal restrictions is related to the risk aversion factors in the intermediate stage, while the strategy only considering the terminal wealth performance is only related to the risk aversion factor at the terminal stage.
Similar to Corollary 1 and Corollary 3, we want to study the effect of intertemporal restrictions on equilibrium reinsurance strategy in the intermediate case when the risk aversion coefficient is exponential and the reinsurance premium is computed under the variance value premium principle.
Corollary 5.
Suppose α ( 0 , 1 ) , s t = s > 1 and η t = η T x ( T t ) , where x > 1 and x s 2 , t = 0 , 1 , , T 1 , η T is a positive constant; the reinsurance premium is computed under the variance value premium principle. For t = 0 , 1 , , T 1 , if 0 a ^ t 1 , we have the following conclusions.
  • When s T t 1 ( 2 α 1 ) σ ˜ t η T s T t 1 + β t ν ˜ t > ( 1 s ) [ x T t s 2 ( T t ) ] ( 2 α 1 ) σ ˜ t η T ( 1 s ) [ x T t s 2 ( T t ) ] + β t ν ˜ t ( 1 s T t ) ( x s 2 ) , we have
    q ^ t w i t h i n t e r t e m p o r a l r e s t r i c t i o n s > q ^ t w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s .
  • When s T t 1 ( 2 α 1 ) σ ˜ t η T s T t 1 + β t ν ˜ t = ( 1 s ) [ x T t s 2 ( T t ) ] ( 2 α 1 ) σ ˜ t η T ( 1 s ) [ x T t s 2 ( T t ) ] + β t ν ˜ t ( 1 s T t ) ( x s 2 ) , we have
    q ^ t w i t h i n t e r t e m p o r a l r e s t r i c t i o n s = q ^ t w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s .
  • When s T t 1 ( 2 α 1 ) σ ˜ t η T s T t 1 + β t ν ˜ t < ( 1 s ) [ x T t s 2 ( T t ) ] ( 2 α 1 ) σ ˜ t η T ( 1 s ) [ x T t s 2 ( T t ) ] + β t ν ˜ t ( 1 s T t ) ( x s 2 ) , we have
    q ^ t w i t h i n t e r t e m p o r a l r e s t r i c t i o n s < q ^ t w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s .
Remark 7.
Assume the reinsurance premium is computed under the variance value premium principle. When α takes the boundary point, model (3) degenerates into investment-reinsurance optimization problems considering unilateral interest, i.e., the insurer’s interest is considered when α = 1 ; the reinsurer’s interest is considered when α = 0 . The optimal investment-reinsurance strategies in both cases are given by Table 2.
From Remark 7, we can find that u ^ t 1 and u ^ t 2 are consistent with (22) in Remark 2 and (27) in Remark 3, respectively. That is, the way premiums are calculated does not affect the form of the optimal investment strategy. Furthermore, it is worth noting that the optimal reinsurance strategy must be between 0 and 1 if α = 1 and the optimal reinsurance strategy is always 1 if α = 0 . In other words, when only the interest of the insurer is considered, the claim proportion borne by the insurer is between 0 and 1; when only the interest of the reinsurer is considered, the reinsurer bears no claim risk at all.

4. Numerical Analysis

In this section, some numerical simulations are conducted to show the results presented in Section 3. Given T = 100 , w 0 1 = w 0 2 = 1 . Referring to the parameter setting in Li and Ng [33] and Xiao et al. [25], for t = 0 , 1 , , T 1 , the model parameter values in this paper are given by Table 3.
The risk aversion coefficient is set to be in exponential form, η t = η T x ( T t ) , t = 0 , 1 , , T 1 , where x > 1 and η T = 1 . Next, we will show the evolutions process of the Nash equilibrium strategies under different settings, and we assume that ξ 1 , ξ 2 , , ξ T have the following situations.
Case I in Table 4 indicates that decisions made by the reinsurer and the insurer will take all the intertemporal restrictions into account. Case II in Table 4 means that the reinsurer and the insurer only concern their terminal wealth’s performance. According to the parameter settings, we will analyze the effect of the intertemporal restrictions on the Nash equilibrium strategies. Since the premium principle does not affect the investment strategies, we will not repeatedly demonstrate the influence of model parameters on the Nash equilibrium investment strategies under different premium principles. However, the selection of premium principles will affect the formulation of a reinsurance strategy, therefore it is necessary to discuss the effect of the premium principle on the Nash equilibrium reinsurance strategies. The detailed simulations are shown in Section 4.1 and Section 4.2.

4.1. Simulations of the Nash Equilibrium Investment Strategies

The evolution of the Nash equilibrium investment strategies will be discussed when the insurer and the reinsurer have the common optimization objective. Using the parameter settings shown above, we can deduce the Nash equilibrium investment strategies’ path. For details, see Figure 1, Figure 2, Figure 3 and Figure 4.
Figure 1 presents the evolution paths of the Nash equilibrium investment strategies when x = 1.2 and the weight coefficient is α = 0.2 . Figure 1a,b consider the effect of the intertemporal restrictions on the Nash equilibrium investment strategies (i.e., case I vs. case II). When x = 1.2 , the condition x s 2 x T t s 2 ( T t ) 1 s s T t 1 ( 1 s T t ) in Corollary 1 is satisfied. By comparing the investment strategies with and without intertemporal restrictions in Figure 1a,b, we can find that when the investment decision-making process considers intertemporal restrictions, the investors will reduce his/her investment position invested in the risk assessment (i.e., u ^ t 1 and u ^ t 2 will decrease) compared to the investment strategy without intertemporal restrictions. This phenomenon is consistent with the conclusions shown in Corollary 1. This means that intertemporal restrictions considerations will lead to an increase in the amount of investment in the riskless asset ( w t 1 u ^ t 1 and w t 2 u ^ t 2 ), indicating that investors will adopt conservative strategies to reduce the investment risk.
Figure 2 depicts the evolution paths of the Nash equilibrium investment strategies when x = 1.01 and the weight coefficient is α = 0.2 . The effect of intertemporal restrictions on the Nash equilibrium investment strategies is considered in this simulation (i.e., cases I and II). When x = 1.01 , the condition x s 2 x T t s 2 ( T t ) 1 s s T t 1 ( 1 s T t ) in Corollary 1 is satisfied. As shown in Figure 2, when the investment decision-making process considers intertemporal restrictions, investors will increase their exposure to risky assets compared to the investment strategy without intertemporal restrictions, which is consistent with the conclusions shown in Corollary 1. This means that the intertemporal restrictions will encourage investors to adopt radical strategies to increase their wealth. Compared with Figure 1 and Figure 2, we note that the effect of intertemporal restrictions on the Nash equilibrium investment strategies is related to the reduction speed of risk aversion parameters and the value of risk-free benefits.
In Figure 1 and Figure 2, we assume that the weight coefficient satisfies α < 0.5 (i.e., α = 0.2 ), which means that the interest of the reinsurer is relatively important in consideration of the common goal. Next, we want to test whether the above conclusions hold when the weight coefficient α > 0.5 (for example, α = 0.8 in Figure 3 and Figure 4).
Comparing Figure 1 with Figure 3, Figure 2 with Figure 4, we find that the weight coefficient α can not change the effect of intertemporal constraints on investment strategies. In other words, whether the intertemporal restrictions will make the investment strategy more radical or more conservative depends only on the reduction speed of risk aversion parameters and the value of risk-free benefits, but not on the size of the weight parameter.
Moreover, in Figure 1, Figure 2, Figure 3 and Figure 4, we note that the Nash equilibrium investment strategies are increasing with respect to t. In other words, at the beginning of investment range, the insurer and the reinsurer are relatively conservative in order to avoid bankruptcy; the later, the more money they invest in risk assets. This phenomenon is because as time goes on, the insurer’s and reinsurer’s wealth keeps accumulating and their risk aversion coefficient gradually decreases, so they have enough ability and confidence to invest more funds in risky assets.

4.2. Simulations of the Nash Equilibrium Reinsurance Strategy

Next, we proceed to illustrate the evolution of the Nash equilibrium reinsurance strategy. On the basis of the parameters given above, we deduce the corresponding path of the Nash equilibrium reinsurance strategy. For details, see Figure 5, Figure 6, Figure 7 and Figure 8.
Under the expected value premium principle, Figure 5 shows the evolution of the Nash equilibrium reinsurance strategy when α < 0.5 (e.g., α = 0.2 ). As two different departments of the same large insurance company, the insurer and the reinsurer have a common interest goal, but their positions in the head office are different. α < 0.5 means that the interest of the reinsurer may be more concerned than that of the insurer. Therefore, the reinsurer may not be willing to bear the risk of claim, especially at the early stage. From Figure 5a, we can find that the reinsurer will not assume any claim risk in the previous investment periods (i.e., q t = 1 when t 76 in case I and t 65 in case II), and the reinsurer will only undertake the part or all of claim risk with the approach of the end of period (Note that ( 1 q t ) denotes the proportion of claim undertaken by reinsurer at time t, where t = 0 , 1 , , T 1 ). A similar analysis can be obtained from Figure 5b.
In addition, in the case of α < 0.5 , Figure 5a,b respectively describe the impacts of intertemporal restrictions on the Nash equilibrium reinsurance strategy when the reduction speed of risk aversion parameters takes different values. As shown in Figure 5a, we can find that the intertemporal restrictions can cause the reinsurer to be more concerned about its performance in the various intermediate periods, which will lead to longer time for the reinsurer not to bear the risk of claim (e.g., the reinsurance strategy of case I is q t = 1 when t 76 , while that of case II is q t = 1 when t 65 , this indicates that the reinsurer in the case I is less willing to take on the risk of claim compared to the reinsurer in the case II). When x = 1.2 , the condition x s 2 x T t s 2 ( T t ) 1 s s T t 1 ( 1 s T t ) in Corollary 3 is satisfied. That is, when the reinsurance strategy is between 0 and 1, the intertemporal restrictions reduce the underwriting ratio of the reinsurer, i.e., ( 1 q t ) decreases. That is to say, intertemporal restrictions restrain the reinsurer’s enthusiasm to bear claim risk in this case.
From Figure 5b, we can find that intertemporal restrictions have a completely opposite effect on the Nash equilibrium reinsurance strategy. Intertemporal restrictions will make the reinsurer more confident about its own wealth level in this case and therefore more willing to take on claim risk as early as possible (e.g., the reinsurance strategy of case I is q t = 1 when t 58 , while that of case II is q t = 1 when t 65 , this indicates that the reinsurer in the case I is more willing to bear the claim risks compared to the reinsurer in the case II). When x = 1.01 , the condition x s 2 x T t s 2 ( T t ) 1 s s T t 1 ( 1 s T t ) in Corollary 3 is satisfied. That is, when the reinsurance strategy is between 0 and 1, the intertemporal restrictions increase the underwriting ratio of the reinsurer, i.e., ( 1 q t ) increases. In other words, intertemporal restrictions stimulate the reinsurer to bear the claim risk when x = 1.01 and α < 0.5 . In summary, when the reinsurer’s interest is more valued, the impact of intertemporal restrictions on Nash equilibrium reinsurance strategy is related to the values of reduction speed of risk aversion parameters and the value of risk-free benefits.
Figure 6 depicts the Nash equilibrium reinsurance strategy changes over time under expected value premium principle when the insurer’s interest is more valued (i.e., α > 0.5 . As an example, we take α = 0.8 here). Similar to the analysis in Figure 5, from Figure 6, we find the insurer might want the reinsurer to assume more claim risks since the bankruptcy probability is higher in early stages of investment. With the insurer’s wealth accumulating, the insurer gradually improves the retention level q t of claim risk until it fully assumes all claim risk.
In addition, as shown in Figure 6a, we can find that, when the decision-making process does not consider intertemporal restrictions, the insurer will increase the retention level q t after t = 65 . When the decision-making process considers intertemporal restrictions, the insurer will increase the retention level q t after t = 76 . When the reinsurance strategy falls between 0 and 1, the intertemporal restrictions will reduce the insurer’s reservation level. In other words, the intertemporal restrictions will aggravate the insurer’s degree of risk aversion and make the insurer more reluctant to bear the claim risk prematurely when x = 1.2 and α > 0.5 . This is the opposite of the effect shown in Figure 6a. From Figure 6b, we can find that, when the decision-making process considers intertemporal restrictions, the insurer is more willing to assume a certain percentage or all of the claim risk as early as possible. Furthermore, the intertemporal restrictions will increase the insurer’s reservation level when the reinsurance strategy falls between 0 and 1. That is to say, intertemporal restrictions make the insurer more willing to take the claim risk when x = 1.01 and α > 0.5 . Combining Figure 6a,b, we find when the reinsurance premium is computed under the expected value premium principle, the effect of intertemporal restrictions on the Nash equilibrium reinsurance strategies depends not only on the reduction speed of risk aversion parameters and the value of risk-free benefits, but also on the weight parameter.
Figure 7 shows the Nash equilibrium reinsurance strategy changes over time under the variance value premium principle in the case of reinsurer’s interest is more valued (i.e., α < 0.5 . For example, α = 0.2 ). From Figure 7, we can find that at the initial period, the reinsurer doesn’t want to take any the claim risk (i.e., 1 q t = 0 ), and with the reinsurer accumulated to a certain degree of wealth, he/she has enough ability to take on all claim risks (i.e., 1 q t = 1 ). As close to the end of investment, the insurer also has some ability to take the claim risk, then the retention level of the insurer (i.e., q t ) will be increased from 0 to a higher level.
Figure 7a,b depict the impacts of intertemporal restrictions on Nash equilibrium reinsurance strategy when x takes different values. As shown in Figure 7a, at the beginning of the period, the intertemporal restrictions will cause the reinsurer to pay more attention to his/her performance, which also lead to a longer time for the reinsurer to be unwilling to assume claim risk. At the end of the investment, the reinsurance strategy is between 0 and 1, and the intertemporal restrictions will increase the underwriting ratio of the reinsurer. From Figure 7b, at the beginning of the investment, the intertemporal restrictions make the reinsurer more confident about its level of wealth and therefore more willing to take on claim risk earlier. At the end of the investment, the reinsurance strategy is between 0 and 1, and the intertemporal restrictions will make the reinsurer reduce its underwriting proportion.
Figure 8 depicts the Nash equilibrium reinsurance strategies change over time under the variance value premium principle in the case of the insurer’s interest is more valued (i.e., α > 0.5 . For example, α = 0.8 ). From Figure 8, we can find that the insurer will pass all claim risk to the reinsurer since the bankruptcy probability is higher in the early stages of investment. With the insurer’s wealth accumulating, the insurer has enough ability to bear claim risk; therefore, the insurer will gradually increase the retention level q t . As shown in Figure 8a, the reinsurance strategy in case I will be shifted to the right by some units compared to that in case II. This illustrates the intertemporal restrictions will cause the insurer to pay more attention to its performance, which will lead to a longer time for the insurer to be unwilling to assume claim risk. When the reinsurance strategy is between 0 and 1, the intertemporal restrictions would encourage the insurer to reduce its reservation proportion. That is to say, the intertemporal restrictions can make the insurer more cautious about claim risk when α = 0.8 and x = 1.2 . From Figure 8b, we can find that the intertemporal restrictions make the insurer more confident about its level of wealth and more willing to take claims early (e.g., as shown in Figure 8a, the reinsurance strategy in case I will be shifted to the left by some units compared to that in case II). When the reinsurance strategy is between 0 and 1, the intertemporal restrictions would encourage the insurer to increase its reservation proportion. In other words, the intertemporal restrictions will make the insurer more willing to bear claim risks in this case.

5. Conclusions

In the paper, under the discrete-time framework, we construct a generalized multi-period mean-variance model to study the investment-reinsurance optimization problem of the reinsurer and the insurer, which can be regarded as two different departments of a general insurance company. The reinsurer and the insurer have a common interest goal to maximize the expectation of the weighted sum of their wealth processes and minimize corresponding variance. In addition, the intertemporal restrictions are considered in our model. By using the game method, we deduce the Nash equilibrium investment-reinsurance strategies for the generalized premium calculated principle, and derive the Nash equilibrium investment-reinsurance strategies under two special premium calculated principles (i.e., the expected value premium principle and the variance value premium principle). Furthermore, we theoretically analyze the effect of the intertemporal restrictions on Nash equilibrium strategies and perform corresponding numerical analyses to verify our theoretical results.
We find that the effect of the intertemporal restrictions on Nash equilibrium strategies is related to the setting of model parameters. (i) The effect of intertemporal restrictions on Nash equilibrium investment strategies depends on the reduction speed of risk aversion parameters and the value of risk-free benefits. (ii) When computing the reinsurance premium under the expected value premium principle, the effect of intertemporal restrictions on the Nash equilibrium reinsurance strategy depends not only on the reduction speed of risk aversion parameters and the value of risk-free benefits but also on the weight parameter. (iii) When computing the reinsurance premium under the variance value premium principle, the effect of intertemporal restrictions on the Nash equilibrium reinsurance strategy depends on a number of other factors, in addition to those mentioned above.
However, the research in this paper has some limitations. For example, this article studies equilibrium investment-reinsurance strategies under general or special reinsurance premium calculation rules but does not involve the pricing of reinsurance premiums. With the method in Chen and Shen [15], Chen and Shen [16] and Anthropelos and Boonen [38], how to find a reasonable reinsurance premium calculation formula to maximize the benefits of both the insurer and reinsurer in the discrete-time framework is an urgent problem to be studied. Furthermore, with reference to the methods of Bensoussan et al. [9], Pun and Wong [10], Pun et al. [11], and so on, the study of the game problem between two insurance companies in discrete-time framework is also a good expandable study direction of this paper in the future.

Author Contributions

Supervision, Z.Z.; writing—original draft, Y.B.; writing—review and editing, H.X. and R.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research is supported by the National Natural Science Foundation of China (Nos. 71771082, 71801091, 71701068), Hunan Provincial Natural Science Foundation of China (No. 2017JJ1012), and the Scientific Research Fund of Hunan Provincial Education Department, China (No.17K057).

Acknowledgments

The authors are grateful to the anonymous reviewers and the editor for the valuable comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. The Proof of Proposition 1

When k = T 1 , from the definition of J k ( w k 1 , w k 2 , π ) , we have
V T 1 ( w T 1 1 , w T 1 2 ) = max π T 1 ξ T [ E T 1 ( α w T 1 + ( 1 α ) w T 2 ) η T V a r T 1 ( α w T 1 + ( 1 α ) w T 2 ) ] .
Therefore, Proposition 1 holds for k = T 1 . For k = 0 , 1 , , T 2 , J k ( w k 1 , w k 2 , π ) can be expressed as
J k ( w k 1 , w k 2 , π ) = t = k + 1 T ξ t [ E k ( α w t 1 + ( 1 α ) w t 2 ) η t V a r k ( α w t 1 + ( 1 α ) w t 2 ) ] = t = k + 2 T ξ t [ E k ( α w t 1 + ( 1 α ) w t 2 ) η t V a r k ( α w t 1 + ( 1 α ) w t 2 ) ] + ξ k + 1 [ E k ( α w k + 1 1 + ( 1 α ) w k + 1 2 ) η k + 1 V a r k ( α w k + 1 1 + ( 1 α ) w k + 1 2 ) ] .
Based on the law of iterated expectations and the law of total variance, we obtain
E k ( α w t 1 + ( 1 α ) w t 2 ) = E k [ E k + 1 ( α w t 1 + ( 1 α ) w t 2 ) ] ,
V a r k ( α w t 1 + ( 1 α ) w t 2 ) = E k [ V a r k + 1 ( α w t 1 + ( 1 α ) w t 2 ) ] + V a r k [ E k + 1 ( α w t 1 + ( 1 α ) w t 2 ) ] .
Then, J k ( w k 1 , w k 2 , π ) can be rewritten as
J k ( w k 1 , w k 2 , π ) = E k t = k + 2 T ξ t [ E k + 1 ( α w t 1 + ( 1 α ) w t 2 ) η t V a r k + 1 ( α w t 1 + ( 1 α ) w t 2 ) ] t = k + 2 T ξ t η t V a r k [ E k + 1 ( α w t 1 + ( 1 α ) w t 2 ) ] + ξ k + 1 [ E k ( α w k + 1 1 + ( 1 α ) w k + 1 2 ) η k + 1 V a r k ( α w k + 1 1 + ( 1 α ) w k + 1 2 ) ] , = E k [ J k + 1 ( w k + 1 1 , w k + 1 2 , π ) ] t = k + 2 T ξ t η t V a r k [ E k + 1 ( α w t 1 + ( 1 α ) w t 2 ) ] + ξ k + 1 [ E k ( α w k + 1 1 + ( 1 α ) w k + 1 2 ) η k + 1 V a r k ( α w k + 1 1 + ( 1 α ) w k + 1 2 ) ] .
Let f k , t ( w k 1 , w k 2 ) = E k [ α w t 1 + ( 1 α ) w t 2 ] | π ^ ( k ) = E k [ f k + 1 , t ( w k + 1 1 , w k + 1 2 ) ] . Additionally, due to the fact that V k ( w k 1 , w k 2 ) = max π k J k ( w k 1 , w k 2 , π ( k ) ) = J k ( w k 1 , w k 2 , π ^ ( k ) ) , then, for k = 0 , 1 , , T 2 , we have
V k ( w k 1 , w k 2 ) = max π k E k [ V k + 1 ( w k + 1 1 , w k + 1 2 ) ] t = k + 2 T ξ t η t V a r k [ f k + 1 , t ( w k + 1 1 , w k + 1 2 ) ] + ξ k + 1 [ E k ( α w k + 1 1 + ( 1 α ) w k + 1 2 ) η k + 1 V a r k ( α w k + 1 1 + ( 1 α ) w k + 1 2 ) ] .
Hence, Proposition 1 is proved.

Appendix B. The Proof of Theorem 1

When t = T 1 , we obtain
V T 1 ( w T 1 1 , w T 1 2 ) = max π T 1 ξ T E T 1 [ α w T 1 + ( 1 α ) w T 2 ] ξ T η T V a r T 1 [ α w T 1 + ( 1 α ) w T 2 ] = max π T 1 α ξ T [ s T 1 w T 1 1 + μ T 1 1 u T 1 1 + c T 1 δ T 1 ( q T 1 ) q T 1 μ ˜ T 1 ] + ( 1 α ) ξ T [ s T 1 w T 1 2 + μ T 1 2 u T 1 2 + δ T 1 ( q T 1 ) ( 1 q T 1 ) μ ˜ T 1 ] ξ T η T { α 2 σ T 1 1 ( u T 1 1 ) 2 + ( 1 α ) 2 σ T 1 2 ( u T 1 2 ) 2 + 2 α ( 1 α ) θ T 1 u T 1 1 u T 1 2 + [ α q T 1 + ( 1 α ) ( 1 q T 1 ) ] 2 σ ˜ T 1 } .
Then, we have
u ^ T 1 1 = μ T 1 1 σ T 1 2 μ T 1 2 θ T 1 2 η T α [ σ T 1 1 σ T 1 2 ( θ T 1 ) 2 ] ,
u ^ T 1 2 = μ T 1 2 σ T 1 1 μ T 1 1 θ T 1 2 η T ( 1 α ) [ σ T 1 1 σ T 1 2 ( θ T 1 ) 2 ] ,
and
q ^ T 1 = arg max 0 q T 1 1 { ( 1 2 α ) δ T 1 ( q T 1 ) η T [ α q T 1 + ( 1 α ) ( 1 q T 1 ) ] 2 σ ˜ T 1 μ ˜ T 1 [ α q T 1 + ( 1 α ) ( 1 q T 1 ) ] } .
Thus, the equilibrium value function V T 1 ( w T 1 1 , w T 1 2 ) can be rewritten as
V T 1 ( w T 1 1 , w T 1 2 ) = ξ T s T 1 [ α w T 1 1 + ( 1 α ) w T 1 2 ] + α ξ T c T 1 + ξ T ( 1 2 α ) δ T 1 ( q ^ T 1 ) + ξ T [ ( μ T 1 1 ) 2 σ T 1 2 + ( μ T 1 2 ) 2 σ T 1 1 2 μ T 1 1 μ T 1 2 θ T 1 ] 4 η T [ σ T 1 1 σ T 1 2 ( θ T 1 ) 2 ] ξ T μ ˜ T 1 [ α q ^ T 1 + ( 1 α ) ( 1 q ^ T 1 ) ] ξ T η T [ α q ^ T 1 + ( 1 α ) ( 1 q ^ T 1 ) ] 2 σ ˜ T 1 = ξ T s T 1 [ α w T 1 1 + ( 1 α ) w T 1 2 ] + κ T 1 ,
and
f T 1 , T ( w T 1 1 , w T 1 2 ) = s T 1 [ α w T 1 1 + ( 1 α ) w T 1 2 ] + α c T 1 + ( 1 2 α ) δ T 1 ( q ^ T 1 ) + ( μ T 1 1 ) 2 σ T 1 2 + ( μ T 1 2 ) 2 σ T 1 1 2 μ T 1 1 μ T 1 2 θ T 1 2 η T [ σ T 1 1 σ T 1 2 ( θ T 1 ) 2 ] μ ˜ T 1 [ α q ^ T 1 + ( 1 α ) ( 1 q ^ T 1 ) ] , = s T 1 [ α w T 1 1 + ( 1 α ) w T 1 2 ] + γ T 1 , T .
where
κ T 1 = α ξ T c T 1 + ( 1 2 α ) ξ T δ T 1 ( q ^ T 1 ) μ ˜ T 1 ξ T [ α q ^ T 1 + ( 1 α ) ( 1 q ^ T 1 ) ] + ξ T [ ( μ T 1 1 ) 2 σ T 1 2 + ( μ T 1 2 ) 2 σ T 1 1 2 μ T 1 1 μ T 1 2 θ T 1 ] 4 η T [ σ T 1 1 σ T 1 2 ( θ T 1 ) 2 ] ξ T η T [ α q ^ T 1 + ( 1 α ) ( 1 q ^ T 1 ) ] 2 σ ˜ T 1 , γ T 1 , T = α c T 1 + ( 1 2 α ) δ T 1 ( q ^ T 1 ) + ( μ T 1 1 ) 2 σ T 1 2 + ( μ T 1 2 ) 2 σ T 1 1 2 μ T 1 1 μ T 1 2 θ T 1 2 η T [ σ T 1 1 σ T 1 2 ( θ T 1 ) 2 ] μ ˜ T 1 [ α q ^ T 1 + ( 1 α ) ( 1 q ^ T 1 ) ] .
Then, Theorem 1 holds for t = T 1 . Suppose Theorem 1 holds for t = j + 1 , , T 2 , then when t = j , from the conclusion in Proposition 1, we have
V j ( w j 1 , w j 2 ) = max π j E j [ V j + 1 ( w j + 1 1 , w j + 1 2 ) ] m = j + 2 T ξ m η m V a r j [ f j + 1 , m ( w j + 1 1 , w j + 1 2 ) ] + ξ j + 1 E j [ α w j + 1 1 + ( 1 α ) w j + 1 2 ] ξ j + 1 η j + 1 V a r j [ α w j + 1 1 + ( 1 α ) w j + 1 2 ] = max π j m = j + 2 T ξ m i = j + 1 m 1 s i E j [ α w j + 1 1 + ( 1 α ) w j + 1 2 ] + κ j + 1 m = j + 2 T ξ m η m i = j + 1 m 1 s i 2 V a r j [ α w j + 1 1 + ( 1 α ) w j + 1 2 ] + ξ j + 1 E j [ α w j + 1 1 + ( 1 α ) w j + 1 2 ] ξ j + 1 η j + 1 V a r j [ α w j + 1 1 + ( 1 α ) w j + 1 2 ] = max π j α m = j + 1 T ξ m i = j + 1 m 1 s i [ s j w j 1 + μ j 1 u j 1 + c j δ j ( q j ) μ ˜ j q j ] + ( 1 α ) m = j + 1 T ξ m i = j + 1 m 1 s i [ s j w j 2 + μ j 2 u j 2 + δ j ( q j ) ( 1 q j ) μ ˜ j ] m = j + 1 T ξ m η m i = j + 1 m 1 s i 2 { α 2 σ j 1 ( u j 1 ) 2 + ( 1 α ) 2 σ j 2 ( u j 2 ) 2 + 2 α ( 1 α ) θ j u j 1 u j 2 + [ α q j + ( 1 α ) ( 1 q j ) ] 2 σ ˜ j } + κ j + 1 .
Then, we have
u ^ j 1 = ( μ j 1 σ j 2 μ j 2 θ j ) m = j + 1 T ξ m i = j + 1 m 1 s i 2 α ( σ j 1 σ j 2 θ j 2 ) m = j + 1 T ξ m η m i = j + 1 m 1 s i 2 ,
u ^ j 2 = ( μ j 2 σ j 1 μ j 1 θ j ) m = j + 1 T ξ m i = j + 1 m 1 s i 2 ( 1 α ) ( σ j 1 σ j 2 θ j 2 ) m = j + 1 T ξ m η m i = j + 1 m 1 s i 2 ,
q ^ j = arg max 0 q j 1 m = j + 1 T ξ m i = j + 1 m 1 s i { ( 1 2 α ) δ j ( q j ) μ ˜ j [ α q j + ( 1 α ) ( 1 q j ) ] } m = j + 1 T ξ m η m i = j + 1 m 1 s i 2 [ α q j + ( 1 α ) ( 1 q j ) ] 2 σ ˜ j .
Thus, the equilibrium value function V j ( w j 1 , w j 2 ) can be rewritten as
V j ( w j 1 , w j 2 ) = m = j + 1 T ξ m i = j m 1 s i [ α w j 1 + ( 1 α ) w j 2 ] + m = j + 1 T ξ m i = j + 1 m 1 s i { α c j + ( 1 2 α ) δ j ( q ^ j ) } + m = j + 1 T ξ m i = j + 1 m 1 s i [ ( μ j 1 ) 2 σ j 2 + ( μ j 2 ) 2 σ j 1 2 μ j 1 μ j 2 θ j ] 4 m = j + 1 T ξ m η m i = j + 1 m 1 s i 2 [ σ j 1 σ j 2 ( θ j ) 2 ] + κ j + 1 m = j + 1 T ξ m i = j + 1 m 1 s i [ α q ^ j + ( 1 α ) ( 1 q ^ j ) ] μ ˜ j m = j + 1 T ξ m η m i = j + 1 m 1 s i 2 [ α q ^ j + ( 1 α ) ( 1 q ^ j ) ] 2 σ ˜ j = m = j + 1 T ξ m i = j m 1 s i [ α w j 1 + ( 1 α ) w j 2 ] ,
and
f j , τ ( w j 1 , w j 2 ) = E j [ f j + 1 , τ ( w j + 1 1 , w j + 1 2 ) ] = [ α w j 1 + ( 1 α ) w j 2 ] i = j τ 1 s i + [ α c j + ( 1 2 α ) δ j ( q ^ j ) ] i = j + 1 τ 1 s i + m = j + 1 T ξ m i = j + 1 m 1 s i i = j + 1 τ 1 s i [ ( μ j 1 ) 2 σ j 2 + ( μ j 2 ) 2 σ j 1 2 μ j 1 μ j 2 θ j ] 2 m = j + 1 T ξ m η m i = j + 1 m 1 s i 2 [ σ j 1 σ j 2 ( θ j ) 2 ] [ α q ^ j + ( 1 α ) ( 1 q ^ j ) ] μ ˜ j i = j + 1 τ 1 s i + γ j + 1 , τ = [ α w j 1 + ( 1 α ) w j 2 ] i = j τ 1 s i + γ j , τ ,
where
κ j = κ j + 1 + m = j + 1 T ξ m i = j + 1 m 1 s i { α c j + ( 1 2 α ) δ j ( q ^ j ) } + m = j + 1 T ξ m i = j + 1 m 1 s i [ ( μ j 1 ) 2 σ j 2 + ( μ j 2 ) 2 σ j 1 2 μ j 1 μ j 2 θ j ] 4 m = j + 1 T ξ m η m i = j + 1 m 1 s i 2 [ σ j 1 σ j 2 ( θ j ) 2 ] m = j + 1 T ξ m i = j + 1 m 1 s i [ α q ^ j + ( 1 α ) ( 1 q ^ j ) ] μ ˜ j m = j + 1 T ξ m η m i = j + 1 m 1 s i 2 [ α q ^ j + ( 1 α ) ( 1 q ^ j ) ] 2 σ ˜ j , γ j , τ = γ j + 1 , τ + [ α c j + ( 1 2 α ) δ j ( q ^ j ) ] i = j + 1 τ 1 s i [ α q ^ j + ( 1 α ) ( 1 q ^ j ) ] μ ˜ j i = j + 1 τ 1 s i + m = j + 1 T ξ m i = j + 1 m 1 s i i = j + 1 τ 1 s i [ ( μ j 1 ) 2 σ j 2 + ( μ j 2 ) 2 σ j 1 2 μ j 1 μ j 2 θ j ] 2 m = j + 1 T ξ m η m i = j + 1 m 1 s i 2 [ σ j 1 σ j 2 ( θ j ) 2 ] .
It also shows that
κ j = k = j T 1 m = k + 1 T ξ m i = k + 1 m 1 s i { α c k + ( 1 2 α ) δ k ( q ^ k ) μ ˜ k [ α q ^ k + ( 1 α ) ( 1 q ^ k ) ] } + k = j T 1 [ ( μ k 1 ) 2 σ k 2 + ( μ k 2 ) 2 σ k 1 2 μ k 1 μ k 2 θ k ] 4 [ σ k 1 σ k 2 ( θ k ) 2 ] m = k + 1 T ξ m i = k + 1 m 1 s i m = k + 1 T ξ m η m i = k + 1 m 1 s i 2 k = j T 1 m = k + 1 T ξ m η m i = k + 1 m 1 s i 2 [ α q ^ k + ( 1 α ) ( 1 q ^ k ) ] 2 σ ˜ k , γ j , τ = k = j τ 1 { α c k + ( 1 2 α ) δ k ( q ^ k ) μ ˜ k [ α q ^ k + ( 1 α ) ( 1 q ^ k ) ] } i = k + 1 τ 1 s i + k = j τ 1 ( μ k 1 ) 2 σ k 2 + ( μ k 2 ) 2 σ k 1 2 μ k 1 μ k 2 θ k 2 [ σ k 1 σ k 2 ( θ k ) 2 ] m = k + 1 T ξ m i = k + 1 m 1 s i i = k + 1 τ 1 s i 2 m = k + 1 T ξ m η m i = k + 1 m 1 s i 2 .
Combining the above proofs, we have Theorem 1 holds for t = j . Based on mathematical induction, Theorem 1 is proved.

Appendix C. The Proof of Corollary 1

When the riskless rate of return is constant and the risk aversion coefficient is in exponential form, the Nash equilibrium investment strategies in Theorem 1 become
u ^ t 1 = [ μ t 1 σ t 2 μ t 2 θ t ] 2 α [ σ t 1 σ t 2 ( θ t ) 2 ] m = t + 1 T ξ m s m t 1 m = t + 1 T ξ m η T x T m s 2 ( m t 1 ) ,
u ^ t 2 = μ t 2 σ t 1 μ t 1 θ t 2 ( 1 α ) [ σ t 1 σ t 2 ( θ t ) 2 ] m = t + 1 T ξ m s m t 1 m = t + 1 T ξ m η T x T m s 2 ( m t 1 ) .
When considering the intertemporal restrictions, i.e., ξ t = 1 , t = 1 , 2 , , T , from Equations (A20) and (A21), we can get that
| u ^ t 1 | w i t h i n t e r t e m p o r a l r e s t r i c t i o n s = μ t 1 σ t 2 μ t 2 θ t σ t 1 σ t 2 ( θ t ) 2 1 2 α ( 1 s T t ) ( 1 s ) 1 η T x s 2 [ x T t s 2 ( T t ) ] ,
| u ^ t 2 | w i t h i n t e r t e m p o r a l r e s t r i c t i o n s = μ t 2 σ t 1 μ t 1 θ t σ t 1 σ t 2 ( θ t ) 2 1 2 ( 1 α ) ( 1 s T t ) ( 1 s ) 1 η T x s 2 [ x T t s 2 ( T t ) ] .
When the intertemporal restrictions are not considered, i.e., ξ t = 0 , t = 1 , 2 , , T 1 , ξ T = 1 , from Equations (A20) and (A21), we can get that
| u ^ t 1 | w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s = μ t 1 σ t 2 μ t 2 θ t σ t 1 σ t 2 ( θ t ) 2 1 2 α 1 η T s T t 1 ,
| u ^ t 2 | w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s = μ t 2 σ t 1 μ t 1 θ t σ t 1 σ t 2 ( θ t ) 2 1 2 ( 1 α ) 1 η T s T t 1 .
Then,
| u ^ t 1 | w i t h i n t e r t e m p o r a l r e s t r i c t i o n s | u ^ t 1 | w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s = μ t 1 σ t 2 μ t 2 θ t σ t 1 σ t 2 ( θ t ) 2 1 2 α η T ( 1 s T t ) ( 1 s ) x s 2 [ x T t s 2 ( T t ) ] 1 s T t 1 ,
| u ^ t 2 | w i t h i n t e r t e m p o r a l r e s t r i c t i o n s | u ^ t 2 | w i t h o u t i n t e r t e m p o r a l r e s t r i c t i o n s = μ t 2 σ t 1 μ t 1 θ t σ t 1 σ t 2 ( θ t ) 2 1 2 ( 1 α ) η T ( 1 s T t ) ( 1 s ) x s 2 [ x T t s 2 ( T t ) ] 1 s T t 1 .
The condition ( 1 s T t ) ( 1 s ) x s 2 [ x T t s 2 ( T t ) ] 1 s T t 1 > 0 is equivalent to the condition x s 2 x T t s 2 ( T t ) > 1 s s T t 1 ( 1 s T t ) . Then, three cases in Corollary (1) are proved.

References

  1. Browne, S. Optimal Investment Policies for a Firm with a Random Risk Process: Exponential Utility and Minimizing the Probability of Ruin. Math. Oper. Res. 1995, 20, 937–958. [Google Scholar] [CrossRef] [Green Version]
  2. Schmidli, H. On minimizing the ruin probability by investment and reinsurance. Ann. Appl. Probab. 2002, 12, 890–907. [Google Scholar] [CrossRef]
  3. Zeng, Y.; Li, Z. Optimal time-consistent investment and reinsurance policies for mean-variance insurers. Insur. Math. Econ. 2011, 49, 145–154. [Google Scholar] [CrossRef]
  4. Zhu, H.; Deng, C.; Yue, S.; Deng, Y. Optimal reinsurance and investment problem for an insurer with counterparty risk. Insur. Math. Econ. 2015, 61, 242–254. [Google Scholar] [CrossRef]
  5. Hu, D.; Wang, H. Time-consistent investment and reinsurance under relative performance concerns. Commun. Stat. Theory Methods 2018, 47, 1693–1717. [Google Scholar] [CrossRef]
  6. Deng, C.; Bian, W.; Wu, B. Optimal reinsurance and investment problem with default risk and bounded memory. Int. J. Control. 2019, 1–13. [Google Scholar] [CrossRef]
  7. Zeng, X. A stochastic differential reinsurance game. J. Appl. Probab. 2010, 47, 335–349. [Google Scholar] [CrossRef] [Green Version]
  8. Li, D.; Rong, X.; Zhao, H. Stochastic differential game formulation on the reinsurance and investment problem. Int. J. Control. 2015, 88, 1861–1877. [Google Scholar] [CrossRef]
  9. Bensoussan, A.; Siu, C.C.; Yam, S.C.P.; Yang, H. A class of non-zero-sum stochastic differential investment and reinsurance games. Automatica 2014, 50, 2025–2037. [Google Scholar] [CrossRef] [Green Version]
  10. Pun, C.S.; Wong, H.Y. Robust non-zero-sum stochastic differential reinsurance game. Insur. Math. Econ. 2016, 68, 169–177. [Google Scholar] [CrossRef]
  11. Pun, C.S.; Siu, C.C.; Wong, H.Y. Non-zero-sum reinsurance games subject to ambiguous correlations. Oper. Res. Lett. 2016, 44, 578–586. [Google Scholar] [CrossRef]
  12. Pun, C.S. Robust Stochastic Control and High-Dimensional Statistics with Applications in Finance. Ph.D. Thesis, The Chinese University of Hong Kong, Hong Kong, China, 2016. [Google Scholar]
  13. Deng, C.; Zeng, X.; Zhu, H. Non-zero-sum stochastic differential reinsurance and investment games with default risk. Eur. J. Oper. Res. 2018, 264, 1144–1158. [Google Scholar] [CrossRef]
  14. Wang, N.; Zhang, N.; Jin, Z.; Qian, L. Robust non-zero-sum investment and reinsurance game with default risk. Insur. Math. Econ. 2019, 84, 115–132. [Google Scholar] [CrossRef]
  15. Chen, L.; Shen, Y. On a New Paradigm of Optimal Reinsurance: A Stochastic Stackelberg Differential Game between an Insurer and a Reinsurer. ASTIN Bull. 2018, 48, 905–960. [Google Scholar] [CrossRef] [Green Version]
  16. Chen, L.; Shen, Y. Stochastic Stackelberg differential reinsurance games under time-inconsistent mean-variance framework. Insur. Math. Econ. 2019, 88, 120–137. [Google Scholar] [CrossRef]
  17. Wang, Y.; Rong, X.; Zhao, H. Optimal investment strategies for an insurer and a reinsurer with a jump diffusion risk process under the CEV model. J. Comput. Appl. Math. 2018, 328, 414–431. [Google Scholar] [CrossRef]
  18. Chen, S.; Liu, Y.; Weng, C. Dynamic risk-sharing game and reinsurance contract design. Insur. Math. Econ. 2019, 86, 216–231. [Google Scholar] [CrossRef]
  19. Li, D.; Rong, X.; Zhao, H. The optimal investment problem for an insurer and a reinsurer under the constant elasticity of variance model. IMA J. Manag. Math. 2016, 27, 255–280. [Google Scholar] [CrossRef]
  20. Zhou, J.; Yang, X.; Huang, Y. Robust optimal investment and proportional reinsurance toward joint interests of the insurer and the reinsurer. Commun. Stat. Theory Methods 2017, 46, 10733–10757. [Google Scholar] [CrossRef]
  21. Huang, Y.; Ouyang, Y.; Tang, L.; Zhou, J. Robust optimal investment and reinsurance problem for the product of the insurer’s and the reinsurer’s utilities. J. Comput. Appl. Math. 2018, 344, 532–552. [Google Scholar] [CrossRef]
  22. Zhao, H.; Weng, C.; Shen, Y.; Zeng, Y. Time-consistent investment-reinsurance strategies towards joint interests of the insurer and the reinsurer under CEV models. Sci. China Math. 2017, 60, 317–344. [Google Scholar] [CrossRef]
  23. Brandt, M.W. Portfolio choice problems. In Handbook of Financial Econometrics: Tools and Techniques; Elsevier: Amsterdam, The Netherlands, 2010; pp. 269–336. [Google Scholar]
  24. Zhou, Z.; Ren, T.; Xiao, H.; Liu, W. Time-consistent investment and reinsurance strategies for insurers under multi-period mean-variance formulation with generalized correlated returns. J. Manag. Sci. Eng. 2019, 4, 142–157. [Google Scholar] [CrossRef]
  25. Xiao, H.; Ren, T.; Bai, Y.; Zhou, Z. Time-Consistent Investment-Reinsurance Strategies for the Insurer and the Reinsurer under the Generalized Mean-Variance Criteria. Mathematics 2019, 7, 857. [Google Scholar] [CrossRef] [Green Version]
  26. Markowitz, H. Portfolio selection. J. Financ. 1952, 7, 77–91. [Google Scholar]
  27. Zhu, S.S.; Li, D.; Wang, S.Y. Risk control over bankruptcy in dynamic portfolio selection: A generalized mean-variance formulation. IEEE Trans. Autom. Control 2004, 49, 447–457. [Google Scholar] [CrossRef]
  28. Zhou, Z.; Xiao, H.; Yin, J.; Zeng, X.; Lin, L. Pre-commitment vs. time-consistent strategies for the generalized multi-period portfolio optimization with stochastic cash flows. Insur. Math. Econ. 2016, 68, 187–202. [Google Scholar] [CrossRef]
  29. Costa, O.; Nabholz, R.D.B. Multiperiod mean-variance optimization with intertemporal restrictions. J. Optim. Theory Appl. 2007, 134, 257. [Google Scholar] [CrossRef]
  30. Costa, O.L.; Araujo, M.V. A generalized multi-period mean–variance portfolio optimization with Markov switching parameters. Automatica 2008, 44, 2487–2497. [Google Scholar] [CrossRef]
  31. Costa, O.L.; de Oliveira, A. Optimal mean-variance control for discrete-time linear systems with Markovian jumps and multiplicative noises. Automatica 2012, 48, 304–315. [Google Scholar] [CrossRef]
  32. Cui, X.; Li, X.; Li, D. Unified framework of mean-field formulations for optimal multi-period mean-variance portfolio selection. IEEE Trans. Autom. Control 2014, 59, 1833–1844. [Google Scholar] [CrossRef] [Green Version]
  33. Li, D.; Ng, W.L. Optimal dynamic portfolio selection: Multiperiod mean-variance formulation. Math. Financ. 2000, 10, 387–406. [Google Scholar] [CrossRef]
  34. Basak, S.; Chabakauri, G. Dynamic mean-variance asset allocation. Rev. Financ. Stud. 2010, 23, 2970–3016. [Google Scholar] [CrossRef]
  35. Björk, T.; Murgoci, A. A General Theory of Markovian Time Inconsistent Stochastic Control Problems. Available online: http://ssrn.com/abstract=1694759 (accessed on 10 December 2019).
  36. Björk, T.; Murgoci, A. A theory of Markovian time-inconsistent stochastic control in discrete time. Financ. Stoch. 2014, 18, 545–592. [Google Scholar] [CrossRef]
  37. Zhou, Z.; Zeng, X.; Xiao, H.; Ren, T.; Liu, W. Multiperiod portfolio optimization for asset-liability management with quadratic transaction costs. J. Ind. Manag. Optim. 2019, 15, 1493–1515. [Google Scholar] [CrossRef]
  38. Anthropelos, M.; Boonen, T.J. Nash Equilibria in Optimal Reinsurance Bargaining. Available online: https://arxiv.org/abs/1909.01739 (accessed on 9 December 2019).
Figure 1. Nash equilibrium investment strategies for (a) the insurer and (b) the reinsurer ( α = 0.2 and x = 1.2 ).
Figure 1. Nash equilibrium investment strategies for (a) the insurer and (b) the reinsurer ( α = 0.2 and x = 1.2 ).
Mathematics 08 00139 g001
Figure 2. Nash equilibrium investment strategies for (a) the insurer and (b) the reinsurer ( α = 0.2 and x = 1.01 ).
Figure 2. Nash equilibrium investment strategies for (a) the insurer and (b) the reinsurer ( α = 0.2 and x = 1.01 ).
Mathematics 08 00139 g002
Figure 3. Nash equilibrium investment strategies for (a) the insurer and (b) the reinsurer ( α = 0.8 and x = 1.2 ).
Figure 3. Nash equilibrium investment strategies for (a) the insurer and (b) the reinsurer ( α = 0.8 and x = 1.2 ).
Mathematics 08 00139 g003
Figure 4. Nash equilibrium investment strategies for (a) the insurer and (b) the reinsurer ( α = 0.8 and x = 1.01 ).
Figure 4. Nash equilibrium investment strategies for (a) the insurer and (b) the reinsurer ( α = 0.8 and x = 1.01 ).
Mathematics 08 00139 g004
Figure 5. Nash equilibrium reinsurance strategy under the expected value premium principle ( α = 0.2 ).
Figure 5. Nash equilibrium reinsurance strategy under the expected value premium principle ( α = 0.2 ).
Mathematics 08 00139 g005
Figure 6. Nash equilibrium reinsurance strategy under the expected value premium principle ( α = 0.8 ).
Figure 6. Nash equilibrium reinsurance strategy under the expected value premium principle ( α = 0.8 ).
Mathematics 08 00139 g006
Figure 7. Nash equilibrium reinsurance strategy under the variance value premium principle ( α = 0.2 ).
Figure 7. Nash equilibrium reinsurance strategy under the variance value premium principle ( α = 0.2 ).
Mathematics 08 00139 g007
Figure 8. Nash equilibrium reinsurance strategy under the variance value premium principle ( α = 0.8 ).
Figure 8. Nash equilibrium reinsurance strategy under the variance value premium principle ( α = 0.8 ).
Mathematics 08 00139 g008
Table 1. Optimal strategy when the expected value premium principle and unilateral interests are considered.
Table 1. Optimal strategy when the expected value premium principle and unilateral interests are considered.
CasesOptimal Investment StrategyOptimal Reinsurance Strategy
α = 1 u ^ t 1 = μ t 1 m = t + 1 T ξ m i = t + 1 m 1 s i 2 σ t 1 m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 q ^ t = β t μ ˜ t 2 σ ˜ t m = t + 1 T ξ m i = t + 1 m 1 s i m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 1
α = 0 u ^ t 2 = μ t 2 2 σ t 2 m = t + 1 T ξ m i = t + 1 m 1 s i m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 q ^ t = 0 1 β t μ ˜ t 2 σ ˜ t m = t + 1 T ξ m i = t + 1 m 1 s i m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 1
Table 2. Optimal strategy when the variance value premium principle and unilateral interests are considered.
Table 2. Optimal strategy when the variance value premium principle and unilateral interests are considered.
CasesOptimal Investment StrategyOptimal Reinsurance Strategy
α = 1 u ^ t 1 = μ t 1 m = t + 1 T ξ m i = t + 1 m 1 s i 2 σ t 1 m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 q ^ t = m = t + 1 T ξ m i = t + 1 m 1 s i β t ν ˜ t m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 σ ˜ t + m = t + 1 T ξ m i = t + 1 m 1 s i β t ν ˜ t
α = 0 u ^ t 2 = μ t 2 2 σ t 2 m = t + 1 T ξ m i = t + 1 m 1 s i m = t + 1 T ξ m η m i = t + 1 m 1 s i 2 q ^ t = 1
Table 3. The model parameter values.
Table 3. The model parameter values.
μ t 1 μ t 2 σ t 1 σ t 2 θ t β t s t
1.162 1.228 0.0146 0.0289 0.0145 0.8 1.09
Table 4. Parameter setting in two cases.
Table 4. Parameter setting in two cases.
Case I ξ t = 1 , t = 1 , 2 , , T
Case II ξ t = 0 for t = 1 , 2 , , T 1 and ξ T = 1

Share and Cite

MDPI and ACS Style

Bai, Y.; Zhou, Z.; Gao, R.; Xiao, H. Nash Equilibrium Investment-Reinsurance Strategies for an Insurer and a Reinsurer with Intertemporal Restrictions and Common Interests. Mathematics 2020, 8, 139. https://0-doi-org.brum.beds.ac.uk/10.3390/math8010139

AMA Style

Bai Y, Zhou Z, Gao R, Xiao H. Nash Equilibrium Investment-Reinsurance Strategies for an Insurer and a Reinsurer with Intertemporal Restrictions and Common Interests. Mathematics. 2020; 8(1):139. https://0-doi-org.brum.beds.ac.uk/10.3390/math8010139

Chicago/Turabian Style

Bai, Yanfei, Zhongbao Zhou, Rui Gao, and Helu Xiao. 2020. "Nash Equilibrium Investment-Reinsurance Strategies for an Insurer and a Reinsurer with Intertemporal Restrictions and Common Interests" Mathematics 8, no. 1: 139. https://0-doi-org.brum.beds.ac.uk/10.3390/math8010139

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop