Experimental Study of Cloud-to-Ground Lightning Nowcasting with Multisource Data Based on a Video Prediction Method

Guo, Shuchang; Wang, Jinyan; Gan, Ruhui; Yang, Zhida; Yang, Yi

doi:10.3390/rs14030604

Open AccessArticle

Experimental Study of Cloud-to-Ground Lightning Nowcasting with Multisource Data Based on a Video Prediction Method

Key Laboratory of Climate Resource Development and Disaster Prevention in Gansu Province, College of Atmospheric Sciences, Lanzhou University, Lanzhou 730000, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2022, 14(3), 604; https://0-doi-org.brum.beds.ac.uk/10.3390/rs14030604

Submission received: 21 December 2021 / Revised: 25 January 2022 / Accepted: 25 January 2022 / Published: 27 January 2022

(This article belongs to the Special Issue Remote Sensing of Lightning and Its Applications in Atmospheric Electricity Studies)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The evolution of lightning generation and extinction is a nonlinear and complex process, and the nowcasting results based on extrapolation and numerical models largely differ from the real situation. In this study, a multiple-input and multiple-output lightning nowcasting model, namely Convolutional Long- and Short-Term Memory Lightning Forecast Net (CLSTM-LFN), is constructed to improve the lightning nowcasting results from 0 to 3 h based on video prediction methods in deep learning. The input variables to CLSTM-LFN include historical lightning occurrence frequency and physical variables significantly related to lightning occurrence from numerical model products, which are merged with each other to provide effective information for lightning nowcasting in time and space. The results of batch forecasting tests show that CLSTM-LFN can achieve effective forecasts of 0 to 3 h lightning occurrence areas, and the nowcasting results are better than those of the traditional lightning parameterization scheme and only inputting a single data source. After analyzing the importance of input variables, the results show that the role of numerical model products increases significantly with increasing forecast time, and the relative importance of convective available potential energy is significantly larger than that of other physical variables.

Keywords:

lightning nowcasting; video prediction; deep learning; permutation importance

1. Introduction

Lightning is a strong discharge phenomenon that occurs in the atmosphere and can cause large damage to power, transportation, communication and other facilities. The occurrence of lightning is often accompanied by heavy rain, strong winds and other severe convective weather, causing a large amount of casualties and property damage. Therefore, obtaining lightning forecast results with high spatial and temporal resolution is a research priority for meteorological departments in various countries.

Nowcasting usually refers to weather forecasting from 0 to 6 h [1] and is a forecast service to prevent catastrophic weather, such as emergency local strong storms. The occurrence of lightning is closely related to meteorological conditions. Through a series of observations, such as satellites [2] and sounding balloons [3], it is possible to determine whether there is conduciveness to the occurrence of lightning meteorological conditions. The K index indicates the static stability of the layer from 850 hPa to 500 hPa, and lightning usually occurs in the high value area of the K index [4]. Strong thunderstorms usually occur in the high value area of convective available potential energy (CAPE) [5] and the strong shear area of deep convection [6]. Since the sounding data are observations of a single station at a single moment, they cannot represent the weather state and the change pattern over time on a large scale, and the nowcasting results have some limitations.

Radar data have high spatial and temporal resolution and a wide detection range, which can quickly provide real-time weather observations and, therefore, are often used for monitoring and warning of catastrophic weather [7,8]. Numerous studies have shown that lightning usually occurs in regions where large vertical velocities [9] and high concentrations of ice crystal particles are present [10]. According to the statistics of lightning and radar echoes, the area where radar echoes of 40 dBZ are located at altitudes greater than 7 km [11] or the occurrence of 30 dBZ radar echoes in the −15 °C to −20 °C isothermal layer that have been observed in more than two consecutive scans can be regarded as the basis for lightning occurrence [12]. Dual-polarization radar is good at detecting information on ice crystal particle content and location, and the volume of ice crystal particles obtained from the inversion of dual-polarization radar can be used to forecast lightning according to different thresholds [13]. The method of identifying lightning occurrence areas based on radar data is obtained by using the statistics of historical samples from some areas but has some limitations in applicability in other areas. Meanwhile, a study pointed out that the correlation between lightning and radar echoes in linear convective systems is weaker than that of supercells [14]. The occurrence of lightning develops more rapidly, and the nowcasting results obtained based on statistical methods rapidly decrease in accuracy and produce large errors after a certain time limit.

In contrast to the above methods for lightning nowcasting, the use of numerical weather prediction models provides a more accurate dynamical and thermal state of the atmosphere. The PR92 lightning parameterization scheme establishes a relationship between lightning occurrence frequency and cloud top height, which has different forecast formulas for land and ocean areas [15]. The potential electrical energy establishes a relationship with the lightning density based on the microphysical variables within the cloud and considers the occurrence of lightning when its dissipation exceeds a prespecified threshold [16]. Wang et al. (2010) used the Global/Regional Assimilation and Prediction Enhanced System to construct a forecast equation based on the relationship between radar echoes and lightning occurrence frequency while using the cloud top temperature as a limiting condition. After testing two cases of lightning in South China, the nowcasting results of lightning occurrence location and density were more consistent with the observed results [17]. Based on the study of lightning initiation and discharge theory [18,19], some scholars have coupled the initiation and discharge parameterization scheme into the numerical model so that the numerical model has the ability to simulate the electric field characteristics within clouds and forecast lightning [20,21]. In recent years, numerical weather models have improved initial forecast model fields by assimilating sounding, radar, satellite and other observations, but there are still some biases in the nowcasting results [22], and the bias is not regular in time and space.

The continuous development of artificial intelligence technology in recent years has led to a series of breakthroughs in image recognition, speech recognition, autonomous driving, etc. The application of artificial intelligence technology to weather forecasting is gradually becoming popular. A deep neural network is a simulation of biological neurons, a dynamic network with strong nonlinear expression ability and self-adaptive capability generated by the superposition of large-scale neurons. The analysis of meteorological observations and products from numerical models using deep learning methods can identify areas of catastrophic weather occurrence at future moments, and the mathematical–physical process is relatively similar to application scenarios such as image recognition and video prediction. The more commonly used existing methods are convolutional neural networks (CNNs) and convolutional long- and short-term memory networks (ConvLSTMs), and some researchers have made a series of progress in pattern recognition [23,24] and lightning forecasting [25,26] using the above methods. Semantic segmentation is used for the classification of images at the pixel level and is commonly used in areas such as the detection of organ lesions in medical images and the classification of remote sensing images. Zhou et al. (2019) used NCEP Final Operational Global Analysis data (FNL) and Global Forecast System forecast data (GFS) to conduct convolution operations for grid points around the forecast grid points and judge whether lightning, hail and other strong convective weather will occur after binary classification in the fully connected layer [27]. Zhou et al. (2020) used Himawari-8 satellite and radar echo data and replaced two-dimensional convolution in a semantic segmentation network with three-dimensional convolution to obtain the probability of lightning occurrence in the forecast area [28]. The model obtained high forecast scores for the 0 to 1 h lightning nowcasting results.

Video prediction usually refers to analyzing object movements in the first few frames of still pictures or videos and predicting future images based on changes in their positions and forms [29]. For example, by analyzing the movement trajectory of pedestrians, the surrounding building environment and other factors, the target is tracked and the trajectory is predicted so that the driver can be alerted to react in time to avoid traffic accidents. Compared with image classification and semantic segmentation, video prediction methods are more suitable for dealing with dynamic image changes because they maintain continuity in the temporal dimension, and they have been successfully applied in autonomous driving [30] and short-term prognosis of precipitation [31,32].

Lightning usually occurs in small- and mesoscale convective systems, which are characterized by rapid changes and short times, and its development process is nonlinear and complex. The single extrapolation of historical observation data or parameterization schemes from numerical model products often largely differs from actual observation results. Instead, merging such observation data with numerical model products is a more promising method to improve lightning nowcasting results. Based on the characteristics of the video prediction method, the future change trend can be inferred from lightning characteristics in historical time, further using the indication of meteorological conditions from the numerical model products, which can effectively improve the lightning nowcasting results.

Based on the above research progress, in this study, a CNN module is added to the ConvLSTM to construct the lightning nowcasting model CLSTM-LFN, where the CNN module is used to extract the factors affecting lightning occurrence from the numerical model products and the ConvLSTM module is used to predict the spatiotemporal sequence forward composed of historical lightning occurrence frequency and factors affecting lightning occurrence, and it thus obtains lightning occurrence areas from 0 to 3 h in the future. Finally, the importance of the input variables is assessed using the permutation importance method.

Section 2 describes the data and the study area. Section 3 introduces the structure of the deep learning model and the experimental design. In Section 4, we illustrate the nowcasting results, and Section 5 demonstrates the variable importance analysis results. The conclusions and discussion are provided in Section 6.

2. Data

2.1. Lightning Data

The lightning data used in this study are from the National Lightning Monitoring Network of China, which can process the received data in real time. The observation range covers most areas of China and is used to detect the time, location and intensity of cloud-to-ground lightning flashes. The average observation range of a single lightning monitoring station is approximately 300 km [33], and the detection efficiency is close to 90% [34], which provides an excellent prerequisite for lightning research. In this study, the lightning occurrence frequency (number of lightning occurrences in a 1-h period) is obtained by preprocessing the lightning observation data, which is one of the input variables to the CLSTM-LFN.

2.2. WRF Model Prediction Products

The numerical model product is one of the input variables of CLSTM-LFN, and the numerical model used in this study is the Weather Research and Forecasting Model (WRF), which is a new generation of mesoscale weather forecasting models developed by the National Center for Atmospheric Research (NCAR) and other institutions and is highly portable and easy to maintain. The WRF model used in this paper is driven by the GFS forecast data, which are activated once a day at 12:00 (UTC) and have a forecast time of 36 h. The first 12 h is the spin-up time, and the forecast results of the next 24 h are used as the input variables to the CLSTM-LFN, with a 1-h interval between output results. On the same day, the start-up time of the numerical model products used by CLSTM-LFN is the same, and the numerical model products with different forecast times are selected to forecast lightning at different times. For example, to forecast lightning for the time period 1 July 2020 00:00 (UTC, same below) to 1 July 2020 03:00 and 1 July 2020 09:00 to 1 July 2020 12:00, the numerical model is activated at 30 June 2020 12:00 and the forecast times are 12 h to 15 h and 21 h to 24 h, respectively.

The center point of the simulated area is 37.52°N, 101.33°E, with a grid resolution of 9 km × 9 km and a horizontal grid size of 570 × 500, covering most of the land area and part of the marine area of China (Figure 1a). Considering the detection efficiency of the lightning observatory and the quality of the WRF model products, the region selected for this study is 19°N–42°N, 98°E–126°E, with a grid number of 256 × 256. The area considered for this study is an excerpt of the WRF model domain, covering most of central-eastern China and parts of western China (Figure 1b). The model has 41 vertical levels, and the upper model level is at 50 hPa. The Purdue Lin scheme is used for the microphysical scheme, and the longwave and shortwave radiation schemes are the RRTM and Goddard schemes, respectively. The boundary layer scheme is the MYNN2.5 scheme, and the land surface scheme is the Noah-MP scheme.

Thirteen variables from WRF model products, such as maximum vertical velocity and CAPE, are selected in this study. The reason is to consider that lightning is strongly indicative of strong convective weather, and physical variables, such as storm helicity, CAPE and precipitation, provide indications for lightning nowcasting on a macroscopic scale. Several studies have shown that physical variables, such as radar echo [35], water vapor [36], ice crystal particles [13] and vertical velocity [37], have strong correlations with the occurrence of lightning. The details of the input variables to the CLSTM-LFN from numerical model products are shown in Table 1.

3. Method

3.1. Preprocessing of Lightning Data

The preprocessing of lightning data consists of two main parts: quality control and increasing the lightning occurrence frequency density. The observation results of the lightning detection network have some errors [38], and isolated lightning cannot be judged whether it truly occurs. Even if they do exist, their convective intensity is weak, and the forecasting process is difficult. Therefore, the quality control aspect is mainly the rejection of isolated lightning, and the detection is rejected if no other lightning is observed within 20 km of a lightning location.

Lightning occurs mainly in mesoscale weather systems, typically on spatial scales of tens to hundreds of kilometers. The common method used in previous studies to mark grid points is to convert the lightning to the nearest grid point. Due to the high resolution of the grid, the meteorological conditions between the grid points adjacent to the lightning occurrence location are relatively similar. After labeling by the above method, the grid points with similar meteorological conditions will be labeled with the opposite result (detection vs. non-detection), which will have a large impact on the training of the neural network model and is not conducive to the convergence of the model. On the other hand, the occurrence of lightning is a small probability event, and statistically, the number of grids in which lightning occurs in the sample used in this study only accounts for 4% of the total number of grids. The process of neural network training gradually reduces the error between the output result and the grid labeling result. The neural network model will recognize lightning as noise if the number of grid points with lightning occurrence is small, resulting in the area of lightning occurrence forecast by the neural network model being smaller than the actual observation. It is necessary to increase the density of the lightning observation frequency to improve the model training results. Therefore, in this study, the number of lightning observations within a 20 km radius of the grid point is used as the lightning occurrence frequency of the grid point.

3.2. Training Set and Test Set

The input variables of the CLSTM-LFN included historical lightning occurrence frequency in the past 3 h at the starting forecast moment and hourly WRF model products in the next 3 h. For example, to forecast lightning from 1 July 2020 03:00 to 1 July 2020 06:00, the input variables to the CLSTM-LFN are the hourly lightning occurrence frequency from 1 July 2020 00:00 to 1 July 2020 03:00 and the hourly WRF model products from 1 July 2020 03:00 to 1 July 2020 05:00 (WRF model is activated at 30 June 2020 12:00).

The target output of traditional methods for forecasting lightning occurrence areas is generally the presence or absence of lightning (1 or 0). Grid points with higher lightning occurrence frequencies are often accompanied by stronger convective weather, and several studies have shown a strong positive correlation between radar echoes [39,40], CAPE [41] and lightning occurrence frequency. In the grid with lightning occurrence, when the difference in lightning occurrence frequency is large, the meteorological conditions of the grid points also have large differences, and marking all the grid points with lightning occurrence as 1 is, to some extent, not conducive to model convergence. Therefore, this study only focuses on the location of lightning occurrence, but the output of the CLSTM-LFN is the lightning occurrence frequency in the next 3 h, and lightning is considered to occur when the output is greater than the threshold N. A small value of N will lead to a higher false alarm rate, and a larger value will reduce the hit rate. After a series of tests, it is concluded that the CLSTM-LFN can achieve a higher hit rate and a lower false alarm rate when N = 5.

Summer is the high lightning period in the forecast area [42], and the samples selected for this study are from June to August 2020. June and July are training sets containing 1270 samples, August is the test set containing 216 samples, and each sample contains an hourly lightning forecast for the next 3 h. Since the CLSTM-LFN has 660,252 parameters, too few training samples can easily lead to overfitting, so it is necessary to use data augmentation methods to improve the model performance [43,44,45]. In this study, the input variables and labels of the training set were simultaneously rotated by 90°, 180° and 270° in the horizontal direction to generate three additional training sets (5080 samples in total).

3.3. Neural Network Structure

According to the characteristics of video forecasting and the data structure used in this study, the neural network model needs to obtain the lightning trends from historical lightning occurrence frequencies and merge them in time and space with the numerical products from the WRF model. Therefore, this study designs the lightning nowcasting model Convolutional Long- and Short-term Memory Lightning Forecast Net (CLSTM-LFN) based on ConvLSTM, which consists of two networks, CNN and ConvLSTM Net, for extracting features from the WRF model products and for composing spatiotemporal sequence forward prediction.

In the input variables to the CLTSM-LFN, the temporal dimension of the historical lightning occurrence frequency is 3, and the north–south and east–west dimension is consistent with the number of grid points in the study area, which is 256 × 256. Thus, the dimension of the historical lightning occurrence frequency is [3, 256, 256, 1], namely [time, rows, columns, variables].

The WRF model products are hourly weather conditions, including 13 variables, such as maximum reflectivity (R_max) and CAPE. Since the number of variables of the lightning occurrence frequency and the WRF model products are different and cannot be merged to form a spatiotemporal sequence in the time dimension, a two-dimensional convolutional network called CNN Net is first used to convolve the WRF model products to perform feature extraction and compress the number of variables to 1. Since the CNN Net performs feature extraction for WRF model products separately, the model products do not contain a temporal dimension, and the north–south and east–west dimension is also 256 × 256, so the dimensions of WRF model products are [256, 256, 13], namely [rows, columns, variables]. The convolved results are merged with the lightning occurrence frequency in the time dimension, which contains the starting forecast moment from 3 h in the past to 3 h in the future, forming a spatiotemporal sequence of dimensions [6, 256, 256, 1]. Subsequently, ConvLSTM Net is used to extract features in time and space and then perform spatiotemporal sequence prediction to achieve hourly lightning occurrence area nowcasting for the next 0 to 3 h. The network structure and data flow are detailed in Figure 2.

The structures of CNN and ConvLSTM Net are shown in Figure 3. CNN Net consists of 4 convolutional layers: the number of convolutional kernels in each layer is (128, 64, 32, 1), the convolutional kernel dimension is 2 × 2, the stride is 1 and the activation function is a rectified linear unit (ReLU) [46]. Meanwhile, the padding module is added in the convolution process to keep the grid resolution constant before and after the convolution. For the WRF model products after the convolution and reshaping operation of CNN Net, the 13 variables were compressed into 1 variable, and the time dimension was added. Then, the tensor dimension changed from [256, 256, 13] to [1, 256, 256, 1]. The ConvLSTM Net consists of the ConvLSTM2D and Conv3D modules with (128, 64, 32, 1) and (32, 16, 1) convolution kernels, respectively, and both of the activation functions are ReLU, where ConvLSTM2D has a convolutional kernel size of 2 × 2 and a stride of 1. The last layer of the network is set to return all sequences (return_sequences = True) to obtain a spatiotemporal sequence of time length 6. Since the purpose of this study is to forecast lightning hourly for the next 3 h, the Conv3D module is added with a convolution kernel size of 2 × 2 × 1 and a stride of 1. The convolution operation is performed on pairs in the time dimension to compress the length of the time series from 6 to 3.

3.3.1. 2D and 3D Convolution Layers

Convolutional neural networks are one of the important models in the field of deep learning and can extract features from the original image. The early classical CNN model LeNet-5 was designed using the error back propagation algorithm, and after continuous improvement, progress has been made in face recognition and robot navigation [47,48,49,50,51].

The output characteristic graph of the convolution layer is obtained by a set of convolution kernels that perform the convolution operation of the previous layer and activation function. The two-dimensional and three-dimensional convolution layers act on the planar graph and the cube, respectively, and the formula is expressed as follows:

x_{j}^{l} = f (\sum_{i \in M_{j}} x_{i}^{l - 1} * k_{i j}^{l} + b_{j}^{l})

(1)

where

x_{j}^{l}

is the output characteristic graph, f(…) is the activation function and

*

is the convolution operation.

k_{i j}^{l}

is the convolution kernel,

b_{j}^{l}

is the bias and

M_{j}

is the set of input variables.

3.3.2. ConvLSTM

Shi et al. (2015) proposed ConvLSTM networks based on fully connected LSTM networks (FC-LSTM) [31]. FC-LSTM is good at processing data that are strongly correlated in anterior and posterior sequences and is often used for time series prediction. In addition to the temporal continuity of radar and precipitation data, they also have strong spatial characteristics in space, which can lead to the loss of spatial information when processed with FC-LSTM. The convolution operation is used in ConvLSTM to extract features instead of the fully connected method in FC-LSTM, which can capture sufficient spatial information, and the structure diagram is shown in Figure 4. Furthermore,

i_{t}

,

f_{t}

and

o_{t}

represent input gate, forgetting gate and output gate, respectively;

x_{t}

,

h_{t}

and

c_{t}

represent input variable, hidden variable and storage unit, respectively; t represents the step of the network; σ is the sigmoid function with the output range of [0, 1]; tan h represents the hyperbolic tangent function with the output range of [−1, 1]; W and b are the weights to be trained and bias, respectively.

3.4. Network Training

In this study, the deep learning framework Keras was used for network construction and training, which integrates several mainstream deep learning algorithms and has the advantages of simplicity and high modularity. The Adam optimizer was used in the training process, and the default settings were used for the parameters [52]. The loss function is the mean square error (MSE) with the following expression.

MSE = \frac{1}{m} \sum_{i = 1}^{m} {(y_{i} - f (x_{i}))}^{2}

(2)

where m is the number of samples, and

y_{i}

and

f (x_{i})

are the observed value (true value) and neural network model prediction of the ith sample, respectively.

The neural network model is trained and predicted using NVIDIA’s general-purpose parallel computing architecture CUDA and the graphics processor Tesla V100. Experimental results show that CLSTM-LFN can complete the read-in of input data and the prediction of hour-by-hour lightning occurrence areas for the next 3 h in less than 5 min to meet the operational requirements.

3.5. Controlled Experimental Design

To investigate the effect of a single input data source on the CLSTM-LFN model and to compare it with traditional lightning forecasting methods, the control experiment consists of a lightning parameterization scheme and an empirical forecasting method, and different data sources are input to CLSTM-LFN to test the impact. The experimental design is as follows:

PR92: The PR92 parameterization scheme predicts lightning according to the relationship between lightning occurrence frequency F and cloud top height H [15]. The cloud top height is determined based on the thresholds of radar echo (20 dBZ) and temperature (0 °C), and the forecast results of this parameterization scheme are included in the WRF model. The PR92 scheme contains both land and ocean scenarios, and the forecast equation on land is

F = (3.44 \times 10^{- 5}) \times H^{4.9}

(3)

dBZ_from_WRF: Based on the statistical relationship between lightning and radar echoes, a study has shown that lightning usually occurs if a reflectivity of 40 dBZ occurs at altitudes where temperatures are <0 °C [53]. Therefore, the forecast results of the numerical model products for radar echo and temperature were used to forecast the lightning occurrence area.

CLSTM-LFN-O: A variant of the CLSTM-LFN model with historical lightning occurrence frequency as single input data, trained using only ConvLSTM Net due to the lack of WRF model products.

CLSTM-LFN-W: The same network structure as CLSTM-LFN-O, with input data containing only WRF model products.

4. Forecast Results

4.1. Nowcasting Results and Scoring Test

The nowcasting results were evaluated with three types of scores: threat score (TS) [54], false alarm rate (FAR) and probability of detection (POD) [55]. As mentioned in the previous section, lightning usually occurs in mesoscale convective systems with spatial scales typically of tens to hundreds of kilometers, so the test scores are calculated with the neighborhood method in this study. If the forecast result for a grid point is greater than the threshold N and lightning is observed within 2 grid points from the forecast point, the hit is considered successful. For different experiments, a series of tests are conducted to determine the threshold N so that the forecast results satisfy a high hit rate and a low false alarm rate.

The results of CLSTM-LFN and multiple control test scores are calculated as shown in Table 2, and the nowcasting results indicate that, regardless of which data source is used, the forecast results of the deep learning models are better than the empirical formula based on the radar echo and PR92 parameterization scheme, and CLSTM-LFN is more suitable for identifying the complex nonlinear evolution of lightning. The TS scores of CLSTM-LFN and CLSTM-LFN-O showed a decreasing trend with increasing forecast time, with both TS scores decreasing by more than 50%. The TS scores of CLSTM-LFN-W were all approximately 0.11 at different forecast times, with a small range of variation. At a forecast time of 0 to 1 h, both CLSTM-LFN and CLSTM-LFN-O show better forecasts considering that the lightning position and morphology are less variable at the beginning of the forecast and the historical lightning observation data provide more valid information. At forecast times of 1 to 3 h, CLSTM-LFN, after merging the WRF model products, outperforms the other models. Compared with CLSTM-LFN-O, the TS scores of the 0 to 3 h forecasts are improved by 9.75%, 5.23% and 10.09%, respectively. PR92 and dBZ_from_WRF failed to achieve effective lightning forecasts because the numerical model has some bias in the forecasts of radar echoes and cloud top height, and there is an inapplicability of the statistical relationship between these physical variables and lightning occurrence, with both having TS scores less than 0.1.

4.2. Case Study

Lightning at 24 August 2020 15:00 (UTC) occurred mainly in the south-central Guizhou region, eastern Yunnan and northwestern Hubei region, with a zonal distribution, and then the convective system gradually moved to the southeast (Figure 5). The nowcasting results of CLSTM-LFN and CLSTM-LFN-O for lightning show that the changes in the convective system are small at the initial forecast time, and both achieve effective forecasts for 0 to 1 h lightning (Figure 5a,d). At the 1 to 3 h period, the forecast range of CLSTM-LFN-O is smaller compared with the real situation, and more misses occur in the border area of Guizhou and Guangxi (red lines in Figure 5e,f). The CLSTM-LFN forecasts of lightning are closer to the observed results due to the indication of the R_max products by the numerical model (Figure 5b,c). However, in the border area of Jiangsu and Anhui, both CLSTM-LFN and CLSTM-LFN-O produced false forecasts (blue lines in Figure 5) due to the middle values of CAPE from numerical model products (Figure 5j–l) and a small amount of lightning at the initial forecast moment.

Further performance diagrams were used to quantitatively evaluate the forecast results (Figure 6), and TS, POD, FAR and bias scores are included in the diagram [56]. The TS scores of CLSTM-LFN are higher than those of CLSTM-LFN-O at different forecast times. The POD scores for the CLSTM-LFN hourly forecast results are 0.81, 0.79 and 0.81, and the POD scores for the CLSTM-LFN-O hourly forecast results are 0.70, 0.67 and 0.62. CLSTM-LFN forecasts a larger range of lightning occurrences and a higher FAR, so the bias values are all greater than 1.

In the case of 7 August 2020 03:00, lightning mainly occurred in the central-eastern part of the Sichuan region (Figure 7). Due to the small lightning occurrence frequency input to the CLSTM-LFN-O, the lack of indication of historical lightning occurrence frequency in this area leads to the failure of CLSTM-LFN-O to achieve an effective forecast of the lightning occurrence area (Figure 7d–f). The numerical model products show that there is a strong CAPE at the lightning occurrence location (Figure 7g,i), and the CLSTM-LFN is significantly improved by the interaction of multisource data. At 05:00 on 7 August 2020 in the southern part of Zhejiang, the numerical model products also forecasted a strong CAPE, but CLSTM-LFN and CLSTM-LFN-O failed to forecast lightning in this region due to the absence of lightning at the initial moment (blue lines in Figure 7).

Combined with the results of the performance diagram (Figure 8), the forecasting results of CLSTM-LFN are significantly better than those of CLSTM-LFN-O, and the bias values are all close to 1.0, maintaining a high probability of detection and a low false alarm rate. The bias values of CLSTM-LFN-O are less than 1.0 due to the small range of forecasted lightning occurrences. The POD scores for the CLSTM-LFN hourly forecast results are 0.59, 0.56 and 0.44, and the POD scores for the CLSTM-LFN-O hourly forecast results are 0.13, 0.08 and 0.01.

The above two cases of convective systems occurred slowly, the WRF model products provided effective information, and the CLSTM-LFN forecast results were more satisfactory. In the period from 19 August 2020 06:00 to 19 August 2020 08:00, CLSTM-LFN successfully hit a wide range of lightning from northeastern Shandong to eastern Guizhou while achieving a more accurate forecast for the gradually expanding range of lightning in northern Anhui (blue line in Figure 9c). However, CLSTM-LFN failed to forecast strip lightning in southeastern Shaanxi and cluster lightning in southeastern Jiangxi (red line in Figure 9b,c). The reason for this situation is that there was only scattered lightning in southeastern Shaanxi at 06:00 h, and the lightning occurred in a rapidly expanding range from 07:00 to 08:00 h. The cluster lightning in southeastern Jiangxi was concentrated at the border between southern Jiangxi and Fujian at the initial moment, and then the convective system moved rapidly to the northwest, which led to the weakening of the indication of historical lightning observation frequency.

Combined with the WRF model products, the lightning miss areas also do not show strong radar echoes and high CAPE values (Figure 9d,e). Considered together, the weakening of the indication of historical lightning frequency and the bias of the numerical model products are important reasons for the CLSTM-LFN lightning misses.

5. Variable Importance Analysis

During neural network model training, different forecast variables contribute differently to the model, and finding the variables with greater influence is important for understanding the development process of lightning. Permutation importance is a common method for variable importance analysis [57]. The main idea is to first input all the variables into the neural network model for training and calculate the original forecast result test scores. Subsequently, each forecast variable is randomly reordered and then input to the trained model to obtain new forecast results and then calculate the forecast result scores. The importance of the input variables is judged according to the degree of change in the scores before and after the reordering. According to the input variables of CLSTM-LFN, three sets of tests were designed as follows.

Exp_obs: Randomly reordering historical lightning occurrence frequency as a whole to produce false observations in areas where no lightning occurs or to consider no lightning to occur in areas where it does occur.

Exp_WRF_whole: The 13 variables of the WRF model products as a whole are randomly reordered for the purpose of comparing their relative importance with historical lightning occurrence frequency (Exp_obs).

Exp_WRF_sequence: The 13 variables of the WRF model products were sequentially reordered to compare the influence of individual physical variables on the forecast results.

The relative importance

{TS}_{relative}^{t, p}

was used to determine the importance of the variables based on the change in TS scores before and after the disruption (4), with higher relative importance indicating that the variable is more important [58]. Furthermore, based on the calculation of

{TS}_{relative}^{t, p}

, we define the variable

r_{relative}^{t}

to compare the relative importance of historical lightning occurrence frequency and WRF model products at different forecast times (5).

{TS}_{relative}^{t, p} = ({TS}_{orignal}^{t, p} - {TS}_{shuffled}^{t, p}) / {TS}_{orignal}^{t, p}

(4)

r_{relative}^{t} = {TS}_{relative}^{t, WRF_whole} / {TS}_{relative}^{t, obs}

(5)

where t represents the forecast time, p is the disrupted forecast variable and

{TS}_{shuffled}^{t, p}

are the TS scores for Exp_obs, Exp_WRF_whole and Exp_WRF_sequence at different forecast times.

{TS}_{orignal}^{t, p}

is the original forecast score without random reordering (TS scores for CLSTM-LFN in Section 4.1).

The results of Exp_obs and Exp_WRF_whole showed that the relative importance of both variables decreased with increasing forecast time in the 0 to 3 h period (color bar in Figure 10). The calculation results of

r_{relative}^{t}

increase with increasing forecast time, indicating that the importance of numerical model products increases gradually compared with historical lightning occurrence frequency (blue line in Figure 10). The reason is that, as the forecast time increases, the lightning location and intensity keep changing, which leads to a decrease in the valid information provided by historical observations and an increase in the meteorological conditions represented by the numerical model.

The results of Exp_WRF_sequence showed that the importance of each variable decreases with increasing forecast time (Figure 11). The relative importance of CAPE remains above 0.5 and is significantly greater than that of other physical variables, while the relative importance of microphysical variables, such as QGRAUP and R_max, decreases more rapidly with increasing forecast time. Generally, a larger CAPE tends to produce stronger convective activities [59], and several studies have shown that there is a strong correlation between CAPE values and thunderstorms [60,61], which provides an indication for the occurrence of lightning in atmospheric circulation. With increasing forecast time, the deviation of microphysical variables forecasted by numerical model products gradually increases, causing its relative importance to decrease rapidly. However, the type, content and spatial distribution of ice-phase particles have a certain correlation with the location of lightning [62,63,64], which still provides a reference for the occurrence of lightning at the microscopic level and is an indispensable variable in the process of the nowcasting model.

Combined with the case in Section 4.2 (Figure 9c,e), lightning usually occurs in the areas of high values of CAPE. However, this does not mean that lightning will necessarily occur in the high value area of CAPE; The correspondence between them is not significant enough, and the effective nowcasting of lightning still needs the common indication of multisource data.

6. Conclusions

The lightning nowcasting model CLSTM-LFN is obtained by merging the lightning observation frequency and WRF model products to construct a spatial–temporal sequence and training with feature extraction and video prediction methods. After testing the batch forecasts for August 2020, the results show that CLSTM-LFN can achieve effective forecasting for 0 to 3 h lightning occurrence areas after merging multisource data, which is a significant improvement compared with the single input data source and traditional lightning parameterization scheme. However, both historical lightning occurrence frequency and numerical model products are less indicative during the incipient and extinction phases of lightning, and the CLSTM-LFN forecast results for such processes still need to be improved.

The results of the input variables to the CLSTM-LFN relative importance analysis showed that the indication of numerical model products gradually increases with increasing forecast time compared to historical lightning occurrence frequency, but both can provide effective indication in the 0 to 3 h forecast time. The forecast results of CAPE by numerical models have fewer errors compared with other microphysical variables, so the relative importance of CAPE is significantly greater than that of other input variables.

With the increase in forecast time, the forecast of lightning depends more on the numerical model products, and how to extract useful information from the large number of numerical model products and improve the forecast time will be the research focus of future work.

Author Contributions

Conceptualization, S.G., J.W. and Y.Y.; Data curation, S.G.; Investigation, S.G.; Methodology, S.G., J.W. and Y.Y.; Project administration, Y.Y.; Writing—original draft, S.G.; Writing—review & editing, S.G., J.W., R.G., Z.Y. and Y.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Key Research and Development Program of China (2020YFA0608402) and the Natural Science Foundation of Gansu Province of China (21JR7RA501).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The NCEP GFS forecast data are available from the NCAR Research Data Archive (https://rda.ucar.edu/datasets/ds084.1/, accessed on 1 June 2020).

Acknowledgments

This work was supported by the Supercomputing Center of Lanzhou University.

Conflicts of Interest

The authors declare no conflict of interest.

References

Wilson, J.W.; Feng, Y.; Min, C.; Roberts, R.D. Nowcasting Challenges during the Beijing Olympics: Successes, Failures, and Implications for Future Nowcasting Systems. Weather. Forecast. 2010, 25, 1691–1714. [Google Scholar] [CrossRef]
Wilson, J.W.; Crook, N.A.; Mueller, C.K.; Sun, J.; Dixon, M. Nowcasting Thunderstorms: A Status Report. Bull. Am. Meteorol. Soc. 1998, 79, 2079–2100. [Google Scholar] [CrossRef]
Kuk, B.; Kim, H.; Ha, J.; Lee, H.; Lee, G. A Fuzzy Logic Method for Lightning Prediction Using Thermodynamic and Kinematic Parameters from Radio Sounding Observations in South Korea. Weather. Forecast. 2012, 27, 205–217. [Google Scholar] [CrossRef]
Mazany, R.A.; Businger, S.; Gutman, S.I.; Roeder, W. A Lightning Prediction Index That Utilizes GPS Integrated Precipitable Water Vapor. Weather. Forecast. 2002, 17, 1034–1047. [Google Scholar] [CrossRef]
Solomon, R.; Baker, M. Electrification of New Mexico Thunderstorms. Mon. Weather. Rev. 1994, 122, 1878–1886. [Google Scholar] [CrossRef] [Green Version]
Brooks, H.E. Severe Thunderstorms and Climate Change. Atmos. Res. 2013, 123, 129–138. [Google Scholar] [CrossRef]
Klazura, G.E.; Imy, D.A. A Description of the Initial Set of Analysis Products Available from the NEXRAD WSR-88D System. Bull. Am. Meteorol. Soc. 1993, 74, 1293–1312. [Google Scholar] [CrossRef] [Green Version]
Dixon, M.; Wiener, G. TITAN: Thunderstorm Identification, Tracking, Analysis, and Nowcasting—A Radar-Based Methodology. J. Atmos. Ocean. Technol. 1993, 10, 785–797. [Google Scholar] [CrossRef]
Vincent, B.R.; Carey, L.D.; Schneider, D.; Keeter, K.; Gonski, R. Using WSR-88D Reflectivity Data for the Prediction of Cloud-to-Ground Lightning: A Central North Carolina Study. Natl. Wea. Dig. 2004, 27, 35–44. [Google Scholar]
Lang, T.J.; Rutledge, S.A. Relationships between Convective Storm Kinematics, Precipitation, and Lightning. Mon. Weather. Rev. 2002, 130, 2492–2506. [Google Scholar] [CrossRef]
Martinez, M. The Relationship between Radar Reflectivity and Lightning Activity at Initial Stages of Convective Storms. In Proceedings of the American Meteorological Society, 82nd Annual Meeting, First Annual Student Conference, Orlando, FL, USA, 12–13 January 2002. [Google Scholar]
Mosier, R.M.; Schumacher, C.; Orville, R.E.; Carey, L.D. Radar Nowcasting of Cloud-to-Ground Lightning over Houston, Texas. Weather. Forecast. 2011, 26, 199–212. [Google Scholar] [CrossRef]
Hayashi, S.; Umehara, A.; Nagumo, N.; Ushio, T. The Relationship between Lightning Flash Rate and Ice-Related Volume Derived from Dual-Polarization Radar. Atmos. Res. 2021, 248, 105166. [Google Scholar] [CrossRef]
Steiger, S.M.; Orville, R.E.; Carey, L.D. Total Lightning Signatures of Thunderstorm Intensity over North Texas. Part I: Supercells. Mon. Weather. Rev. 2007, 135, 3281–3302. [Google Scholar] [CrossRef]
Price, C.; Rind, D. A Simple Lightning Parameterization for Calculating Global Lightning Distributions. J. Geophys. Res. Atmos. 1992, 97, 9919–9933. [Google Scholar] [CrossRef]
Lynn, B.H.; Yair, Y.; Price, C.; Kelman, G.; Clark, A.J. Predicting Cloud-to-Ground and Intracloud Lightning in Weather Forecast Models. Weather. Forecast. 2012, 27, 1470–1488. [Google Scholar] [CrossRef]
Wang, F.; Zhang, Y.; Dong, W. A Lightning Activity Forecast Scheme Developed for Summer Thunderstorms in South China. J. Meteorol. Res. 2010, 24, 631–640. [Google Scholar]
Jayaratne, E.; Saunders, C.; Hallett, J. Laboratory Studies of the Charging of Soft-Hail during Ice Crystal Interactions. Q. J. R. Meteorol. Soc. 1983, 109, 609–630. [Google Scholar] [CrossRef]
Saunders, C.; Keith, W.; Mitzeva, R. The Effect of Liquid Water on Thunderstorm Charging. J. Geophys. Res. Atmos. 1991, 96, 11007–11017. [Google Scholar] [CrossRef]
Fierro, A.O.; Mansell, E.R.; MacGorman, D.R.; Ziegler, C.L. The Implementation of an Explicit Charging and Discharge Lightning Scheme within the WRF-ARW Model: Benchmark Simulations of a Continental Squall Line, a Tropical Cyclone, and a Winter Storm. Mon. Weather. Rev. 2013, 141, 2390–2415. [Google Scholar] [CrossRef]
Xu, L.; Zhang, Y.; Wang, F.; Zheng, D. Coupling of Electrification and Discharge Processes with WRF Model and Its Preliminary Verification. Chin. J. Atmos. Sci. 2012, 36, 1041–1052. [Google Scholar]
Sun, J.; Xue, M.; Wilson, J.W.; Zawadzki, I.; Ballard, S.P.; Onvlee-Hooimeyer, J.; Joe, P.; Barker, D.M.; Li, P.-W.; Golding, B.; et al. Use of NWP for Nowcasting Convective Precipitation: Recent Progress and Challenges. Bull. Am. Meteorol. Soc. 2014, 95, 409–426. [Google Scholar] [CrossRef] [Green Version]
Badrinarayanan, V.; Kendall, A.; Cipolla, R. SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 2481–2495. [Google Scholar] [CrossRef] [PubMed]
Yuan, Z.; Zhou, X.; Yang, T. Hetero-Convlstm: A Deep Learning Approach to Traffic Accident Prediction on Heterogeneous Spatio-Temporal Data. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK, 19–23 August 2018; pp. 984–992. [Google Scholar]
Geng, Y.; Li, Q.; Lin, T.; Zhang, J.; Xu, L.; Yao, W.; Zheng, D.; Lyu, W.; Huang, H. A Heterogeneous Spatiotemporal Network for Lightning Prediction. In Proceedings of the 2020 IEEE International Conference on Data Mining (ICDM), Sorrento, Italy, 17–20 November 2020; pp. 1034–1039. [Google Scholar]
Lin, T.; Li, Q.; Geng, Y.-A.; Jiang, L.; Xu, L.; Zheng, D.; Yao, W.; Lyu, W.; Zhang, Y. Attention-Based Dual-Source Spatiotemporal Neural Network for Lightning Forecast. IEEE Access 2019, 7, 158296–158307. [Google Scholar] [CrossRef]
Zhou, K.; Zheng, Y.; Li, B.; Dong, W.; Zhang, X. Forecasting Different Types of Convective Weather: A Deep Learning Approach. J. Meteorol. Res. 2019, 33, 797–809. [Google Scholar] [CrossRef]
Zhou, K.; Zheng, Y.; Dong, W.; Wang, T. A Deep Learning Network for Cloud-to-Ground Lightning Nowcasting with Multisource Data. J. Atmos. Ocean. Technol. 2020, 37, 927–942. [Google Scholar] [CrossRef]
Oprea, S.; Martinez-Gonzalez, P.; Garcia-Garcia, A.; Castro-Vargas, J.A.; Orts-Escolano, S.; Garcia-Rodriguez, J.; Argyros, A. A Review on Deep Learning Techniques for Video Prediction. IEEE Trans. Pattern Anal. Mach. Intell. 2020. [Google Scholar] [CrossRef]
Hu, A.; Cotter, F.; Mohan, N.; Gurau, C.; Kendall, A. Probabilistic Future Prediction for Video Scene Understanding. In Proceedings of the European Conference on Computer Vision, Glasgow, UK, 23–28 August 2020; pp. 767–785. [Google Scholar]
Shi, X.; Chen, Z.; Wang, H.; Yeung, D.-Y.; Wong, W.-K.; Woo, W. Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting. In Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada, 7–12 December 2015; pp. 802–810. [Google Scholar]
Shi, X.; Gao, Z.; Lausen, L.; Wang, H.; Yeung, D.-Y.; Wong, W.; WOO, W. Deep Learning for Precipitation Nowcasting: A Benchmark and A New Model. Adv. Neural Inf. Processing Syst. 2017, 30, 5617–5627. [Google Scholar]
Yang, X.; Sun, J.; Li, W. An Analysis of Cloud-to-Ground Lightning in China during 2010–13. Weather. Forecast. 2015, 30, 1537–1550. [Google Scholar] [CrossRef]
Xia, R.; Zhang, D.-L.; Wang, B. A 6-Yr Cloud-to-Ground Lightning Climatology and Its Relationship to Rainfall over Central and Eastern China. J. Appl. Meteorol. Climatol. 2015, 54, 2443–2460. [Google Scholar] [CrossRef]
McCaul, W.E.; LaCasse, K.; Goodman, S. Use of High-Resolution WRF Simulations to Forecast Lightning Threat. In Proceedings of the 23rd Severe Storms Conference, St. Louis, MO, USA, 6–10 November 2006. [Google Scholar]
Guerova, G.; Dimitrova, T.; Georgiev, S. Thunderstorm Classification Functions Based on Instability Indices and GNSS IWV for the Sofia Plain. Remote Sens. 2019, 11, 2988. [Google Scholar] [CrossRef] [Green Version]
Yair, Y.; Lynn, B.; Price, C.; Kotroni, V.; Lagouvardos, K.; Morin, E.; Mugnai, A.; de Llasat, M.C. Predicting the Potential for Lightning Activity in Mediterranean Storms Based on the Weather Research and Forecasting (WRF) Model Dynamic and Microphysical Fields. J. Geophys. Res. Atmos. 2010, 115. [Google Scholar] [CrossRef] [Green Version]
Chen, S.; Du, Y.; Fan, L.; He, H.; Zhong, D. A Lightning Location System in China: Its Performances and Applications. IEEE Trans. Electromagn. Compat. 2002, 44, 555–560. [Google Scholar] [CrossRef]
Hu, M. Assimilation of Lightning Data Using Cloud Analysis within the Rapid Refresh. In Proceedings of the 4th Conference on the Meteorological Applications of Lightning Data, Phoenix, AZ, USA, 13 January 2009; American Meteorology Society: Boston, MA, USA, 2009; p. 29. [Google Scholar]
Reap, R.M.; MacGorman, D.R. Cloud-to-Ground Lightning: Climatological Characteristics and Relationships to Model Fields, Radar Observations, and Severe Local Storms. Mon. Weather. Rev. 1989, 117, 518–535. [Google Scholar] [CrossRef]
Dewan, A.; Ongee, E.T.; Rafiuddin, M.; Rahman, M.M.; Mahmood, R. Lightning Activity Associated with Precipitation and CAPE over Bangladesh. Int. J. Climatol. 2018, 38, 1649–1660. [Google Scholar] [CrossRef]
Zheng, Y.; Chen, J.; Zhu, P. Climatological Distribution and Diurnal Variation of Mesoscale Convective Systems over China and Its Vicinity during Summer. Chin. Sci. Bull. 2008, 53, 1574–1586. [Google Scholar] [CrossRef] [Green Version]
Howard, A.G. Some Improvements on Deep Convolutional Neural Network Based Image Classification. arXiv 2013, arXiv:1312.5402. [Google Scholar]
Ratner, A.J.; Ehrenberg, H.R.; Hussain, Z.; Dunnmon, J.; Ré, C. Learning to Compose Domain-Specific Transformations for Data Augmentation. Adv. Neural Inf. Processing Syst. 2017, 30, 3239. [Google Scholar]
Cubuk, E.D.; Zoph, B.; Mane, D.; Vasudevan, V.; Le, Q.V. AutoAugment: Learning Augmentation Policies from Data. arXiv 2019, arXiv:1805.09501. [Google Scholar]
Nair, V.; Hinton, G. Rectified Linear Units Improve Restricted Boltzmann Machines Vinod Nair. In Proceedings of the ICML, Haifa, Israel, 21–24 June 2010; Volume 27, pp. 807–814. [Google Scholar]
LeCun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-Based Learning Applied to Document Recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef] [Green Version]
Tivive, F.H.C.; Bouzerdown, A. An Eye Feature Detector Based on Convolutional Neural Network. In Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, Sydney, Australia, 28–31 August 2005; Volume 1, pp. 90–93. [Google Scholar]
Szarvas, M.; Yoshizawa, A.; Yamamoto, M.; Ogata, J. Pedestrian Detection with Convolutional Neural Networks. In Proceedings of the IEEE Intelligent Vehicles Symposium, Las Vegas, NE, USA, 6–8 June 2005; pp. 224–229. [Google Scholar]
Muller, U.; Ben, J.; Cosatto, E.; Flepp, B.; Cun, Y.L. Off-Road Obstacle Avoidance through End-to-End Learning. In Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada, 4–5 December 2006; pp. 739–746. [Google Scholar]
Lauer, F.; Suen, C.Y.; Bloch, G. A Trainable Feature Extractor for Handwritten Digit Recognition. Pattern Recognit. 2007, 40, 1816–1824. [Google Scholar] [CrossRef] [Green Version]
Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. arXiv 2017, arXiv:1412.6980. [Google Scholar]
Buechler, D.E.; Goodman, S.J. Echo Size and Asymmetry: Impact on NEXRAD Storm Identification. J. Appl. Meteorol. Climatol. 1990, 29, 962–969. [Google Scholar] [CrossRef] [Green Version]
Schaefer, J.T. The Critical Success Index as an Indicator of Warning Skill. Weather. Forecast. 1990, 5, 570–575. [Google Scholar] [CrossRef] [Green Version]
Brownlee, K.A. Statistical Theory and Methodology in Science and Engineering; Wiley: Hoboken, NJ, USA, 1965. [Google Scholar]
Roebber, P.J. Visualizing Multiple Measures of Forecast Quality. Weather. Forecast. 2009, 24, 601–608. [Google Scholar] [CrossRef] [Green Version]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
Zhou, K.; Zheng, Y.; Wang, T. Very Short-Range Lightning Forecasting with NWP and Observation Data: A Deep Learning Approach. Acta Meteorol. Sin. 2021, 79, 1–14. [Google Scholar] [CrossRef]
Moncrieff, M.W.; Miller, M.J. The Dynamics and Simulation of Tropical Cumulonimbus and Squall Lines. Q. J. R. Meteorol. Soc. 1976, 102, 373–394. [Google Scholar] [CrossRef]
Williams, E.R.; Geotis, S.; Renno, N.; Rutledge, S.; Rasmussen, E.; Rickenbach, T. A Radar and Electrical Study of Tropical “Hot Towers”. J. Atmos. Sci. 1992, 49, 1386–1395. [Google Scholar] [CrossRef] [Green Version]
Wissmeier, U.; Goler, R. A Comparison of Tropical and Midlatitude Thunderstorm Evolution in Response to Wind Shear. J. Atmos. Sci. 2009, 66, 2385–2401. [Google Scholar] [CrossRef]
Carey, L.D.; Rutledge, S.A. The Relationship between Precipitation and Lightning in Tropical Island Convection: A C-Band Polarimetric Radar Study. Mon. Weather. Rev. 2000, 128, 2687–2710. [Google Scholar] [CrossRef] [Green Version]
Gauthier, M.L.; Petersen, W.A.; Carey, L.D.; Christian, H.J., Jr. Relationship between Cloud-to-Ground Lightning and Precipitation Ice Mass: A Radar Study over Houston. Geophys. Res. Lett. 2006, 33. [Google Scholar] [CrossRef]
Sherwood, S.C.; Phillips, V.T.; Wettlaufer, J. Small Ice Crystals and the Climatology of Lightning. Geophys. Res. Lett. 2006, 33. [Google Scholar] [CrossRef] [Green Version]

Figure 1. The WRF model domain (a) and the study region range (b); the colors represent the height of the terrain. The abbreviations for Shandong, Shaanxi, Anhui, Sichuan, Hubei, Guizhou, Jiangxi, Fujian, Yunnan, Guangxi, Zhejiang and Jiangsu are SD, SX, AH, SC, HuB, GZ, JX, FJ, YN, GX, ZJ and JS, respectively.

Figure 2. CLSTM-LFN structure and data flow. The numbers in square brackets represent the variable dimensions, and t and t + 1 represent the starting forecast moment and 1 h after the starting forecast moment, respectively.

Figure 3. CNN Net (a) and ConvLSTM Net (b) structures.

Figure 4. ConvLSTM structure.

Figure 5. 24 August 2020 15:00 to 24 August 2020 17:00 CLSTM-LFN forecast results (a–c), CLSTM-LFN-O forecast results (d–f) and WRF model prediction products for Rmax (g–i) and CAPE (j–l). Green shading in a-f represents lightning forecast results, and black dots represent lightning observations.

Figure 6. Performance diagram of 24 August 2020 15:00 to 24 August 2020 17:00 CLSTM-LFN forecast results (green circles) and CLSTM-LFN-O forecast results (yellow circles). The magenta lines represent the TS scores, the black dashed line represents bias scores and the number in the circle represents the forecast time.

Figure 7. 7 August 2020 03:00 to 7 August 2020 05:00 CLSTM-LFN forecast results (a–c), CLSTM-LFN-O forecast results (d–f) and WRF model products for CAPE (g–i). Green shading represents lightning nowcasting results, and black dots represent lightning observations.

Figure 8. Performance diagram of 7 August 2020 03:00 to 7 August 2020 05:00 CLSTM-LFN forecast results (green circles) and CLSTM-LFN-O forecast results (yellow circles). The magenta lines represent the TS scores, the black dashed line represents bias scores and the number in the circle represents the forecast time.

Figure 9. 19 August 2020 06:00 to 19 August 2020 08:00 CLSTM-LFN forecast results (a–c) and WRF model products for R_max (d) and CAPE (e).

Figure 10. Relative importance of historical lightning occurrence frequency and WRF model products at different forecast times (color bar). The blue line represents the

r_{relative}^{t}

.

Figure 10. Relative importance of historical lightning occurrence frequency and WRF model products at different forecast times (color bar). The blue line represents the

r_{relative}^{t}

.

Figure 11. Relative importance of each variable of the WRF model products (Exp_WRF_sequence) at different forecast times.

Table 1. Input variables to the CLSTM-LFN from numerical model products and descriptions.

Variables	Description	Units
W_max	maximum vertical velocity component of wind	m/s
helicity	storm relative helicity	m²/s²
RAINNC	accumulated total grid scale precipitation	mm
QVAPOR	water vapor mixing ratio	g/kg
QCLOUD	cloud water mixing ratio	g/kg
QRAIN	rain water mixing ratio	g/kg
QICE	ice mixing ratio	g/kg
QSNOW	snow mixing ratio	g/kg
QGRAUP	graupel mixing ratio	g/kg
CAPE	convective available potential energy	J/kg
R_max	maximum radar reflectivity	dBZ
R₆	radar reflectivity at 6 km above ground level	dBZ
R₉	radar reflectivity at 9 km above ground level	dBZ

Table 2. CLSTM-LFN and multiple controlled experiment forecast results scores.

Experiments	Forecast Time	TS	FAR	POD	Threshold (N)
CLSTM-LFN	1 h	0.518	0.367	0.741	5.0
	2 h	0.342	0.569	0.625
	3 h	0.240	0.693	0.523
CLSTM-LFN-O	1 h	0.472	0.337	0.621	5.0
	2 h	0.325	0.552	0.544
	3 h	0.218	0.666	0.387
CLSTM-LFN-W	1 h	0.114	0.869	0.467	2.0
	2 h	0.112	0.87	0.455
	3 h	0.105	0.873	0.382
PR92	0–3 h	0.053	0.94	0.304	0.0
dBZ_from_WRF	0–3 h	0.007	0.869	0.007	0.0

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Guo, S.; Wang, J.; Gan, R.; Yang, Z.; Yang, Y. Experimental Study of Cloud-to-Ground Lightning Nowcasting with Multisource Data Based on a Video Prediction Method. Remote Sens. 2022, 14, 604. https://0-doi-org.brum.beds.ac.uk/10.3390/rs14030604

AMA Style

Guo S, Wang J, Gan R, Yang Z, Yang Y. Experimental Study of Cloud-to-Ground Lightning Nowcasting with Multisource Data Based on a Video Prediction Method. Remote Sensing. 2022; 14(3):604. https://0-doi-org.brum.beds.ac.uk/10.3390/rs14030604

Chicago/Turabian Style

Guo, Shuchang, Jinyan Wang, Ruhui Gan, Zhida Yang, and Yi Yang. 2022. "Experimental Study of Cloud-to-Ground Lightning Nowcasting with Multisource Data Based on a Video Prediction Method" Remote Sensing 14, no. 3: 604. https://0-doi-org.brum.beds.ac.uk/10.3390/rs14030604

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Experimental Study of Cloud-to-Ground Lightning Nowcasting with Multisource Data Based on a Video Prediction Method

Abstract

1. Introduction

2. Data

2.1. Lightning Data

2.2. WRF Model Prediction Products

3. Method

3.1. Preprocessing of Lightning Data

3.2. Training Set and Test Set

3.3. Neural Network Structure

3.3.1. 2D and 3D Convolution Layers

3.3.2. ConvLSTM

3.4. Network Training

3.5. Controlled Experimental Design

4. Forecast Results

4.1. Nowcasting Results and Scoring Test

4.2. Case Study

5. Variable Importance Analysis

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI