Abstract

Multimodality brain image registration technology is the key technology to determine the accuracy and speed of brain diagnosis and treatment. In order to achieve high-precision image registration, a fast subpixel registration algorithm based on single-step DFT combined with phase correlation constraint in multimodality brain image was proposed in this paper. Firstly, the coarse positioning at the pixel level was achieved by using the downsampling cross-correlation model, which reduced the Fourier transform dimension of the cross-correlation matrix and the multiplication of the discrete Fourier transform matrix, so as to speed up the coarse registration process. Then, the improved DFT multiplier of the matrix multiplication was used in the neighborhood of the coarse point, and the subpixel fast location was achieved by the bidirectional search strategy. Qualitative and quantitative simulation experiment results show that, compared with comparison registration algorithms, our proposed algorithm could greatly reduce space and time complexity without losing accuracy.

1. Introduction

Medical image registration technology is a widely used image processing technology in the field of medicine image analysis [1]. It plays an important role in human 3D modeling, multisource medical image fusion, the lesion feature detection and extraction, and other auxiliary diagnoses [2]. Brain medical diagnosis has a high demand for accuracy, and brain CT image registration technology is the key technology to determine the accuracy and speed of brain diagnosis and treatment [3]. The research motivation of this article is shown in Figure 1.

Different forms of images express different information and different functions. Combining the two can simultaneously express information from many aspects of the human body in one image. The internal structure and function of the human body can be reflected through the image, providing intuitive human anatomy, physiology, and pathology information. At this time, the image configuration technology needs to solve the problem of position registration of fusion between images. When there is moderate noise in the image and there is translation and scaling between the multimodality images, phase correlation image registration technology is an effective method for subpixel image registration. This paper proposes an improved algorithm based on Guizar-Sicairos registration, which can quickly search for the offset between registered images and greatly reduce the time and space complexity of registration without losing the registration accuracy.

Medical image registration can be divided into single-mode image registration and multimode image registration from the imaging mode. Single modality means that multiple images to be registered are acquired by the same imaging technology, and multimodality means that the registered images are acquired by different imaging technologies [4]. Since the imaging principles of different imaging modes are different, the images they acquire have different characteristics, and the sensitivity to different tissues is also different. Therefore, the object information of different modals is also different [5]. In order to help doctors better understand the disease situation, it is necessary to fuse a variety of multimodality images combined with more information so as to make diagnoses. For example, low-quality US images captured in real time during surgery and high-quality CT (or MRI) images before surgery are used to balance the accuracy and real time required clinically during computer-assisted intervention [6]. However, the premise of multimodal image fusion is to register multimodal images, so multimodal image registration technology is one of the research hotspots in medical image processing and is widely used in modern computer-aided interventional medicine [7]. In clinical applications, increasing clinical needs and image imaging modes have injected new impetus into multimodal medical image registration [8].

In the field of medical analysis, image registration achieves subpixel registration accuracy, but there is a huge challenge, especially in multimodality image registration [9]. In order to effectively register multimodal images, subpixel accurate feature-point location is needed. The inevitable error of the actual acquisition system makes it impossible to obtain completely accurate features of the two images, and the corresponding images are offset, stretched, and rotated in spatial domain. The research result shows that when the deviation of two complex images exceeds 0.1 pixels, the quality of the corresponding feature points will be seriously affected, which seriously affects the registration accuracy. Therefore, the high-precision registration of two multimodalities images is the first step of medical image analysis. We need to further study the key issues of multimodality registration algorithms, find the optimal algorithms that are more in line with clinical development, and further improve them reasonably.

Nowadays, typical image registration methods include registration algorithm based on control points, registration algorithm based on image features, and registration algorithm based on regional cross-correlation [10]. In particular, the registration method based on regional cross-correlation is to use cross-correlation technology to get the relative offset between image pairs, which has good robustness. Since the regional cross-correlation algorithm based on spectral operation improves the efficiency of registration, it is a common fast regional cross-correlation algorithm. Therefore, the phase correlation-based registration method has been widely studied due to its advantages of high accuracy, low computational complexity, small amount of computation, strong antinoise, and optical blur invariance [11]. The mainstream model of brain CT image registration is based on the maximum mutual information as a measurement and combined with efficient and high-precision optimization search algorithm. Because of the large amount of mutual information calculation and the slow speed of registration, the performance of the optimization search algorithm has a great impact on the efficiency and accuracy of registration. As for multimodality brain CT images, it is an effective way to improve the speed and accuracy of registration.

Phase correlation method includes spatial domain-based phase correlation and frequency domain-based phase correlation. The early correlation-registration algorithm mainly uses image translation parameters, and its image registration accuracy can achieve pixel level. Then, on this basis, Fourier–Mellin transform is used to expand image registration to the case of rotation, translation, and scaling, but its registration accuracy is also only pixel level [12]. Chen et al. proposed Fourier transform based on matrix multiplication, which can be used for subpixel registration of multimodality image [13]. Its accuracy is better than that of the traditional subpixel registration method, but the processing efficiency of large-scale multimodality medicine data is not high because the calculation efficiency of this improved method is obviously low in the process of pixel-level displacement. In order to solve the problem of mismatch (inappropriate problem) between registration accuracy and computational complexity, Claus et al. proposed an novel and effective image registration algorithm, which has the same registration accuracy as the standard fast Fourier transform and is considered to be one of the most reliable algorithms in the image registration algorithm based on the phase correlation method. For ease of description, the registration algorithm is called as SSDFT for the image registration algorithm based on the single-step discrete Fourier transform [14].

In this paper, we improve the performance of single-step discrete Fourier transform by reducing the dimension of Fourier transform cross-correlation matrix and the number of DFT matrix multiplication used to locate the peak value. Compared with other studies, the innovation of this study is summarized as follows:(1)Compared with single-step discrete Fourier transform (SSDFT), the improved algorithm proposed in this paper can quickly search the offset between the registered images(2)The method proposed enhances the registration performance of single-step discrete Fourier transform and greatly reduces the time and space complexity of registration without losing the registration accuracy(3)The algorithm proposed is robust and insensitive to noise.

The organization structure of this paper is as follows: Section 2 describes the single-step DFT registration algorithm in detail and introduces the multimodality medical image registration theory and implementation process; Section 3 proposes our improved multimodality registration algorithm, and the process of image registration is given in Section 4. Section 5 selects the different multimodality images with manual transformation to test and verify our proposed registration algorithm and gives the simulation results and analysis; we summarize the whole paper in the last section.

2. Materials and Methods

2.1. Multimodality Brain Image Registration

Image registration refers to comparing and matching two images F(x) and R(y) obtained at different times or under different conditions. According to the spatial transformation relationship obtained from the corresponding point location information of the two images, we can define a similarity measurement function that maximizes the similarity between the two images after the spatial transformation. In other words, each point on the image F(x) has a corresponding unique point on the image R(y), and these two points should be for the same physical space location. The mathematical model of image registration is shown in the following equation:where S is the similarity measurement function; T is the transformation space; R(y) represents the reference image, and represents a transformed frequency domain image. The main task of image registration is to find the optimal spatial transformation function T to make S reach the maximum, which can achieve an exact matching between the registered image and the reference image. The model is written as follows:

The process of registration is also the process of solving the global optimal value of the similarity measure function and its corresponding spatial transformation parameters. The search range of the parameters is called as the search space, and the number of parameters is called the degree of freedom in the spatial transformation model. The number of parameters is related to the spatial transformation model, and the degrees of freedom of different transformation models are also different [15]. Take 3D rigid body transformation as an example, spatial transformation matrix can denoted as , where are the displacement offset of the registered image with respect to the three directions of the coordinate axes ; represent the rotation angles in three directions around the coordinate axes .

The calculation of image registration parameters can be roughly divided into two types: image registration parameters based on grayscale and image registration parameters based on features. Gray-based image registration uses the gray data for registration, which can effectively avoid errors caused by feature extraction [16]. A predefined registration measurement function in the process of medicine image registration is designed to measure the difference between two images and then search for an optimal transformation to maximize the similarity of both images. This method has the characteristics of high accuracy and strong robustness and can achieve automatic registration without preprocessing [17]. Correlation methods and mutual information methods are commonly used at present. The framework of gray-based registration is shown in Figure 2.

Feature-based image registration methods use feature sets extracted from images to establish a correspondence relationship between feature sets [18]. Registration parameter is the key to success, but these methods require manual participation to complete the extraction of image features. Feature-based image registration method is divided into three steps: feature extraction, feature matching, and spatial transformation. Corresponding image features are firstly extracted based on the characteristic of the image, such as corners, edges, and curvature [19]. Secondly, a matching algorithm is used to match the corresponding features between the registered image and the reference image. Finally, the best matching function of the two images is achieved by changing the transformation parameters between the registered image and the reference image. The basic steps of the feature-based image registration method are shown in Figure 3.

In feature-based image registration methods, corner points are often used to express features of medicine image. A corner point is a point where the gray level of an image changes drastically and can also be defined as the intersection of two edges in an image. Corners have the advantages of stability and abundant information. In addition, their corners also have the advantages of rotation invariance, affine invariance, and scale invariance, which is very suitable for matching medical images with different modalities [20].

2.2. Single-Step Discrete Fourier Transform

Correlation algorithm is a statistics-based registration [5]. It is assumed that the time interval between the two images taken successively is small enough, and there is only a small linear displacement between the images [21]; suppose that the gray distribution function of the two images is and , and and are displacement offsets of relative to in and axes, respectively. The normalized mean square error (NRMSE) [22] between and can be expressed as follows:where represents the cross-correlation coefficient between and . The cross-correlation value is defined as follows:where is the Fourier transform of image . In order to get the accurate peak position, the traditional subpixel image registration algorithm has huge storage cost and time consumption because of scaling the image and processing the whole upsampling matrix. In order to overcome this performance limitation, there are two steps proposed by Guizar-Sicairos algorithm [23] to improve the efficiency of registration: (1) the fast Fourier transform (FFT) with the upsampling coefficient  = 2 is used to calculate the cross-correlation surface peak coordinates between images so as to obtain the initial subpixel motion estimation; (2) then, the accurate peak value is searched in a small window area near the initial estimation. Through the discrete Fourier transform of the small window area with the initial estimation as the center and its size as , an upsampling cross-correlation surface is obtained, without the need of zero-padding to the product. For the realization of this process, equation (1) can also be rewritten as the matrix product of size , , and , respectively. Therefore, the peak location can be found on the result matrix with size . It can be seen that, compared with the conventional FFT image registration method, the computational complexity of this method is greatly improved.

3. The Improved Single-Step DFT for Brain Image Registration

Although Guizar-Sicairos algorithm is a novel and fast algorithm for subpixel registration, its main disadvantage is that most of the registration time is spent in the first step to find the initial estimation. To solve this problem, this paper improves the Guizar-Sicairos registration algorithm to reduce the time cost. Therefore, the improved algorithm is to reduce the time of initial estimation of peak position and the time of accurate registration. The framework of brain image registration is shown in Figure 4. Due to the consistency of neighborhood structures in multimodal brain images, the directions around key points where gray values change severely are used as dominant orientations. To adapt for multimodality registration, the SURF descriptor is modified according to the gradient reversals. Due to the great difference between different brain images, the existing algorithm can achieve pixel-level registration. For example, literature [8] makes full use of the flexibility of NSCT for image decomposition and the accuracy of SURF for feature location, as well as the quickness of SURF for feature extraction. The main work of this paper is to design a novel and fast subpixel large-scale translation image registration algorithm. Therefore, rough registration is represented by a blue box, and our proposed method is located in a red box.

Assuming that and are corresponding Fourier transforms, the following equation can be obtained:

The cross-power spectrum of two images is defined as follows:

The translation theorem of Fourier transform ensures that the phase of cross-power spectrum between images is equal to the phase difference between images. If the inverse discrete Fourier transform is applied to in frequency domain, then the unit pulse function will be obtained at :

It can be seen that the amplitude of the transformed surface is almost zero except for the amplitude at , so it can be used to measure the translation between two images. Since the normalization is adopted in formula (7), the phase correlation has excellent anti-interference characteristics. After the above analysis, the right side of formula (4) can be rewritten as follows:where and are the downsampling coefficients along the direction and direction, respectively, and is the Kronecker impulse function. Taking advantage of the selection feature of function , thus equation (5) can be written as follows:

It can be seen that the equation represents the cross-correlation of the images after dimension reduction and sampling, where the size of the cross-correlation matrix before and after sampling is and , respectively. By comparing equation (4) and equation (6), it can be inferred that as long as the peak value falls within the downsampling cross-correlation matrix, the peak positions in the matrix are the same. Therefore, the key idea of the improved algorithm in this paper is to do downsampling of the Fourier transform for the two registered images and then use the same Guizar-Sicairos method to find the initial estimation of the peak position. The complexity of the time complexity of the improved registration algorithm is .

In order to obtain accurate reconstruction based on downsampling in frequency domain, the proposed registration algorithm in this paper needs to calculate an overlapping form of cross-correlation. Because the cross-power spectrum performs K downsampling in two directions and the cross-correlation time spectrum is located near in spatial domain, the overlap of cross-correlation function may change the initial peak position, resulting in the wrong estimation of the downsampling matrix. In order to solve this problem, the improved algorithm in this paper normalizes the original cross-correlation function before downsampling, and its expression can be denoted as follows [8]:

It can be seen that the inverse Fourier transform of the complex exponential is a delta function, which is to say that the position of the single peak is related to the offset . In order to reduce the edge effect, a window function can be used in the Fourier transform to intercept, or zero-padding. Therefore, the peak curve obtained by the cross-correlation function is independent of the image. As long as the peak position in the original normalized cross-correlation before downsampling is less than , the overlapping cross-correlation will not change the initial peak position. Experimental results show that the cross-correlation surface has an obvious peak value, but the peak value surface is very broad, not much larger than other peaks. The cross-correlation surface has a sharp peak value. It means these local peaks may be considered as the position of global peaks in the wrong position area. Therefore, using the inverse Fourier transform of normalized cross-power spectrum, the matching performance will be very accurate and stable even in the presence of noise.

In order to correctly locate the peak value in the overlapping regularization phase correlation matrix, it is assumed that the subpixel translation boundaries along the x-axis and y-axis are and , respectively. If the sampling coefficient is 2, it can be obtained the maximum subpixel displacement and in two directions. In addition, the upper limits of the sampling coefficient are and , which will make the subpixel displacement less than 2. If these upper limits are not satisfied, the initial peak value will fall outside the boundary of the original cross-power spectrum. It will make the peak value near these values and mistake the surface peak position, so this method cannot determine the subpixel displacement robustly.

Through the above theoretical analysis, it can be seen that the improved algorithm proposed in this paper mainly uses the downsampling cross-correlation function to solve the overlapping model and reduces the Fourier transform dimension of the cross-correlation matrix and the multiplication number of the discrete Fourier transform matrix to speed up the registration process.

4. Brain Image Fine-Search Strategy for Registration

The second step of Guizar-Sicairos method is to search for an accurate peak value in a small window area near the initial estimate. Combined with the above analysis, we can achieve the goal of acceleration by reducing the number of multiplication and addition of complex matrix (cumulative multiplication) needed to locate the peak value in the upsampling cross-correlation window. In addition, according to the derivation of literature [14], the correlation function can be written as the product of three matrices, namely:

The aim is to find the subpixel peak in the result matrix . Through the correlation surface analysis, it can be seen that there is a correlation peak in the cross-correlation, and its upsampling form is similar to a parabola shape, and the contour is monotonically increasing. Therefore, the improved algorithm adopts a forward and backward search strategy to reduce the total number of multiplication and addition operations of complex matrix, and its search process is referred to literature [4].

The main steps of this improved algorithm are as follows: (1) coarse positioning: the algorithm uses the improved strategy proposed in Section 3 to calculate the peak coordinates of cross-correlation surfaces between images and obtain the pixel-level translation ; (2) fine positioning: the DFT of improved matrix multiplication and search strategy are used to obtain the -times sampled and neighborhood area of the coarse positioning point , and the pixel-level translation is obtained by calculating the phase correlation of the upsampling area. Considering the resampling multiple , the subpixel-level translation is obtained. Therefore, the translation combined with phase correlation and resampled image registration is written as follows:

In this paper, phase correlation is used to obtain coarse position points in the original image and fine position point is obtained after resampling -times. Because the phase correlation-based coarse positioning has the pixel-level accuracy, setting the fine positioning area as the size of with the coarse positioning point as the center can ensure the subpixel-level accurate positioning point is in this area. In order to obtain high positioning accuracy and not increase too much calculation cost, the inverse Fourier transform will be carried out in the region with the size of to obtain the fine positioning peak value if the upsampling times is taken as 100. In other words, the positioning accuracy will reach 0.001 pixels.

It can be seen from the theoretical analysis that the improved fine-searching process in this paper will also reduce the calculation time for large-scale multimode brain imaging registration. However, both the Guizar-Sicairos algorithm and its improved algorithms use the same inverse DFT matrix, and most of the calculation time is spent on the generation of the matrix, rather than looking for the peak value. Therefore, the proposed improvement algorithm is limited to large-scale multimode brain imaging registration, but its calculation time will not exceed the original registration algorithm.

5. Simulation Experiment and Results

5.1. Experimental Data

This experiment uses the brain PET/CT data provided by the Second Affiliated Hospital of China Medical University (CMU) to experimentally test our proposed registration algorithm. The data obtained by PET/CT imaging equipment include a PET image volume data and a CT image volume data. The specific information of the experimental data is shown in Table 1. In addition, this experiment also applies the registration algorithm to the brain data from Vanderbilt University and will give the registration result so as to verify the accuracy of the proposed algorithm. In order to verify the registration accuracy of multimodal data acquired in the real environment, the CT brain image data acquired from a patient in Guangzhou General Military Hospital (GMH) are also selected, and the registration results are given, where the image sample is shown in Figure 5. The resolution, pixel size, and grayscale range of each image data are shown in Table 1.

5.2. Evaluation Index

In order to measure the registration accuracy well in the experiment, two completely aligned different modal images are selected. We adopted a random deformation operation to one image and then take the deformed image as a floating image (registered image), and the other is a reference image. The introduced random deformation operation is the gold standard to test whether the deformation obtained by registration is accurate. We measure the accuracy of image registration by calculating the difference between the gold standard and the registration estimation. In our experiment, the target registration error (THE) is the valuation index, and its definition is as follows:where and are two images that need to calculate THE; and are the gray values of the same points in two images; is the image domain of ; and is the number of pixels. THE measures the similarity of the two images by calculating the root mean square error of gray value. The smaller the THE value of the reference image and the registered image is, the closer the gray value of the same point is in the two images, the more similar the whole image is, and the better the registration effect is.

5.3. Experiment and Results

In order to verify the effectiveness of our proposed registration algorithm for multimodality brain images, this paper performs experimental simulation on MATLAB 7.6 platform. To fairly test the performance of the improved image registration algorithm in this paper, the Guizar-Sicairos algorithm [23], the RSTT registration algorithm proposed by Foroosh and Balci [21], the normalized cross-correlation algorithm (NCC), and the Fast-RST methods proposed by Zhou et al. [24] were selected. This paper uses the medicine images with different sizes as simulation images, and images with different scales are implemented by interpolation. In this paper, the registration accuracy of the Guizar-Sicairos method and the improved algorithm proposed in this paper are both 0.01 pixels. The calculation time required for the initial coarse peak estimation and fine-search step is shown in Figures 6 and 7. It can be seen that compared with the Guizar-Sicairos method, our proposed registration algorithm in this paper greatly reduces the time to obtain the initial peak position.

For images with different sizes, the sampling coefficient is set to M/32. If the image size is not considered, the time required for peak estimation is roughly the same as the time required for a matrix of size 32 × 32. For large-size images with a size of 512 × 512, the estimated time is close to 2 milliseconds, while the traditional Guizar-Sicairos method requires about 200 seconds. During the fine registration process, our improved method in this paper improves the performance of the Guizar-Sicairos algorithm, as shown in Figure 7. In the entire simulation experiment, the step size is set to , and the performance comparison results of all algorithms are shown in Figure 8.

Overall, the registration time of our proposed registration method is faster than that of other comparison methods. In addition, it can be seen from the experimental results that the registration error of the Guizar-Sicairos method is 0.000471, and the error of our improved method in this paper is 0.000460. The experimental process also shows that our algorithm in this paper can adapt to various translation situations. It only needs to adaptively calculate the sampling step in the initial positioning step, while the improved algorithm in [3] can only process weak translation. In order to quantitatively evaluate the accuracy of the registration algorithm, some evaluation criterion is designed for quantitative analysis. Table 2 and Figure 9 show the effectiveness under different registration algorithms. According to the definition of target registration error (THE), if the evaluation index is close to 1, this registration algorithm is the most accurate [20]. In addition, by changing the required subpixel accuracy, this paper also tests the registration accuracy and calculation time of different comparison algorithm, as shown in Table 3 and Figure 10, where the image size is . Experimental results show that our improved algorithm in this paper will greatly reduce the time and space complexity for multimodality brain image registration and obtain the same subpixel accuracy as the original Guizar-Sicairos algorithm.

5.4. Discussion

Although Guizar-Sicairos algorithm is a novel and fast algorithm for subpixel registration, its main disadvantage is that most of the registration time is spent in the first step to find the initial estimation. To solve this problem, this paper improves the Guizar-Sicairos registration algorithm to reduce the time cost. Therefore, our improved algorithm is to reduce the time of initial estimation of peak position and the time of accurate registration. Due to the consistency of neighborhood structures in multimodal brain images, the directions around key points where gray values change severely are used as dominant orientations. To adapt for multimodality registration, the SURF descriptor is modified according to the gradient reversals. Due to the great difference between different brain images, the existing algorithm can achieve pixel-level registration. For example, literature [8] makes full use of the flexibility of NSCT for image decomposition and the accuracy of SURF for feature location, as well as the quickness of SURF for feature extraction.

Multimodality brain image registration technology is the key technology to determine the accuracy and speed of brain diagnosis and treatment. In order to achieve high-precision image registration, a novel and fast subpixel large-scale translation image registration algorithm was proposed. Firstly, the coarse positioning at the pixel level was achieved by using the downsampling cross-correlation model, which reduced the Fourier transform dimension of the cross-correlation matrix and the multiplication of the discrete Fourier transform matrix, so as to speed up the coarse registration process. Then, the improved DFT multiplier of the matrix multiplication was used in the neighborhood of the coarse point, and the subpixel fast location was achieved by the bidirectional search strategy. Simulation experiment results show that, compared with common image registration algorithms, our proposed algorithm could greatly reduce space and time complexity without losing accuracy.

6. Conclusion

When there is moderate noise in the image and there is translation and scaling between the multimodality images, phase correlation image registration technology is an effective method for subpixel image registration. This paper proposes an improved algorithm based on Guizar-Sicairos registration, which can quickly search for the offset between registered images and greatly reduce the time and space complexity of registration without losing the registration accuracy. Theoretical analysis and experimental verification show that our proposed multimodality brain image registration algorithm has high matching accuracy and antinoise performance, can be well applied to medicine image registration with big-scale translation, and is suitable for medicine analysis engineering applications.

Data Availability

All relevant data are within the paper.

Conflicts of Interest

The authors declare that there are no conflicts of interest.