Next Article in Journal
Path Diversity Improved Opportunistic Routing for Underwater Sensor Networks
Previous Article in Journal
On the Energy Efficiency of On-Off Keying Transmitters with Two Distinct Types of Batteries
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Color Feature-Based Object Tracking through Particle Swarm Optimization with Improved Inertia Weight

1
Chinese Academy of Science, Changchun Institute of Optics Fine Mechanics and Physics, 3888 Dongnanhu Road, Changchun 130033, China
2
University of Chinese Academy of Science, 19 Yuquan Road, Beijing 100049, China
*
Author to whom correspondence should be addressed.
Submission received: 9 March 2018 / Revised: 5 April 2018 / Accepted: 18 April 2018 / Published: 23 April 2018
(This article belongs to the Section Physical Sensors)

Abstract

:
This paper presents a particle swarm tracking algorithm with improved inertia weight based on color features. The weighted color histogram is used as the target feature to reduce the contribution of target edge pixels in the target feature, which makes the algorithm insensitive to the target non-rigid deformation, scale variation, and rotation. Meanwhile, the influence of partial obstruction on the description of target features is reduced. The particle swarm optimization algorithm can complete the multi-peak search, which can cope well with the object occlusion tracking problem. This means that the target is located precisely where the similarity function appears multi-peak. When the particle swarm optimization algorithm is applied to the object tracking, the inertia weight adjustment mechanism has some limitations. This paper presents an improved method. The concept of particle maturity is introduced to improve the inertia weight adjustment mechanism, which could adjust the inertia weight in time according to the different states of each particle in each generation. Experimental results show that our algorithm achieves state-of-the-art performance in a wide range of scenarios.

1. Introduction

In recent years, object tracking technology has been a hot topic in the field of computer vision. For decades, a large number of scholars worldwide have been engaged in the study of object tracking algorithms [1,2,3,4,5,6]. However, object tracking remains a challenging issue due to object complexity, background complexity, and object occlusion. Although there are many sophisticated object tracking algorithms, each algorithm has its own established conditions and application scope.
The algorithms representative of the traditional object tracking field include the centroid tracking algorithm, the related tracking algorithm, the gate tracking algorithm, the optical flow algorithm, and the mean-shift tracking algorithm. The centroid tracking algorithm and the gate tracking algorithm have satisfactory real-time performance and low complexity, but their stability is poor. In contrast, the related tracking algorithm and the optical flow algorithm have satisfactory stability, but they have difficulty guaranteeing real-time. Compared to the advantages and disadvantages of the previous algorithms, the mean-shift tracking algorithm is the most widely used in engineering. However, the mean-shift tracking algorithm is more susceptible to the background, which is problematic since it is easy to lose the target in a complex environment. In particular, there is no robust algorithm proposed for target occlusion. Most of the solutions are confined to a specific environment and require a large amount of research.
In recent years, many new algorithms have emerged in the field of target tracking. The most popular and representative new algorithms include correlation-filtering-based tracking methods and deep-learning-based tracking methods. Typical algorithms for correlation-filtering-based tracking include the kernelized correlation filter (KCF), the discriminative scale space tracker (DSST) and the spatio-temporal context (STC). The framework of these algorithms is roughly the same. In general, the principle of the correlation filter is that a high (correlation peak) is generated for each interested target and a low response is generated for the background in the scene. In 2010, Bolme proposed a Minimum Output Sum of Squared Error (MOSSE) tracking method based on adaptive correlation filtering in Reference [7]. The algorithm is a pioneering work of introducing correlation filters for visual tracking. In 2015, Henriques proposed a kernelized correlation filter (KCF) which introduced a ridge regression problem with a circulant matrix to apply kernel methods [8]. In 2015, Martin replaced the grayscale features in the MOSSE algorithm with the histogram of oriented gradient (HOG) features that were commonly used in the field of tracking and recognition and proposed a DSST tracking method in Reference [9]. The algorithm relieved the scaling issue using the scale pyramid feature and a discriminative correlation filter. However, because of the complexity of the algorithm, the computational efficiency is low. In 2015, deep learning became very interesting in the tracking field. The use of deep learning could better extract the characteristics of the target and thus better express the target. The algorithm proposed in Reference [10] is an influential algorithm in deep learning. The core idea put forward in Reference [10] was based on the basic tracking framework KCF. The author focused on changing the target characteristics and replacing the HOG features with deep convolution features. The paper proposes learning and expressing the target using the layered convolution feature. They adaptively learn correlation filters on each convolutional layer to encode the target appearance and improve tracking precision. However, the drawback of deep learning lies in the training and speed of the network. Even though the hierarchical convolutional features (HCF) and other offline training are still slow, it is difficult to achieve real-time applications.
Modern tracking algorithms have achieved satisfactory results in the tracking effect, but most are very complex and significantly sacrifice the operation efficiency of the algorithm. For engineering applications, we predict that the algorithm will be simple and easy to use due to its better performance than traditional algorithms and the fact that it can guarantee real-time performance.
In the target tracking system, the selection of target features is related to the tracking effect of the subsequent algorithm. Color features are the most widely used visual features in video tracking. The main reason for this is that color is usually related to objects or scenes contained in images, and when the object rotates and varies in size, the color feature can improve the flexibility of the tracking algorithm. In this paper, the most widely used color histogram is used as the target feature. We have a targeted and reasonable weighting on the histogram to improve the robustness of the algorithm in a complex environment.
Particle swarm optimization (PSO) has recently been a fast-growing optimization algorithm based on swarm intelligence. The algorithm has good multi-peak search ability, which means it has a good ability to search for a global optimal value and it converges quickly. When the target is occluded during the tracking process, the object similarity function will appear multi-peak. The particle swarm optimization algorithm is used to optimize the object similarity function, which could cope with the target occlusion tracking problem.
In 1995, Dr. Kennedy and Dr. Eberhart proposed a particle swarm optimization algorithm for the optimization of nonlinear functions [11]. The algorithm has the advantages of simple operation and easy implementation, and is widely used in the fields of neural network training, deep learning, and system optimization [12,13,14]. The operation of the algorithm requires some parameters such as inertia weight and the acceleration coefficient. The choice of parameters directly affects the convergence speed and algorithm accuracy. In 1998, Eberhart and Shi proposed a new parameter called inertia weight for the first time [15]. Simulation results showed that this new parameter had a significant and effective impact on the particle swarm optimizer, and when inertia weight was 0.7298, the algorithm had a satisfactory convergence. In 2006, according to the number of iterations, Chatterjee and Siarry proposed a non-linear decreasing adjustment mechanism to adjust the inertia weight [16]. The mechanism allows the inertia weight to be reduced non-linearly from the maximum value to the minimum value. In 2008, Hongguo Zhu presented a random inertia weight adjustment mechanism to improve its global optimization performance [17]. In 2013, Martins Akugbe Arasomwan proposed two adaptive chaotic inertia weight strategies based on the swarm success rate in Reference [18]. In 2013, a new PSO algorithm known as the Gompertz increasing inertia weight (GIIW) was proposed by Wakeel Ahmed, which had satisfactory performance with quick convergence capability [19].
However, the inertial weight adjustment mechanisms proposed in the above paragraph have some limitations, including a constant or random inertia weight adjustment mechanism that is not universally applicable, the linear or non-linear inertia weight adjustment mechanism adjusts inertial weight only by the number of iterations without taking the state of each particle into account, and the adaptive inertia weight adjustment mechanism is generally adjusted by a feedback parameter. However, the feedback parameter cannot be used as a reliable criterion for evaluating inertial weight.
To deal with these problems, this paper introduces the concept of the particle maturity rate to improve the adjustment of inertia weight by taking full account of the different situations of each particle in each generation. The inertial weight is adjusted accurately according to the state of particles to optimize the solution of the object similarity function. The improved particle swarm optimization object tracking algorithm can reduce the number of iterations to achieve the same fitness, which improves the operation efficiency and realizes the tracking of object occlusion.
The main contributions of this paper are summarized below.
(1)
We use a weighted color histogram as the object feature description. The object is normalized to a unit circle to reduce the contribution of the object edge pixel in the object feature, which effectively solves the problem of non-rigid deformation, rotation, and scale variation during the object tracking. At the same time, the influence of partial occlusion on the object feature description is reduced.
(2)
When the target is blocked in the object tracking process, the object similarity function will reach the “multi-peak” target. Combined with the multi-peak search ability of the particle swarm optimization algorithm, the algorithm is used to optimize the object similarity function to achieve accurate positioning under the object occlusion situation.
(3)
An improved particle swarm optimization object tracking algorithm is proposed. The method solves the limitation of the inertia weight adjustment mechanism when the particle swarm optimization algorithm is applied in the field of object tracking.
(4)
Experimental results show that the particle swarm optimization object tracking algorithm with improved inertia weight can significantly improve computational efficiency.
(5)
Experiments show that the proposed method shows more advantages compared with eleven state-of-the-art algorithms. It is a satisfactory solution to non-rigid deformation, rotation, scale variation, and object occlusion during object tracking.
The rest of the paper is organized as follows. Section 2 introduces the color feature used in the paper and the corresponding similarity measure in detail. The optimized inertia weight adjustment mechanism and the improved particle swarm optimization object tracking algorithm are presented in Section 3. The experimental details of the improved algorithm compared with the eleven most advanced algorithms are discussed in Section 4. In Section 5, we conclude the paper.

2. Related Work

2.1. Color Feature

Ever since color indexing was proposed by Swain and Ballard in 1991 [20], color features began to be widely used in object tracking. The following is a brief introduction to color feature representation and the corresponding similarity measure used in the paper.
The color feature has less dependence on the size, orientation, and viewing angle of the image, which is why it has higher robustness. Firstly, the image-oriented color feature needs to choose the appropriate color space to describe the color features. Then, we quantify the color features. Finally, we need to define a similarity criterion to measure the color similarity between the images. There are several ways to describe color features, such as histograms, color sets, and color moments. The histogram is not the best nonparametric representation, but it is mostly adequate for use in the tracking process. The histogram is the most widely used color feature in many color tracking fields [21,22,23]. When the image features cannot take all possible values, several zero values will appear in the histogram. These zero values have a large effect on the calculation, which cannot accurately match the two images. In order to solve this problem, we can use the weighted color histogram to represent the color features.
A weighted color histogram of the image region in which the center position of the image is ( x n , y n ) and the image height and weight are h n and w n is calculated as the image feature. In order to reduce the influence of the target scale, the target is normalized. { x i * } i = 1 N is the pixel coordinate of the normalized initial object. Then, we choose a center convex monotonically decreasing kernel function k ( x ) as the weight of the pixel to the object center position. We choose the Epanechnikov kernel as kernel function.
k ( x ) = { 1 2 c d 1 ( d + 2 ) ( 1 x 2 ) x 1 0 o t h e r w i s e ,
where c d is a constant. b ( x i * ) represents the pixel-quantized histogram level. The color histogram quantization level is g , and the object weighted color histogram M = { M j | j = 1 g } is represented as Equation (2).
M j = C i = 1 N k ( x * i 2 ) δ ( b ( x i * ) j ) ,
where C is the normalization constant. The weighted histogram of the candidate object is defined by the equation below:
I j = C h i = 1 N k ( x * i 2 ) δ ( b ( x i * ) j ) ,
where C h is the normalization constant.
The tracking method based on color features has the following advantages.
(1)
The amount of the calculation is small. The histogram is a reduced dimension representing the target image, which can reduce the computational complexity of pattern matching.
(2)
The histogram generated by a reasonable color model is insensitive to scale variation and rotation. The weighted color histogram also has great advantages for partial occlusion, especially for edge occlusion.
(3)
Histogram models are well-suited for modeling non-rigid objects. It is difficult to use shape features to represent a non-rigid object. However, the color distribution is still the same, no matter how the non-rigid object deforms.

2.2. Similarity Measurement

A similarity measurement is usually expressed in Hausdorff distance. For two finite point sets X = { x 1 , x 2 , } and Y = { y 1 , y 2 , } , the Hausdorff distance between sets X and Y is defined as the equation below:
d H ( X , Y ) = max { d ( X , Y ) , d ( X , Y ) } ,
where d ( X , Y ) = max x X   min y Y x y is a distance norm. We define the distance d Y ( x ) of point x to point set Y as the minimum distance of point y to each point in point set Y . This means d Y ( x ) = min y Y x y . Then, d Y ( X ) is the distance of each point in point set X to point set Y , and d H ( X , Y ) is the maximum value of d Y ( X ) . If the known object feature is X and the candidate object feature is Y , then d H ( X , Y ) can be used to measure the degree of similarity matching between X and Y .
The Hausdorff measurement has different representations depending on the different selected features. The Hausdorff measurement of the gray features mainly includes the mean square error evaluation function (MSE), the sum of absolute difference (SAD), and the normalized cross correlation evaluation function (NC). For the color features, the Bhattacharyya coefficient can be used to express the Hausdorff measurement based on the color histogram.
It is assumed that the normalized color histogram of the template image is M = { M j | j = 1 g } and the normalized color histogram of the image to be matched is I = { I j | j = 1 g } . g is the color quantization level. The Bhattacharyya coefficient indicates the similarity between M and I , which is defined by Equation (5):
ρ ( M , I ) = j = 1 g M j I j ,
where ρ ( M , I ) [ 0 , 1 ] and when M is the same to I , ρ ( M , I ) = 1 . The larger the value of ρ ( M , I ) , the more similar M is to I .

3. Improved Particle Swarm Optimization Method

3.1. Particle Swarm Optimization

Particle swarm optimization simulates the predation behavior of birds. It is based on the assumption that a flock of birds randomly searches for food when there is only one piece of food in the area. All of the birds do not know where the food is, but they know how far the current location is from food. Then, the best way to find food is to search for the area around the birds that are currently closest to the food. The particle swarm optimization learns from this model and uses it to solve optimization problems. In particle swarm optimization, the solution to each optimization problem is a bird in the search space, which is called the “particle”. Each particle has a fitness value determined by the objective function. The high fitness value indicates that the obtained solution is accurate. Each particle moves in the solution space whose direction and distance will be determined by a velocity. Then, the particles follow the movement of the current optimal particle in the solution space.
The mathematical equations of the particle swarm optimization algorithm are as follows [24]. In the n-dimensional search space, the ith particle position is presented as x i = ( x i 1 , x i 2 , x i n ) T and its velocity is presented as v i = ( v i 1 , v i 2 , v i n ) T . The individual best particle is presented as p b e s t i = ( p b e s t i 1 , p b e s t i 2 , p b e s t i n ) T and the population global best particle is presented as g b e s t = ( g b e s t 1 , g b e s t 2 , g b e s t n ) T . The position and velocity of the particles are updated according to the equations below.
v i d ( t + 1 ) = ω × v i d ( t ) + c 1 r a n d 1 ( p b e s t i d ( t ) x i d ( t ) ) + c 2 r a n d 2 ( g b e s t d ( t ) x i d ( t ) ) ,
x i d ( t + 1 ) = x i d ( t ) + v i d ( t + 1 ) ,
where d = 1 , 2 , n , i = 1 , 2 , P o p S i z e , P o p S i z e is the population size, c1 and c2 are acceleration constants, r a n d 1 and r a n d 2 are two distinct random values in [0, 1], t denotes the algebra of the current iteration, and ω is the inertia weight. The first part of the equation is the momentum of the particle, which can avoid the swing of the particle in the search direction. The second part of the equation indicates the natural tendency of particles to adapt to the environment. The third part of the equation means information sharing between the particle and neighboring groups [25].

3.2. Improved Particle Swarm Optimization Method

The research shows that the particle swarm optimization algorithm has an effective global convergence ability when the inertia weight is large, while the small inertia weight facilitates local convergence. Based on the important influence of inertia weight on the algorithm search behavior, many researchers improve the algorithm optimization performance by changing the inertia weight. At present, the improved method can be divided into a fixed or random inertia weight adjustment mechanism, the linear or non-linear inertia weight adjustment mechanism, and the adaptive inertia weight adjustment mechanism.
In order to further explore the effect of inertia weight on particle swarm optimization, the problem is simplified to a unimodal function optimization. We first define that the particle closest to the optimal solution in the particle swarm is the optimal particle and the other particles are suboptimal particles. It should be noted that the fitness value of suboptimal particles may exceed the fitness value of the optimal particle in the process of the particle swarm approaching the optimal solution. At this point, the suboptimal particle that has a better fitness value becomes the optimal particle. This situation constantly happens during the algorithm operation. In Figure 1, the yellow region represents the area where the suboptimal particles may appear next time. The red region indicates that the whole population can obtain a region closer to the optimal solution, which is called the lifting area. The closer the particle is to the optimal solution, the higher the fitness is. It is obvious that the size of the lifting region is determined by the distance of the optimal particle to the optimal solution.
For the first case shown in Figure 1a, the particle swarm is far from the optimal solution, but the particles are close to each other. When the particles are far from the optimal solution, it is necessary to iterate several times to get closer to the optimal solution. In this case, since the optimal particle needs a large velocity to approach the optimal solution, the value of inertia weight should be relatively large. The velocities of suboptimal particles are determined by their previous velocity and their distance to the optimal particle. Therefore, the suboptimal particles also need a large inertia weight value.
For the second case shown in Figure 1c, the optimal particle is close to the optimal solution and the suboptimal particles also move in the vicinity of the optimal solution. When the suboptimal particles approach the optimal solution at a high speed, they will rush though the lifting area and continue to oscillate in the yellow area, which means the particles will not enter the lifting area. In this case, in order to improve the accuracy of the solution, it is necessary to reduce the velocities of the particles, which means the particles will require a smaller inertia weight value to approach the optimal solution slowly.
Both of these situations may occur during the algorithm operation. Fixed or random inertia weight adjustment mechanisms are not appropriate. The method of adjusting the inertia weight linearly or non-linearly by the number of particle iterations is to use a moderate inertia weight to make the optimization achieve satisfactory results without taking full account of the different situations of each particle in each generation. The adaptive inertia weight adjustment mechanism adjusts the inertia weight according to the feedback parameter. Usually, the feedback parameter will use the fitness value or its derived value. However, the fitness value is not a reliable standard for evaluating inertia weights. When the particle has a high fitness value, it may require different values of inertia weight to adjust because of the different situations of the particle. When the fitness value of the particle is low, it may have different requirements for inertia weight.
In order to solve the above problems, this paper presents an improved inertial weight adjustment mechanism, which introduces the concept of the particle maturity rate as a description of particle states. When the particle maturity rate is relatively high, it indicates that the particles are clustered in a position far from the optimal solution and the whole population is slowly approaching the optimal solution. This is shown in Figure 1a. When the particle maturity rate is relatively low, it shows that the particles oscillate near the optimal solution. This is shown in Figure 1c.
First of all, we define coefficient σ i ( t ) as a value that can indicate the speed of particle maturity. The large coefficient value indicates that the particle matures quickly. σ max ( t ) is the maximum value of σ i ( t ) , i = 1 P o p s i z e .
σ i ( t ) = 1 p b e s t i ( t 1 ) 1 p b e s t i ( t )
The maturity rate of the ith particle at tth iteration is defined by the equation below:
M i ( t ) = { 1 p b e s t i ( t 1 ) σ max ( t ) p b e s t i ( t ) × σ max ( t ) σ i ( t ) > 1 ( p b e s t i ( t ) × σ i ( t ) 2 ) 2 σ i ( t ) 1 ,
where p b e s t i ( t ) is the individual optimal solution found by the ith particle at the tth iteration. Combined with the particle maturity rate formula, the maturity of the entire population can be described by Equation (10):
S m ( t ) = i = 1 P o p S i z e M i ( t ) P o p S i z e .
S m ( t ) [ 0 , 1 ] represents the maturity rate of the entire population. When S m ( t ) is small, it indicates that the particles are mostly in the first situation and, when S m ( t ) is large, it shows that most of the particles are in the second situation. Using S m ( t ) to adjust inertia weight, the status of each particle in each generation can be used as a basis for adjustment. In this paper, S m ( t ) is used to adjust inertia weight linearly and the formula is shown below:
ω ( t ) = ω min + S m ( t ) ( ω m a x ω m i n ) .
The inertia weight varies between the maximum and the minimum. ω max and ω min are selected to be [0.2, 0.8] according to experience. Because S m ( t ) [ 0 , 1 ] , the range of inertia weight is reasonable.

3.3. Improved PSO Object Tracking Method

In this paper, the improved particle swarm optimization object tracking method is combined with the concept of the particle maturity rate to calculate the maturity rate of the entire population by taking into full account the situation of each particle in each generation. The maturity rate of the entire population is used to adjust the inertia weight, which means the adjustment of the inertia weight is more reasonable and the algorithm efficiency is improved. The main steps of the algorithm are provided in Algorithm 1.
Algorithm 1. Improved particle swarm optimization object tracking method
1: Initial parameters of the particle swarm. Initial v i   and   x i .
2: Initialize the target box in the first frame. Determine the object feature description and similarity measure.
3: For frame = 2, 3, ... to the last frame.
4: For t = 1, 2, ... to the maximum number of iterations.
5: Calculate the v i according to Equation (1).
6: Calculate the x i according to Equation (2).
7: If fitness value of the current particle is better compared to individual optimal solution.
8: Update the individual optimal solution p b e s t i = x i .
9: End.
10: If fitness value of the current particle is better compared to global optimal solution.
11: Update global optimal solution, p b e s t i = x i .
12: End.
13: Adjust inertia weight according to Equation (6).
14: End.
15: Locate the target by using the maximum fitness value position.
16: End.

4. Experimental Results and Analysis

4.1. Object Tracking Effect

We performed the experiments on a PC with Intel i5-5200U CPU (2.2 GHz), 8 GB RAM memory, and Windows 7 operating system. The experiments in this paper were implemented in MATLAB 2016. We compared the performance of the improved algorithm with eleven state-of-the-art algorithms including TLD [26], Struck [27], VTD [28], CSK [29], CT [30], DFT [31], OAB [32], MIL [33], KCF [8], DSSL [9], and DLSSVM [34]. The population size in our algorithm was set to 100, the maximum number of iterations was 180, and the parameters were fixed during the whole experiment. For the other eleven algorithms in this paper, we used the code that they have already provided, including the best parameter values. We tested all relevant video sequences in OTB2013 and OTB2015, and some representative tracking results are shown as follows.

4.1.1. Non-Rigid Deformation

For the non-rigid deformation in the object tracking process, Dancer, Dancer2, and Skater were used as the representative test sequences, and the partial tracking results are shown in Figure 2. In the Dancer sequence, the TLD algorithm began to lose the target after the target was deformed for a period of time, which would not accurately track the target. In the Dancer2 sequence, when the object was non-rigidly deformed for a period of time, the TLD and CT algorithms showed tracking frame drift. In the Skater sequence, DFT and DSST algorithms failed to track the target after a period of time after target deformation. During the whole tracking process, our algorithm was more accurate and stable. The image single frame processing speed was better than 23 milliseconds. This can be attributed to the use of color features, which are not sensitive to appearance deformation.

4.1.2. Scale Variation

In Figure 3, Carscale, Human6, and Singer1 were used as the test sequences with scale variation during the object movement. In the Carscale sequence, there are not only obvious changes in scale, but there is partial occlusion. In the 152nd frame, when the target size varies greatly, the algorithm proposed in this paper started to show clear advantages. In the Human6 sequence, DFT and CT algorithms began to lose the object from the 191nd frame. In subsequent image sequences, the target scale changes more significantly and the proposed algorithm could always track the target steadily. In the Singer1 sequence, our algorithm and DSST algorithm showed satisfactory tracking performance compared to other algorithms. Throughout the tracking process, our tracking window could mostly contain the object and the single frame image processing speed was better than 36 milliseconds.

4.1.3. Rotation

Figure 4 shows the tracking results with rotation. Jump, Diving, and Skiing are classic target rotation sequences, and were used as the test image sequences. In the Jump sequence, the target starts rotating from frame 10 and the algorithm proposed in this paper could solve the rotation problem better than other algorithms. In the Diving sequence, starting from frame 76, other algorithms began to lose the target, in addition to the proposed algorithm. In the Skiing sequence, our algorithm and the DLSSVM algorithm showed good performance and achieved a stable tracking target in fast rotation. In the entire process of tracking, the proposed algorithm could stably lock the target. In this scenario, the single frame image processing speed was better than 40 milliseconds.

4.1.4. Occlusion

In order to fully reflect the advantage of the algorithm proposed in this paper in the case of the similarity function occurring at “multi-peak”, we used Suv, Tiger2, Human3, Basketball, and Walking2 as test sequences to track the occluded object. The tracking results are shown in Figure 5. In the Suv sequence, CT and DFT algorithms lost the target when the target passed through the blocking card while the tracking algorithm proposed in this paper could accurately and stably track the target. In the Tiger2 sequence, the object was repeatedly obscured by the green leaves and most of the tracking algorithms deviated from the object. The tracking algorithm proposed in this paper has obvious advantages, and the tracking box covered the object position accurately. In the Human3 sequence, the object is blocked completely for a short period of time. Starting from frame 53, tracking algorithms such as VTD, TLD, and Struck—which are traditionally well-behaved when dealing with occlusion—started to lose the object. However, the tracking algorithm proposed in this paper still stably locked the object. When the object appeared again in the image, the proposed tracking algorithm immediately identified the object to achieve the robustness of the entire tracking process. In the Basketball sequence, the object is heavily or even completely blocked. The algorithm proposed in this paper had good performance in managing the occlusion problem. In the Walking2 sequence, when the target is out of occlusion, only a few algorithms (e.g., our algorithm and the DLSSVM algorithm) could continue to track the target. It can be seen that the improved particle swarm optimization tracking algorithm has an effective anti-occlusion ability, which basically guarantees the real-time value.

4.2. Algorithm Efficiency Comparison

In order to show that the improved particle swarm optimization object tracking method proposed in this paper has fewer iterations and higher tracking efficiency than other particle swarm optimization object tracking methods, the five sets of image sequences in the previous section were compared experimentally using the nonlinear inertia weight adjustment mechanism, the adaptive inertia weight adjustment mechanism, and the improved inertial weight adjustment mechanism, respectively. Because the linear and nonlinear inertia weight adjustment mechanisms are relatively similar, this paper uses the non-linear adjustment mechanism as its representative. We applied different PSO weight adjustment mechanisms to complete 20 experiments for each group of image sequences and recorded the experimental data to calculate the average single frame. When the object similarity function value reached 0.98, the results are shown in Table 1.
Method 1 is a non-linear decreasing inertia weight adjustment method. Method 2 is an adaptive method that dynamically changes the value of the inertia weight. Method 3 is the improved inertial weight adjustment method proposed in this paper.
As can be seen from Table 1, for the Suv sequence, the three methods could basically guarantee robustness and real-time due to less obstruction. However, the computational efficiency of the improved particle swarm tracking algorithm was clearly higher than the nonlinear method and the adaptive method. For Tiger2 sequences, the computational efficiency of Method 1 and Method 2 were significantly reduced because of the more serious occlusion of image sequences. However, the improved method proposed in this paper could basically guarantee real-time tracking under partial occlusion, showing that the improved method has greatly enhanced algorithm efficiency. For Human3, Basketball, and Walking2 sequences, the efficiency of the improved method was obviously better than the non-linear method. Compared with the adaptive method, the improved method significantly reduced the number of iterations when the object was severely obstructed, and it improved the efficiency of the algorithm. Experimental results showed that the improved algorithm could accurately meet the algorithm requirements by adjusting the inertia weight according to the different states of each particle in each generation, which can effectively improve the computational efficiency of the algorithm.

4.3. Algorithm Effect Comparison

In this subsection, we evaluate the algorithm performance with one-pass evaluation (OPE), which is proposed in Reference [35]. We used two assessment criteria, including the success and precision plots. The precision plots are defined as the percentage of frames with the center position error less than the predefined threshold. The success plots represent the percentage of the frames in which the overlap rates of the tracking area and the boundary frame are larger than the threshold. For precision plots, the results at the error threshold of 20 were used for ranking. The area under the curve (AUC) of each success plot was used to rank the tracking algorithms.
Figure 6 shows the overall performance of the twelve trackers. It can be seen that our algorithm ranked first in success plots. In precision plots, our algorithm ranked second, which is only 4.4% worse than the first DLSSVM algorithm. The precision of our algorithm was improved by 9.9% compared to the top-level KCF algorithm in the precision plots. In the success plots, our algorithm achieved an AUC of 0.472 and the performance of our algorithm was equal to the top-ranked DSST algorithm in the related filtering algorithm. The precision and success plots show that our algorithm had superior tracking performance compared to 11 advanced tracking algorithms in terms of the video sequences in OTB2013 and OTB2015.
Figure 7 analyzes the performance of the twelve trackers in various tracking states. For all test video sequences, there were 48 groups belonging to object deformation, 50 groups belonging to object scale variation, 59 groups belonging to the out-of-plane rotation group, 48 groups belonging to the in-plane rotation group, and 54 groups belonging to object occlusion. It should be noted that one group of the image sequence may belong to different kinds of groups at the same time because the group of image sequences may appear in a variety of situations. For example, there is not only scale variation, but also a partial occlusion in the Carscale sequence. Figure 7 shows the precision plots and success plots, respectively, in various situations. Our algorithm stands out in every situation. For the deformation image sequence, our method ranked first in precision plots and success plots, and was significantly better than other methods. In the scale variation image sequence, our method ranked first, with 0.575 in precision plots. In success plots, our algorithm was only 1.7% worse than the DSSL algorithm, which achieved an AUC of 0.438. In the rotation image sequence, whether it is out-of-plane rotation or in-plane rotation, our algorithm performed significantly better than DSST, TLD, KCF, and other methods. In addition to out-of-plane rotation, the DLSSVM algorithm took first place, with an advantage of 1.7%. In the occlusion image sequence, our algorithms ranked after the DLSSVM algorithm both in precision plots and success plots. However, our algorithm still achieved a good score of 0.599 in precision plots. In success plots, our algorithm achieved an AUC of 0.438, which is obviously better than the other ten algorithms such as the DSST and KCF.
It can be seen that the particle swarm tracking algorithm with a color histogram feature has clear advantages in dealing with non-rigid deformation and rotation. This may be due to the fact that the weighted color histogram feature is insensitive to deformation and rotation. Our algorithm also had good performance in dealing with scale variation. However, our algorithm ranked after the DSST algorithm in success plots. As described in Section 1, the DSST algorithm is specifically designed to make effective improvements to scale variation, and thus has excellent performance in dealing with scale variation. Although our algorithm ranked behind the DSST algorithm, it is still superior to other advanced algorithms when dealing with scale variation issues. For the case of occlusion, the DLSSVM algorithm performed better than our algorithm in tracking effects. The DLSSVM algorithm is an improved machine-learning-based tracking algorithm, and has better performance than traditional machine-learning-based algorithms. However, the tracked structure is still complex. The target tracking is achieved through the cooperation of the detector and tracker. A large number of target samples are learned by using the classifier to improve the detection effect. When dealing with occlusion issues, the machine-learning-based tracking algorithm has significant advantages. Our algorithm uses the simplest tracking structure. Particle swarm optimization is used to optimize the target similarity function to achieve a sufficient tracking effect. By improving the inertia weight adjustment mechanism, we improved the efficiency of the algorithm’s operation. Therefore, compared with the DLSSVM algorithm with complex structure, our algorithm had difficulty surpassing the tracking effect, but it had clear advantages in tracking efficiency. Compared to the other ten algorithms, when the object was blocked, the improved particle swarm tracking algorithm could still fully exploit its own characteristics in dealing with the various occlusion situations, and could stably lock the object.

5. Conclusions

This paper presents a new object tracking method. A weighted color histogram normalized to a unit circle is used as the target feature description. Particle swarm optimization is used to optimize the object similarity function. When the target undergoes occlusion, the object similarity function will appear at the multi-peak situation and the multi-peak search ability of the particle swarm optimization algorithm is utilized to realize the accurate positioning in the occlusion situation. When the particle swarm optimization algorithm is applied to object tracking, the inertia weight adjustment mechanism has some limitations, and we present an improved method. We introduce the concept of particle maturity to achieve efficient optimization. The algorithm can adjust the inertia weight in time according to different conditions of each particle in each generation and greatly reduce the amount of computation. The experimental results show that the improved particle swarm optimization object tracking method achieves the same fitness with a smaller number of iterations. The proposed algorithm shows the most advanced performance in a wide range of scenarios.

Acknowledgments

We would like to extend our sincere gratitude to our partner, Jie Guo, for his careful revision and useful advice on this paper. This work is supported by National Nature Science Foundation of China (Grant No. 61705225).

Author Contributions

Siqiu Guo and Tao Zhang conceived and designed the algorithm and the experiments. Siqiu Guo, Yulong Song and Feng Qian were responsible for most of the MATLAB codes for the experiments. Siqiu Guo analyzed the experimental data. Siqiu Guo wrote the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Choi, Y.J.; Kim, Y.G. A Target Model Construction Algorithm for Robust Real-Time Mean-Shift Tracking. Sensors 2014, 14, 20736–20752. [Google Scholar] [CrossRef] [PubMed]
  2. Mao, H.W.; Abousleman, G.P.; Si, J. An integrated multitarget tracking system for interacting target scenarios. Airborne Intell. Surveill. Reconnaiss. (ISR) Syst. Appl. X 2013, 8713, 871314. [Google Scholar]
  3. Wang, X.; Li, T.; Sun, S.; Corchado, J.M. A Survey of Recent Advances in Particle Filters and Remaining Challenges for Multitarget Tracking. Sensors 2017, 17, 2707. [Google Scholar] [CrossRef] [PubMed]
  4. Kang, W.J.; Li, X.Y.; Li, S.Q.; Liu, G.L. Corrected Continuous Correlation Filter for Long-term Tracking. IEEE Access 2018. [Google Scholar] [CrossRef]
  5. Gundogdu, E.; Alatan, A.A. Good Features to Correlate for Visual Tracking. IEEE Trans. Image Proc. 2018. [Google Scholar] [CrossRef]
  6. Yuan, D.; Zhao, G.L.; Li, D.H.; He, Z.Y.; Luo, N. Visual object tracking based on particle filter re-detection. In Proceedings of the IEEE Conference on Security, Pattern Analysis, and Cybernetics (SPAC), Shenzhen, China, 15–17 December 2017. [Google Scholar]
  7. Bolme, D.S.; Beveridge, J.R.; Draper, B.A.; Lui, Y.M. Visual object tracking using adaptive correlation filters. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, 13–18 June 2010. [Google Scholar]
  8. Henriques, J.F.; Caseiro, R.; Martins, P.; Batista, J. High-Speed Tracking with Kernelized Correlation Filters. IEEE Trans. Pattern Anal. Mach. Intell. 2015, 37, 583–596. [Google Scholar] [CrossRef] [PubMed]
  9. Danelljan, M.; Häger, G.; Khan, F.S. Accurate Scale Estimation for Robust Visual Tracking. In Proceedings of the British Machine Vision Conference, Nottingham, UK, 1–5 September 2014. [Google Scholar]
  10. Ma, C.; Huang, J.B.; Yang, X.K.; Yang, M.H. Hierarchical Convolutional Features for Visual Tracking. In Proceedings of the IEEE Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015. [Google Scholar]
  11. Kennedy, J.; Eberhart, R. Particle swarm optimization. In Proceedings of the IEEE International Conference on Neural Networks, Perth, WA, Australia, 27 November–1 December 1995. [Google Scholar]
  12. Gali, V.; Gupta, N.; Gupta, R.A. Improved dynamic performance of shunt active power filter using particle swarm optimization. In Proceedings of the IEEE Conference on Intelligent Techniques in Control, Optimization and Signal Processing (INCOS), Srivilliputhur, India, 23–25 March 2017. [Google Scholar]
  13. Khalifa, M.H.; Anmar, M.; Ouarda, W.; Alimi, A.M. Particle swarm optimization for deep learning of convolution neural network. In Proceedings of the Sudan Conference on Computer Science and Information Technology (SCCSIT), Elnihood, Sudan, 17–19 November 2017. [Google Scholar]
  14. Gupta, A.; Gusain, K.; Kumar, D. Novel approach for incremental learning using ensemble of SVMs with particle swarm optimization. In Proceedings of the IEEE International Conference on Industrial and Information Systems (ICIIS), Roorkee, India, 3–4 December 2016. [Google Scholar]
  15. Shi, Y.; Eberhart, R. A modified particle swarm optimizer. In Proceedings of the IEEE International Conference on Evolutionary Computation Proceedings, Anchorage, AK, USA, 4–9 May 1998. [Google Scholar]
  16. Chatterjee, A.; Siarry, P. Nonlinear inertia weight variation for dynamic adaption in particle swarm optimization. Comp. Oper. Res. 2006, 33, 859–871. [Google Scholar] [CrossRef]
  17. Zhu, H.G.; Zheng, C.W.; Hu, X.H.; Li, X. Adaptive PSO using random inertia weight and its application in UAV path planning. In Proceedings of the Seventh International Symposium on Instrumentation and Control Technology: Measurement Theory and Systems and Aeronautical Equipment, Beijing, China, 13 October 2008. [Google Scholar]
  18. Arasomwan, M.A.; Adewumi, A.O. On adaptive chaotic inertia weights in Particle Swarm Optimization. In Proceedings of the IEEE Symposium on Swarm Intelligence (SIS), Singapore, 16–19 April 2013. [Google Scholar]
  19. Ahmed, W.; Jamil, O.M.; Shirazi, M.F.; Abbasi, M.H. PSO with Gompertz increasing inertia weight. In Proceedings of the IEEE Conference on Industrial Electronics and Applications (ICIEA), Melbourne, VIC, Australia, 19–21 June 2013. [Google Scholar]
  20. Swain, M.J.; Ballard, D.H. Color indexing. Int. J. Comput. Vis. 1991, 7, 11–32. [Google Scholar] [CrossRef]
  21. Zhang, D.; Zhang, J.; Xia, C. Multi-Complementary Model for Long-Term Tracking. Sensors 2018, 18, 527. [Google Scholar] [CrossRef] [PubMed]
  22. Ishii, I.; Tatebe, T.; Gu, Q.Y.; Takaki, T. Color-histogram-based tracking at 2000 fps. J. Electron. Imaging 2012, 21, 013010. [Google Scholar] [CrossRef]
  23. Luo, L.S.; Fan, X.S. Immune particle filter algorithm for target tracking based on histograms of color and oriented gradient. Opt. Sens. Imaging Technol. Appl. 2017, 10462, 104622Q. [Google Scholar] [CrossRef]
  24. Sun, Y.X. Analysis of the effects of the random weights of particle swarm optimization. In Proceedings of the IEEE International Conference on Advances in Computing and Communication Engineering (ICACCE), Durban, South Africa, 28–29 November 2016. [Google Scholar]
  25. Doctor, S.; Venayagamoorthy, G.K. Improving the performance of particle swarm optimization using adaptive critics designs. In Proceedings of the IEEE Conference on Swarm Intelligence Symposium, Pasadena, CA, USA, 8–10 June 2005. [Google Scholar]
  26. Kalal, Z.; Mikolajczyk, K.; Mata, J. Tracking-Learning-Detection. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 34, 1409–1422. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Hare, S.; Saffari, A.; Torr, P.H.S. Struck: Structured Output Tracking with Kernels. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Barcelona, Spain, 6–13 November 2011. [Google Scholar]
  28. Kwon, J.; Lee, K.M. Visual tracking decomposition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), San Francisco, CA, USA, 13–18 June 2010. [Google Scholar]
  29. Henriques, J.F.; Caseiro, R.; Martins, P.; Batista, J. Exploiting the circulant structure of tracking-by-detection with kernels. In Proceedings of the European Conference on Computer Vision (ECCV), Florence, Italy, 7–13 October 2012. [Google Scholar]
  30. Zhang, K.H.; Zhang, L.; Yang, M.H. Real-Time Compressive Tracking. In Proceedings of the European Conference on Computer Vision (ECCV), Florence, Italy, 16–21 June 2012. [Google Scholar]
  31. Sevilla-Lara, L.; Learned-Miller, E. Distribution Fields for Tracking. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012. [Google Scholar]
  32. Grabner, H.; Grabner, M.; Bischof, H. Real-Time tracking via on-line boosting. In Proceedings of the British Machine Vision Conference (BMCV), Edinburgh, UK, 4–7 September 2006. [Google Scholar]
  33. Babenko, B.; Yang, M.H.; Belongie, S. Visual tracking with online multiple instance learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Miami, FL, USA, 20–25 June 2009. [Google Scholar]
  34. Ning, J.; Yang, J.; Jiang, S.; Zhang, L.; Yang, M.-H. Object tracking via dual linear structured svm and explicit feature map. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 4266–4274. [Google Scholar]
  35. Wu, Y.; Lim, J.; Yang, M.H. Online object tracking: A benchmark. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Portland, OR, USA, 23–28 June 2013. [Google Scholar]
Figure 1. The simplified particle motion model. (a) The first case, where the particle swarm is far from the optimal solution, but the particles are close to each other. (b) Another perspective of the first case. (c) The second case, where the optimal particle is close to the optimal solution and the suboptimal particles also move in the vicinity of the optimal solution. (d) Another perspective of the second case.
Figure 1. The simplified particle motion model. (a) The first case, where the particle swarm is far from the optimal solution, but the particles are close to each other. (b) Another perspective of the first case. (c) The second case, where the optimal particle is close to the optimal solution and the suboptimal particles also move in the vicinity of the optimal solution. (d) Another perspective of the second case.
Sensors 18 01292 g001
Figure 2. Tracking results of the proposed method and eleven state-of-the-art tracking methods on sequences Dancer, Dancer2, and Skater. In these sequences, there is a non-rigid deformation problem.
Figure 2. Tracking results of the proposed method and eleven state-of-the-art tracking methods on sequences Dancer, Dancer2, and Skater. In these sequences, there is a non-rigid deformation problem.
Sensors 18 01292 g002
Figure 3. Tracking results of the proposed method and eleven state-of-the-art tracking methods on sequences Carscale, Human6, and Singer1. In these sequences, there is a scale variation problem.
Figure 3. Tracking results of the proposed method and eleven state-of-the-art tracking methods on sequences Carscale, Human6, and Singer1. In these sequences, there is a scale variation problem.
Sensors 18 01292 g003
Figure 4. Tracking results of the proposed method and eleven state-of-the-art tracking methods on sequences Jump, Diving, and Skiing. In these sequences, there is a rotation problem.
Figure 4. Tracking results of the proposed method and eleven state-of-the-art tracking methods on sequences Jump, Diving, and Skiing. In these sequences, there is a rotation problem.
Sensors 18 01292 g004
Figure 5. Tracking results of the proposed method and eleven state-of-the-art tracking methods on sequences Suv, Tiger2, Human3, Basketball, and Walking2. In these sequences, the objects undergo occlusion.
Figure 5. Tracking results of the proposed method and eleven state-of-the-art tracking methods on sequences Suv, Tiger2, Human3, Basketball, and Walking2. In these sequences, the objects undergo occlusion.
Sensors 18 01292 g005
Figure 6. (a) Precision plots and (b) Success plots of one-pass evaluation (OPE) for 12 trackers. The legend reports the performance score for each tracker.
Figure 6. (a) Precision plots and (b) Success plots of one-pass evaluation (OPE) for 12 trackers. The legend reports the performance score for each tracker.
Sensors 18 01292 g006
Figure 7. The precision plots and the success plots of sequences with non-rigid deformation, scale variation, rotation, and occlusion. The performance score of each tracker is shown in the figure.
Figure 7. The precision plots and the success plots of sequences with non-rigid deformation, scale variation, rotation, and occlusion. The performance score of each tracker is shown in the figure.
Sensors 18 01292 g007aSensors 18 01292 g007b
Table 1. Algorithm time-consuming contrast (ms).
Table 1. Algorithm time-consuming contrast (ms).
AttributesMethod 1 Method 2Method 3
Suv65383
Tiger214010226
Human339025235
Basketball51043042
Walking236722239

Share and Cite

MDPI and ACS Style

Guo, S.; Zhang, T.; Song, Y.; Qian, F. Color Feature-Based Object Tracking through Particle Swarm Optimization with Improved Inertia Weight. Sensors 2018, 18, 1292. https://0-doi-org.brum.beds.ac.uk/10.3390/s18041292

AMA Style

Guo S, Zhang T, Song Y, Qian F. Color Feature-Based Object Tracking through Particle Swarm Optimization with Improved Inertia Weight. Sensors. 2018; 18(4):1292. https://0-doi-org.brum.beds.ac.uk/10.3390/s18041292

Chicago/Turabian Style

Guo, Siqiu, Tao Zhang, Yulong Song, and Feng Qian. 2018. "Color Feature-Based Object Tracking through Particle Swarm Optimization with Improved Inertia Weight" Sensors 18, no. 4: 1292. https://0-doi-org.brum.beds.ac.uk/10.3390/s18041292

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop