Next Article in Journal
CNN-Based Multi-Modal Camera Model Identification on Video Sequences
Next Article in Special Issue
Three-Color Balancing for Color Constancy Correction
Previous Article in Journal
Enhanced Magnetic Resonance Image Synthesis with Contrast-Aware Generative Adversarial Networks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Detection Method of Operated Fake-Images Using Robust Hashing

Department of Computer Science, Tokyo Metropolitan University, 6-6 Asahigaoka, Tokyo 191-0065, Japan
*
Author to whom correspondence should be addressed.
Submission received: 17 May 2021 / Revised: 29 July 2021 / Accepted: 30 July 2021 / Published: 5 August 2021
(This article belongs to the Special Issue Intelligent Media Processing)

Abstract

:
SNS providers are known to carry out the recompression and resizing of uploaded images, but most conventional methods for detecting fake images/tampered images are not robust enough against such operations. In this paper, we propose a novel method for detecting fake images, including distortion caused by image operations such as image compression and resizing. We select a robust hashing method, which retrieves images similar to a query image, for fake-image/tampered-image detection, and hash values extracted from both reference and query images are used to robustly detect fake-images for the first time. If there is an original hash code from a reference image for comparison, the proposed method can more robustly detect fake images than conventional methods. One of the practical applications of this method is to monitor images, including synthetic ones sold by a company. In experiments, the proposed fake-image detection is demonstrated to outperform state-of-the-art methods under the use of various datasets including fake images generated with GANs.

1. Introduction

Recent rapid advances in image manipulation tools and deep image synthesis techniques have made generating fake images easy. In addition, with the spread of social networking services (SNS), the existence of fake images has become a major threat to the credibility of the international community. Accordingly, detecting fake/tampered images has become an urgent issue [1].
Most methods for detecting forgery assume that images are generated using specific manipulation techniques, so these methods detect unique features caused by such techniques such as checkerboard artifacts [2,3,4,5]. In contrast, tampered images are usually uploaded to social networks to share with other people. SNS providers are known to apply uploaded images to resizing and compressing [6], and such operation by an SNS provider or malicious person damage can damage or cause the images to lose the unique features of tampered images. Accordingly, methods for detecting fake/tampered images suffer from the influence of operation in general. However, conventional methods are not robust enough yet against the various types of content-preserving transforms without malice, such as resizing and compression.
Accordingly, the motivation of this work is to propose a novel method for robustly detecting fake/tampered images even when various operations such as compressing and resizing are applied to query images. Figure 1 illustrates two types of real and fake images, those in the narrow sense and in the broad sense, where images in the broad sense include the influence of some noise caused by resizing and image compression. The proposed method aims to robustly detect fake images even under the use of such image operations, which is well known to be applied by most SNS providers. In this paper, reference images correspond to real images in the narrow sense, so a query image is judged to be fake if it is not a real image in the broad sense.
In this paper, we investigate the possibility of developing a fake-image detection method with robust hashing for image retrieval. Hash values for fake-image detection are required to be robust enough against a number of types of image operation such as image compression and resizing. In contrast, hash values generated by using a robust hash method have to be sensitive to the influence of manipulations used for generating fake images such as copy-move and generative adversarial networks (GANs). Because of these requirements, we selected Li et al.’s method [7], and it is applied to the proposed detection method. The proposed method is a one class training method as defined by Verdoliva [1]. If there is no original hash code for comparison, the proposed method is not available for fake-image detection. In contrast, if there is an original hash code for comparison, it can more robustly detect fake images than conventional methods. In addition, the proposed method allows us to use any images, including ones generated with CNNs, as references (real images), which other detection methods, such as noiseprint and CNN-based ones, cannot be applied to.
One of the practical applications of this method is to monitor images, including synthetic ones, sold by a company. Recently, companies that sell media, including synthetic ones, have been spread [8,9,10,11], so customers can easily obtain copyright-free images suitable for the purpose of their use. However, in many cases, the manipulation of the images is limited by an acceptable use policy. The proposed method allows the companies to monitor whether the use of images is subjected to the acceptable use policy because an original hash code for comparison can be prepared, and the proposed method can be applied to any images, including synthetic images. Accordingly, the paper is posted as a detection of the unauthorized manipulation of existing images, including synthetic ones.
In an experiment, the proposed method with Li et al.’s method is demonstrated to outperform state-of-the-art methods under the use of various datasets, including fake images generated with GANs.

2. Related Work

2.1. Fake-Image Generation

Some fake images are manually generated by using image editing tools such as Photoshop. Splicing, copy-move, and deletion are carried out with such a tool. Similarly, resizing, rotating, blurring, and changing the color of an image can be manually carried out.
In addition, recent rapid advances in deep image synthesis techniques have made it possible to automatically generate fake images. CycleGAN [12] and StarGAN [13] are typical image synthesis techniques with GANs. CycleGAN is a GAN that performs a one-to-one transformation, e.g., changing an apple to an orange, while StarGAN is a GAN that performs a many-to-many transformation, such as changing a person’s facial expression or hair color (see Figure 2). Furthermore, fake videos created by using deep learning are called deepfakes, and various tampering methods have emerged, such as those using autoencoders, Face2Face [14], and FaceSwap [15].
Real-world fake images can include alterations by a number of operation techniques such as both image compression and resizing at the same time, even if the images are generated by using GANs. Therefore, we have to consider this when detecting real-world fake images.

2.2. Fake-Image Detection Methods

Image tampering has a longer history than that of deep learning. Fragile watermarking [16], the detection of double JPEG compression with a statistical method [17,18], and the use of photo-response non-uniformity (PRNU) patterns for each camera [19,20] have been proposed to detect tampering. However, these conventional methods cannot detect differences between fake images and those that have only been operated, such as resized images, without malicious intent. In addition, most of these methods do not consider detecting fake images generated with GANs.
With the development of deep learning, various fake-image detection methods with deep learning have been developed so far. These methods also assume that fake images have unique features caused by using a manipulation tool. There are several methods that use deep learning to detect fake images generated with an image editing tool, such as Photoshop. Some of them focus on detecting the boundary between tampered regions and the original image [21,22,23].
Most detection methods with deep learning have been proposed to detect fake images generated by using GANs. An image classifier trained only with ProGAN was shown to be effective in detecting images generated by other GAN models [24]. Various studies have focused on detecting checkerboard artifacts caused in two processes: forward propagation of upsampling layers and backpropagation of convolutional layers [25]. In reference [25], the spectrum of images is used as an input image in order to capture checkerboard artifacts.
To detect deepfake videos, a number of detection methods have been investigated. Some attempt to detect failures that occur in the generation of fake videos such as on the basis of poorly generated eyes and teeth [26], the frequency of blinking used as a feature [27], and the correctness of facial landmarks [28] or head posture [29]. However, all of these methods have been pointed out to have problems in terms of robustness against the difference between training datasets and test data [1]. In addition, conventional methods are not robust against images with combinations of multiple types of manipulation, i.e., resizing and deepfakes.

3. Proposed Method with Robust Hashing

3.1. Overview

Figure 3 shows an overview of the proposed method. In this framework, robust hash values are calculated from reference images by using a robust hashing method, and they are stored in a database. Similar to reference images, a robust hash value is calculated from a query image by using the hash method. The hash value of the query is compared with those stored in the database, and the query image is judged to be either real or fake in accordance with the distance between hash values.

3.2. Selection of Robust Hashing Methods

Hash values for fake-image detection are required to be robust enough against a number of types of image operation such as image compression and resizing since such operation does not convert the content of images, although the quality of the images is reduced. Therefore, we focus on using a robust hash method that aims to robustly retrieve images similar to query images. In contrast, hash values generated by using a robust hash method have to be sensitive to the influence of manipulation used for generating fake images such as copy-move and GANs.
Under these requirements, various robust hashing methods [7,30,31,32] are compared in terms of sensitivity and robustness in Section 4.3, in which Li et al.’s method [7] is demonstrated to have a suitable performance for fake-image detection. As mentioned above, we selected Li et al.’s method for fake-image detection.

3.3. Fake Detection with Robust Hashing

Li et al.’s method was designed to robustly retrieve similar images from many images with different sizes even when including noise. It has the following properties.
(a)
Resizing images to 128 × 128 pixels prior to feature extraction.
(b)
Performing 5 × 5-Gaussian low-pass filtering with a standard deviation of a value of one.
(c)
Using features related to spatial and chromatic characteristics from images.
(d)
Outputting a bit string with a length of 120 bits as a hash value.
Property (a) enhances robustness against image resizing, and property (b) can reduce the influence of image compression, respectively. Property (c) makes hash values sensitive to the influence of malicious tampering, and property (d) allows us to efficiently compare the features of reference images.
In the method, the similarity between two images is evaluated in accordance with the Hamming distance between the hash string of a query image and that of each image in a database. Let vectors u = { u 1 , u 2 , , u n } and q = { q 1 , q 2 , , q n } , u i , q i { 0 , 1 } be the hash strings of reference image U and query image Q, respectively. The Hamming distance d H ( u , q ) between U and Q is given by:
d H ( u , q ) i = 1 n δ ( u i , q i ) ,
where
δ ( u i , q i ) = 0 , u i = q i 1 , u i q i .
In this paper, to apply this similarity to fake-image detection, the threshold value d is introduced as follows.
Q U , min u q , u U ( d H ( u , q ) ) < d Q U , min u q , u U ( d H ( u , q ) ) d ,
where U is a set of reference images, i.e., a set of real images in the narrow sense, and U is the corresponding set of real images in the broad sense, which does not include fake images. Q is judged to be either a fake image or not in accordance with Equation (3).
Note that real images are not limited to images captured by a camera under the proposed framework. Therefore, images generated by using GANs without malice can be used as reference (real) ones.

4. Results of Experiment

The proposed fake-image detection method with robust hashing was experimentally evaluated in terms of accuracy and robustness against various types of image operation.

4.1. Experiment Setup

In the experiment, four fake-image datasets—Image Manipulation Dataset [33], UADFV [27], CycleGAN [12], and StarGAN [13]—were used, where each dataset consisted of pairs of a fake-image and corresponding original one (real one) as shown in Table 1 (see Figure 2 and Figure 4). For example, in the Image Manipulation Dataset, there are 48 real and 48 fake images, so the dataset consists of 96 images in total. Fake images in the Image Manipulation Dataset were generated by using image manipulation tools without GANs, and those in the other datasets were prepared with GANs. Original images were used as reference ones, where a separate reference dataset created for each experiment with a different fake-image dataset was prepared. In contrast, both original images and fake ones were used as query images, where JPEG compression with a quantization parameter of Q J = 80 was applied to all query images, so real images used as query ones included some compression noise.
The performance of fake-image detection was evaluated using two assessment criteria: average precision (AP) and F-score. AP is computed by summarizing a precision-recall curve as the weighted mean of precisions achieved at each threshold:
AP = Σ j ( R j R j 1 ) P j ,
where P j and R j are the precision and recall at the j t h threshold. F-score is also given by
F - score = 2 R P R + P ,
where P and R are the precision and recall at a selected threshold.

4.2. Selection of Threshold Value d

An experiment was carried out to select threshold value d in Equation (3). In the experiment, equal error rate (EER) values were calculated on the datasets in Table 1, where the EER is the intersection of both the false acceptance rate (FAR) and false rejection rate (FRR) lines. Figure 5a shows experiment results under Image manipulation and UADFV, and Figure 5b shows results under CycleGAN and StarGAN. Manipulated areas in an image for CycleGAN and StarGAN were larger than those for Image manipulation and UADFV, so EER values for CycleGAN and StarGAN were given under the use of a larger threshold value than for Image manipulation and UADFV. In the experiment, d = 3 was selected as a threshold value in accordance with the EER performance in Figure 5a, although the best threshold value depended on datasets.

4.3. Robust Hashing

Hash values for fake-image detection are required to be robust enough against a number of types of image operation, such as image compression. In addition, hash values generated by using a robust hash method have to be sensitive to the influence of manipulations used for generating fake images, such as copy-move and GANs. From these requirements, we selected four possible robust hash methods [7,30,31,32] that are robust against image compression.
To compare the four methods, they were applied to the framework of the proposed method under two datasets: the Image Manipulation Dataset and UADFV. As shown in Table 2, Li et al.’s method was confirmed to have the highest F-score among the four methods. This means that Li et al.’s method was more sensitive to the influence of manipulation used for generating fake images than the others. Accordingly, we selected Li et al.’s method for fake detection in this paper.

4.4. Suitability of Li et al.’s Method

A brief analysis on the suitability of Li et al.’s method to serve as the hash function was presented in Section 3.3. The analysis is verified empirically here. Figure 6 shows F-score values on the Image Manipulation Dataset under various Q J values and resizing rations. From the figure, the proposed method with Li et al.’s hashing was demonstrated to still maintain a high F-score value even when such image operations were applied to images. Properties (a) and (b) in Section 3.3 made a contribution to the robustness against these operations.
In Table 3, the importance of color information for fake-image detection was verified by using two image datasets, where the first dataset referred to as RGB was the Image Manipulation Dataset, and the second one called grayscale consisted of grayscale images generated from the Image Manipulation Dataset. From the table, the use of color information was confirmed to achieve a higher performance than the use of grayscale images. For Li et al.’s method, Chroma is used as a feature of images, so hash values generated by Li et al.’s method are sensitive to the change of colors. Property (c) in Section 3.3 made a contribution to the sensitivity enhancement for fake-image detection.

4.5. Results without Additional Operation

In this experiment, the proposed method was compared with two state-of-the-art fake detection methods, Wang’s method [24] and Xu’s method [25], where query images from the datasets were directly used without any additional operation, although real query images were compressed by using JPEG compression with Q J = 80 . Wang’s method was proposed for detecting images generated by various convolutional neural networks (CNNs), including GAN models, where the classifier was trained by using ProGAN.
Table 4 shows that the proposed method had a higher accuracy than the other methods in terms of almost all criteria. In addition, the accuracy of the conventional methods heavily decreased when using the Image Manipulation and UADFV datasets. The reason is that the conventional ones focus on detecting fake images generated by using CNNs. The Image Manipulation Dataset does not consist of images generated with GANs. In addition, although UADFV consists of deepfake videos, videos in the dataset already have the influence of video compression.

4.6. Results with Additional Operation

Next, JPEG compression with Q J = 70 and resizing with a scale factor of 0.5 were applied to all query images in the CycleGAN dataset. Copy-move or splicing was also applied to the fake query images. Therefore, if copy-move or splicing was applied to fake queries, manipulated fake queries include the influence of two types of manipulation at the same time, such as copy-move and CycleGAN or splicing and CycleGAN.
Table 5 shows experimental results obtained with the additional manipulation, where 50 fake images generated using CycleGAN were used. The proposed method with d = 7 was confirmed to still maintain a high accuracy even under the additional operation. In contrast, Wang’s method suffered from the influence of the addition operation. In particular, for splicing and resizing, Wang’s method was affected by these operations. The reason is that the method assumes that fake images are generated using CNNs to detect unique features caused by CNNs. However, splicing and resizing are carried out without CNNs. In addition, both the conventional methods had a low performance under the use of Image Manipulation Dataset and UADFV, as demonstrated in Table 4.

4.7. Comparison with Noiseprint Algorithm

The proposed method is a one-class training method [1], so the noiseprint algorithm [34], which is also a one-class training method, was evaluated in terms of robustness against JPEG compression and resizing.
Figure 7 shows an example of experiment results, where an original image (fake one) with a size of 3488 × 2616, its ground truth, and the heatmap calculated from the noiseprint of the original image, as in [34], are shown in Figure 7a, respectively, and each color in the heatmap corresponds to a value, as shown in Figure 7d. From Figure 7a, it was confirmed that the noiseprint algorithm worked well under this condition. Figure 7b,c also show heatmaps calculated under the use of JPEG compression and image resizing. From these figures, the heatmaps were affected by the influence of these operations. In particular, the noiseprint algorithm failed when applying resizing with a ratio of 0.5.
Other examples are shown in Figure 8, where real and fake images were selected from three datasets: Image Manipulation, UADFV, and StarGAN, and the images had a size of 3264 × 2448, 256 × 256, and 256 × 56, respectively. Although any additional operations were not applied to the images, the noiseprint algorithm did not work well for fake-image detection. One of the reasons that images used for this experiment do not have a larger size than that of Figure 7. As described in [34], sometimes, the noiseprint algorithm does not work well under some conditions, such as the use of images with a small size and dark images. In addition, a number of attack methods against the noiseprint algorithm were proposed [35,36]. In contrast, the proposed method successfully detected the fake images shown in Figure 7 and Figure 8.
From the results shown in Table 5 and Figure 7, the state-of-the-art fake image detections and the noiseprint algorithm are not robust enough against the various types of content-preserving transformations with malice, such as resizing and compression, although they have different applications from the proposed method. From the comparison, the method was demonstrated to outperform the conventional methods in some applications in which original hash code of a reference image was prepared for comparison.

4.8. Computational Complexity

To evaluate the computational complexity of the proposed method, we measured the executing time of the proposed one. The Image Manipulation Dataset [33] was utilized for the evaluation. The simulation was run on a PC, with a 3.6 GHz processor and the main memory of 16 Gbytes (see Table 6). In the simulation, tic and toc function in MATLAB was used for measuring the executing time.
The executing time for computing 48 hash values from 48 queries was 2.25 s in total. In contrast, the executing time for comparing 96 hash values from query images with 48 hash values from reference images was 0.04 s in total. Li et al.’s method outputs a bit string with a length of 120 bits as a hash value, so the comparison between hash values is efficiently carried out.

5. Conclusions

In this paper, we proposed a novel fake-image detection method with robust hashing for the first time. Although various robust hashing methods have been proposed to retrieve images similar to the tested one, the robust hashing method proposed by Li et al. was selected for fake-image detection, considering both strong robustness against image compression and resizing and high sensitivity to the influence of manipulations used for generating fake images. When there is an original hash code for compression, the proposed method can more robustly detect fake/tampered images than conventional methods. In an experiment, the proposed method was demonstrated not only to outperform the state-of-the-art methods but also to work well even when combining multiple operations.

Author Contributions

Conceptualization, M.T. and H.K.; methodology, M.T.; software, M.T.; validation, M.T. and H.K.; formal analysis, M.T. and H.K.; investigation, M.T.; resources, H.K.; data curation, M.T.; writing—original draft preparation, M.T.; writing—review and editing, H.K.; visualization, M.T.; supervision, S.S. and H.K.; project administration, H.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Verdoliva, L. Media Forensics and DeepFakes: An Overview. IEEE J. Sel. Top. Signal Process. 2020, 14, 910–932. [Google Scholar] [CrossRef]
  2. Sugawara, Y.; Shiota, S.; Kiya, H. Super-Resolution Using Convolutional Neural Networks Without Any Checkerboard Artifacts. In Proceedings of the IEEE International Conference on Image Processing, Athens, Greece, 7–10 October 2018; pp. 66–70. [Google Scholar] [CrossRef] [Green Version]
  3. Sugawara, Y.; Shiota, S.; Kiya, H. Checkerboard artifacts free convolutional neural networks. APSIPA Trans. Signal Inf. Process. 2019, 8, e9. [Google Scholar] [CrossRef] [Green Version]
  4. Kinoshita, Y.; Kiya, H. Fixed Smooth Convolutional Layer for Avoiding Checkerboard Artifacts in CNNS. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Barcelona, Spain, 4–8 May 2020; pp. 3712–3716. [Google Scholar] [CrossRef] [Green Version]
  5. Osakabe, T.; Tanaka, M.; Kinoshita, Y.; Kiya, H. CycleGAN without checkerboard artifacts for counter-forensics of fake-image detection. Int. Workshop Adv. Imaging Technol. 2021, 11766, 51–55. [Google Scholar] [CrossRef]
  6. Chuman, T.; Iida, K.; Sirichotedumrong, W.; Kiya, H. Image Manipulation Specifications on Social Networking Services for Encryption-then-Compression Systems. IEICE Trans. Inf. Syst. 2019, 102, 11–18. [Google Scholar] [CrossRef] [Green Version]
  7. Li, Y.N.; Wang, P.; Su, Y.T. Robust Image Hashing Based on Selective Quaternion Invariance. IEEE Signal Process. Lett. 2015, 22, 2396–2400. [Google Scholar] [CrossRef]
  8. Adobe Stock, Inc. Stock Photos, Royalty-Free Images, Graphics, Vectors & Videos. Available online: https://stock.adobe.com/ (accessed on 12 July 2021).
  9. Shutterstock, Inc. Stock Images, Photos, Vectors, Video, and Music. Available online: https://www.shutterstock.com/ (accessed on 12 July 2021).
  10. Stock Images, Royalty-Free Pictures, Illustrations & Videos-iStock. Available online: https://www.istockphoto.com/ (accessed on 12 July 2021).
  11. Generated Photos|Unique, Worry-Free Model Photos. Available online: https://generated.photos/# (accessed on 12 July 2021).
  12. Zhu, J.Y.; Park, T.; Isola, P.; Efros, A.A. Unpaired Image-To-Image Translation Using Cycle-Consistent Adversarial Networks. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017. [Google Scholar]
  13. Choi, Y.; Choi, M.; Kim, M.; Ha, J.W.; Kim, S.; Choo, J. StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018. [Google Scholar]
  14. Thies, J.; Zollhofer, M.; Stamminger, M.; Theobalt, C.; Niessner, M. Face2Face: Real-Time Face Capture and Reenactment of RGB Videos. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
  15. Nirkin, Y.; Masi, I.; Tran Tuan, A.; Hassner, T.; Medioni, G. On Face Segmentation, Face Swapping, and Face Perception. In Proceedings of the IEEE International Conference on Automatic Face Gesture Recognition, Xi’an, China, 15–19 May 2018; pp. 98–105. [Google Scholar] [CrossRef] [Green Version]
  16. Ho, A.T.S.; Zhu, X.; Shen, J.; Marziliano, P. Fragile Watermarking Based on Encoding of the Zeroes of the z-Transform. IEEE Trans. Inf. Forensics Secur. 2008, 3, 567–569. [Google Scholar] [CrossRef]
  17. Gong, Z.; Niu, S.; Han, H. Tamper Detection Method for Clipped Double JPEG Compression Image. In Proceedings of the International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Adelaide, Australia, 23–25 September 2015; pp. 185–188. [Google Scholar] [CrossRef]
  18. Bianchi, T.; Piva, A. Detection of Nonaligned Double JPEG Compression Based on Integer Periodicity Maps. IEEE Trans. Inf. Forensics Secur. 2012, 7, 842–848. [Google Scholar] [CrossRef] [Green Version]
  19. Chen, M.; Fridrich, J.; Goljan, M.; Lukas, J. Determining Image Origin and Integrity Using Sensor Noise. IEEE Trans. Inf. Forensics Secur. 2008, 3, 74–90. [Google Scholar] [CrossRef] [Green Version]
  20. Chierchia, G.; Poggi, G.; Sansone, C.; Verdoliva, L. A Bayesian-MRF Approach for PRNU-Based Image Forgery Detection. IEEE Trans. Inf. Forensics Secur. 2014, 9, 554–567. [Google Scholar] [CrossRef]
  21. Rao, Y.; Ni, J. A deep learning approach to detection of splicing and copy-move forgeries in images. In Proceedings of the IEEE International Workshop on Information Forensics and Security, Abu Dhabi, United Arab Emirates, 4–7 December 2016; pp. 1–6. [Google Scholar] [CrossRef]
  22. Bappy, J.H.; Roy-Chowdhury, A.K.; Bunk, J.; Nataraj, L.; Manjunath, B.S. Exploiting Spatial Structure for Localizing Manipulated Image Regions. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017. [Google Scholar]
  23. Zhou, P.; Han, X.; Morariu, V.I.; Davis, L.S. Pros of Learning Rich Features for Image Manipulation Detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018. [Google Scholar]
  24. Wang, S.Y.; Wang, O.; Zhang, R.; Owens, A.; Efros, A.A. CNN-Generated Images Are Surprisingly Easy to Spot …for Now. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020. [Google Scholar]
  25. Zhang, X.; Karaman, S.; Chang, S. Detecting and Simulating Artifacts in GAN Fake Images. In Proceedings of the IEEE International Workshop on Information Forensics and Security, Delft, Netherlands, 9–12 December 2019; pp. 1–6. [Google Scholar] [CrossRef] [Green Version]
  26. Matern, F.; Riess, C.; Stamminger, M. Exploiting Visual Artifacts to Expose Deepfakes and Face Manipulations. In Proceedings of the IEEE Winter Applications of Computer Vision Workshops, Waikoloa Village, HI, USA, 7–11 January 2019; pp. 83–92. [Google Scholar] [CrossRef]
  27. Li, Y.; Chang, M.; Lyu, S. In Ictu Oculi: Exposing AI Created Fake Videos by Detecting Eye Blinking. In Proceedings of the IEEE International Workshop on Information Forensics and Security, Hong Kong, China, 11–13 December 2018; pp. 1–7. [Google Scholar] [CrossRef]
  28. Yang, X.; Li, Y.; Qi, H.; Lyu, S. Exposing GAN-Synthesized Faces Using Landmark Locations. In Proceedings of the ACM Workshop on Information Hiding and Multimedia Security, Paris, France, 3–5 July 2019; pp. 113–118. [Google Scholar] [CrossRef] [Green Version]
  29. Yang, X.; Li, Y.; Lyu, S. Exposing Deep Fakes Using Inconsistent Head Poses. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Brighton, UK, 12–17 May 2019; pp. 8261–8265. [Google Scholar] [CrossRef] [Green Version]
  30. Gong, Y.; Lazebnik, S.; Gordo, A.; Perronnin, F. Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 2916–2929. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  31. Li, Y.; Wang, P. Robust image hashing based on low-rank and sparse decomposition. In Proceedings of the 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, 20–25 March 2016; pp. 2154–2158. [Google Scholar] [CrossRef]
  32. Iida, K.; Kiya, H. Robust Image Identification with DC Coefficients for Double-Compressed JPEG Images. IEICE Trans. Inf. Syst. 2019, 102, 2–10. [Google Scholar] [CrossRef] [Green Version]
  33. Image Manipulation Dataset. Available online: https://www5.cs.fau.de/research/data/image-manipulation/ (accessed on 10 July 2021).
  34. Cozzolino, D.; Verdoliva, L. Noiseprint: A CNN-based camera model fingerprint. arXiv 2018, arXiv:1808.08396. [Google Scholar] [CrossRef] [Green Version]
  35. Chen, C.; Xiong, Z.; Liu, X.; Wu, F. Camera Trace Erasing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 14–19 June 2020. [Google Scholar]
  36. Chen, C.; Zhao, X.; Stamm, M.C. Generative Adversarial Attacks Against Deep-Learning-Based Camera Model Identification. IEEE Trans. Inf. Forensics Secur. 2019. [Google Scholar] [CrossRef]
Figure 1. Real and fake images. (a) Real image in broad sense; (b) Fake image in broad sense.
Figure 1. Real and fake images. (a) Real image in broad sense; (b) Fake image in broad sense.
Jimaging 07 00134 g001
Figure 2. An example of a fake image in the CycleGAN dataset.
Figure 2. An example of a fake image in the CycleGAN dataset.
Jimaging 07 00134 g002
Figure 3. An overview of the proposed method.
Figure 3. An overview of the proposed method.
Jimaging 07 00134 g003
Figure 4. Examples of fake-images in dataset. (a) Image Manipulation Dataset; (b) UADFV; (c) StarGAN.
Figure 4. Examples of fake-images in dataset. (a) Image Manipulation Dataset; (b) UADFV; (c) StarGAN.
Jimaging 07 00134 g004
Figure 5. The selection of d. (a) Image Manipulation Dataset and UADFV; (b) CycleGAN and StarGAN.
Figure 5. The selection of d. (a) Image Manipulation Dataset and UADFV; (b) CycleGAN and StarGAN.
Jimaging 07 00134 g005
Figure 6. Robustness against compression and resizing (Image Manipulation Dataset). (a) JPEG compression; (b) Resizing.
Figure 6. Robustness against compression and resizing (Image Manipulation Dataset). (a) JPEG compression; (b) Resizing.
Jimaging 07 00134 g006
Figure 7. The robustness of the noiseprint algorithm against compression and resizing. (a) Without operation; (b) JPEG compression; (c) Resizing; (d) Color bar.
Figure 7. The robustness of the noiseprint algorithm against compression and resizing. (a) Without operation; (b) JPEG compression; (c) Resizing; (d) Color bar.
Jimaging 07 00134 g007
Figure 8. The robustness of the noiseprint algorithm no additional operation. (a) Image Manipulation Dataset (fake); (b) Image Manipulation Dataset (real); (c) UADFV (fake); (d) UADFV (real); (e) StarGAN (fake); (f) StarGAN (real); (g) Color bar.
Figure 8. The robustness of the noiseprint algorithm no additional operation. (a) Image Manipulation Dataset (fake); (b) Image Manipulation Dataset (real); (c) UADFV (fake); (d) UADFV (real); (e) StarGAN (fake); (f) StarGAN (real); (g) Color bar.
Jimaging 07 00134 g008
Table 1. Datasets used in the experiment.
Table 1. Datasets used in the experiment.
DatasetFake-Image GenerationRealFake
No. of Images
Imagecopy-move4848
Manipulation
Dataset [33]
UADFV [27]face swap4949
CycleGAN [12]GAN13201320
StarGAN [13]GAN19991999
Table 2. Comparison of robust hashing methods (F-score).
Table 2. Comparison of robust hashing methods (F-score).
Robust Hash DatasetLi et al.’s Method [7]Modified Li’s Method [31]Gong’s
Method [30]
Iida’s
Method [32]
Image0.94120.83480.75000.768
Manipulation
Dataset
UADFV0.83020.68150.69060.7934
Table 3. The influence of color information (Image Manipulation Dataset).
Table 3. The influence of color information (Image Manipulation Dataset).
DatasetF-Score
Greyscale0.7869
RGB0.9412
Table 4. Comparison with state-of-the-art methods.
Table 4. Comparison with state-of-the-art methods.
DatasetWang’s Method [24]Xu’s Method [25]Proposed
APF-ScoreAPF-ScoreAPF-Score
Image
Manipulation
Dataset
0.51850.00000.50350.51920.97600.9412
UADFV0.57070.00000.51050.61400.88010.8302
CycleGAN0.97680.74050.87520.78261.00000.9708
StarGAN0.95940.74180.49850.62691.00000.9973
Table 5. Comparison with state-of-the-art method under additional operation (dataset: CycleGAN).
Table 5. Comparison with state-of-the-art method under additional operation (dataset: CycleGAN).
Additional OperationWang’s Method [24]Xu’s Method [25]Proposed
APF-ScoreAPF-ScoreAPF-Score
None0.98330.76540.99410.88010.99410.9800
JPEG ( Q J = 70 )0.96700.74070.85720.70400.99220.8667
resize (0.5)0.82640.38710.56370.66660.97930.5217
copy-move0.97810.64000.97980.87641.00001.0000
splicing0.96660.69230.98010.86660.99921.0000
Table 6. Machine spec used for evaluating executing time.
Table 6. Machine spec used for evaluating executing time.
ProcessorIntel Core i7-7700 3.60 GHz
Memory16 GB
OSWindows 10 Education Insider Preview
SoftwareMATLAB R2020a
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Tanaka, M.; Shiota, S.; Kiya, H. A Detection Method of Operated Fake-Images Using Robust Hashing. J. Imaging 2021, 7, 134. https://0-doi-org.brum.beds.ac.uk/10.3390/jimaging7080134

AMA Style

Tanaka M, Shiota S, Kiya H. A Detection Method of Operated Fake-Images Using Robust Hashing. Journal of Imaging. 2021; 7(8):134. https://0-doi-org.brum.beds.ac.uk/10.3390/jimaging7080134

Chicago/Turabian Style

Tanaka, Miki, Sayaka Shiota, and Hitoshi Kiya. 2021. "A Detection Method of Operated Fake-Images Using Robust Hashing" Journal of Imaging 7, no. 8: 134. https://0-doi-org.brum.beds.ac.uk/10.3390/jimaging7080134

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop