Discovering Transferable Forensic Features
for CNN-generated Images Detection
Abstract
Visual counterfeits 111We refer to CNN-generated images as counterfeits throughout this paper are increasingly causing an existential conundrum in mainstream media with rapid evolution in neural image synthesis methods. Though detection of such counterfeits has been a taxing problem in the image forensics community, a recent class of forensic detectors – universal detectors – are able to surprisingly spot counterfeit images regardless of generator architectures, loss functions, training datasets, and resolutions [87]. This intriguing property suggests the possible existence of transferable forensic features (T-FF) in universal detectors. In this work, we conduct the first analytical study to discover and understand T-FF in universal detectors. Our contributions are 2-fold: 1) We propose a novel forensic feature relevance statistic (FF-RS) to quantify and discover T-FF in universal detectors and, 2) Our qualitative and quantitative investigations uncover an unexpected finding: color is a critical T-FF in universal detectors. Code and models are available at https://keshik6.github.io/transferable-forensic-features/
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
---|---|---|---|---|---|---|
1 Introduction
Visual counterfeits are increasingly causing an existential conundrum in mainstream media [21, 70, 1, 26, 53, 61, 33, 32, 74].
With rapid improvements in CNN-based generative modelling
[30, 39, 38, 94, 66, 19, 10, 62, 97, 45, 3, 81, 96, 82, 83, 48, 86, 44],
detection of such counterfeits is increasingly becoming challenging and critical.
Nevertheless, a recent class of forensic detectors known as universal detectors are able to surprisingly spot counterfeits regardless of generator architectures, loss functions, datasets and resolutions without any extensive adaptation [87].
i.e.: Publicly released ResNet-50 [35] universal detector by Wang et al. [87] trained only on ProGAN [37] counterfeits, surprisingly generalizes well to detect counterfeits from unseen GANs including StyleGAN2 [41], StyleGAN [40], BigGAN [10], CycleGAN [97], StarGAN [18] and GauGAN [62].
This intriguing cross-model forensic transfer property suggests the existence of transferable forensic features (T-FF) in universal detectors.
1.1 Transferable Forensic Features (T-FF) in Universal Detectors
This work is motivated by a profound and challenging thesis statement: What transferable forensic features (T-FF) are used by universal detectors for counterfeit detection? A more elemental representation of this thesis statement would be: given an image of a real car and a high fidelity synthetic car generated by an unseen GAN (i.e.: StyleGAN2 [41]), what T-FF are used by the universal detector, such that it detects the synthetic car as counterfeit accurately? Though Wang et al. [87] hypothesize that universal detectors may learn low-level CNN artifacts for detection, no qualitative / quantitative evidence is available in contemporary literature to understand T-FF in universal detectors. Our work takes the first step towards discovering and understanding T-FF in universal detectors for counterfeit detection. A foundational understanding on T-FF and their properties are of paramount importance to both image forensics research and image synthesis research. Understanding T-FF will allow to build robust forensic detectors and to devise techniques to improve image synthesis methods to avoid generation of forensic footprints.
1.2 Our contributions
Our work conducts the first analytical study to discover and understand T-FF in universal detectors for counterfeit detection. We begin our study by comprehensively demonstrating that input-space attribution – using 2 popular algorithms namely Guided-GradCAM [72] and LRP [5] – of universal detector decisions are not informative to discover T-FF. Next, we study the forensic feature space of universal detectors to discover T-FF. But investigating the feature space is an extremely daunting task due to the sheer amount of feature maps present. i.e.: ResNet-50 [35] architecture contains approximately 27K feature maps. To tackle this challenging task, we propose a novel forensic feature relevance statistic (FF-RS), to quantify and discover T-FF in universal detectors. Our proposed FF-RS () is a scalar which quantifies the ratio between positive forensic relevance of the feature map and the total unsigned relevance of the entire layer that contains the particular feature map. Using our proposed FF-RS (), we successfully discover T-FF in the publicly released ResNet-50 universal detector [87].
Next, to understand the discovered T-FF, we introduce a novel pixel-wise explanation method based on maximum spatial Layer-wise Relevance Propagation response (LRP-max). Particularly we visualize the pixel-wise explanations of each discovered T-FF in universal detectors independently using LRP-max visualization method. Large-scale study on visual interpretability of T-FF reveal that color information is critical for cross-model forensic transfer. Further large-scale quantitative investigations using median counterfeits probability analysis and statistical tests on maximum spatial activation distributions based on color ablation show that is a critical T-FF in universal detectors. Our findings are intriguing and new to the research community, as many contemporary image forensics works focus on frequency discrepancies between real and counterfeit images [24, 25, 92, 12, 71, 42]. In summary, our contributions are as follows:
-
We propose a novel forensic feature relevance statistic (FF-RS) to quantify and discover transferable forensic features (T-FF) in universal detectors for counterfeit detection.
-
We qualitatively – using our proposed LRP-max visualization for feature map activations – and quantitatively – using median counterfeits probability analysis and statistical tests on maximum spatial activation distributions based on color ablation – show that color is a critical transferable forensic feature (T-FF) in universal detectors for counterfeit detection.
2 Related Work
Counterfeit detection.
Recent works have studied counterfeit detection both
in the RGB domain
[67, 54, 20, 92, 60, 85, 87]
and
frequency domain
[25, 24, 12, 27, 51].
Particularly, notable number of works have proposed to use hand-crafted features for counterfeit detection [25, 24, 12, 60].
Using simple experiments, Mccloskey et al. [56] showed that detection based on the frequency of over-exposed pixels can provide good discrimination between real images and counterfeits.
Li et al. observed disparities between GAN images and real images in the residual domain of the chrominance color components [46].
Some recent works have also proposed methods to detect and attribute counterfeits to the generating architectures
[91, 55].
Anomaly detection techniques leveraging on pre-trained face recognition models have also been proposed [85].
Cross-model forensic transfer.
Most counterfeit detection works do not focus on cross-model forensic transfer.
Among the works that study forensic transfer, Cozzolino et al. [20] and Zhang et al. [92] observed that counterfeit detectors generalized poorly during cross-model forensic transfer.
In order to solve poor forensic transfer performance, Cozzolino et al. [20] proposed an autoencoder based adaptation framework to improve cross-model forensic transfer.
The work by Wang et al. [87] was the first work to show that counterfeit detectors – universal detectors – can generalize well during cross-model forensic transfer without any re-training / fine-tuning / adaptation on the target samples suggesting the possible existence of transferable forensic features.
Furthermore, Chai et al. [11] showed that patch-based detectors with limited receptive fields
often perform better at detecting unseen counterfeits compared to full-image based detectors.
Interpretability methods. A number of interpretability methods in machine learning aim to summarize the relations which a model has learnt as a whole, such as PCA and t-SNE [63, 52], or to explain single decisions of a neural network. The latter may follow different lines of questioning, such as identifying similar training samples in k-NN and prototype CNNs [49, 14], finding modified samples such as pertinent negatives [23], or model-based uncertainty estimates [29]. One class of algorithms aims at computing input space attributions. This includes Shapley values [77, 50, 15] suitable for tabular data types, and methods for data types for which dropping a feature is not well defined, relying on modified gradients such as Guided Backprop [75], Layer-wise Relevance Propagation (LRP) [5], Guided-GradCAM [72], Full-Grad [76], and class-attention-mapping inspired research [22, 84, 36, 28, 59]. Bau et al. proposed frameworks for interpreting representations at the feature map level for classifiers [7] and GANs [8].
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | |
Image |
|||||
Pixel-wise explanations of universal detector decisions [87] using Guided-GradCAM (GGC) [72] and LRP [5] | |||||
GGC [72] |
|||||
LRP [5] |
|||||
Pixel-wise explanations of ImageNet classifier decisions using Guided-GradCAM (GGC) [72] and LRP [5] | |||||
GGC [72] |
|||||
LRP [5] |
3 Dataset / Metrics
We use the ForenSynths dataset proposed by Wang et al. [87]. ForenSynths is the largest counterfeit benchmark dataset containing CNN-generated images from multiple generator architectures, datasets, loss functions and resolutions. In addition to ProGAN [37], we select 6 candidate GANs to comprehensively study cross-model forensic transfer in this work namely, StyleGAN2 [41], StyleGAN [40], BigGAN [10], CycleGAN [97], StarGAN [18] and GauGAN [62]. Following Wang et al. [87], we use AP (Average Precision) to measure cross-model forensic transfer of universal detectors. Particularly, we also show the accuracies for real and counterfeit images as we intend to understand counterfeit detection. For detector calibration, we follow [87] and use the oracle threshold obtained using geometric mean of sensitivity and specificity.
4 Discovering Transferable Forensic Features (T-FF)
4.1 Input-space attribution methods
Interpretable machine learning algorithms are useful exploratory tools to visualize neural networks’ decisions by input-space attribution [9, 72, 76, 22, 84, 36, 28, 59]. We start from the following question: Are interpretability methods suitable to discover T-FF in universal detectors?
We use 2 popular interpretability methods namely Guided-GradCAM [72] and LRP [5] to analyse the pixel-wise explanations of universal detector decisions. These methods were chosen due to their relatively low amount of gradient shattering noise [6]. We show the pixel-wise explanation results of ResNet-50 universal detector [87] decisions for ProGAN [37] and 4 GANs not used for training – CycleGAN [97], StarGAN [18], BigGAN [10] and StyleGAN2 [41]– in Fig. 2. As one can observe in Fig. 2, pixel-wise explanations of universal detector decisions are not informative to discover T-FF due to their focus on spatial localization. Particularly, we are unable to discover any forensic footprints based on pixel-wise explanations of universal detector decisions. This is consistently seen across both Guided-GradCAM [72] and LRP [5] methods. We remark that these observations do not indicate failure modes of Guided-GradCAM [72] or LRP [5] methods, but rather suggest that universal detectors are learning more complex T-FF that are not easily human-parsable.
4.2 Forensic Feature Space
Given that input-space attribution methods are
not informative
to discover
T-FF,
we study the feature space to discover T-FF in universal detectors for counterfeit detection.
Particularly, we ask the question: which feature maps in universal detectors are responsible for cross-model forensic transfer?
This is a challenging problem as it requires quantifying the importance of every feature map in universal detectors for counterfeit detection. The ResNet-50 universal detector [87] consists of approximately 27K intermediate feature maps.
Forensic feature relevance statistic (FF-RS). We propose a novel FF-RS () to quantify the relevance of every feature map in universal detectors for counterfeit detection. Specifically, for feature map at layer and channel , computes the forensic relevance of this feature map for counterfeit detection. We describe the important design considerations and intuitions behind our proposed FF-RS () below and include the pseudocode in Algorithm LABEL:alg:omega_algorithm:
-
We postulate the existence of a set of feature maps in universal detectors that are responsible for cross-model forensic transfer. In particular, we hypothesize that there is a set of common transferable forensic feature maps that mostly gets activated when passing counterfeits from ProGAN [37] and unseen GANs.
-
Our proposed FF-RS () is a scalar that quantifies the forensic relevance of every feature map. In particular, for a feature map quantifies the ratio between positive forensic relevance of the feature map and the total unsigned forensic relevance of the entire layer that contains the particular feature map. This is shown in Line 8 in Algorithm LABEL:alg:omega_algorithm. For the numerator we are only interested in positive relevance, therefore use a max operation to select only positive relevance (identical to a ReLU operation).
-
is calculated over large number of counterfeit images and is bounded between . i.e.: indicates that the particular feature map is the most relevant forensic feature and indicates vice versa.
-
Finally we use to rank all the feature maps and identify the set of T-FF. We refer to this set as top-k in our experiments.
Experiments : Sensitivity assessments of discovered T-FF using algorithm LABEL:alg:omega_algorithm. We perform rigorous sensitivity assessments using feature map dropout experiments to demonstrate that our proposed FF-RS () is able to quantify and discover T-FF. Feature map dropout suppresses (zeroing out) the resulting activations of the target feature maps. Particularly, feature map dropout of T-FF should satisfy the following sensitivity conditions:
Test bed details. We use the ForenSynths test set [87]. is calculated using 1000 ProGAN [37] counterfeits (validation set). We use the following experiment codes:
-
top- : Set of T-FF discovered using FF-RS
-
random- : Set of random feature maps used as a control experiment.
-
low- : Set of low-ranked feature maps corresponding to extremely small values of , i.e.: .
Results. We show the results in Table 1 for ResNet-50 and EfficientNet-B0 universal detectors. We clearly observe that feature map dropout of top- features corresponding to T-FF satisfies both sensitivity conditions above indicating that our proposed FF-RS () is able to quantify and discover transferable forensic features. We also observe that feature map dropout of low- (low-ranked) forensic features has little / no effect on cross-model forensic transfer which further adds merit to our proposed FF-RS ().
algocf[tb] \end@dblfloat
5 Understanding Transferable Forensic Features (T-FF)
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
---|---|---|---|---|---|---|
Given the successful discovery of T-FF using our proposed FF-RS (), in this section, we ask the following question: what counterfeit properties are detected by this set of T-FF? Though Wang et al. [87] hypothesize that universal detectors may learn low-level CNN artifacts for cross-model forensic transfer, no evidence is available to understand as to what features in counterfeits are being detected during cross-model forensic transfer.
5.1 LRP-max explanations of T-FF
We approach this problem from a visual interpretability perspective. In this section, we introduce a novel pixel-wise explanation method for feature map activations based on maximum spatial Layer-wise Relevance Propagation response (LRP-max). The idea behind LRP-max is to independently visualize which pixels in the input space correspond to maximum spatial relevance scores for each T-FF. Particularly, instead of back-propagating using the detector logits, we back-propagate from the maximum spatial relevance neuron of each T-FF independently. LRP-max automatically extracts image regions for every T-FF and does not depend on external modules such as segmentation used in [7, 8]. The pseudocode is included in LABEL:alg:lrp_max_algorithm.
Color is a critical T-FF in universal detectors: LRP-max visualizations of T-FF uncover the unexpected observation that a substantial amount of T-FF exhibits color-conditional activations. We show the LRP-max regions for ProGAN [37] and all unseen GANs [41, 40, 10, 97, 18, 62] for ResNet-50 and EfficientNet-B0 universal detectors in Fig. 1 and 3 respectively. As one can observe, the consistent color-conditional LRP-max response across all GANs for these T-FF clearly indicate that color is critical for cross-model forensic transfer in universal detectors. This is notably surprising and observed for the first time in transferable image forensics research. In the next section, we conduct quantitative studies to rigorously verify that color is a critical T-FF in universal detectors.
algocf[htbp] \end@dblfloat
5.2 Color Ablation Studies
In this section, we conduct 2 quantitative studies to show that color is a critical transferable forensic feature in universal detectors. Our studies measure the sensitivity of universal detectors before and after color ablation.
algocf[t] \end@dblfloat
Study 1. We investigate the change in probability distribution of universal detectors when removing color information in counterfeits during cross-model forensic transfer. We specifically study the change in median counterfeit probability when removing color information (median is not sensitive to outliers). The results for both ResNet-50 and EfficientNet-B0 universal detectors are shown in Fig. 4. As one can clearly observe, color ablation causes the median probability predicted by the universal detector to drop by more than 89% across all unseen GANs showing that color is a critical T-FF in universal detectors. This is observed in both ResNet-50 and EfficientNet-B0 universal detectors.
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
---|---|---|---|---|---|---|
Study 2. In this study, we measure the percentage of T-FF that are color-conditional. Particularly, we conduct a statistical test to compare the maximum globally pooled spatial activation distributions of each T-FF before and after color ablation. The intuition is that with color ablation, color-conditional T-FF will produce lower amount of activations for the same sample and we perform a hypothesis test to measure whether the maximum spatial activation distributions are statistically different before (Baseline) and after color ablation (Grayscale). Particularly, we use Mood’s median test (non-parametric, low-power) with a significance level of in our study. The pseudocode is shown in Algorithm LABEL:alg_main:median_test. The results for ResNet-50 and EfficientNet-B0 universal detectors are shown in Table 2 (rows 1, 2). Our results show that substantial amount of T-FF in universal detectors are color-conditional indicating that color is a critical T-FF. We also show the maximum spatial activation distributions for several color-conditional T-FF for ResNet-50 and EfficientNet-B0 universal detectors in Fig. 6. As one can observe, maximum spatial activations are suppressed for these T-FF across ProGAN [37] and all unseen GANs [41, 40, 10, 97, 18, 62] when removing color information. This clearly suggests that these T-FF are color-conditional.
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
---|---|---|---|---|---|---|
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
ResNet-50 | ||||||
EfficientNet-B0 | ||||||
6 Applications : Color-Robust (CR) Universal Detectors
Reliance on substantial amount of color information for cross-model forensic transfer exposes universal detectors to attacks via color-ablated counterfeits. This is particularly unfavourable. In this section, we propose a data augmentation scheme to build Color-Robust (CR) universal detectors that do not substantially rely on color information for cross-model forensic transfer. The crux of the idea is to randomly remove color information from samples during training (both for real and counterfeit images). Particularly, we perform random Grayscaling during training with 50% probability to maneuver universal detectors to learn T-FF that do not substantially rely on color information.
Results. Median probability analysis results for ResNet-50 and EfficientNet-B0 CR-universal detectors are shown in Fig. 5. We clearly observe that with our proposed data augmentation scheme, CR-universal detectors are more robust to color ablation during cross-model forensic transfer indicating that they learn T-FF that do not substantially rely on color information. We further show the percentage of color-conditional T-FF in CR-ResNet-50 and CR-EfficientNet-B0 in Table 2 (rows 3, 4), quantitatively showing that CR-universal detectors learn substantially lower amount of color-conditional T-FF.
T-FF in CR-Universal Detectors. We further discover T-FF in CR-universal detectors using our proposed FF-RS (). We show LRP-max visualization of T-FF in CR-ResNet-50 in Supplementary Fig. 0.H.1. These T-FF largely correspond to patterns / artifacts (i.e.: wheels). We emphasize that our proposed method can identify different types of T-FF in addition to color.
7 Discussion and Conclusion
We conducted the first analytical study to discover and understand transferable forensic features (T-FF) in universal detectors. Our first set of investigations demonstrated that input-space attribution methods such as Guided-GradCAM [72] and LRP [5] are not informative to discover T-FF (Sec 4.1). In light of these observations, we study the forensic feature space of universal detectors. Particularly, we propose a novel forensic feature relevance statistic (FF-RS) to quantify and discover T-FF in universal detectors. Rigorous sensitivity assessments using feature map dropout convincingly show that our proposed FF-RS () is able to successfully quantify and discover T-FF (Sec 4.2).
Further investigations on T-FF uncover an unexpected finding: is a critical T-FF in universal detectors.
We show this critical finding qualitatively using our proposed LRP-max visualization of discovered T-FF (Sec 5.1).
Further we validate this finding quantitatively using median counterfeit probability analysis and statistical tests on maximum spatial activation distributions of T-FF based on color ablation (Sec 5.2).
i.e.: We showed that of T-FF are color-conditional in the publicly released ResNet-50 universal detector [87].
Finally, we propose a simple data augmentation scheme to train
Color-Robust (CR) universal detectors (Sec 6). We remark that color is not the only T-FF, but it is a critical T-FF in universal detectors.
We also discuss computational complexity of FF-RS () and LRP-max in Supplementary 0.B.
A natural question would be why is color a critical T-FF. Though this is not a straight-forward question to answer, we provide our perspective:
Color distribution of real images is non-uniform, and
we hypothesize that most GANs
struggle to capture the diverse,
multi-modal color distribution of real images. i.e.: low-density color regions.
This may result in noticeable discrepancies between real and GAN images (counterfeits) in the color space
which
can be used as T-FF to
detect counterfeits.
To conclude, through this work we discover and understand T-FF in universal detectors for counterfeit detection, and hope that our contributions will inspire further research in image forensics and image synthesis methods.
Limitations / Broader Impact.
With deepfakes-in-the-wild being generated using diverse techniques in addition to GAN-based methods including shallow methods (i.e.: Photoshop) and face-swapping frameworks (i.e.: DeepFaceLab [64]), studying transferable forensic features in such synthesis methods are essential to build robust general-purpose image forensics detectors.
With increasing usage of machine learning methods in proliferating mis- and disinformation, we hope that our discovery on transferable forensic features can open-up more plausible research directions to combat the fight against visual disinformation.
Acknowledgements. This research is supported by the National Research Foundation, Singapore under its AI Singapore Programmes (AISG Award No.: AISG2-RP-2021-021; AISG Award No.: AISG-100E2018-005). This project is also supported by SUTD project PIE-SGP-AI-2018-01. Alexander Binder was supported by the SFI Visual Intelligence, project no. 309439 of the Research Council of Norway.
References
- [1] Synthetic media: How deepfakes could soon change our world, https://www.cbsnews.com/news/deepfake-artificial-intelligence-60-minutes-2022-07-31/
- [2] Abdollahzadeh, M., Malekzadeh, T., Cheung, N.M.: Revisit Multimodal Meta-Learning through the Lens of Multi-Task Learning. In: Thirty-Fifth Conference on Neural Information Processing Systems (2021), https://openreview.net/forum?id=V5prUHOrOP4
- [3] Arjovsky, M., Chintala, S., Bottou, L.: Wasserstein GAN (2017)
- [4] Arras, L., Osman, A., Müller, K.R., Samek, W.: Evaluating Recurrent Neural Network Explanations. In: ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP. pp. 113–126. ACL (2019)
- [5] Bach, S., Binder, A., Montavon, G., Klauschen, F., Müller, K.R., Samek, W.: On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PLoS ONE 10(7), e0130140 (2015)
- [6] Balduzzi, D., Frean, M., Leary, L., Lewis, J.P., Ma, K.W., McWilliams, B.: The Shattered Gradients Problem: If ResNets are the answer, then what is the question? In: International Conference on Machine Learning (ICML). PMLR, vol. 70, pp. 342–350. PMLR (2017)
- [7] Bau, D., Zhou, B., Khosla, A., Oliva, A., Torralba, A.: Network dissection: Quantifying interpretability of deep visual representations. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 6541–6549 (2017)
- [8] Bau, D., Zhu, J.Y., Strobelt, H., Zhou, B., Tenenbaum, J.B., Freeman, W.T., Torralba, A.: GAN Dissection: Visualizing and Understanding Generative Adversarial Networks. In: International Conference on Learning Representations (2018)
- [9] Binder, A., Montavon, G., Lapuschkin, S., Müller, K.R., Samek, W.: Layer-Wise Relevance Propagation for Neural Networks with Local Renormalization Layers. In: Villa, A.E., Masulli, P., Pons Rivero, A.J. (eds.) Artificial Neural Networks and Machine Learning – ICANN 2016. pp. 63–71. Springer International Publishing, Cham (2016)
- [10] Brock, A., Donahue, J., Simonyan, K.: Large Scale GAN Training for High Fidelity Natural Image Synthesis. In: International Conference on Learning Representations (2019), https://openreview.net/forum?id=B1xsqj09Fm
- [11] Chai, L., Bau, D., Lim, S.N., Isola, P.: What makes fake images detectable? understanding properties that generalize. In: European conference on computer vision. pp. 103–120. Springer (2020)
- [12] Chandrasegaran, K., Tran, N.T., Cheung, N.M.: A Closer Look at Fourier Spectrum Discrepancies for CNN-Generated Images Detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 7200–7209 (June 2021)
- [13] Chandrasegaran, K., Tran, N.T., Zhao, Y., Cheung, N.M.: Revisiting Label Smoothing and Knowledge Distillation Compatibility: What was Missing? In: Chaudhuri, K., Jegelka, S., Song, L., Szepesvari, C., Niu, G., Sabato, S. (eds.) Proceedings of the 39th International Conference on Machine Learning. Proceedings of Machine Learning Research, vol. 162, pp. 2890–2916. PMLR (17-23 Jul 2022)
- [14] Chen, C., Li, O., Tao, C., Barnett, A.J., Su, J., Rudin, C.: This Looks like That: Deep Learning for Interpretable Image Recognition, year = 2019. Curran Associates Inc., Red Hook, NY, USA
- [15] Chen, J., Song, L., Wainwright, M.J., Jordan, M.I.: L-Shapley and C-Shapley: Efficient Model Interpretation for Structured Data. In: International Conference on Learning Representations (2019), https://openreview.net/forum?id=S1E3Ko09F7
- [16] Chen, L., Zhang, Y., Song, Y., Liu, L., Wang, J.: Self-supervised Learning of Adversarial Examples: Towards Good Generalizations for DeepFake Detections. In: CVPR (2022)
- [17] Choi, K., Grover, A., Singh, T., Shu, R., Ermon, S.: Fair generative modeling via weak supervision. In: International Conference on Machine Learning. pp. 1887–1898. PMLR (2020)
- [18] Choi, Y., Choi, M., Kim, M., Ha, J.W., Kim, S., Choo, J.: StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2018)
- [19] Choi, Y., Uh, Y., Yoo, J., Ha, J.W.: StarGAN v2: Diverse Image Synthesis for Multiple Domains. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (June 2020)
- [20] Cozzolino, D., Thies, J., Rössler, A., Riess, C., Nießner, M., Verdoliva, L.: Forensictransfer: Weakly-supervised domain adaptation for forgery detection. arXiv preprint arXiv:1812.02510 (2018)
- [21] Dan, V., Paris, B., Donovan, J., Hameleers, M., Roozenbeek, J., van der Linden, S., von Sikorski, C.: Visual Mis- and Disinformation, Social Media, and Democracy. Journalism & Mass Communication Quarterly 98(3), 641–664 (2021). https://doi.org/10.1177/10776990211035395, https://doi.org/10.1177/10776990211035395
- [22] Desai, S.S., Ramaswamy, H.G.: Ablation-CAM: Visual Explanations for Deep Convolutional Network via Gradient-free Localization. 2020 IEEE Winter Conference on Applications of Computer Vision (WACV) pp. 972–980 (2020)
- [23] Dhurandhar, A., Chen, P.Y., Luss, R., Tu, C.C., Ting, P., Shanmugam, K., Das, P.: Explanations Based on the Missing: Towards Contrastive Explanations with Pertinent Negatives. In: Proceedings of the 32nd International Conference on Neural Information Processing Systems. p. 590–601. NIPS’18, Curran Associates Inc., Red Hook, NY, USA (2018)
- [24] Durall, R., Keuper, M., Keuper, J.: Watch Your Up-Convolution: CNN Based Generative Deep Neural Networks Are Failing to Reproduce Spectral Distributions. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (June 2020)
- [25] Dzanic, T., Shah, K., Witherden, F.: Fourier Spectrum Discrepancies in Deep Network Generated Images. In: Thirty-fourth Annual Conference on Neural Information Processing Systems (NeurIPS) (December 2020)
- [26] Foley, J.: 14 deepfake examples that terrified and amused the internet (Apr 2022), https://www.creativebloq.com/features/deepfake-examples
- [27] Frank, J., Eisenhofer, T., Schönherr, L., Fischer, A., Kolossa, D., Holz, T.: Leveraging frequency analysis for deep fake image recognition. In: International Conference on Machine Learning. pp. 3247–3258. PMLR (2020)
- [28] Fu, R., Hu, Q., Dong, X., Guo, Y., Gao, Y., Li, B.: Axiom-based Grad-CAM: Towards Accurate Visualization and Explanation of CNNs. In: BMVC (2020)
- [29] Gal, Y., Ghahramani, Z.: Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning. In: Balcan, M.F., Weinberger, K.Q. (eds.) Proceedings of The 33rd International Conference on Machine Learning. Proceedings of Machine Learning Research, vol. 48, pp. 1050–1059. PMLR, New York, New York, USA (20–22 Jun 2016), https://proceedings.mlr.press/v48/gal16.html
- [30] Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative Adversarial Nets. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems. vol. 27, pp. 2672–2680. Curran Associates, Inc. (2014), https://proceedings.neurips.cc/paper/2014/file/5ca3e9b122f61f8f06494c97b1afccf3-Paper.pdf
- [31] Haliassos, A., Vougioukas, K., Petridis, S., Pantic, M.: Lips don’t lie: A generalisable and robust approach to face forgery detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 5039–5049 (2021)
- [32] Hao, K., Heaven, W.D.: The year deepfakes went mainstream (Dec 2020), https://www.technologyreview.com/2020/12/24/1015380/best-ai-deepfakes-of-2020/
- [33] Harrison, E.: Shockingly realistic Tom Cruise deepfakes go viral on TikTok (Feb 2021), https://www.independent.co.uk/arts-entertainment/films/news/tom-cruise-deepfake-tiktok-video-b1808000.html
- [34] He, K., Fan, H., Wu, Y., Xie, S., Girshick, R.: Momentum contrast for unsupervised visual representation learning. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 9729–9738 (2020)
- [35] He, K., Zhang, X., Ren, S., Sun, J.: Deep Residual Learning for Image Recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (June 2016)
- [36] Jiang, P.T., Zhang, C.B., Hou, Q., Cheng, M.M., Wei, Y.: LayerCAM: Exploring Hierarchical Class Activation Maps For Localization. IEEE Transactions on Image Processing (2021)
- [37] Karras, T., Aila, T., Laine, S., Lehtinen, J.: Progressive Growing of GANs for Improved Quality, Stability, and Variation. In: International Conference on Learning Representations (2018), https://openreview.net/forum?id=Hk99zCeAb
- [38] Karras, T., Aittala, M., Hellsten, J., Laine, S., Lehtinen, J., Aila, T.: Training Generative Adversarial Networks with Limited Data. In: Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M.F., Lin, H. (eds.) Advances in Neural Information Processing Systems. vol. 33, pp. 12104–12114. Curran Associates, Inc. (2020), https://proceedings.neurips.cc/paper/2020/file/8d30aa96e72440759f74bd2306c1fa3d-Paper.pdf
- [39] Karras, T., Aittala, M., Laine, S., Härkönen, E., Hellsten, J., Lehtinen, J., Aila, T.: Alias-free generative adversarial networks. Advances in Neural Information Processing Systems 34, 852–863 (2021)
- [40] Karras, T., Laine, S., Aila, T.: A Style-Based Generator Architecture for Generative Adversarial Networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (June 2019)
- [41] Karras, T., Laine, S., Aittala, M., Hellsten, J., Lehtinen, J., Aila, T.: Analyzing and Improving the Image Quality of StyleGAN. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (June 2020)
- [42] Khayatkhoei, M., Elgammal, A.: Spatial Frequency Bias in Convolutional Generative Adversarial Networks (Oct 2020)
- [43] Kim, M., Tariq, S., Woo, S.S.: Fretal: Generalizing deepfake detection using knowledge distillation and representation learning. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 1001–1012 (2021)
- [44] Koh, J.Y., Lee, H., Yang, Y., Baldridge, J., Anderson, P.: Pathdreamer: A World Model for Indoor Navigation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). pp. 14738–14748 (October 2021)
- [45] Lee, K.S., Tran, N.T., Cheung, N.M.: Infomax-GAN: Improved adversarial image generation via information maximization and contrastive learning. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision. pp. 3942–3952 (2021)
- [46] Li, H., Li, B., Tan, S., Huang, J.: Identification of deep network generated images using disparities in color components. Signal Processing 174, 107616 (2020)
- [47] Li, L., Bao, J., Zhang, T., Yang, H., Chen, D., Wen, F., Guo, B.: Face X-ray for more general face forgery detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 5001–5010 (2020)
- [48] Lim, S.K., Loo, Y., Tran, N.T., Cheung, N.M., Roig, G., Elovici, Y.: DOPING: Generative Data Augmentation for Unsupervised Anomaly Detection with GAN. In: 18th IEEE International Conference on Data Mining, ICDM 2018. pp. 1122–1127. Institute of Electrical and Electronics Engineers Inc. (2018)
- [49] Lloyd, S.: Least squares quantization in PCM. IEEE Transactions on Information Theory 28(2), 129–137 (1982). https://doi.org/10.1109/TIT.1982.1056489
- [50] Lundberg, S.M., Lee, S.I.: A Unified Approach to Interpreting Model Predictions. In: Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R. (eds.) Advances in Neural Information Processing Systems. vol. 30. Curran Associates, Inc. (2017), https://proceedings.neurips.cc/paper/2017/file/8a20a8621978632d76c43dfd28b67767-Paper.pdf
- [51] Luo, Y., Zhang, Y., Yan, J., Liu, W.: Generalizing face forgery detection with high-frequency features. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 16317–16326 (2021)
- [52] van der Maaten, L., Hinton, G.: Visualizing Data using t-SNE. Journal of Machine Learning Research 9(86), 2579–2605 (2008), http://jmlr.org/papers/v9/vandermaaten08a.html
- [53] Mahmud, A.H.: Deep dive into deepfakes: Frighteningly real and sometimes used for the wrong things (Oct 2021), https://www.channelnewsasia.com/singapore/deepfakes-ai-security-threat-face-swapping-2252161
- [54] Marra, F., Gragnaniello, D., Cozzolino, D., Verdoliva, L.: Detection of GAN-Generated Fake Images over Social Networks. In: 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR). pp. 384–389 (2018). https://doi.org/10.1109/MIPR.2018.00084
- [55] Marra, F., Gragnaniello, D., Verdoliva, L., Poggi, G.: Do GANs leave artificial fingerprints? In: 2019 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR). pp. 506–511. IEEE (2019)
- [56] McCloskey, S., Albright, M.: Detecting GAN-generated imagery using saturation cues. In: 2019 IEEE international conference on image processing (ICIP). pp. 4584–4588. IEEE (2019)
- [57] Montavon, G., Binder, A., Lapuschkin, S., Samek, W., Müller, K.R.: Layer-Wise Relevance Propagation: An Overview, pp. 193–209. Springer International Publishing, Cham (2019). https://doi.org/10.1007/978-3-030-28954-6_10, https://doi.org/10.1007/978-3-030-28954-6_10
- [58] Montavon, G., Lapuschkin, S., Binder, A., Samek, W., Müller, K.R.: Explaining nonlinear classification decisions with deep Taylor decomposition. Pattern Recognition 65, 211–222 (2017). https://doi.org/https://doi.org/10.1016/j.patcog.2016.11.008, https://www.sciencedirect.com/science/article/pii/S0031320316303582
- [59] Muhammad, M.B., Yeasin, M.: Eigen-cam: Class activation map using principal components. In: 2020 International Joint Conference on Neural Networks (IJCNN). pp. 1–7. IEEE (2020)
- [60] Nataraj, L., Mohammed, T.M., Manjunath, B., Chandrasekaran, S., Flenner, A., Bappy, J.H., Roy-Chowdhury, A.K.: Detecting GAN generated fake images using co-occurrence matrices. Electronic Imaging 2019(5), 532–1 (2019)
- [61] News, C.: Synthetic media: How deepfakes could soon change our world (Oct 2021), https://www.cbsnews.com/news/deepfake-artificial-intelligence-60-minutes-2021-10-10/
- [62] Park, T., Liu, M.Y., Wang, T.C., Zhu, J.Y.: Semantic image synthesis with spatially-adaptive normalization. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 2337–2346 (2019)
- [63] Pearson, K.: LIII. On lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science 2(11), 559–572 (1901). https://doi.org/10.1080/14786440109462720
- [64] Perov, I., Gao, D., Chervoniy, N., Liu, K., Marangonda, S., Umé, C., Dpfks, M., Facenheim, C.S., RP, L., Jiang, J., et al.: DeepFaceLab: Integrated, flexible and extensible face-swapping framework. arXiv preprint arXiv:2005.05535 (2020)
- [65] Pörner, N., Schütze, H., Roth, B.: Evaluating neural network explanation methods using hybrid documents and morphosyntactic agreement. In: Annual Meeting of the Association for Computational Linguistics (ACL). pp. 340–350. ACL (2018)
- [66] Razavi, A., van den Oord, A., Vinyals, O.: Generating Diverse High-Fidelity Images with VQ-VAE-2. In: Wallach, H., Larochelle, H., Beygelzimer, A., d\textquotesingleAlché-Buc, F., Fox, E., Garnett, R. (eds.) Advances in Neural Information Processing Systems. vol. 32, pp. 14866–14876. Curran Associates, Inc. (2019), https://proceedings.neurips.cc/paper/2019/file/5f8e2fa1718d1bbcadf1cd9c7a54fb8c-Paper.pdf
- [67] Rossler, A., Cozzolino, D., Verdoliva, L., Riess, C., Thies, J., Nießner, M.: Faceforensics++: Learning to detect manipulated facial images. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. pp. 1–11 (2019)
- [68] Samek, W., Binder, A., Montavon, G., Lapuschkin, S., Müller, K.R.: Evaluating the Visualization of What a Deep Neural Network Has Learned. IEEE Transactions on Neural Networks and Learning Systems 28(11), 2660–2673 (2017)
- [69] Sattigeri, P., Hoffman, S.C., Chenthamarakshan, V., Varshney, K.R.: Fairness GAN: Generating datasets with fairness properties using a generative adversarial network. IBM Journal of Research and Development 63(4/5), 3:1–3:9 (2019). https://doi.org/10.1147/JRD.2019.2945519
- [70] SCHICK, N.: Deepfakes: The coming infocalypse. GRAND CENTRAL PUB (2021)
- [71] Schwarz, K., Liao, Y., Geiger, A.: On the Frequency Bias of Generative Models. Advances in Neural Information Processing Systems 34 (2021)
- [72] Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-cam: Visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE international conference on computer vision. pp. 618–626 (2017)
- [73] Shiohara, K., Yamasaki, T.: Detecting Deepfakes with Self-Blended Images. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 18720–18729 (2022)
- [74] Simonite, T.: What Happened to the Deepfake Threat to the Election? (Nov 2020), https://www.wired.com/story/what-happened-deepfake-threat-election/
- [75] Springenberg, J.T., Dosovitskiy, A., Brox, T., Riedmiller, M.A.: Striving for Simplicity: The All Convolutional Net. In: Bengio, Y., LeCun, Y. (eds.) 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Workshop Track Proceedings (2015), http://arxiv.org/abs/1412.6806
- [76] Srinivas, S., Fleuret, F.: Full-gradient representation for neural network visualization. Advances in neural information processing systems 32 (2019)
- [77] Strumbelj, E., Kononenko, I.: An Efficient Explanation of Individual Classifications Using Game Theory. J. Mach. Learn. Res. 11, 1–18 (mar 2010)
- [78] Tan, M., Le, Q.: Efficientnet: Rethinking model scaling for convolutional neural networks. In: International Conference on Machine Learning. pp. 6105–6114. PMLR (2019)
- [79] Tan, S., Shen, Y., Zhou, B.: Improving the fairness of deep generative models without retraining. arXiv preprint arXiv:2012.04842 (2020)
- [80] Teo, C.T., Cheung, N.M.: Measuring fairness in generative models. arXiv preprint arXiv:2107.07754 (2021)
- [81] Tran, N.T., Bui, T.A., Cheung, N.: Dist-GAN: An Improved GAN Using Distance Constraints. In: ECCV (2018)
- [82] Tran, N.T., Tran, V.H., Nguyen, B.N., Yang, L., Cheung, N.M.M.: Self-supervised GAN: Analysis and Improvement with Multi-class Minimax Game. In: Wallach, H., Larochelle, H., Beygelzimer, A., d\textquotesingleAlché-Buc, F., Fox, E., Garnett, R. (eds.) Advances in Neural Information Processing Systems. vol. 32. Curran Associates, Inc. (2019), https://proceedings.neurips.cc/paper/2019/file/d04cb95ba2bea9fd2f0daa8945d70f11-Paper.pdf
- [83] Tran, N.T., Tran, V.H., Nguyen, N.B., Nguyen, T.K., Cheung, N.M.: On data augmentation for GAN training. IEEE Transactions on Image Processing 30, 1882–1897 (2021)
- [84] Wang, H., Wang, Z., Du, M., Yang, F., Zhang, Z., Ding, S., Mardziel, P., Hu, X.: Score-CAM: Score-weighted visual explanations for convolutional neural networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops. pp. 24–25 (2020)
- [85] Wang, R., Juefei-Xu, F., Ma, L., Xie, X., Huang, Y., Wang, J., Liu, Y.: FakeSpotter: a simple yet robust baseline for spotting AI-synthesized fake faces. In: Proceedings of the Twenty-Ninth International Conference on International Joint Conferences on Artificial Intelligence. pp. 3444–3451 (2021)
- [86] Wang, S.Y., Bau, D., Zhu, J.Y.: Rewriting Geometric Rules of a GAN. ACM Trans. Graph. 41(4) (jul 2022). https://doi.org/10.1145/3528223.3530065, https://doi.org/10.1145/3528223.3530065
- [87] Wang, S.Y., Wang, O., Zhang, R., Owens, A., Efros, A.A.: CNN-Generated Images Are Surprisingly Easy to Spot… for Now. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (June 2020)
- [88] Xu, D., Yuan, S., Zhang, L., Wu, X.: FairGAN: Fairness-aware generative adversarial networks. In: 2018 IEEE International Conference on Big Data (Big Data). pp. 570–575. IEEE (2018)
- [89] Yeom, S.K., Seegerer, P., Lapuschkin, S., Binder, A., Wiedemann, S., Müller, K.R., Samek, W.: Pruning by explaining: A novel criterion for deep neural network pruning. Pattern Recognition 115, 107899 (2021). https://doi.org/https://doi.org/10.1016/j.patcog.2021.107899, https://www.sciencedirect.com/science/article/pii/S0031320321000868
- [90] Yu, F., Zhang, Y., Song, S., Seff, A., Xiao, J.: LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop. arXiv preprint arXiv:1506.03365 (2015)
- [91] Yu, N., Davis, L.S., Fritz, M.: Attributing fake images to GANs: Learning and analyzing GAN fingerprints. In: Proceedings of the IEEE/CVF international conference on computer vision. pp. 7556–7566 (2019)
- [92] Zhang, X., Karaman, S., Chang, S.: Detecting and Simulating Artifacts in GAN Fake Images. In: 2019 IEEE International Workshop on Information Forensics and Security (WIFS). pp. 1–6 (2019). https://doi.org/10.1109/WIFS47025.2019.9035107
- [93] Zhao, H., Zhou, W., Chen, D., Wei, T., Zhang, W., Yu, N.: Multi-attentional deepfake detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 2185–2194 (2021)
- [94] Zhao, S., Liu, Z., Lin, J., Zhu, J.Y., Han, S.: Differentiable Augmentation for Data-Efficient GAN Training. In: Conference on Neural Information Processing Systems (NeurIPS) (2020)
- [95] Zhao, T., Xu, X., Xu, M., Ding, H., Xiong, Y., Xia, W.: Learning self-consistency for deepfake detection. In: Proceedings of the IEEE/CVF international conference on computer vision. pp. 15023–15033 (2021)
- [96] Zhao, Y., Ding, H., Huang, H., Cheung, N.M.: A Closer Look at Few-shot Image Generation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 9140–9150 (2022)
- [97] Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE international conference on computer vision. pp. 2223–2232 (2017)
Supplementary Materials
Contents
This Supplementary provides additional experiments, analysis, discussion and code / reproducibility details to further support our findings. The Supplementary materials are organized as follows:
-
Section 0.A: A brief overview of the LRP-algorithm used
-
Section 0.B: Computational complexity of FF-RS () / LRP-max.
-
Section 0.C: Non Color-conditional T-FF
-
Section 0.D: hyper-parameter in top- for T-FF
-
Section 0.F: Is the performance degrade in universal detectors due to unseen corruptions (OOD)?
-
Section 0.G: Color-conditional T-FF (Additional Results)
-
Section 0.H: CR-Universal Detectors (Additional Results)
-
Section 0.I: Pixel-wise explanations are not informative to discover T-FF (Additional Results)
-
Section 0.J: Research Reproducibility / Code Details
-
Section 0.K: Future Work: Can we identify globally relevant channels for counterfeit detection in a Generator?
Appendix 0.A A brief overview of the LRP-algorithm used
Layer-wise relevance propagation (LRP) [5] is a modified-gradient type algorithm for backward passes in neural networks and other models. LRP is based on the idea of replacing the partial derivatives, which are usually flowing back along the edges of a graph, by terms derived from Taylor decompositions for single layers [58] of the network. While the -LRP-rule is similar to gradient-times-input, other rules such as the -rule [57] result in explanations which exhibit visually low noise and are robust to gradient shattering effects [6] common in deep neural networks due to its normalization properties. Consider a neuron with inputs , weights , and a relevance score being already computed for its output being . The relevance score is the analogue for the total derivative in conventional backpropagation started at output logits, however computed using LRP. Then the relevance score for the input according to the -rule is given as
(1) |
where is the positive part.This measures the proportion of the positive part of the weighted input for the input neuron relative to the positive weighted inputs from all inputs used to compute the value of neuron . Therefore it redistributes relevance from an output to the inputs proportional to this fraction and proportional to the relevance of the output neuron. We used the -rule for all convolution layers and the -rule for the top-most fully connected layer. Before applying LRP, we fuse batchnorm layers into convolution layers and reset the batchnorm layers. The backpropagation in the resetted batchnorm layers uses the identity. Technically the base LRP algorithm is implemented in PyTorch as custom static autograd functions. This results for convolution layers in relevance scores having a shape of in the gradient field.
LRP scores computed in the input space of neural networks have been shown to perform well on metrics regarding the ordering of input space regions according to the computed explanation scores and the correlation of this ordering to changes in model output logits [68, 65, 4] when modifying the highest scoring regions.
Appendix 0.B Computational Complexity of Ff-Rs () / LRP-max
Both FF-RS () and LRP-max require an additional forward and backward pass during computation. We emphasize that FF-RS () and LRP-max are not used during training, and are only used for analysis / interpretability. Therefore, computational overhead is not substantial. All our experiments were performed using a single Nvidia RTX 3090 GPU.
Appendix 0.C Non Color-conditional T-FF
There are a few T-FF that are not color-conditional. In this section, we show non color-conditional T-FF. We show LRP-max response image regions for ResNet-50 and EfficientNet-B0 in Fig. 0.C.1 and 0.C.3 respectively. We further show the maximum spatial activation distributions before and after color ablation for ResNet-50 and EfficientNet-B0 in Fig. 0.C.2 and 0.C.4 respectively. As one can observe using LRP-max response image regions, these non color-conditional T-FF contain frequency / texture artifacts. The maximum spatial activation distributions clearly show that these non color-conditional T-FF produce identical / similar distributions before and after color ablation.
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
---|---|---|---|---|---|---|
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
---|---|---|---|---|---|---|
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
---|---|---|---|---|---|---|
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
---|---|---|---|---|---|---|
Appendix 0.D hyper-parameter in top- for T-Ff
In this section, we include more discussion regarding the hyper-parameter in top-. We show that as we increase , AP and GAN detection accuracies drop across ProGAN [37] and all unseen GANs [41, 40, 10, 97, 18, 62]. For our analysis, we identify the smallest k with a substantial drop in cross-model forensic transfer as indicated by AP and GAN detection accuracies. The results for ResNet-50 and EfficientNet-B0 detectors are shown in Table 0.D.1
Appendix 0.E Cross-model forensic transfer using BigGAN [10] pre-training dataset
In this section, we show that color is a critical T-FF using an additional training dataset. We use BigGAN real / fake as second dataset with 1.04M images to train universal detectors following Wang et al. [87] and verify our findings. We remark that ForenSynths [87] uses ProGAN real / fake dataset. We perform large-scale experiments using EfficientNet-B0 universal detector. We report median counterfeit probability results for all 7 GANs [41, 40, 10, 97, 18, 62] in Fig. 0.E.1. Our results show on a second dataset that color ablation causes counterfeit probability to drop by for all unseen GANs. These results on another dataset further support that color is a critical T-FF in universal detectors for counterfeit detection.
BigGAN [10] | ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
---|---|---|---|---|---|---|
Appendix 0.F Is the performance degradation in universal detectors due to unseen corruptions (OOD)?
We remark that some performance degrade is due to CNNs’ poor generalization to unseen corruptions / OOD (grayscale), but here we show that significant amount of degradation is due to color being a critical transferable forensic feature (T-FF) in the universal detector, therefore ablation of color (i.e., grayscale) leads to significant performance degrade. Specifically, we perform an experiment using official EfficientNet-B0 ImageNet classifier (architecture identical to our universal detector) under Grayscale (OOD) setup. We measure the median probability of the correct class before and after Grayscale (OOD) and observe only 17% drop due to Grayscale. Comparing the within-model OOD setup with the cross-model setup, the median probability drop during cross- model forensic transfer is much larger, i.e.: median probability drop during cross-model forensic transfer is 89% (ProGAN pre-training, Fig. 4) and 58% (BigGAN pre-training, Fig. 0.E.1) for EfficientNet-B0 universal detector. This shows that color is critical in forensic transfer compared to within-model OOD setups. See row 1, col 1 in Fig. 4 and Fig. 0.E.1, col 1 to verify that the median probability does not drop much for the GAN used to train universal detectors under Grayscale (OOD).
Appendix 0.G Color-conditional T-Ff (Additional Results)
In this section, we show more color-conditional T-FF to support our finding that color is a critical T-FF. We show LRP-max response image regions for ResNet-50 and EfficientNet-B0 in Fig. 0.G.1 and 0.G.3 respectively. We further show the maximum spatial activation distributions before and after color ablation for these color-conditional T-FF in Fig. 0.G.2(ResNet-50) and Fig. 0.G.4(EfficientNet-B0) respectively.
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
---|---|---|---|---|---|---|
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
---|---|---|---|---|---|---|
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
---|---|---|---|---|---|---|
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
---|---|---|---|---|---|---|
Appendix 0.H CR-Universal Detectors (Additional Results)
We show the AP, real and GAN detection accuracies for the universal Detectors in Table 0.H.1 and CR-Universal Detectors trained using our proposed data augmentation scheme in Table 0.H.2. As one can observe, our proposed CR-universal detectors are more robust and can avoid attacks from color-ablated counterfeits compared to the original detectors proposed by Wang et al. [87].
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | StarGAN [18] | GauGAN [62] |
---|---|---|---|---|---|---|
Appendix 0.I Pixel-wise explanations are not informative to discover T-Ff (Additional Results)
In this section, we show additional results to demonstrate that direct pixel-wise explanations of universal detector decisions are not informative to discover T-FF. Similar to main paper, we use 2 popular interpretation methods namely Guided-GradCAM [72] and LRP [9] to analyse the pixel-wise explanations of universal detector decisions. We show additional results for ResNet-50 detector in Fig. 0.I.1. We also show results for EfficientNet-B0 in Fig. 0.I.2 and 0.I.3. As one can observe from Fig. 0.I.1, 0.I.2 and 0.I.3 pixel-wise explanations of universal detector decisions are not informative to discover T-FF due to their focus on spatial localization.
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | |
Image |
|||||
Pixel-wise explanations of universal detector decisions [87] using Guided-GradCAM (GGC) [72] and LRP [5] | |||||
GGC [72] |
|||||
LRP [5] |
|||||
Pixel-wise explanations of ImageNet classifier decisions using Guided-GradCAM (GGC) [72] and LRP [5] | |||||
GGC [72] |
|||||
LRP [5] |
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | |
Image |
|||||
Pixel-wise explanations of universal detector decisions [87] using Guided-GradCAM (GGC) [72] and LRP [5] | |||||
GGC [72] |
|||||
LRP [5] |
|||||
Pixel-wise explanations of ImageNet classifier decisions using Guided-GradCAM (GGC) [72] and LRP [5] | |||||
GGC [72] |
|||||
LRP [5] |
ProGAN [37] | StyleGAN2 [41] | StyleGAN [40] | BigGAN [10] | CycleGAN [97] | |
Image |
|||||
Pixel-wise explanations of universal detector decisions [87] using Guided-GradCAM (GGC) [72] and LRP [5] | |||||
GGC [72] |
|||||
LRP [5] |
|||||
Pixel-wise explanations of ImageNet classifier decisions using Guided-GradCAM (GGC) [72] and LRP [5] | |||||
GGC [72] |
|||||
LRP [5] |
Appendix 0.J Research Reproducibility / Code Details
Code: Pytorch code is available at here. Refer to README for step-by-step instructions. The codebase is clearly documented. The code is structured as follows:
-
lrp/: Base Pytorch module containing LRP implementations for ResNet and EfficientNet architectures. This includes all Pytorch wrappers.
-
fmap_ranking/: Pytorch module to calculate FF-RS () for counterfeit detection.
-
sensitivity_assessment/: Pytorch module to perform sensitivity assessments for T-FF and color ablation.
-
patch_extraction/: Pytorch module to extract LRP-max response image regions for every T-FF.
-
activation_histograms/: Pytorch module to calculate maximum spatial activation for images for every T-FF.
-
utils/: Contains all utilities, helper functions and plotting functions.
Pre-trained models: All pretrained models can be found at here. We provide both ResNet-50 and EfficientNet-B0 pretrained universal detectors. We also include CR-universal detector models. All our claims reported in Main / Supplementary can be reproduced using these checkpoints.
Docker information: For training /analysis in containerised environments (HPC, Super-computing clusters), please use nvcr.io/nvidia/pytorch:20.12-py3 container.
Experiment details and hyper-parameters: For training universal detectors, we use the exact setup proposed in [87] with Adam optimizer (), batch size of 64 and initial learning rate of . For data augmentation, we use the exact setup proposed in [87] that includes random cropping (224x224), random horizontal flip and 50% JPEG + Blurring. All experiments were repeated 3 times. For LRP, we use rule. For statistical tests, we use Mood’s median test with a significance level of .
Appendix 0.K Future Work: Can we identify globally relevant channels for counterfeit detection in a Generator?
This section serves to motivate future directions from an image synthesis perspective. Particularly, we ask the question as to whether it’s possible to identify feature maps in GANs that are responsible for generating forensic features that are detected by universal detectors.
In this section, we show preliminary results suggesting that it’s possible to identify such globally relevant channels in a generator. Particularly, we perform LRP all the way into the Generator to identify the top highest scoring GAN channels that are responsible for counterfeit detection (i.e.: In the computational graph, the image is generated from a pre-trained ProGAN [37] model). We show that ablating these top-scoring GAN channels consequently results in large drop in probability predicted by the universal detector (We use the publicly released ResNet-50 in this experiment). This result is shown in Fig. 0.K.1 that propagating LRP into the generator is able to identify the globally top-5% relevant channels for images. The box plot (A) shows a strong decrease after ablating these high-scoring GAN channels (though ablated GAN samples have poor visual quality). This can be compared to (B) where 5% of randomly selected GAN channels are ablated, which results in a very small decrease in counterfeit detection scores. These results show promising directions for understanding image synthesis methods, and we hope to explore this area in future work. We also hope to explore the properties of Fair Generative models [88, 69, 80, 17, 79], GANs / detectors trained using different techniques (regularization, knowledge transfer, pruning, few-shot learning, self-supervised learning) [13, 2, 89, 34] and face-forgery detectors [43, 16, 73, 93, 95, 47, 31].