Prediction Model For Digital Image Tampering Using Customized Deep Neural Network Techniques

doi:10.21203/rs.3.rs-4273139/v1

Download PDF

Research Article

Prediction Model For Digital Image Tampering Using Customized Deep Neural Network Techniques

https://doi.org/10.21203/rs.3.rs-4273139/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 19 Jul, 2024

Read the published version in International Journal of System Assurance Engineering and Management →

You are reading this latest preprint version

Image tampering detection is a critical area of research, given the widespread use of manipulated images for deceptive purposes. Convolutional Neural Networks (CNNs) have shown significant potential in automating the identification of tampered images. This paper presents customized deep learning model to detect tampering class with comparative analysis of CNN architectures - ResNet50V2, InceptionNetV3, MobileNetV2, and the proposed CNN, for image tampering detection. The proposed approach encompasses a dataset comprising four distinct classes: copy-move, inpaint, splicing, and normal images. This study sheds light on the comparative strengths and weaknesses of these CNN architectures. The dataset encompasses the key tampered classes, offering a holistic assessment of each model's ability to identify various tampering techniques. The custom CNN architecture is specifically tailored for this task, aiming to evaluate its efficiency compared to the established CNNs. Metrics for training and evaluation are standardized to generate equitable comparisons, encompassing performance indicators such as accuracy, precision, recall, and F1-score.This research contributes the knowledge in the field of image tampering detection, offering a comprehensive evaluation of multiple CNN architectures. Additionally, the effectiveness of separable convolutional layers is explored in deep neural networks, showcasing their potential to enhance scalability and effectiveness across various tasks in machine learning and computer vision. The proposed model, designed with separable convolution layers, exhibits superior validation accuracy and training accuracy compared to the other models under evaluation. Notably, The proposed customized model achieved an impressive F1 score of 96%, highlighting its proficiency in accurately detecting tampered regions within images while minimizing false positives.

Convolutional Neural Network

Image Tampering

Fake Images

Image Forensic

Deep Learning

In the digital age, the manipulation of images has become a prevalent and concerning issue. With the proliferation of image-editing software and the ease of sharing visuals on various platforms, image tampering has evolved into a significant challenge for preserving the integrity and authenticity of digital media. Detecting image tampering is not only essential for maintaining trust and credibility in various domains but is also crucial for criminal investigations, journalism, and ensuring the accuracy of visual content in social media. Traditional methods for image tampering detection have relied on manual examination, expert analysis, and the application of heuristics. While these approaches can be effective in some cases, they often fall short when dealing with sophisticated tampering techniques. Convolutional Neural Networks (CNNs), a potent category of deep learning models that have shown impressive abilities in tackling intricate computer vision challenges. This paper delves into the domain of image tampering detection and presents a complete performance analysis of four distinct CNN architectures: ResNet50V2, InceptionNetV3, MobileNetV2, and a custom-designed CNN architecture specifically crafted for this task. Image tampering is a powerful tool for creating deceptive visuals that can mislead, manipulate, or deceive the viewer. In the realm of journalism, manipulated images can be used to distort facts, sensationalize events, or even fabricate entirely fictitious stories. Misleading visuals can also be employed for political propaganda, harming the credibility of institutions and public trust [1]. In criminal investigations, the integrity of digital evidence is paramount. Manipulated images can be used to alter crime scenes, obscure identities, or erase incriminating evidence. Detecting such tampering is crucial for law enforcement agencies and legal proceedings, as a single manipulated image can alter the course of justice [2].The rise of social media platforms and online communication has given individuals the power to share images globally. However, this convenience has also made it easier for malicious actors to disseminate fake or manipulated visuals, leading to the spread of misinformation and disinformation. Detecting and addressing image tampering is essential for maintaining the credibility of content shared on these platforms [3].

In addressing the pressing concern of fake detection, the integration of Generative Adversarial Networks (GANs) in deep learning has demonstrated significant prowess, particularly in the manipulation of faces, commonly referred to as Deepfake. Deepfake, driven by GANs, enables the realistic substitution of faces, posing a threat of spreading fabricated events through online channels. To counter this, [10] proposes an intelligent forensic method for Deepfake detection. Their approach focuses on subtle texture disparities in image saliency, particularly in facial textures, utilizing a guided filter with a saliency map to enhance texture artifacts. The ResNet18 classification network efficiently learns these features, achieving state-of-the-art detection accuracy in distinguishing between real and fake face images [10]. Generative adversarial networks (GANs) have been investigated for their ability to discern real images from fake ones based on texture differences in tampered images. However, challenges persist, as the proposed methodology still necessitates large-scale data to yield satisfactory results [16]. Recent efforts, exemplified by [11], have explored novel strategies leveraging resampling features, Long-Short Term Memory (LSTM) cells, and encoder-decoder networks. Their approach adeptly captures artifacts like JPEG quality loss and spatial transformations, enabling discriminative analysis between manipulated and non-manipulated regions. The method achieves pixel-level localization, demonstrating high precision in identifying image manipulations. The introduction of a new large image splicing dataset for training further enhances the robustness of their approach. Through rigorous experimentation on diverse datasets, [11] showcase the efficacy of their method in addressing the intricate challenges posed by subtle image manipulations.

Recent studies have identified various methods for tampering detection. However, a common challenge persists among these methods, as they often concentrate on specific manipulation techniques by extracting explicit features from the images [12]. Pre-trained models have been utilized for identifying manipulated images. ResNet50 attained higher accuracy; nevertheless, it suggested enhancing accuracy through the utilization of image patches [13]. Guided filters have been utilized for extracting texture features aimed at identifying tampered images [14]. InceptionNet and MobileNet have been applied to the classification of medical images. However, there is a need to further enhance the accuracy to ensure broader acceptance of detection methodologies [15].

The various approaches discussed above, perform automatic feature extraction and recognition from image data, however, challenges arise in tasks such as determining valid data, handling de-duplication, and verifying data authenticity before the application of machine learning. The proposed approach provided better accuracy by feeding verified image data into different layers of CNN. This uniqueness distinguishes the approach, as existing methods often rely on machine learning techniques and authentication mechanisms in isolation.

The objective of this paper is to formulate a robust predictive model employing deep learning techniques for the discernment of counterfeit images. This model is designed to accurately identify distinct classes of tampering applied to images, ensuring a comprehensive and precise detection mechanism. Furthermore, this paper thoroughly explores sophisticated feature extraction methods, aiming to enhance the precision and nuanced classification of images according to distinct tampering classes like copy-move, region removal, and splicing unlikely done in the extant research. The proposed approach of work enhances the accuracy of the model in less computation time.

The paper is organized as follows; Following the introduction which presents the pressing issue of digital tampering of images, recent work done in this area, challenges, and objectives of the paper. Section 2 mentioned the techniques and meth-ods used to propose the custom CNN architecture used in this paper. Section 3 pre-sent the research methodology describing the dataset used and the proposed 4

architecture. In section 4, the experiment and results are mentioned followed by the conclusion in the last section 5.

Convolutional Neural Networks (CNNs) have emerged as a groundbreaking technology in the field of computer vision [4], particularly in addressing complex image analysis tasks. CNNs are a class of deep neural networks designed to process and analyze visual data, making them highly effective in tasks such as image classification, object detection, and image tampering detection [4].CNNs consist of convolutional layers, which apply filters to input images, extracting features like edges and textures. These layers slide small filters over the image, computing dot products with local regions to generate feature maps. Pooling layers, like max-pooling and average-pooling, are integral components of CNNs and serve to decrease spatial dimensions, aiding computation and emphasizing important features. After convolution and pooling, fully connected layers capture advanced abstractions and relationships among features, facilitating intricate decision-making processes. Activation functions like ReLU [8] and Sigmoid introduce non-linearity, allowing CNNs to model intricate data relationships. CNNs undergo training through backpropagation, a process that involves adjusting internal parameters such as weights and biases to minimize a loss function, typically a measure of predicted vs. actual labels. In this paper analysis of four CNN architectures—ResNet50V2, InceptionNetV3, MobileNetV2, and a custom-designed CNN specifically tailored for image tampering detection is shown by experimental results.ResNet50V2 [5], short for Residual Network with 50 layers and Version 2 is particularly recognized for its outstanding performance in image classification. At its core, ResNet50V2 utilizes residual blocks with skip connections to mitigate the vanishing gradient problem in deep networks. The architecture incorporates bottleneck structures within these blocks to reduce computational load while maintaining representational capacity [5]. InceptionNetV3 [6], or Inception-v3, is recognized for exceptional performance in image classification and object recognition, Inception-v3 employs specialized inception modules, allowing multi-scale feature capture and intricate pattern detection. It incorporates global average pooling to reduce dimensionality, aiding in mitigating overfitting, and benefits from batch normalization for stabilized training [4]. MobileNetV2 [7], designed for efficient on-device computer vision on mobile and embedded devices, enhances its predecessor with innovative features. Linear bottlenecks and shortcut connections enhance information flow and stability [7]. The scalable design includes width and resolution multipliers for flexibility in balancing model size and accuracy. Efficient convolutions reduce computational overhead, making MobileNetV2 ideal for real-time applications like image classification, object detection, and semantic segmentation [7].

Addressing the challenges posed by image tampering requires advanced tools capable of automatically identifying manipulated images with precision and accuracy. Convolutional Neural Networks (CNNs) [4], a type of deep learning models, have shown immense promise in this regard. CNNs are designed to learn and extract intricate patterns and features directly from the data they are trained on [4]. This ability makes them highly suitable for image tampering detection, as tampering techniques often involve subtle changes in pixel values, textures, and patterns. Traditional methods for image tampering detection often rely on manually crafted features and heuristics. CNNs, on the other hand, have the capacity to automatically discover and leverage complex features that may be challenging to define explicitly. This feature extraction capability allows CNNs to excel in identifying tampered regions [4], even in cases where the manipulation is sophisticated or subtle.

There are various methods exist for determining the authenticity of an image, and these approaches are briefly examined. In the realm of tampered text detection in document images, the critical role of information security has drawn increasing attention. While progress has been made, detecting visually consistent tampered text in photographed document images remains challenging. Document Tampering Detector (DTD) framework, leveraging a Frequency Perception Head (FP H) and a Multi-view Iterative Decoder (MID) focuses on visual feature challenges. The novel Curriculum Learning for Tampering Detection (CLTD) training paradigm is designed to enhance robustness, mitigating confusion during training [9].

In the proposed work, the CNN model and its variations ResNet50V2, Inception-NetV3 and MobileNetV2 were used to define the proposed architecture of the customized CNN model explained in the following section 3.

In the proposed work, a more extensive approach is opted to fine-tune the pre-trained models. This involved unfreezing all layers of the models, not just the fully connected layers, allowing for modifications to both the convolutional and fully connected layers. The rationale behind this approach was to ensure that the models could adapt comprehensively to the unique demands of the image tampering detection task. The combination of the datasets, especially the augmented Defacto dataset, allows for a comprehensive assessment of CNNs in image tampering detection. The Defacto dataset provides tampered images with various scenarios and severity levels, further enhanced by rotation augmentation, while the COCO dataset offers diverse authentic images for assessing the models' ability to distinguish between tampered and normal images in real-world scenarios [17]. This dataset diversity and structure make them suitable for robust training, evaluation, and benchmarking of machine learning models dedicated to image tampering detection, offering an extensive and diverse dataset resource for research and experimentation in this field.

3.1. Dataset

In the context of image tampering detection, the choice of an appropriate dataset is crucial for training and evaluating machine learning models, including Convolutional Neural Networks (CNNs). The research paper discusses two key datasets: the Defacto dataset and the COCO dataset [17]. The Defacto dataset[[18], containing 72,776 images per class after rotation augmentation, is specialized for image tampering detection. Multiple python scripts are developed for converting the raw images into tampered images of 4 categories such as copy-move, inpaint, and splicing,and normal with varying levels of tampering severity. Sample image form the each class is shown in the Fig. 1.Ground truth annotations for tampered regions within each image are provided, enabling supervised machine learning model training and performance evaluation.

On the other hand, the COCO dataset serves as a valuable source for authentic, unaltered images, representing the "normal" class. It consists of diverse high-quality images capturing everyday scenes, objects, and contexts, with detailed annotations for object categories, locations, and segmentation masks.

The combination of high-end GPUs and CPUs created a harmonious synergy that facilitated efficient model training, leading to optimal utilization of computational resources. In determining the training parameters, a batch size of 128 has been chosen and executed training for a span of 20 epochs. The selection of these parameters was a result of a carefully crafted balance between computational resources and model convergence. A batch size of 128 strikes a balance between using large enough batches for efficient gradient updates while not exceeding the memory capacity of the GPUs, which could lead to bottlenecks in training. The choice of 20 epochs for training duration was equally well-considered. It provided the models with ample opportunities to learn and adapt to the intricate features within the image tampering detection dataset while avoiding overfitting.

3.2. Proposed Architecture:

In addition to the utilization of pre-trained models, custom CNN architecture is proposed, specifically designed to address the unique challenges posed by image tampering detection as shown in Fig. 2. While pre-trained models serve as powerful baselines, the development of a custom architecture was a crucial component of proposed research, as it allowed us to tailor the network to the specific requirements of this task.

At the foundation lies the input layer, accepting raw image data. The initial CNN block forms the base, composed of two CNN layers and a Batch Normalization layer, establishing the groundwork for feature extraction. A subsequent max-pooling layer condenses information, marking a pivotal transition. The following are separable convolution blocks, represented as sturdy columns, emphasizing the model's proficiency in discerning intricate features. Max-pooling layers intermittently guide the flow, each contributing to spatial reduction. The model evolves through these blocks, culminating in a compact global average pooling layer. Finally, a fully connected layer crowns the structure, epitomizing the culmination of feature extraction for precise classification of tampered class.

A distinguishing feature of proposed customized CNN architecture is the incorporation of separable convolution layers. Traditional CNNs employ standard convolution layers, which involve a high degree of interdependence between convolutional filters. In contrast, separable convolution layers separate the spatial and depth-wise convolutions. This separation significantly reduces computational complexity and enhances the model's capacity to learn features more efficiently and rapidly. It's important to emphasize that this innovation wasn't just about computational efficiency; it also brought tangible benefits to the model's representational power. The custom CNN, along with separable convolution layers, became a crucial asset in the comparative analysis. This architecture not only demonstrated its potential for more efficient feature learning but also highlighted the importance of architectural innovations when tackling image tampering detection.

A comprehensive evaluation of the model’s performance unveils distinctive trends for loss and accuracy as mentioned in Table 1. The custom model, crafted with separable convolution layers, displays the highest validation accuracy and training accuracy among the models being assessed. It notably surpasses the performance of both InceptionNetV3 and ResNet50V2, attaining a validation accuracy that stands out significantly as shown in Fig. 6. The results are also mirrored in its training accuracy, where the proposed customized model displays robust learning capabilities.

Table 1

Accuracy and Loss of Deep Learning Models

Model	Validation Accuracy	Training Accuracy	Training Loss	Validation Loss
Custom model	0.9000	0.9688	0.0137	0.1043
InceptionNetV3	0.7148	0.9240	0.5184	0.4958
ResNet50V2	0.6716	0.7196	0.5068	0.5854
MobileNetV2	0.7727	0.9026	0.5718	0.6068

Furthermore, the custom model excels in minimizing training loss and validation loss, demonstrating its effectiveness in image tampering detection. In contrast, InceptionNetV3, though renowned for its feature extraction prowess [6], achieves 71.48% validation accuracy as shown in Fig. 3. This is further emphasized by its comparatively higher training and validation loss values which are 51.84% and 49.58% respectively, indicating a struggle to achieve good precision. ResNet50V2 model known for its depth and skip connections [5], demonstrates 67.16% validation accuracy in comparison, emphasizing the advantages of customization as shown in Fig. 4. Its training accuracy falls short of expectations, and the training and validation losses remain relatively high, indicating a challenge in capturing the intricate features essential for tampering detection.

MobileNetV2 [7], as shown in Fig. 5 while showing a competitive validation accuracy and training accuracy which is 77.27% and 90.26% respectively, slight lags in the validation loss section, which could be attributed to its inherent trade-offs in terms of computational efficiency. Nonetheless, the findings underline the superior performance too, which successfully combines efficient feature learning with faster convergence, showcasing its potential as a promising solution for image tampering detection.

In the evaluation of image tampering detection models, precision, recall, and F1 score serve as crucial performance metrics. In this context, the results highlight the exceptional performance of the custom CNN, showcasing its capability in achieving a harmonious balance between precision and recall. The custom model achieved an impressive F1 score of 0.96, indicating its proficiency in both identifying tampered regions within images and minimizing false positives as mentioned in Table 2.

Table 2. Precision, Recall and F1 Score of Deep Learning Models

Model	Precision	Recall	F1 Score
InceptionNetV3	0.8586	0.8541	0.8538
ResNet50V2	0.7409	0.7263	0.7278
MobileNetV2	0.8153	0.8158	0.8154
Custom model	0.9562	0.9683	0.9622

The proposed customized model demonstrates a strong F1 score of 0.9622, indicating a symmetric precision and recall. In comparison, InceptionNetV3 achieves an F1 score of 0.8538, ResNet50V2 records 0.7278, and MobileNetV2 attains 0.8154. These scores signify the nuanced trade-off between precision and recall for each model, offering insights into their respective strengths in identifying tampered regions within images, positioning it as an asset in applications. In results and analysis, it is evident that training times for each model significantly impact their practical utility. As shown in Fig. 7, The proposed CNN outshines the competition with a training time of 573.4 seconds for 20 epochs, which is a substantial improvement over the other models. In comparison, MobileNetV2 required 2250.5 seconds, InceptionNetV3 took 2413.5 seconds, and ResNet50V2 demanded 2220.5 seconds for the same number of epochs. What's particularly striking is that proposed model, despite having a similar number of parameters as MobileNetV2, reduced the training time by nearly 75%.

This reduction in training time showcases the efficiency of the proposed customized CNN, thanks in part to the utilization of separable convolution layers, which expedite the learning process and enhance overall training efficiency. A comprehensive evaluation of the model parameters underscores the efficiency and compactness of the proposed customized CNN architecture in image tampering detection. With a remarkably lean parameter count of 5,450,988, the custom model showcases an adept balance between complexity and performance as shown in Fig. 8. In contrast, InceptionNetV3, ResNet50V2, and MobileNetV2 exhibit substantially larger parameter counts, registering at 25,878,802, 27,908,998, and 5,047,298, respectively. This notable discrepancy in parameter sizes emphasizes the streamlined architecture of custom model, demonstrating that good performance need not necessitate an excessive number of parameters. The judicious management of parameters in customized CNN not only contributes to computational efficiency but also underscores its potential for applications where resource constraints are a critical consideration.

4.1 Hyperparameter Optimization

Hyper parameter optimization is a critical component of the proposed research methodology. To guarantee the optimal fine-tuning of custom CNN and the pretrained models (ResNet50V2, InceptionNetV3, and MobileNetV2) for image tampering detection, an extensive and systematic hyperparameter search has been used. The hyperparameter search involved running hundreds of training sessions, each characterized by a unique combination of hyperparameters. Key parameters that were subjected to this optimization process included learning rates, weight decay, dropout rates, and architecture-specific settings. Each of these hyperparameters plays a crucial role in the performance and behavior of a neural network. Therefore, determining the most suitable values for these hyperparameters was paramount. By systematically exploring the hyperparameter space, the model gained insights into the intricate interplay between these parameters and the models' ability to generalize and adapt effectively to the proposed work. This comprehensive approach to hyperparameter optimization reflects to ensuring that model, reached its full potential. It not only enhanced the performance of the models but also contributed to the reliability and robustness of comparative analysis, providing a solid foundation for the findings and conclusions.

4.2 Optimization Techniques:

In proposed work, the choice of optimization techniques played a crucial role in fine-tuning the pretrained CNN architectures for image tampering detection. Adam optimizer has been used which is widely acclaimed and renowned for its efficacy in training deep neural networks. Adam combines the benefits of both the momentum-based updates. This adaptive optimization algorithm dynamically modifies the learning rates for individual parameters, leading to accelerated convergence and more effective training. To further enhance the training performance and prevent the models from converging prematurely or getting stuck in local minima, learning rate scheduling mechanism has been implemented. Learning rate scheduling is a dynamic adjustment of the learning rate during training. This mechanism monitored the model's performance, and when it detected a plateau in accuracy, it automatically decreased the learning rate.

4.3 Addressing Overfitting:

Overfitting, a common hurdle in deep learning, occurs when a model excessively tailors itself to the training data, often incorporating noise and irrelevant patterns that hinder its ability to generalize effectively to unseen data. To tackle this issue, various strategies were executed with the aim of improving the models' ability to perform well on data they had not encountered before. First and foremost, L1 regularization has been applied which was a vital tool in arsenal against overfitting. L1 regularization works on to the network's weights, encouraging sparsity within the model. This regularization technique introduced a penalty on the magnitude of the weights, promoting a more parsimonious set of features. By doing so, L1 regularization encouraged the model to focus on the most relevant and informative features while discouraging the overemphasis on noise and irrelevant details in the training data. This process played a significant role in enhancing the model's performance with unseen data, a critical factor in image tampering detection where adaptability and accuracy are paramount. The dropout layers are incorporated at strategic points in the network architecture. These dropout layers played a pivotal role in preventing overfitting by randomly deactivating a fraction of neurons during training. This randomness introduced a degree of uncertainty into the model's learning process, effectively discouraging it from becoming overly reliant on specific features or neurons. Furthermore, batch normalization [14] was a key component of the proposed strategy to combat overfitting. This technique was applied to standardize the input to each layer during training. By doing so, batch normalization mitigated the effects of internal covariate shift [19], a phenomenon where the distribution of layer inputs changes during training. Batch normalization aided in improving the models' generalization by ensuring that the learning process was not hindered by abrupt shifts in data distributions. It contributed to the overall stability and robustness of the models, which is paramount in the context of image tampering detection where detecting subtle variations and manipulations are essential.

In conclusion, the investigation into Convolutional Neural Network (CNN) architectures for image tampering detection has unfolded notable advancements, notably exemplified by the proposed customized model. This exclusive CNN exhibited discernible proficiency, particularly in validation accuracy, performing better than the other considered architectures. Its precision in identifying tampered regions underscores its potential significance in digital forensics and content verification. Moreover, the custom model's noteworthy efficiency, characterized by reduced training time, stands as a testament to the impact of leveraging separable convolution layers. This computational efficacy not only conserves resources but also extends the model's adaptability to more extensive datasets and complex tasks. While recognizing the merits of established architectures, the suggested exploration emphasizes the inventive potential and adaptability encapsulated in neural network customization. Beyond the pursuit of elevated accuracy and training efficiency, this research contributes to the ongoing evolution of image tampering detection methodologies, paving the way for nuanced applications and reinforcing the digital landscape's integrity and trustworthiness. In future different image pre-processing technique could be used to enhance the performance. Additionally, a potential future scope of work could encompass the detection of fake videos. This expansion would involve adapting and extending the proposed model to address the unique challenges posed by video content.

Molina MD, Sundar SS, Le T, Lee D (2021) Fake News Is Not Simply False Information: A Concept Explication and Taxonomy of Online Content. Am Behav Sci 65(2):180–212. https://doi.org/10.1177/0002764219878224
Moussa AF (2021) Electronic evidence and its authenticity in forensic evidence. Egypt J Forensic Sci 11(1). https://doi.org/10.1186/s41935-021-00234-6,hp
Aïmeur E, Amri S, Brassard G (2023) Fake news, disinformation and misinformation in social media: a review. Social Netw Anal Min 13(1). https://doi.org/10.1007/s13278-023-01028-5
O’Shea K, Nash RR, An (2015) introduction to convolutional neural networks. arXiv (Cornell University). https://doi.org/10.48550/arxiv.1511.08458
He K, Zhang X, Ren S, Sun J (2016) Identity mappings in deep residual networks. https://doi.org/10.48550/arxiv.1603.05027. arXiv (Cornell University)
Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2015) Rethinking the inception architecture for computer vision. arXiv (Cornell University). https://doi.org/10.48550/arxiv.1512.00567
Sandler M, Howard A, Zhu M, Zhmoginov A, Chen L (2018) MobileNetV2: Inverted residuals and linear bottlenecks. arXiv (Cornell University). https://doi.org/10.48550/arxiv.1801.04381
Agarap AF (2019) Deep Learning using Rectified Linear Units (ReLU). arXiv [Cs.NE]. Retrieved from http://arxiv.org/abs/1803.08375
Qu C, Liu C, Liu Y, Chen X, Peng D, Guo F, Jin L (2023) Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 5937–5946
Yang J, Xiao S, Li A, Lan G, Wang (2021) Detecting fake images by identifying potential texture differences. Future Generation Comput Syst 125:127–135. https://doi.org/10.1016/j.future.2021.06.043
Bappy JH, Simons C, Nataraj L, Manjunath BS, Roy-Chowdhury AK (2019) Hybrid LSTM and Encoder–Decoder Architecture for Detection of Image Forgeries. IEEE Trans Image Process 28(7):3286–3300. 10.1109/tip.2019.2895466
Manjunatha S, Malini M, Patil (2021) Deep learning-based Technique for Image Tamper Detection, Proceedings of the Third International Conference on Intelligent Communication Technologies and Virtual Mobile NetworksICICV EEE
Nagaveni K, Hebbar, Kunte AS (2021) Transfer Learning Approach For Splicing And Copy-Move Image Tampering Detection. Ictact Journal On Image And Video Processing, May
Jiachen Ya, Xiao S (2021) Aiyun Li a, GuipengLan, HuihuiWangb, Detecting fake images by identifying potential texture difference. Future Generation Computer Systems, Elsevier
Image classification and prediction using transfer learning in colab notebook, J Praveen Gujjara,∗, H R Prasanna Kumar b, Niranjan N, Chiplunkar (2021) Global Transitions Proceedings
Jiachen Ya, Li SXA, Wang GLH (2021) Detecting fake images by identifying potential texture difference. Future Generation Computer Systems, ELSEVIER –
Lin T, Maire M, Belongie S, Bourdev L, Girshick R, Hays J, Perona P, Ramanan D, Zitnick CL, Dollár P, Microsoft COCO (2015) Common Objects in context. arXiv (Cornell University). https://doi.org/10.48550/arxiv.1405.0312
MAHFOUDI G, TAJINI B, RETRAINT F, MORAIN-NICOLIER F, DUGELAY JL and M. PIC, DEFACTO: Image and Face Manipulation Dataset, 2019 27th European Signal Processing Conference (EUSIPCO), Coruna A (2019) Spain, pp. 1–5, 10.23919/EUSIPCO.2019.8903181.(2014)
Ioffe S, Szegedy C (2015) Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. arXiv [Cs.LG]. Retrieved from http://arxiv.org/abs/1502.03167

Download PDF

Journal Publication

published 19 Jul, 2024

Read the published version in International Journal of System Assurance Engineering and Management →

Editorial decision: Minor revisions
24 Jun, 2024
Reviewers agreed at journal
07 Jun, 2024
Reviewers invited by journal
30 May, 2024
Editor invited by journal
07 May, 2024
First submitted to journal
06 May, 2024

You are reading this latest preprint version

Prediction Model For Digital Image Tampering Using Customized Deep Neural Network Techniques

Status:

Journal Publication

Version 1

Abstract

Figures

1. Introduction

2. Methods and Techniques – Deep Neural Networks

3. Research Methodology

3.1. Dataset

3.2. Proposed Architecture:

4. Experiments and Results

4.1 Hyperparameter Optimization

4.2 Optimization Techniques:

4.3 Addressing Overfitting:

5. Conclusion & Future Scope

References

Status:

Journal Publication

Version 1