Video Forgery Detection for Surveillance Cameras: A Review

doi:10.21203/rs.3.rs-3360980/v1

Download PDF

Research Article

Video Forgery Detection for Surveillance Cameras: A Review

https://doi.org/10.21203/rs.3.rs-3360980/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Although video recording is now a standard feature in smartphones and other digital devices, digital records are affordable and straightforward to take and distribute on social media. Furthermore, digital recordings have recently become an essential part of our daily lives, ranging from personal experiences to surveillance footage that may now be utilised as evidence of proof. Thus, it is essential to create detection algorithms for this kind of captured video because the number of high-standard counterfeit videos on social networks and other media is increasing exponentially. Evaluating the integrity of surveillance videos is crucial with the advanced equipment of surveillance systems. As a result, the rapid increase in surveillance cameras has made progress in physical security, including in building homes, stores, governmental buildings, agencies, etc. Moreover, surveillance videos have lately been extensively used as significant evidence in the courts. The evidence from the video can be crucial for a court of justice and investigators to comprehend the incidents as they happened. Unfortunately, due to the widespread of video editing programs, the video forgery/ tampering process has become incredibly simple. Consequently, the courts refuse to accept these videos as evidence because there is some margin of doubt about their authenticity and integrity. Therefore, to avoid such a situation in the future, more efforts are needed in video forensics. For this reason, this paper reviews primarily all the techniques and methods proposed so far to find out forgery in surveillance recordings and how the authenticity of such videos is proven.

Video Forgery

Video Forgery Detection

CCTV Camera

Surveillance Video

Digital Video Tampering

The presence of considerably low-priced digital video cameras on smart devices and gadgets and the availability of many websites play a remarkably vital role in sharing various videos. YouTube, Facebook, Twitter, Instagram, TikTok, Snap Chat, WhatsApp, etc. [1], [2] are used daily to share huge amounts of recording videos. In today's technologically advanced world, digital video recorders, particularly security cameras, are widely accessible and create a vast amount of multimedia footage [3]. Surveillance systems are routinely used to ensure public safety. The security camera has achieved massive popularity as an efficient safety precaution in offices, residences, and numerous public locations.

Although video manipulating software, for example, Adobe Premier, Adobe After Effect, PowerDirector Essential, GNU Gimp, Blender, Vegas, etc., allows users to create a fabricated video quickly. Therefore, the integrity of surveillance videos cannot be guaranteed because they are no longer regarded as crucial legal evidence, mainly when that video footage is treated as evidence against any crimes in court. Moreover, it has been verified that forensic analysis of videos must be completed to prove the authenticity of their contents [4]. As a result, authenticating surveillance recordings and identifying the forging process is critical [5]. Video forgery refers to any intentional change of a digital video for fabrication. At the same time, Digital video forgery detection refers to determining if digital video footage has been deliberately changed. Thus, this survey article targets to review the recent state-of-the-art methods and approaches introduced to detect video tampering in surveillance videos and highlight their limitations since no prior work has reviewed the methods and techniques utilised to detect a forgery in videos obtained from surveillance cameras.

As depicted in Fig. 1, the number of publications published to detect tampering in surveillance videos was not significantly great compared to the publications that frequently deal with the issues of detecting forgery in videos. In the years 2017 and 2022, three articles were published. However, just two research have been done in 2013, 2019, 2020, and 2021. While only one article was published in 2014 and 2018. Surprisingly, no work has been published in 2015 and 2016 related to forgery detection in surveillance videos.

This paper is organised as follows: section 3 includes the approaches that are frequently used to detect a forgery in videos; Section 4 determines the popular types of video forgery; Section 5 states the techniques and methods of detecting tampering in surveillance videos; Section 6 discusses the best mechanism that has been proposed to detect fraud in footage videos; and finally, Section 7 includes the conclusion and future work.

Active and passive strategies (i.e., represented in Figure 2) are two kinds of forgery detection systems in digital video:

3.1 Active Approaches

Active techniques for detecting forgeries employ pre-embedded information, for example, a digital watermark [6] or digital signature [7], to assess the authenticity of content ownership, integrity, and copyright violations [8], as shown clearly in Figure 3.

If a video's content has been altered, the encoded watermark or signature changes, indicating that the video has been tampered with [9]. The disadvantages of active methods are the need for particular hardware, such as a camera, and the requirement of embedding a digital signature or digital watermark during the recording stage, as illustrated in Figures 4 and 5.

3.2 Passive Approaches

Passive forgery detection methods are a promising digital security path forward [12]. During the absence of pre-embedded data, passive forgery detection depends on the inherent properties of the digital video rather than information that could be utilized to confirm the authenticity of the video [13]. However, since the majority of videos lack pre-embedded information, for instance, a signature or watermark, it is difficult to detect modifications using an active technique. Because this method does not require a specific tool and prior knowledge of the video's content and generates static and temporal distortions in a video that must be verified to detect modified videos. Consequently, it is also referred to as the Passive-Blind Technique [14]. Passive methods (i.e., represented in Figure 6 include the following sorts of altering parts based on the video's regional properties:

1) Spatial Tampering

2) Temporal Tampering

3) Spatio-Temporal Tampering

4) Re-Projection

3.2.1 Spatial Tampering/ Intra-frame Forgery

Spatial Tampering or Intra-frame forgery determines the kind of counterfeit that involves manipulating the original contents of specific frames [15]. It can be accomplished by modifying the pixel bits in a frame or the adjacent ones in a video sequence (i.e., along the x-y axis) [16]. Figure 7 shows how the spatial approach could be applied to obtain a fake video.

Forensic professionals have access to many sorts of information (artifacts or footprints) to detect spatial tampering and localization. These details reveal, represented in Table 1, that the techniques fall under the following groups of techniques: deep learning techniques [18]–[23], camera source features techniques [24]–[27], pixels and source device [28]–[30], SVD techniques [31], compression techniques [32]–[34] and statistical features techniques [35]–[37].

It could be classified (i.e., as shown in Figure 8) into:

1) Copy-Move Forgery

2) Splicing Forgery

3) Upscale Crop Forgery

Table 1: The groups of Spatial (Intra-Frame) Forgery Detection Methods

Spatial Techniques Groups	Deep Learning Techniques
	Camera Source Features Techniques
	Pixels And Source Device Techniques
	SVD Techniques
	Compression Techniques
	Statistical Features Techniques

3.2.1.1 Copy-Move Forgery

It is one of the most widespread forms of digital image/video tampering [38]. An object can be inserted or deleted from a video scene using this sort of forgeries. Simultaneously, It may be used to replicate video elements by copying a section of the video frame and pasting it elsewhere, either within the same or a separate video frame. [39]. This procedure can popularly be used to hide the desired location in the frame [40], [41]. As a result, it is also known as copypaste or area manipulation forgery. Figure 9 depicts a. the original image that has been taken and b. the final image after applying copy move tampering where the tree has been copied next to the tree shown on the left-hand side.

3.2.1.2 Splicing Forgery

The new frame is created by copying and pasting a fragment of an existing video frame [43]–[46]. As illustrated in Figure 10, The new spliced video has been composed by merging the two video frames. This forgery is often hard to identify as the generated video is compressed, resampled, and blurred. Besides, it is used for malicious purposes. Thus, developing trustworthy splicing detection tools to assess the veracity of photos has emerged as a key challenge. This inspires the researchers to develop several methods to identify splicing forgeries. The main objective of many ways to identify image splicing is to detect the region of abnormalities using characteristics of the image [47], [48].

3.2.1.3 Upscale Crop Forgery

It refers to cropping and enlarging the outer portion of a video frame to remove an area or object that might represent any evidence of incriminating events [50], [51]. Figure 11 depicts an example of an upscale crop video tampering where Figure 11 (a) shows the original scene whilst Figure 11 (b) illustrates the fake video created in Figure 11 (a) where the walking lady has been removed.

3.2.2 Temporal Tampering/ Inter-frame Forgery

Temporal Tampering or Inter-frame forgery manipulation is done on the video's concatenated chain of frame sequence either through the involvement of replacement, reordering, addition, or removal of a video frame [53]. The actions that can be included in this tampering are generally executed at the frame level [54]. Figure 12 represents the process of creating temporal video tampering.

The following groups can be used to categorise the algorithms used to identify temporal forgery, depicted in Table 2, for instance, statistical features [55]–[58], frequency domain features [59]–[63], residual and optical flow [64]–[70], pixel and texture [71]–[76], and deep learning [77]–[82] strategies.

The various types of temporal forgery are illustrated in Figure 13:

1) Frame Deletion

2) Frame Duplication

3) Frame Insertion

4) Frame Shuffling

Table 2: The groups of Temporal (Inter-Frame) Forgery Detection Methods

Temporal Techniques Groups	Statistical Features Techniques
	Frequency Domain Features Techniques
	Residual And Optical Flow Techniques
	Pixel And Texture Features Techniques
	Deep Learning Techniques

3.2.2.1 Frame Deletion

This manipulation eliminates frames from a video on purpose to manufacture misleading proof of illegal activities [83]. Figure 14 illustrates the sequence of the forged video before and after involving the frame deletion tampering.

3.2.2.2 Frame Duplication

This counterfeit deliberately repeats some of the frames in a movie [85]. Figure 15 clarifies the sequence of the forged video before and after involving the frame duplication tampering.

There is another form of frame duplication forgery that is called frame mirroring [86] which aims to copy some frames from the original video and paste them randomly into other locations in the same video. Figure 16 delineates the sequence of the forged video after involving the frame mirroring tampering where (a) is the original video and (b) is the counterfeit video after frame mirroring in which the mirrored copies of the 2nd, 3rd, and 4th frames are pasted between the frame 6th and 7th location.

3.2.2.3 Frame Insertion

For any criminal conduct or fraudulent proof, frames from other different videos or the same video are randomly placed at positions [87]. Figure 17 explains the sequence of the forged video before and after involving the frame insertion tampering.

3.2.2.4 Frame Shuffling/ Replication

This counterfeit rearranges or changes the original arrangement of video frames, giving the original video a false meaning [88]. Figure 18 demonstrates the sequence of the forged video before and after involving the frame shuffling tampering.

3.2.3 Spatio-Temporal Tampering (Forgery)

Spatio-Temporal Tampering combines both methods that are explained above in 3.2.1 and 3.2.2 It simply modifies the combined sequence of frames and the information present in the same video frames [89]. Figure 19 demonstrates the procedure of applying spatiotemporal forgery to obtain the fake video.

3.2.4 Re-projection

Re-projection [90] is the act of recording a movie from the theatre screen to breach copyright laws [91]. Since the quality of the recorded video is deficient, it is undoubtedly effortless to detect. Figure 20 - a illustrates the place of the audience that recorded the film Live Free or Die Hard in the cinema Hall. While Figure 20 - b contains perspective distortion because of how the video camera is angled concerning the screen during the re-projected process. The camera skew, a distortion based on the angle between both the horizontal and vertical pixel axes, can be introduced into the intrinsic camera characteristics [92].

This section surveys all the mechanisms that have been proposed to find tampering in videos that have been recorded using CCTV cameras. Table 3 delineates all the techniques and approaches that have been proposed so far to detect tampering in surveillance videos.

Table 3: Surveillance Videos Forgery Detection Approaches

Approaches and Techniques for Forgery Detection in Surveillance Videos	Sensor Pattern Noise Technique
	Gaussian Distribution
	Residual Gradient and Optical Flow Gradient
	Residual Frames
	Optical Flow Gradient and Residual Analysis
	Feature Extraction
	WiFi Signals
	Temporal Domain
	Capsule Network
	Secure-Pose
	Similarity Analysis
	Deep Learning
	Radio-Frequency (RF) Signal

4.1 Sensor Pattern Noise Technique

Sensor Pattern Noise (SPN) [93] and resampling estimation [94] techniques have been proposed to identify fakes in surveillance footage. Minimum Average Correlation Energy - Mellin Radial Harmonic (MACE-MRH) correlation filters can detect upscale crop, partial manipulation, and video alternation forgeries by utilising invariance or scaling tolerance. This approach is also used to identify the source camera. In the first stage, the source camera for a certain video is recognised. Then, in the second step, the scalar factor and correlation coefficient are used to identify tampering in the videos. Videos of static scenes have considerably remarkably outperformed others. This method produced significantly superior results compared to Chen’s method [95] (i.e., 15% higher accuracy particularly when the scaling factor for infrared video is 1.8) [96].

The previously mentioned approach has been improved in which the scaling tolerance of a Minimum Average Correlation Energy - Mellin Radial Harmonic (MACE-MRH) correlation has been filtered to consistently reveal video upscale-crop fraud and recognise partially altered portions. According to this, since resampling creates specific statistical correlations in the provided content, its presence can be determined by checking for these correlations. The Sensor Pattern Noise (SPN) [97] has been utilised as a forensic feature and examined the differences between reference SPN and SPN of upscaled frames in terms of correlation characteristics. The approach was evaluated on a total of 1920 fabricated sequences constructed from 120 self-recorded RGB and infrared H.264 encoded test videos. As long as the scale and quality parameters were regularly checked and adjusted, this method achieved a TNR (True Negative Rate) of 100% and a TPR (True Positive Rate) of greater than 98%. In the instance of partial modification detection, the detection accuracy of 100% for dynamic scene videos and 94.2 to 100% for static scene videos was recorded for region sizes between 100 and 150 square pixels. This technique was found to be reliable while dealing with compressed videos in addition to RGB and infrared videos. It works with movies of both dynamic and static scenes taken with moving and not moving cameras [98].

4.2 Gaussian Distribution

In the optical-flow-based forgery detection approach, the probability distributions of optical-flow variations for unaltered surveillance videos were modelled using a Gaussian distribution. An anomaly was any irregularity in the flow fluctuations, and a statistical inference test (Grubb's test) was used to assign an anomaly score to the optical flow patterns of each test video. The degree to which the pattern demonstrated anomalous behaviour determined this score. Lastly, to detect inter-frame forgeries, three cut-off levels (one for frame insertion, one for frame deletion, and one for frame duplication) were applied to the anomaly score, which identified the abnormalities. The technique was assessed using a total of 160 test clips, all of which were produced from two original MPEG-2-encoded videos extracted from TRECVID's [99] surveillance event detection data set. The detection accuracies for frame deletion, insertion, and duplication were determined to be 75%, 85%, and 82.5%, respectively. The reported accuracy rates for forgery localisation were 96.9%, 100%, and 86.2%, respectively [100].

4.3 Residual Gradient and Optical Flow Gradient

For H.264 and MPEG-2 encoded films, a detection technique for inter-frame forgeries employing prediction residual gradient and optical flow gradient has been given. A hybrid technique based on motion and brightness gradient characteristics is used to determine forgeries by identifying variations between nearby frames, notably for manually mobile recorded films and surveillance footage. Using the spike count regardless of the number of frames in the video, the proposed technique automatically detects video manipulation. This method achieved an accuracy of 83% [101].

4.4 Residual Frames

For the detection and localization of digital video inter-frame duplication, another approach based on the idea of residual frames has been developed. To detect and discover frame duplication frauds, the entropy of DCT coefficients in each residual frame's standard deviation value is computed, and the similarity between pairs of feature vectors is assessed. Using positive predictive value (PPV), true positive rate (TPR), and F1 Score, the efficacy of this method has been tested. This technique can effectively detect inter-frame duplication tampering in an extremely short time and it obtained the following results after using the SULFA dataset: PPV: 98%, TPR: 99%, F1: 98%, and after using the VIRAT dataset obtained PPV: 97%, TPR: 98%, F1: 97% [102].

4.5 Optical Flow Gradient and Residual Analysis

The forgery detection technique based on optical flow gradient characteristics and prediction residual analysis has been described. The approach can detect and identify video frame deletion, insertion, and duplication. When the video is altered, the temporal correlations between neighbouring frames are broken, which is evaluated by the researchers. The window-based paradigm is used to locate the counterfeit. The suggested method is optimised for H.264 video and MPEG-2 codecs, and it is 83% accurate for both slow- and fast-motion video [103].

4.6 Feature Extraction

Feature extraction and novel point localisation technique have been proposed. During the phase of feature extraction, the 2-D phase congruency of each frame was identified as a desirable image property. The relationship between adjacent frames was then determined. In the second step, anomalous places were identified using a clustering technique (k-means). The normal and abnormal points were divided into two groups. The average accuracy acquired for the 1^st dataset [104] is 97.08% and for the 2^nd dataset [105] is 93.13% [106].

4.7 WiFi Signals

It has been demonstrated that Wi-Fi signals are beneficial for revealing video looping assaults on surveillance systems. It utilises handcrafted event-level timing and frequency information from time-series Wi-Fi and camera data, resulting in a slow reaction time and an inability to do fine-grained false localisation. Consequently, none of the existing solutions simultaneously meet the real-time and fine-grained criteria of forgery detection and localisation in video surveillance systems. SurFi is used to analyse event-level timing information from WiFi and camera data to detect camera looping attacks. SurFi utilises existing Wi-Fi infrastructure (requiring no further hardware or deployment costs) to extract channel state information (CSI), which is then analysed and linked with video and CSI signals to identify discrepancies. SurFi can identify assaults with up to 95.1% accuracy [107].

4.8 Temporal Domain

Another approach for detecting inter-frame forging (i.e., frame deletion, insertion, and shuffling) has been presented, in which the manipulation takes place in the temporal domain. This approach uses the universal image quality index (UQI) of temporal averages (TP) for non-overlapping neighbouring frames to detect illegitimate actions in an exceptionally short amount of time. Individual frames will be collected from the security camera's directly captured footage. Then, each frame's TP will be measured. Due to the consistency and regularity of the video, the UQI of every two adjacent TP images is used to extract unusual activity as illegal candidates; if the video is subjected to deletion, insertion, or shuffling, the similarity will decrease, and the Q values at the border of the doctored clip will be lower than those of other clips. Lastly, the least Q value of the related frames of TP candidates and their neighbors is used to select the locations of inter-frame attacks. The detection performance of UQI method using Precision, Recall, and F1 Score using frame deletion is 0.98, 0.99, and 0.98 respectively. The performance of the same metrics under frame-insertion tampering is 0.99, 0.99, and 0.99 respectively. However, with frame-shuffling forgeries, the performance of the measures is 0.96, 0.97, and 0.96, respectively. The outcomes of each of the three assessment criteria were compared to the procedures in [108]–[111] and proved the best outcomes in terms of Precision, Recall, and F1 Score values. Moreover, the proposed technique has the shortest execution time compared to the previous techniques mentioned in [108]–[111] because it compares the temporal averages of non-overlapping subsequence frames rather than examining each frame individually [112].

Instead of all frames, the temporal average of each shot was employed to detect frame duplication. Grey-level co-occurrence matrix (GLCM) features were obtained for feature vectors, and the similarity between adjacent vectors was used to detect frame duplication. Despite the inclusion of post-processing activities with high false positives due to weak boundaries of duplicated frames, the suggested technique obtained an accuracy rate of 95% to 99% and a low running time. Without post-processing, the accuracy rates for frame duplication with shuffling (FDS) and frame duplication (FD) were 94% and 99%, respectively. This proposed technique has been evaluated on the SULFA [105] and EASIEST datasets [113].

A technique for identifying video tampering based on sensor pattern noise in video frames has been presented. Denoising video frames yielded the noise patterns, which were then averaged to identify sensor noise patterns. Using a locally adaptable DCT, the sensor noise patterns were analysed (Discrete Cosine Transform). To detect if a video was genuine or faked, the correlation of noise residues from several video frames was calculated. The method was evaluated on a dataset containing noise patterns and yielded satisfactory results; nevertheless, these findings are contingent on the physical specifications of the source device. The accuracy of the prior model is 96.6 % [114].

4.9 Capsule Network

Based on Capsule Networks, a new digital forensic method for identifying object-based counterfeiting in surveillance recordings has been developed. Intra-frame and inter-frame statistical features of the video sequence have been recovered as the input of the capsule network utilising motion residual computed for each video frame. The experimental results demonstrate that the proposed method, with different bit rate values and dataset resolutions, achieves significant performance in terms of Video Detection Accuracy (VDA), Authentic Frame Detection Accuracy (AFDA), Forged Frame Detection Accuracy (FFDA), and Double-compressed Frame Detection Accuracy (DFDA), regardless of the group image length and degree of video compression. With a 3 M bit rate and 1280 720 dataset resolution, for example, VDA: 100%, AFDA: 99.30%, DFDA: 97.94%, and FFDA: 84.97%. For a 1.5 M bit rate and 1280 720 dataset resolution, the VDA accuracy is 99.99%, the AFDA accuracy is 98.64%, the DFDA accuracy is 96.12%, and the FFDA accuracy is 81.05%. Although the accuracies for 3 M bit rate and 640 360 dataset resolution are VDA: 100%, AFDA: 98.95%, DFDA: 97.49%, and FFDA: 84.56% [115]. The results mentioned for VDA, DFDA, and FFDA are considered the best compared to [116] and [117].

4.10 Secure-Pose

Secure-Pose, the novel cross-modal system that identifies and localises forgery attacks in each frame of live surveillance video, has been implemented. In a half-hour, they generated their dataset by gathering multimodal data. Faster-RCNN is utilised for intra-frame assaults to detect and clip out a human item before replacing it with the equivalent blank backdrop segment. Their test data covered a forgery detection accuracy of 95% [118].

4.11 Similarity Analysis

The AIFDT-SV-BAS approach for identifying inter-frame manipulation is based on a study of similarity that is not affected by a single or many scenes. The recommended method involves examining the suspicious video for scene transitions. Whenever a situation changes, the method separates the scene into many shots. The images are then fed into a passive-blind technique based on a similarity analysis [73]. However, if there is no scene change, the splitting is incomplete. Primarily, the histogram difference between two consecutive frames in the HSV colour space will be used to detect forgeries. In addition, H-S and S-V colour histograms can identify various variations. The proposed technique AIFDT-SV-BAS has been assessed using CASIA 2 and NC 16 datasets. Furthermore, this technique has been evaluated using precision, recall, and accuracy metrics. This method has significantly outperformed the benchmark [73] result with a precision of 98.07%, a recall of 100%, and an accuracy of 99.1% due to the scene change recognition and video segmentation before checking for counterfeit [119].

4.12 Deep Learning

A system for identifying inter-frame forgeries that segments a movie into video shots and fuses spatial and temporal information to generate a single picture for each shot has been created. For effective extraction of spatiotemporal features, a pre-trained 2D-CNN model is utilised. The structural similarity index (SSIM) is then utilised to construct deep-learning video features. Lastly, they used 2D-CNN and RBF Multiclass Support Vector Machine (RBF-MSVM) to detect temporal manipulation in the video. To detect inter-frame forgery, a dataset of 13135 videos containing three types of forged videos under different conditions was created using original videos from the VRAT, SULFA, LASIESTA, and IVY datasets. The dataset achieved TPRs of 0.987, 0.999, and 0.985 for frame deletion, insertion, and duplication, respectively [120].

4.13 Radio-Frequency (RF) Signals

Learning-based algorithms have been designed to detect video forgery attacks using radio-frequency (RF) signals. It is an extended version of Secure-Pose [118]. The secure-Pose method identifies camera looping attacks by analysing event-level timing and frequency data derived from the coexistence of Wi-Fi signal and camera data. However, it cannot give fast identification and precise location of forgeries. Subsequently, the enhanced study employed the RF-based approach, which identifies anomalous items with a detection accuracy of 98.7% and correctly localises them during playback and manipulation [121].

Table 4 below summarizes all the methods and techniques that have been implemented to detect forgery in surveillance videos.

Table 4: Summary of Forgery Detection Methods in Surveillance Videos

Ref	Detection Methods	Forgery Detection Details	Accuracy	Limitations	Year
[96]	Sensor Pattern Noise (SPN) and resampling estimation method.	Minimum Average Correlation Energy - Mellin Radial Harmonic (MACE-MRH) correlation filters may identify upscale crop, partial manipulation, and video alteration frauds by utilising scaling tolerance or invariance.	15% is more accurate than Chen's technique [95].	No approach can detect the source device without forecasting the scaling factor and enhancing the accuracy of partially changed region prediction by examining the video's features, such as RGB/IR, dynamic/static scene, and compressed video.	2013
[98]	Identifying two tampering kinds upscale-crop and partial manipulation using Sensor Pattern Noise (SPN)	Employing the scaling tolerance of a minimum average correlation energy Mellin radial harmonic (MACE-MRH) correlation filter to consistently reveal video upscale-crop fraud and recognised partially altered portions in dynamic and static-scene videos (recorded using static and moving cameras) by omitting high-frequency components and adaptively determining the size of the window block.	For Dynamic Scene videos: 100% For Static Scene videos: 94.2% -100%	The techniques considerably rely on a vast number of content-dependent parameters and thresholds.	2013
[100]	A method based on optical flow and anomaly detection is created to authenticate videos and identify the inter-frame forgery process (i.e. frame deletion, insertion, and duplication).	The defined method is predicated on the notion that forgery operations result in discontinuity locations in the optical flow variation sequence. The qualities of these criteria vary depending on the type of fraud. To distinguish discontinuity spots, the technique of anomaly detection is utilised.	Elevator Scene Deletion: 93.8% Insertion: 100% Duplication: 86.8% Airport Passageway Scene Deletion: 100% Insertion: 100% Duplication: 85.7%	The optical flow algorithm needs improvement to increase its ability to detect an abnormality and it is only allowed for videos with static backgrounds. Furthermore, the estimation method needs enhancement to make it more sensitive to counterfeiting and less associated with natural variations in original videos.	2014
[101]	Employing prediction residual gradient and optical flow gradient, the suggested forensic method detected inter-frame forgeries in H.264 and MPEG-2 encoded videos, particularly manually mobile-recorded videos and surveillance footage.	The mechanism was developed to identify faked videos automatically by merely counting the spikes. It was determined that this method is independent of the motion of objects in a video sequence, the number of frames altered, the number of objects in a video sequence, illumination change, recording equipment, and compression codec.	83%	The performance of the mechanism degrades when videos with relatively slow motion are applied. The technique efficiency for videos with quite low quality needs to be improved.	2017
[102]	Using the standard deviation of residual frames to detect Inter-frame duplication detection.	Calculating the entropy of discrete cosine transform (DCT) coefficients for every residual frame to illustrate their different characteristics will lead to identifying duplicated frames using subsequence feature analysis.	Using SULFA Dataset: PPV: 98%, TPR: 99%, F1: 98% Using VIRAT Dataset PPV: 97%, TPR: 98%, F1: 97%	Has limitations in detecting other kinds of inter-frame forgery in the minimum time. When the copied clip occurs in a static scene, this mechanism fails.	2017
[103]	The approach detects frame insertion, deletion, and duplication in MPEG-2 and H.264 encoded videos using prediction residual and optical flow inconsistencies.	The inter-frame forgery alters the temporal relationship between subsequent video frames. When these interruptions are properly examined utilising prediction residual and optical flow, they could assist in identifying indicators of manufacture and pinpointing the particular location of the counterfeit in the presented video sequence.	Detection: 83% Localisation: 80%	When used on videos with intense lighting, performance suffers.	2017
[106]	Developing a system for detecting inter-frame forgeries based on 2-D Phase Congruency and K-Means Clustering.	Measuring the relationship between adjacent frames will result in the detection of discontinuous points produced by forged video using the K-Means Clustering technique, which will classify normal and abnormal points.	1^st Dataset: 97.08% 2^nd Dataset: 93.13%	1. The precision value of detecting frame deletion at the video's beginning and the end is low. 2. Inserted frames cannot be recognised whether they were spliced from another video or copied from the same video.	2018
[107]	Defining SurFi using commonly available WiFi signals to identify surveillance camera looping assaults in real-time.	SurFi utilises existing WiFi infrastructure (no new hardware or deployment costs are required) to extract human activities from channel state information (CSI), which is then processed and matched with video and CSI signals to identify mismatches.	95.1%	The SurFi approach is unable to identify and pinpoint forgery assaults in individual video frames.	2019
[112]	It has been proposed to detect inter-frame forging (i.e., frame deletion, insertion, and shuffling) when the manipulation process occurs in the temporal domain.	This approach uses the universal image quality index (UQI) of temporal averages (TP) for non-overlapping neighbouring frames and performs it in a very short amount of time to identify illegal actions.	Frame-Deletion Precision 0.98 Recall 0.99, F1 Score 0.98 Frame-Insertion Tampering Precision 0.99 Recall 0.99 F1 Score 0.99 Frame-Shuffling Forgery Precision 0.96 Recall 0.97 F1 Score 0.96	The Precision, Recall, and F Score results degrade when modification in tampered videos is added such as noise.	2019
[113]	Identifying a robust technique for identifying inter-frame fraud based on a temporal average of each frame and a gray-level co-occurrence matrix (GLCM) constructed using statistical textural data.	For similarity matching, the correlation between identical feature vectors is calculated while taking image size into account.	Without post-processing, the suggested method's precision rate for frame duplication (FD) is 94%, and for frame duplication and shuffling (FDS) is 99%. However, the precision rate with post-processing is 95% for (FD) and 99% for (FDS).	Since inaccurate shots result in a rise in false positives, shot boundary detection is low.	2020
[114]	Expanding an approach for detecting video manipulation using Sensor Pattern Noise (SPN) in video frames.	Extracting the noise residue by subtracting denoised frames from the original frames. The sensor pattern noise will then be generated by averaging these noises. Finally, the correlation of noise residues from separate video frames is computed to assess whether a video is legitimate or fabricated.	96.61%	Detect a single kind of forged document. Residue for moving background videos is not appropriate.	2020
[115]	A new digital forgery detection approach has been introduced based on Capsule Networks for the identification of object-based counterfeiting in surveillance recordings.	Motion residual is extracted from video sequences to improve the performance of the proposed capsule network for assessing videos for genuine, double-compressed, and faked frames.	3 M bit rate and 1280 720 dataset resolution VDA: 100%, AFDA: 99.30%, DFDA: 97.94%, FFDA: 84.97% 1.5 M bit rate and 1280 720 dataset resolution VDA: 99.99%, AFDA: 98.64%, DFDA: 96.12%, FFDA: 81.05%. 3 M bit rate and 640 360 dataset resolution VDA: 100%, AFDA: 98.95%, DFDA: 97.49%, FFDA: 84.56%	It can identify the forged frames in a video but not their precise location in the frame.	2021
[118]	Presenting Secure-Pose the revolutionary cross-modal system that identifies and localises forging traces on each suspicious live surveillance video frame using ambient wireless signals.	The Secure-Pose system successfully extracts human pose characteristics from the time-series camera and Wi-Fi data. It successfully identifies and localises forging traces in each frame under both inter-frame and intra-frame assaults.	95.1%	It is unable to do fine-grained forgery localisation.	2021
[119]	An inter-frame video forgery method AIFDT-SV-BAS for detecting tampering in surveillance videos has been proposed.	This approach examines the suspicious video for scene changes. If a scenario changes, the approach divides the scene into several shots. The shots are then loaded into a passive-blind approach based on similarity analysis [73]. However, the splitting is not completed if there is no scene change. The histogram difference between two adjacent frames in the HSV colour space will be used to detect forgeries. In addition, H-S and S-V colour histograms can detect numerous variations. The proposed technique AIFDT-SV-BAS has been assessed using CASIA 2 and NC 16 datasets.	99.1%	This technique obtained a similar accuracy report to the benchmark during single shots video.	2022
[120]	Presenting an inter-frame forgery detection system based on a 2D convolution neural network (2D-CNN) in which a video is segmented into video shots and spatial and temporal information is fused to generate a single picture of each image.	The video is analysed using 2D-CNN and Gaussian RBF Multiclass Support Vector Machine (RBF-MSVM) to detect a temporal counterfeit.	True Positive Rate (TPR) in Detection Insertion, Deletion, and Duplication Forgery is 99.9%, 98.7%, and 98.5% respectively.	This method can detect single inter-frame tampering in one video.	2022
[121]	Determining the extended Secure-Pose approach, which employs the extensive coexistence of surveillance and Wi-Fi infrastructures to fight video forgery assaults in the real-time and fine-grained form via RF technology.	The new approach successfully identifies and localises forging traces in video streams by extracting standard human semantic information from the synchronised camera and Wi-Fi signals.	98.7%	-	2022

It can be proven that the extended Secure-Pose [121] is the best mechanism implemented as surveillance cameras and Wi-Fi signals can extract human semantic information and effectively detect and localise tampering activities on the video feeds. Besides, the assessment findings show that improved Secure-Pose has a high accuracy of 98.7% and correctly detects and finds anomalous items during playback and tampering assaults.

Due to the rapid growth of digital multimedia technology, the significance of multimedia data, such as digital images/videos, is increasing rapidly in a variety of domains, with surveillance video footage serving as the primary form of evidence in a great number of court cases, including those involving highly sensitive information. With the widespread availability of inexpensive software for image/video manipulation, such as Priemer Rush, Quik, and LumaFusion, digital images/videos are now particularly susceptible to alteration/modification attacks. Video forgery refers to any deliberate modification of a digital video for fabrication. Digital video forgery detection is the approach used to assess whether digital video footage has been manipulated intentionally. Because no prior work has reviewed the methods and strategies used to detect a forgery in surveillance videos, the primary purpose of this article is to cover all prior works applied to detect tampering in surveillance videos and to state their opposing sides.

It could be concluded that the improved version of Secure-Pose[121] is considered to be the best technique with an accuracy of 98.7%, as it could successfully identify and localise forging traces in the video streams by extracting standard human semantic information from a synchronised surveillance camera and Wi-Fi signals in real-time and fine-grained detail. In the future, we will try to get an actual CCTV footage dataset, apply all the mentioned methods, test, and compare their outcomes and results with the figures obtained so far from the stated approaches.

Conflict of interest: The authors declare no conflict of interest to any party.

Ethical Approval: The manuscript is conducted in the ethical manner advised by the targeted journal.

Consent to Participate: Not applicable

Consent to Publish: The research is scientifically consented to be published.

Funding: The research did not receive specific funding but we want to contribute to a traditional publishing model.

Competing Interests: The authors declare no conflict of interest.

Availability of data and materials: Data can be shared upon request from the corresponding author.

Acknowledgment: None.

Authros contributions:

Noor Bahjat Tayfor: Draft, Software;Tarik A. Rashid: Review, Editing, Supervising; Shko Muhammed Qader: Review; Bryar A. Hassan: Review; Mohammed Hussein Abdalla: Review; Jafar Majidpour: Review; Aram Mahmood Ahmed: Review; Haval Mohammed Sidqi: Review; Abdulrahman Salih:Review; Zaher Mundher Yaseen: Review, Editing

Y. K. Dwivedi et al., “Setting the future of digital and social media marketing research: Perspectives and research propositions,” Int J Inf Manage, vol. 59, Aug. 2021, doi: 10.1016/j.ijinfomgt.2020.102168.
G. Appel, L. Grewal, R. Hadi, and A. T. Stephen, “The future of social media in marketing,” J Acad Mark Sci, vol. 48, no. 1, pp. 79–95, Jan. 2020, doi: 10.1007/s11747-019-00695-1.
B. N. Subudhi, D. K. Rout, and A. Ghosh, “Big data analytics for video surveillance,” Multimed Tools Appl, vol. 78, no. 18, pp. 26129–26162, Sep. 2019, doi: 10.1007/s11042-019-07793-w.
X. Pan and S. Lyu, “Region duplication detection using image feature matching,” IEEE Transactions on Information Forensics and Security, vol. 5, no. 4, pp. 857–867, Dec. 2010, doi: 10.1109/TIFS.2010.2078506.
R. D. Singh and N. Aggarwal, “Video content authentication techniques: a comprehensive survey,” Multimed Syst, vol. 24, no. 2, pp. 211–240, Mar. 2018, doi: 10.1007/s00530-017-0538-9.
Y. Zhou, Q. Ying, X. Zhang, Z. Qian, S. Li, and X. Zhang, “Robust Watermarking for Video Forgery Detection with Improved Imperceptibility and Robustness,” Jul. 2022, [Online]. Available: http://arxiv.org/abs/2207.03409
J. L. Hernandez-Ardieta, A. I. Gonzalez-Tablas, J. M. de Fuentes, and B. Ramos, “A taxonomy and survey of attacks on digital signatures,” Comput Secur, vol. 34, pp. 67–112, 2013, doi: 10.1016/j.cose.2012.11.009.
N. A. Shelke and P. N. Chatur, “Optimized and Hybrid based Watermarking System for Digital Video Security,” in IEEE International Conference on Wireless Communications, WiSPNET , 2016, pp. 1068–1074.
N. A. Shelke and P. N. Chatur, “A Survey on Various Digital Video Watermarking Schemes,” International Journal of Computer Science & Engineering Technology.
M. Begum and M. S. Uddin, “Analysis of Digital Image Watermarking Techniques through Hybrid Methods,” Advances in Multimedia, vol. 2020, 2020, doi: 10.1155/2020/7912690.
S. Aggarwal and N. Kumar, “Digital signatures☆,” in Advances in Computers, vol. 121, Academic Press Inc., 2021, pp. 95–107. doi: 10.1016/bs.adcom.2020.08.004.
J. K. Bhatia and A. S. Jalal, “A Review on Image Forgery Detection Techniques on Passive attacks,” International Journal of Engineering Research in Computer Science and Engineering (IJERCSE), vol. 5, no. 2, pp. 2394–2320, 2018.
K. Sitara and B. M. Mehtre, “Digital video tampering detection: An overview of passive techniques,” Digital Investigation, vol. 18. Elsevier Ltd, pp. 8–22, Sep. 01, 2016. doi: 10.1016/j.diin.2016.06.003.
Q. Dong, G. Yang, and N. Zhu, “A MCEA based passive forensics scheme for detecting frame-based video tampering,” Digit Investig, vol. 9, no. 2, pp. 151–159, 2012, doi: 10.1016/j.diin.2012.07.002.
T. H. Kim, C. W. Park, and I. K. Eom, “Frame Identification of Object-Based Video Tampering Using Symmetrically Overlapped Motion Residual,” Symmetry (Basel), vol. 14, no. 2, Feb. 2022, doi: 10.3390/sym14020364.
M. Saddique, K. Asghar, U. I. Bajwa, M. Hussain, and Z. Habib, “Spatial video forgery detection and localization using texture analysis of consecutive frames,” Advances in Electrical and Computer Engineering, vol. 19, no. 3, pp. 97–108, 2019, doi: 10.4316/AECE.2019.03012.
N. Akhtar, M. Saddique, K. Asghar, U. I. Bajwa, M. Hussain, and Z. Habib, “Digital Video Tampering Detection and Localization: Review, Representations, Challenges and Algorithm,” Mathematics, vol. 10, no. 2. MDPI, Jan. 01, 2022. doi: 10.3390/math10020168.
M. Zampoglou et al., “Detecting Tampered Videos with Multimedia Forensics and Deep Learning,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2019, vol. 11295 LNCS, pp. 374–386. doi: 10.1007/978-3-030-05710-7_31.
Y. Yao, Y. Shi, S. Weng, and B. Guan, “Deep learning for detection of object-based forgery in advanced video,” Symmetry (Basel), vol. 10, no. 1, Jan. 2018, doi: 10.3390/sym10010003.
K. Kono, T. Yoshida, S. Ohshiro, and N. Babaguchi, “Passive Video Forgery Detection Considering Spatio-Temporal Consistency,” in Advances in Intelligent Systems and Computing, 2020, vol. 942, pp. 381–391. doi: 10.1007/978-3-030-17065-3_38.
D. D’Avino, D. Cozzolino, G. Poggi, and L. Verdoliva, “Autoencoder with recurrent neural networks for video forgery detection,” in IS and T International Symposium on Electronic Imaging Science and Technology, 2017, pp. 92–99. doi: 10.2352/ISSN.2470-1173.2017.7.MWSF-330.
H. Kaur and N. Jindal, “Deep Convolutional Neural Network for Graphics Forgery Detection in Video,” Wirel Pers Commun, vol. 112, no. 3, pp. 1763–1781, Jun. 2020, doi: 10.1007/s11277-020-07126-3.
A. Kohli, A. Gupta, and D. Singhal, “CNN based localisation of forged region in object-based forgery for HD videos,” IET Image Process, vol. 14, no. 5, pp. 947–958, Apr. 2020, doi: 10.1049/iet-ipr.2019.0397.
C.-C. Hsu, T.-Y. Hung, C.-W. Lin, and C.-T. Hsu, “Video Forgery Detection Using Correlation of Noise Residue,” in In Proceedings of the IEEE 10th Workshop on Multimedia Signal Processing, 2008, pp. 170–174.
M. Kobayashi, T. Okabe, and Y. Sato, “Detecting Video Forgeries Based on Noise Characteristics.”
M. Kobayashi, T. Okabe, and Y. Sato, “Detecting forgery from static-scene video based on inconsistency in noise level functions,” IEEE Transactions on Information Forensics and Security, vol. 5, no. 4, pp. 883–892, Dec. 2010, doi: 10.1109/TIFS.2010.2074194.
X. Hu, J. Ni, and R. Pan, “Detecting video forgery by estimating extrinsic camera parameters,” in In Proceedings of the International Workshop on Digital Watermarking, 2015, vol. 9569, pp. 28–38. doi: 10.1007/978-3-319-31960-5.
C. S. Lin and J. J. Tsay, “A passive approach for effective detection and localization of region-level video forgery with spatio-temporal coherence analysis,” Digit Investig, vol. 11, no. 2, pp. 120–140, 2014, doi: 10.1016/j.diin.2014.03.016.
A. v Subramanyam and S. Emmanuel, “VIDEO FORGERY DETECTION USING HOG FEATURES AND COMPRESSION PROPERTIES.”
O. I. Al-Sanjary et al., “Deleting Object in Video Copy-Move Forgery Detection Based on Optical Flow Concept.”
L. Su, T. Huang, and J. Yang, “A video forgery detection algorithm based on compressive sensing,” Multimed Tools Appl, vol. 74, no. 17, pp. 6641–6656, Sep. 2015, doi: 10.1007/s11042-014-1915-4.
Institute of Electrical and Electronics Engineers, “Localization of forgeries in MPEG-2 video through GOP size and DQ analysis,” in IEEE 15th International Workshop on Multimedia Signal Processing (MMSP), 2013, pp. 494–499.
S. Tan, S. Chen, and B. Li, “GOP Based Automatic Detection of Object-based Forgery in Advanced Video.”
J. Bakas, A. K. Bashaboina, and R. Naskar, “MPEG Double Compression Based Intra-Frame Video Forgery Detection using CNN,” in Proceedings - 2018 International Conference on Information Technology, ICIT 2018, Dec. 2018, pp. 221–226. doi: 10.1109/ICIT.2018.00053.
W. Wang and H. Farid, “Exposing digital forgeries in interlaced and deinterlaced video,” IEEE Transactions on Information Forensics and Security, vol. 2, no. 3, pp. 438–449, Sep. 2007, doi: 10.1109/TIFS.2007.902661.
C. Richao, Y. Gaobo, and Z. Ningbo, “Detection of object-based manipulation by the statistical features of object contour,” Forensic Sci Int, vol. 236, pp. 164–169, Mar. 2014, doi: 10.1016/j.forsciint.2013.12.022.
L. Su, C. Li, Y. Lai, and J. Yang, “A Fast Forgery Detection Algorithm Based on Exponential-Fourier Moments for Video Region Duplication,” IEEE Trans Multimedia, vol. 20, no. 4, pp. 825–840, Apr. 2018, doi: 10.1109/TMM.2017.2760098.
L. Li, S. Li, H. Zhu, S.-C. Chu, J. F. Roddick, and J.-S. Pan, “An Efficient Scheme for Detecting Copy-move Forged Images by Local Binary Patterns,” Journal of Information Hiding and Multimedia Signal Processing, vol. 4, no. 1, pp. 46–56, Jan. 2013.
R. Davarzani, K. Yaghmaie, S. Mozaffari, and M. Tapak, “Copy-move forgery detection using multiresolution local binary patterns,” Forensic Sci Int, vol. 231, no. 1–3, pp. 61–72, Sep. 2013, doi: 10.1016/j.forsciint.2013.04.023.
Y. Fan, Y.-S. Zhu, and Z. Liu, “An Improved SIFT-Based Copy-Move Forgery Detection Method Using T-Linkage and Multi-Scale Analysis,” 2016.
C.-C. Chen, L.-Y. Chen, and Y.-J. Lin, “Block Sampled Matching with Region Growing for Detecting Copy-Move Forgery Duplicated Regions,” Journal of Information Hiding and Multimedia Signal Processing, vol. 8, no. 1, pp. 86–96, 2017.
K. Liu et al., “Copy move forgery detection based on keypoint and patch match,” Multimed Tools Appl, vol. 78, no. 22, pp. 31387–31413, Nov. 2019, doi: 10.1007/s11042-019-07930-5.
Z. Moghaddasi, H. A. Jalab, and R. M. Noor, “SVD-based Image Splicing Detection,” in International Conference on Information Technology and Multimedia (ICIMU), Nov. 2014, pp. 27–30.
M. Emam, Q. Han, and X. Niu, “PCET based copy-move forgery detection in images under geometric transforms,” Multimed Tools Appl, vol. 75, no. 18, pp. 11513–11527, Sep. 2016, doi: 10.1007/s11042-015-2872-2.
O. Mayer and M. C. Stamm, “Accurate and Efficient Image Forgery Detection Using Lateral Chromatic Aberration,” IEEE Transactions on Information Forensics and Security, vol. 13, no. 7, pp. 1762–1777, Jul. 2018, doi: 10.1109/TIFS.2018.2799421.
Z. Zhang, C. Wang, and X. Zhou, “A survey on passive image copy-move forgery detection,” Journal of Information Processing Systems, vol. 14, no. 1, pp. 6–31, 2018, doi: 10.3745/JIPS.02.0078.
Z. He, W. Lu, W. Sun, and J. Huang, “Digital image splicing detection based on Markov features in DCT and DWT domain,” Pattern Recognit, vol. 45, no. 12, pp. 4292–4299, Dec. 2012, doi: 10.1016/j.patcog.2012.05.014.
A. Roy, D. Bhalang Tariang, R. Subhra Chakraborty, and R. Naskar, “Discrete Cosine Transform Residual Feature based Filtering Forgery and Splicing Detection in JPEG Images.”
N. Kaur, N. Jindal, and K. Singh, “A passive approach for the detection of splicing forgery in digital images,” Multimed Tools Appl, vol. 79, no. 43–44, pp. 32037–32063, Nov. 2020, doi: 10.1007/s11042-020-09275-w.
R. D. Singh and N. Aggarwal, “Detection of upscale-crop and splicing for digital video authentication,” Digit Investig, vol. 21, pp. 31–52, Jun. 2017, doi: 10.1016/j.diin.2017.01.001.
S. Kingra, N. Aggarwal, and R. D. Singh, “Video inter-frame forgery detection: A survey,” Indian J Sci Technol, vol. 9, no. 44, 2016, doi: 10.17485/ijst/2016/v9i44/105142.
N. A. Shelke and S. S. Kasana, “A comprehensive survey on passive techniques for digital video forgery detection,” Multimed Tools Appl, vol. 80, no. 4, pp. 6247–6310, Feb. 2021, doi: 10.1007/s11042-020-09974-4.
R. Habeeb and L. C. Manikandan, “A Review : Video Tampering Attacks and Detection Techniques,” International Journal of Scientific Research in Computer Science, Engineering and Information Technology, pp. 158–168, Oct. 2019, doi: 10.32628/cseit195524.
J. D. Gavade and S. R. Chougule, “Passive Blind Forensic Scheme for Copy-Move Temporal Tampering Detection,” in International Conference On Advances in Communication and Computing Technology (ICACCT), Feb. 2018, pp. 155–160.
ACM Special Interest Group on Multimedia., University of Texas at Dallas., Texas A & M University., and ACM Digital Library., “Exposing digital forgeries in video by detecting duplication,” in proceedings of the Multimedia & Security Workshop , Sep. 2007, pp. 35–42.
Q. Wang, Z. Li, Z. Zhang, and Q. Ma, “Video Inter-Frame Forgery Identification Based on Consistency of Correlation Coefficients of Gray Values,” Journal of Computer and Communications, vol. 02, no. 04, pp. 51–57, 2014, doi: 10.4236/jcc.2014.24008.
G. Singh and K. Singh, “Video frame and region duplication forgery detection based on correlation coefficient and coefficient of variation,” Multimed Tools Appl, vol. 78, no. 9, pp. 11527–11562, May 2019, doi: 10.1007/s11042-018-6585-1.
C. C. Huang, C. E. Lee, and V. L. L. Thing, “A novel video forgery detection model based on triangular polarity feature classification,” International Journal of Digital Crime and Forensics, vol. 12, no. 1, pp. 14–34, Jan. 2020, doi: 10.4018/IJDCF.2020010102.
Institute of Electrical and Electronics Engineers. Wuhan Section., “Exposing Digital Video Forgery by Detecting Motion-compensated Edge Artifact,” in International Conference on Computational Intelligence and Software Engineering (CiSE 2009), Dec. 2009.
Q. Dong, G. Yang, and N. Zhu, “A MCEA based passive forensics scheme for detecting frame-based video tampering,” Digit Investig, vol. 9, no. 2, pp. 151–159, 2012, doi: 10.1016/j.diin.2012.07.002.
S. Jaiswal and S. Dhavale, “Video Forensics in Temporal Domain using Machine Learning Techniques,” J. Computer Network and Information Security, no. 9, pp. 58–67, 2013, doi: 10.5815/ijcn.
T. Huang, X. Zhang, W. Huang, L. Lin, and W. Su, “A multi-channel approach through fusion of audio for detecting video inter-frame forgery,” Comput Secur, vol. 77, pp. 412–426, Aug. 2018, doi: 10.1016/j.cose.2018.04.013.
Y. Wang, Y. Hu, A. W. C. Liew, and C. T. Li, “ENF based video forgery detection algorithm,” International Journal of Digital Crime and Forensics, vol. 12, no. 1, pp. 131–156, Jan. 2020, doi: 10.4018/IJDCF.2020010107.
K. Kancherla and S. Mukkamala, “Novel Blind Video Forgery Detection Using Markov Models on Motion Residue,” In Intelligent Information and Database Systems, pp. 308–315, 2012.
T. Shanableh, “Detection of frame deletion for digital video forensics,” Digit Investig, vol. 10, no. 4, pp. 350–360, 2013, doi: 10.1016/j.diin.2013.10.004.
J. Chao, X. Jiang, and T. Sun, “A novel video inter-frame forgery model detection scheme based on optical flow consistency,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2013, vol. 7809 LNCS, pp. 267–281. doi: 10.1007/978-3-642-40099-5_22.
C. Feng, Z. Xu, W. Zhang, and Y. Xu, “Automatic location of frame deletion point for digital video forensics,” in IH and MMSec 2014 - Proceedings of the 2014 ACM Information Hiding and Multimedia Security Workshop, Jun. 2014, pp. 171–179. doi: 10.1145/2600918.2600923.
C. Feng, Z. Xu, S. Jia, W. Zhang, and Y. Xu, “Motion-Adaptive Frame Deletion Detection for Digital Video Forensics,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 27, no. 12, pp. 2543–2554, Dec. 2017, doi: 10.1109/TCSVT.2016.2593612.
S. Jia, Z. Xu, H. Wang, C. Feng, and T. Wang, “Coarse-to-Fine Copy-Move Forgery Detection for Video Forensics,” IEEE Access, vol. 6, pp. 25323–25335, Mar. 2018, doi: 10.1109/ACCESS.2018.2819624.
V. Joshi and S. Jain, “Tampering detection and localization in digital video using temporal difference between adjacent frames of actual and reconstructed video clip,” International Journal of Information Technology (Singapore), vol. 12, no. 1, pp. 273–282, Mar. 2020, doi: 10.1007/s41870-018-0268-z.
Z. Zhang, J. Hou, Q. Ma, and Z. Li, “Efficient video frame insertion and deletion detection based on inconsistency of correlations between local binary pattern coded frames,” Security and Communication Networks, vol. 8, no. 2, pp. 311–320, Jan. 2015, doi: 10.1002/sec.981.
S.-Y. Liao and T.-Q. Huang, “Video Copy-Move Forgery Detection and Localization Based on Tamura Texture Features,” in 6th International Congress on Image and Signal Processing (CISP 2013) , Dec. 2013, pp. 864–868.
D. N. Zhao, R. K. Wang, and Z. M. Lu, “Inter-frame passive-blind forgery detection for video shot based on similarity analysis,” Multimed Tools Appl, vol. 77, no. 19, pp. 25389–25408, Oct. 2018, doi: 10.1007/s11042-018-5791-1.
J. Bakas, R. Naskar, and R. Dixit, “Detection and localization of inter-frame video forgeries based on inconsistency in correlation distribution between Haralick coded frames,” Multimed Tools Appl, vol. 78, no. 4, pp. 4905–4935, Feb. 2019, doi: 10.1007/s11042-018-6570-8.
J. Kharat and S. Chougule, “A passive blind forgery detection technique to identify frame duplication attack,” Multimed Tools Appl, vol. 79, no. 11–12, pp. 8107–8123, Mar. 2020, doi: 10.1007/s11042-019-08272-y.
N. A. Shelke and S. S. Kasana, “Multiple forgeries identification in digital video based on correlation consistency between entropy coded frames,” Multimed Syst, vol. 28, no. 1, pp. 267–280, Feb. 2022, doi: 10.1007/s00530-021-00837-y.
L. Bondi, L. Baroffio, D. Guera, P. Bestagini, E. J. Delp, and S. Tubaro, “First Steps Toward Camera Model Identification with Convolutional Neural Networks,” IEEE Signal Process Lett, vol. 24, no. 3, pp. 259–263, Mar. 2017, doi: 10.1109/LSP.2016.2641006.
G. Xu, H. Z. Wu, and Y. Q. Shi, “Structural design of convolutional neural networks for steganalysis,” IEEE Signal Process Lett, vol. 23, no. 5, pp. 708–712, May 2016, doi: 10.1109/LSP.2016.2548421.
B. Bayar and M. C. Stamm, “A deep learning approach to universal image manipulation detection using a new convolutional layer,” in IH and MMSec 2016 - Proceedings of the 2016 ACM Information Hiding and Multimedia Security Workshop, 2016, pp. 5–10. doi: 10.1145/2909827.2930786.
Y. ; Rao and J. Ni, “A Deep Learning Approach to Detection of Splicing and Copy-Move Forgeries in Images,” in WIFS 2016 : 8th IEEE International Workshop on Information Forensics and Security, Dec. 2016.
C. Long, A. Basharat, and A. Hoogs, “A Coarse-to-fine Deep Convolutional Neural Network Framework for Frame Duplication Detection and Localization in Forged Videos,” Nov. 2018, [Online]. Available: http://arxiv.org/abs/1811.10762
P. Johnston, E. Elyan, and C. Jayne, “Video tampering localisation using features learned from authentic content,” Neural Comput Appl, vol. 32, no. 16, pp. 12243–12257, Aug. 2020, doi: 10.1007/s00521-019-04272-z.
P. Selvaraj and M. Karuppiah, “Inter-frame forgery detection and localization in videos using earth mover’s distance metric,” IET Image Process, vol. 14, no. 16, Dec. 2020, doi: 10.1049/iet-ipr.2020.0287.
H. D. Panchal and H. B. Shah, “Video tampering dataset development in temporal domain for video forgery authentication,” Multimed Tools Appl, vol. 79, no. 33–34, pp. 24553–24577, Sep. 2020, doi: 10.1007/s11042-020-09205-w.
S. M. Fadl, Q. Han, and Q. Li, “Inter-frame forgery detection based on differential energy of residue,” IET Image Process, vol. 13, no. 3, pp. 522–528, Feb. 2019, doi: 10.1049/iet-ipr.2018.5068.
G. Ulutas, B. Ustubioglu, M. Ulutas, and V. Nabiyev, “Frame duplication/mirroring detection method with binary features,” IET Image Process, vol. 11, no. 5, pp. 333–342, 2017, doi: 10.1049/iet-ipr.2016.0321.
N. A. Shelke and S. S. Kasana, “Multiple forgery detection and localization technique for digital video using PCT and NBAP,” Multimed Tools Appl, vol. 81, no. 16, pp. 22731–22759, Jul. 2022, doi: 10.1007/s11042-021-10989-8.
P. Keerthana, E. Nikita, R. Lakkshm, and R. S. Devi, “Tampering Detection in Video Inter-Frame using Watermarking,” International Journal of Research in Engineering, Science and Management, vol. 2, no. 3, pp. 251–254, 2019.
K. N. Sowmya, H. R. Chennamma, and L. Rangarajan, “Video authentication using spatio temporal relationship for tampering detection,” Journal of Information Security and Applications, vol. 41, pp. 159–169, Aug. 2018, doi: 10.1016/j.jisa.2018.07.002.
E. P. Bennett and L. Mcmillan, “Video Enhancement Using Per-Pixel Virtual Exposures.”
L. Wu and Y. Wang, “Detecting image forgeries using geometric cues,” in Computer Vision for Multimedia Applications: Methods and Solutions, IGI Global, 2010, pp. 197–217. doi: 10.4018/978-1-60960-024-2.ch012.
W. Wang and H. Farid, “Detecting Re-Projected Video.” [Online]. Available: www.cs.dartmouth.edu/∼{whwang,farid}
J. Lukáš, J. Fridrich, and M. Goljan, “Digital Camera Identification from Sensor Pattern Noise,” IEEE Transactions on Information Forensics and Security, vol. 1, no. 2, pp. 205–214, Jun. 2006.
“Detection of Linear and Cubic Interpolation in JPEG Compressed Images”.
M. Chen, J. Fridrich, M. Goljan, and J. Lukáš, “Source Digital Camcorder Identification Using Sensor Photo Response Non-Uniformity,” 2007. [Online]. Available: http://www.slyck.com/misc/mpaa_loss.
D. K. Hyun, M. J. Lee, S. J. Ryu, H. Y. Lee, and H. K. Lee, “Forgery detection for surveillance video,” in The Era of Interactive Media, vol. 9781461435013, Springer New York, 2013, pp. 25–36. doi: 10.1007/978-1-4614-3501-3_3.
Y. Hu, C. Jian, and C. T. Li, “Using improved imaging sensor pattern noise for source camera identification,” in 2010 IEEE International Conference on Multimedia and Expo, ICME 2010, 2010, pp. 1481–1486. doi: 10.1109/ICME.2010.5582952.
D. K. Hyun, S. J. Ryu, H. Y. Lee, and H. K. Lee, “Detection of upscale-crop and partial manipulation in surveillance video based on sensor pattern noise,” Sensors (Switzerland), vol. 13, no. 9, pp. 12605–12631, Sep. 2013, doi: 10.3390/s130912605.
A. F. Smeaton, P. Over, and W. Kraaij, “Evaluation campaigns and TRECVid,” in MIR ’06: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval, 2006, pp. 321–330. doi: http://doi.acm.org/10.1145/1178677.1178722.
W. Wang, X. Jiang, S. Wang, M. Wan, and T. Sun, “Identifying video forgery process using optical flow,” in 12th International Workshop on Digital-Forensics and Watermarking (IWDW), Jul. 2014, vol. 8389 LNCS, pp. 244–257. doi: 10.1007/978-3-662-43886-2_18.
S. Kingra, N. Aggarwal, and R. D. Singh, “Video inter-frame forgery detection approach for surveillance and mobile recorded videos,” International Journal of Electrical and Computer Engineering, vol. 7, no. 2, pp. 831–841, 2017, doi: 10.11591/ijece.v7i2.pp831-841.
S. M. Fadl, Q. Han, and Q. Li, “Authentication of Surveillance Videos: Detecting Frame Duplication Based on Residual Frame,” J Forensic Sci, vol. 63, no. 4, pp. 1099–1109, Jul. 2018, doi: 10.1111/1556-4029.13658.
S. Kingra, N. Aggarwal, and R. D. Singh, “Inter-frame forgery detection in H.264 videos using motion and brightness gradients,” Multimed Tools Appl, vol. 76, no. 24, pp. 25767–25786, Dec. 2017, doi: 10.1007/s11042-017-4762-2.
C. Schüldt, I. Laptev, and B. Caputo, “Recognizing Human Actions: A Local SVM Approach *,” in Recognizing Human Actions: A Local SVM Approach, 2004.
G. Qadir, S. Yahaya, and A. T. Ho, “Surrey University Library for Forensic Analysis (SULFA) of Video Content,” Jul. 2012, pp. 1–6.
Q. Li, R. Wang, and D. Xu, “An inter-frame forgery detection algorithm for surveillance video,” Information (Switzerland), vol. 9, no. 12, Nov. 2018, doi: 10.3390/info9120301.
N. Lakshmanan, I. Bang, M. S. Kang, J. Han, and J. T. Lee, “SurFi: Detecting Surveillance Camera Looping Attacks with Wi-Fi Channel State Information (Extended Version),” Apr. 2019, [Online]. Available: http://arxiv.org/abs/1904.01350
D. N. Zhao, R. K. Wang, and Z. M. Lu, “Inter-frame passive-blind forgery detection for video shot based on similarity analysis,” Multimed Tools Appl, vol. 77, no. 19, pp. 25389–25408, Oct. 2018, doi: 10.1007/s11042-018-5791-1.
W. Wang, X. Jiang, S. Wang, M. Wan, and T. Sun, “Identifying video forgery process using optical flow,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, vol. 8389 LNCS, pp. 244–257. doi: 10.1007/978-3-662-43886-2_18.
Y. Liu and T. Huang, “Exposing video inter-frame forgery by Zernike opponent chromaticity moments and coarseness analysis,” Multimed Syst, vol. 23, no. 2, pp. 223–238, Mar. 2017, doi: 10.1007/s00530-015-0478-1.
L. Zheng, T. Sun, and Y. Q. Shi, “Inter-frame video forgery detection based on block-wise brightness variance descriptor,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2015, vol. 9023, pp. 18–30. doi: 10.1007/978-3-319-19321-2_2.
S. Fadl, Q. Han, and Q. Li, “Surveillance video authentication using universal image quality index of temporal average,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2019, vol. 11378 LNCS, pp. 337–350. doi: 10.1007/978-3-030-11389-6_25.
S. Fadl, A. Megahed, Q. Han, and L. Qiong, “Frame duplication and shuffling forgery detection technique in surveillance videos based on temporal average and gray level co-occurrence matrix,” Multimed Tools Appl, vol. 79, no. 25–26, pp. 17619–17643, Jul. 2020, doi: 10.1007/s11042-019-08603-z.
M. A. Fayyaz, A. Anjum, S. Ziauddin, A. Khan, and A. Sarfaraz, “An improved surveillance video forgery detection technique using sensor pattern noise and correlation of noise residues,” Multimed Tools Appl, vol. 79, no. 9–10, pp. 5767–5788, Mar. 2020, doi: 10.1007/s11042-019-08236-2.
J. Bakas, R. Naskar, M. Nappi, and S. Bakshi, “Object-based forgery detection in surveillance video using capsule network,” J Ambient Intell Humaniz Comput, 2021, doi: 10.1007/s12652-021-03511-3.
S. Chen, S. Tan, B. Li, and J. Huang, “Automatic Detection of Object-Based Forgery in Advanced Video,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 26, no. 11, pp. 2138–2151, Nov. 2016, doi: 10.1109/TCSVT.2015.2473436.
H. H. Nguyen, J. Yamagishi, and I. Echizen, “Capsule-Forensics: Using Capsule Networks to Detect Forged Images and Videos,” Oct. 2018, [Online]. Available: http://arxiv.org/abs/1810.11215
Y. Huang, X. Li, W. Wang, T. Jiang, and Q. Zhang, “Towards Cross-Modal Forgery Detection and Localization on Live Surveillance Videos,” International Conference on Computer Communications, Jan. 2021, [Online]. Available: http://arxiv.org/abs/2101.00848
A. Abdullahi, M. A. Bagiwa, A. Roko, and S. Buda, “An Inter-Frame Forgery Detection Technique for Surveillance Videos Based on Analysis of Similarities,” SLU Journal of Science and Technology, vol. 4, no. 1 & 2, pp. 15–26, Aug. 2022, doi: 10.56471/slujst.v4i.265.
S. Fadl, Q. Han, and Q. Li, “CNN spatiotemporal features and fusion for surveillance video forgery detection,” Signal Process Image Commun, vol. 90, Jan. 2021, doi: 10.1016/j.image.2020.116066.
Y. Huang, X. Li, W. Wang, T. Jiang, and Q. Zhang, “Forgery Attack Detection in Surveillance Video Streams Using Wi-Fi Channel State Information,” Jan. 2022, [Online]. Available: http://arxiv.org/abs/2201.09487

No competing interests reported.

Download PDF

Reviewers agreed at journal
12 Oct, 2023
Reviewers invited by journal
11 Oct, 2023
Editor assigned by journal
19 Sep, 2023
Submission checks completed at journal
18 Sep, 2023
First submitted to journal
16 Sep, 2023

You are reading this latest preprint version

Video Forgery Detection for Surveillance Cameras: A Review

Status:

Version 1

Abstract

Figures

Introduction

The Mechanisms of Video Forgery Detection

Forgery Detection in Surveillance Videos

Discussion

Conclusion and Future Work

Declarations

References

Additional Declarations

Status:

Version 1