Video Video is an Electronics, electronic medium for the recording, copying, playback, broadcasting, and display of moving picture, moving image, visual Media (communication), media. Video was first developed for mechanical television systems, whi ...

copy detection is the process of detecting illegally copied video s by analyzing them and comparing them to original content. The goal of this process is to protect a video creator's intellectual property.

History

Indyk et al. produced a video copy detection theory based on the length of the film; however, it worked only for whole films without modifications. When applied to short clips of a video, Idynk et al.'s technique does not detect that the clip is a copy. Later, Oostveen et al. introduced the concept of a ''fingerprint'', or ''hash function'', that creates a unique signature of the video based on its contents. This fingerprint is based on the length of the video and the brightness, as determined by splitting it into a grid. The fingerprint cannot be used to recreate the original video because it describes only certain features of its respective video. Some time ago, B.Coskun et al. presented two robust algorithms based on

discrete cosine transform A discrete cosine transform (DCT) expresses a finite sequence of data points in terms of a sum of cosine functions oscillating at different frequency, frequencies. The DCT, first proposed by Nasir Ahmed (engineer), Nasir Ahmed in 1972, is a widely ...

. Hampapur and Balle created an algorithm creating a global description of a piece of video based on the video's motion, color, space, and length. To look at the color levels of the image was thought, and for this reason, Li et al. created an algorithm that examines the colors of a clip by creating a binary signature get from the histogram of every frame. This algorithm, however, returns inconsistent results in cases in which a

logo A logo (abbreviation of logotype; ) is a graphic mark, emblem, or symbol used to aid and promote public identification and recognition. It may be of an abstract or figurative design or include the text of the name that it represents, as in ...

is added to the video, because the insertion of the logo's color elements adds false information that can confuse the system.

Techniques

Watermarks

Watermark A watermark is an identifying image or pattern in paper that appears as various shades of lightness/darkness when viewed by transmitted light (or when viewed by reflected light, atop a dark background), caused by thickness or density variations i ...

s are used to introduce an invisible signal into a video to ease the detection of illegal copies. This technique is widely used by

photographers A photographer (the Greek φῶς (''phos''), meaning "light", and γραφή (''graphê''), meaning "drawing, writing", together meaning "drawing with light") is a person who uses a camera to make photographs. Duties and types of photograp ...

. Placing a watermark on a video such that it is easily seen by an audience allows the content creator to detect easily whether the image has been copied. The limitation of watermarks is that if the original image is not watermarked, then it is not possible to know whether other images are copies.

Content-based signature

In this technique, a unique signature is created for the video on the basis of the video's content. Various video copy detection

algorithms In mathematics and computer science, an algorithm () is a finite sequence of mathematically rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for per ...

exist that use features of the video's content to assign the video a unique videohash. The fingerprint can be compared with other videohashes in a

database In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and a ...

. This type of algorithm has a significant problem: if various aspects of the videos' contents are similar, it is difficult for an algorithm to determine whether the video in question is a copy of the original or merely similar to it. In such a case (e.g., two distinct

news broadcast News broadcasting is the medium of broadcasting various news events and other information via television, radio, or the internet in the field of broadcast journalism. The content is usually either video production, produced local programming ...

s), the algorithm can return that the video in question is a copy as the news broadcast often involve similar kind of banner and presenter often sit in a similar position. Videos with very minimal changes in frames with respect to time are more vulnerable to hash collision.

Algorithms

The following are some algorithms and techniques proposed for video copy detection.

Global Descriptors

Global temporal descriptor

In this algorithm, a ''global intensity'' is defined as the sum of all intensities of all pixels weighted along all the video. Thus, an identity for a video sample can be constructed on the basis of the length of the video and the pixel intensities throughout. The global intensity ''a(t)'' is defined as:

a(t)=\sum_^N K(i)(I(i,t-1))^2

Where ''k'' is the weighting of the image, ''I'' is the image, and ''N'' is the number of pixels in the image.

Global ordinal measurement descriptor

In this algorithm, the video is divided in ''N'' blocks, sorted by

gray level In digital photography, computer-generated imagery, and colorimetry, a greyscale (more common in Commonwealth English) or grayscale (more common in American English) image is one in which the value of each pixel is a single sample represent ...

. Then it's possible to create a

vector Vector most often refers to: * Euclidean vector, a quantity with a magnitude and a direction * Disease vector, an agent that carries and transmits an infectious pathogen into another living organism Vector may also refer to: Mathematics a ...

describing the average gray level of each block. With these average levels it is possible to create a new vector ''S(t)'', the video's signature:

S(t)=(r_1, r_2, \cdots ,r_N)

To compare two videos, the algorithm defines a ''D(t)'' representing the similarity between both.

D(t)=\frac \sum_^ \beginR(i)-C(i) \end

The value returned by ''D(t)'' helps determine whether the video in question is a copy.

Ordinal and Temporal Descriptors

This technique was proposed by L.Chen and F. Stentiford. A measurement of dissimilarity is made by combining the two aforementioned algorithms, Global temporal descriptors and Global ordinal measurement descriptors, in

time and space In physics, spacetime, also called the space-time continuum, is a mathematical model that fuses the three dimensions of space and the one dimension of time into a single four-dimensional continuum. Spacetime diagrams are useful in visualizin ...

TMK+PDQF

In 2019, Facebook open sourced TMK+PDQF, part of a suite of tools used at Facebook to detect harmful content. It generates a signature of a whole video, and can easily handle changes in format or added watermarks, but is less tolerant of cropping or clipping.

Local Descriptors

AJ

Described by A. Joly et al., this algorithm is an improvement of Harris' Interest Points detector. This technique suggests that in many videos a significant number of frames are almost identical, so it is more efficient to test not every frame but just those depicting a significant amount of motion.

ViCopT

''ViCopT'' uses the interest points from each image to define a signature of the whole video. In every image, the algorithms identifies and defines two parts: the ''background'', a set of static elements along a temporal sequence, and the ''motion'', persistent points changing positions throughout the video.

Space Time Interest Points (STIP)

This algorithm was developed by I. Laptev and T.Lindeberg. It uses the interest points technique along the space and time to define the video signature, and creates a 34th-

dimension In physics and mathematics, the dimension of a mathematical space (or object) is informally defined as the minimum number of coordinates needed to specify any point within it. Thus, a line has a dimension of one (1D) because only one coo ...

vector that stores this signature.

Algorithm showcasing

There exist algorithms for video copy detection that are in use today. In 2007, there was an evaluation showcase known as the Multimedia Understanding Through Semantics, Computation and Learning (MUSCLE), which tested video copy detection algorithms on various video samples ranging from home video recordings to TV show segments ranging from one minute to one hour in length.

References

MUSCLE (Multimedia Understanding through Semantics, Computation and Learning)

* {{in lang, en Multimedia Video