Theraography

From Wikipedia, the free encyclopedia

Theraography is a content recognition technology based on the extraction of an information summary from a multimedia content. It can be applied to a variety of mutlimedia file formats to generate a digital "fingerprint" based on file content. Specific algorithms are used depending on the media type - audio, video, graphics, text, etc. - but with a common representation, simplifying fingerprint database management. Where files contain multiple different media types (for example audio+video, or text+graphics) each media type is analysed using the appropriate algorithms and multiple fingerprints are generated. Content fingerprints are independent of file types and encoding - a fingerprint generated from a JPEG image, for example, will recognise the same image in another graphics format.

Fingerprints extracted by theraography allow to recognize contents which are slightly different from the original one. Such fingerprints are called soft fingerprints. Examples of hard fingerprints are hashcode (or Hash function), for which even a one bit modification of the original content produces different fingerprint and exclude all recognition process.

In this way, a document is represented by one or more fingerprints which allow all or parts of the content to be traced even after substantial modifications - cut and paste into other documents, changes of format or coding, additions or deletions. Advestigo's technology looks for any single point of similarity in the content, rather than calculating a "similarity factor". A copy is defined by the existence of common characteristics, rather than the degree to which the original may have been altered. Because the technology relies on fingerprinting it can be applied to any digital content, including all pre-existing data. To identify if a suspect document is a copy of, or contains content copied from, a reference document, all that is needed is to generate the fingerprints for the original and the suspect and to compare the fingerprints. Advestigo's technology can also be used to generate fingerprint databases of original works, to simplify and accelerate tracking of the dissemination - legitimate or otherwise - of content held in online databases or libraries.

Theraography is used in various fields : security issues, content recognition, police investigation, DRM, ...

Theraography is not Watermarking, original documents are not modified.

Main characteristics of Theraography : - Can be applied immediately to existing content - Does not alter or "pollute" existing data in any way - Cannot be masked or removed (it's not in the content...) - Can recognise re-created content - retyping, loop recording, camcorder capture... - No Encapsulation - Based on content, not on file