Screenshot
Demonstration of PhotoRec running in GNU/Linux |
|
Developer(s) | Christophe Grenier |
Stable release | 6.13 / November 15, 2011 |
Development status | Active |
Written in | C |
Operating system | Cross-platform |
Platform | CLI |
Type | Data recovery |
License | GPL (free software) |
Website | www.cgsecurity.org/wiki/PhotoRec |
PhotoRec is a file carver data recovery software tool designed to recover lost files from digital camera memory (CompactFlash, Memory Stick, Secure Digital, SmartMedia, Microdrive, MMC, USB flash drives etc.), hard disks and CD-ROMs. It recovers most common photo formats, including JPEG, and also recovers audio files including MP3, document formats such as OpenDocument, Microsoft Office, PDF and HTML, and archive formats including ZIP. [1]
PhotoRec does not attempt to write to the damaged media the user is about to recover from. Recovered files are instead written to the directory from which PhotoRec is run, any other directory may be chosen. It can be used for data recovery or in a digital forensics context. [2] [3] [4] PhotoRec is shipped with TestDisk.[5]
PhotoRec is compatible with:
Contents |
FAT, NTFS, ext2/ext3/ext4 filesystems store files in data blocks (also called data clusters under Windows). The cluster or block size remains at a constant number of sectors after being initialized during the formatting of the filesystem. In general, most operating systems try to store the data in a contiguous way so as to minimize data fragmentation. The seek time of mechanical drives is significant for writing and reading data to/from a hard disk, so that′s why it′s important to keep the fragmentation to a minimum level.
When a file is deleted, the meta-information about this file (filename, date/time, size, location of the first data block/cluster, etc.) is lost; e.g., in an ext3/ext4 filesystem, the names of deleted files are still present, but the location of the first data block is removed. This means the data is still present on the filesystem, but only until some or all of it is overwritten by new file data.
To recover these ‘lost’ files, PhotoRec first tries to find the data block (or cluster) size. If the filesystem is not corrupted, this value can be read from the superblock (ext2/ext3/ext4) or volume boot record (FAT, NTFS). Otherwise, PhotoRec reads the media, sector by sector, searching for the first ten files, from which it calculates the block/cluster size from their locations. Once this block size is known, PhotoRec reads the media block by block (or cluster by cluster). Each block is checked against a signature database; which comes with the program and has been growing in the type of files it can recover ever since PhotoRec′s first version came out. It′s a common data recovery method called File carving.
For example, PhotoRec identifies a JPEG file when a block begins with:
If PhotoRec has already started to recover a file, it stops its recovery, checks the consistency of the file when possible and starts to save the new file (which it determined from the signature it found).
If the data is not fragmented, the recovered file should be either identical; or possibly larger than the original file, in size. In some cases, PhotoRec can learn the original filesize from the file header, so the recovered file is truncated to the correct size. If, however, the recovered file ends up being smaller than its header specifies, it is discarded. Some files, such as *.MP3 types, are data streams. In this case, PhotoRec parses the recovered data, then stops the recovery when the stream ends.
When a file is recovered successfully, PhotoRec checks the previous data blocks to see if a file signature was found but the file wasn′t able to be successfully recovered (i.e., the file was too small), and it tries again. This way, some fragmented files can be successfully recovered. [6]
TestDisk and PhotoRec have been downloaded more than 150,000 times in July 2008 from the primary website. In fact these utilities are even more popular as they can be found on various GNU/Linux Live CDs:
They are also packaged for numerous GNU/Linux based distributions: