Pdf-parser

pdf-parser
Original author(s) Didier Stevens
Initial release May 2, 2008
Stable release 0.4.3 / September 18, 2013
Development status Active
Written in Python programming language
Operating system Multiplatform, including smart phones
Type PDF software
License Public domain
Website pdf-parser

Pdf-parser is a command-line program that parses and analyses PDF documents. It provides features to extract raw data from PDF documents, like compressed images. pdf-parser can deal with malicious PDF documents that use obfuscation features of the PDF language.[1]

The tool can also be used to extract data from damaged or corrupt PDF documents.

References

  1. PDF Babushka by Bojan Zdrnja, Internet Storm Center, January 14, 2010