Apache PDFBox

PDFBox
Developer(s) Apache Software Foundation
Stable release
2.0.6 / May 15, 2017 (2017-05-15)
Written in Java
Operating system Cross-platform
Type Portable Document Format (PDF)
License Apache License 2.0
Website https://pdfbox.apache.org

Apache PDFBox is an open source pure-Java library that can be used to create, render, print, split, merge, alter, verify and extract text and meta-data of PDF files.

Open Hub reports over 11,000 commits (since the start as an Apache project) by 18 contributors representing more than 140,000 lines of code. PDFBox has a well established, mature codebase maintained by an average size development team with decreasing Y-O-Y commits. Using the COCOMO model, it took an estimated 36 person-years of effort. [1]

Structure

Apache PDFBox has these components:

History

PDFBox was started in 2002 in SourceForge by Ben Litchfield who wanted to be able to extract text of PDF files for Lucene.[2] It became an Apache Incubator project in 2008, and an Apache top level project in 2009. [3]

Preflight was originally named PaDaF and developed by Atos worldline, and donated to the project in 2011.[4]

In February 2015, Apache PDFBox was named an Open Source Partner Organization of the PDF Association. [5]

See also

References

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.