Browse Prior Art Database

Indexing PDF Documents without file size creep

IP.com Disclosure Number: IPCOM000012579D
Original Publication Date: 2003-May-16
Included in the Prior Art Database: 2003-May-16

Publishing Venue

IBM

Abstract

Indexing Adobe Portable Document Format (PDF) documents for database archival and retrieval will create documents that are 10 to 20 times larger than the original Adobe PDF document supplied by the customer. The process used to index these Adobe PDF files is the cause of this problem and the following process will allow Adobe PDF documents to be indexed while keeping the original file size or creating indexed Adobe PDF documents with file sizes smaller than the original. This disclosure assumes the readers has working knowledge of the Adobe Acrobat PDF development library.