octoleo/qpdf - qpdf - Vast Development Method

mirror of https://github.com/qpdf/qpdf.git synced 2024-12-23 03:18:59 +00:00

Author	SHA1	Message	Date
Jay Berkenbilt	b389268f16	Better handle split content streams (fixes #73 ) When parsing content streams, allow content to be split arbitrarily across stream boundaries.	2017-07-29 12:19:04 -04:00
Jay Berkenbilt	4647acbe3c	Clarify documentation on copyForeignObject (fixes #69 ) Be explicit about the need to keep the source QPDF object around.	2017-07-29 12:19:04 -04:00
Jay Berkenbilt	3a1ff5ded9	Add option to preserve unreferenced objects	2017-07-28 19:19:11 -04:00
Jay Berkenbilt	7f8892525f	Add precheck streams capability When requested, QPDFWriter will do more aggress prechecking of streams to make sure it can actually succeed in decoding them before attempting to do so. This will allow preservation of raw data even when the raw data is corrupted relative to the specified filters.	2017-07-27 23:42:27 -04:00
Jay Berkenbilt	a4fd4b91c6	Convert stream filtering errors to warnings	2017-07-27 18:43:07 -04:00
Jay Berkenbilt	40f00122b8	Convert object parsing errors to warnings QPDFObjectHandle::parseInternal now issues warnings instead of throwing exceptions for all error conditions that it finds (except internal logic errors) and has stronger recovery for things like invalid tokens and malformed dictionaries. This should improve qpdf's ability to recover from a wide range of broken files that currently cause it to fail.	2017-07-27 18:20:31 -04:00
Jay Berkenbilt	dd8dad74f4	Move lexer helper functions to QUtil	2017-07-27 13:59:56 -04:00
Jay Berkenbilt	701b518d5c	Detect recursion loops resolving objects (fixes #51 ) During parsing of an object, sometimes parts of the object have to be resolved. An example is stream lengths. If such an object directly or indirectly points to the object being parsed, it can cause an infinite loop. Guard against all cases of re-entrant resolution of objects.	2017-07-26 06:24:07 -04:00
Jay Berkenbilt	315092dd98	Avoid xref reconstruction infinite loop (fixes #100 ) This is CVE-2017-9209.	2017-07-26 06:24:07 -04:00
Jay Berkenbilt	bd6c845619	Fix typo in comment	2017-07-26 06:24:07 -04:00
Thorsten Schöning	7c08aa4280	Include QPDFExc.hh for use in std::list	2016-01-24 12:07:03 -05:00
Thorsten Schöning	e0201c12cc	Include QPDFObjectHandle for use in std::list QPDFObjectHandle was used as forward declaration, but C++-Builder 10 Seattle can't use it in std::list in such cases because the type is undefined.	2016-01-24 12:04:25 -05:00
Jay Berkenbilt	e0e9d64674	Remove some ABI compatibility private methods Since we have to bump soname, remove some private methods that were just there for binary compatibility	2015-11-10 12:22:40 -05:00
Jay Berkenbilt	0496ab1a6e	Fix spelling errors	2015-10-31 18:56:43 -04:00
Jay Berkenbilt	b8bdef0ad1	Implement deterministic ID For non-encrypted files, determinstic ID generation uses file contents instead of timestamp and file name. At a small runtime cost, this enables generation of the same /ID if the same inputs are converted in the same way multiple times.	2015-10-31 18:56:42 -04:00
Jay Berkenbilt	f77acbdbba	Copyright 2015	2015-05-24 17:26:49 -04:00
Jay Berkenbilt	857bb208d3	include time.h in QUtil.hh QUtil.hh needs time.h to get time_t on some platforms. Thanks Peter Korsgaard <peter@korsgaard.com>	2015-05-24 16:26:05 -04:00
Jay Berkenbilt	a11549a566	Detect loops in /Pages structure Pushing inherited objects to pages and getting all pages were both prone to stack overflow infinite loops if there were loops in the Pages dictionary. There is a general weakness in the code in that any part of the code that traverses the Pages structure would be prone to this and would have to implement its own loop detection. A more robust fix may provide some general method for handling the Pages structure, but it's probably not worth doing. Note: addition of *Internal2 private functions was done rather than changing signatures of existing methods to avoid breaking compatibility.	2015-02-21 19:47:11 -05:00
Jay Berkenbilt	225b018290	Update Copyright to 2014	2014-01-14 15:40:02 -05:00
Jay Berkenbilt	235d8f28f8	Increase random data provider support Add a method to get the current random data provider, and document and test the method for resetting it.	2013-12-16 16:21:28 -05:00
Jay Berkenbilt	5e3bad2f86	Refactor random data generation Add new RandomDataProvider object and implement existing random number generation in terms of that. This enables end users to supply their own random data providers.	2013-12-14 15:17:35 -05:00
Jay Berkenbilt	f010e07c0c	Add missing #include of <string>	2013-10-28 20:59:58 -04:00
Jay Berkenbilt	4229457068	Security: use a secure random number generator If not available, give an error. The user may also configure qpdf to use an insecure random number generator.	2013-10-18 10:45:12 -04:00
Jay Berkenbilt	cee2592ed1	Change API/ABI and withdraw 4.2.0 4.2.0 was binary incompatible in spite of there being no deletions or changes to any public methods. As such, we have to bump the ABI and are fixing some API breakage while we're at it. Previous 4.3.0 target is now 5.1.0.	2013-07-10 11:30:13 -04:00
Jay Berkenbilt	a3576a7359	Bug fix: handle generation > 0 when generating object streams Rework QPDFWriter to always track old object IDs and QPDFObjGen instead of int, thus not discarding the generation number. Switch to QPDF::getCompressibleObjGen() to properly handle the case of an old object eligible for compression that has a generation of other than zero.	2013-06-14 14:58:09 -04:00
Jay Berkenbilt	96eb965115	Use QPDFObjectHandle::getObjGen() where appropriate In internal code and examples, replace calls to getObjectID() and getGeneration() with calls to getObjGen() where possible.	2013-06-14 14:58:09 -04:00
Jay Berkenbilt	5039da0b91	Add QPDFObjectHandle::getObjGen() This is safer than getObjectID() and getGeneration() for many uses.	2013-06-14 14:58:09 -04:00
Jay Berkenbilt	d88231e01e	Promote QPDF::ObjGen to top-level object QPDFObjGen	2013-06-14 14:58:08 -04:00
Jay Berkenbilt	3803e9cc4a	Export terminateParsing in the DLL Windows fix: QPDFObject::ParserCallbacks::terminateParsing() was not declared with QPDF_DLL.	2013-03-11 12:37:32 -04:00
Jay Berkenbilt	9d4f52c014	Clarify documentation on encrypted files Explicitly state how QPDF handles empty passwords when writing files. Apparently some libraries treat the empty string as the owner password as an instruction to generate a random password.	2013-03-11 12:37:32 -04:00
Jay Berkenbilt	29f5830325	Fix getTypeCode and getTypeName work for indirect objects Remove const qualifier from getTypeCode and get getTypeName methods of QPDFObjectHandle, make them work properly for indirect objects, and exercise them much better in the test suite.	2013-03-05 13:35:46 -05:00
Jay Berkenbilt	119f2a4b68	Add method to terminate content stream parsing	2013-03-05 13:35:46 -05:00
Jay Berkenbilt	ac4deac187	Call QUtil::safe_fopen in place of fopen fopen was previuosly called wrapped by QUtil::fopen_wrapper, but QUtil::safe_fopen does this itself, which is less cumbersome.	2013-03-05 13:35:46 -05:00
Jay Berkenbilt	a51ae10b8d	Remove all calls to sprintf	2013-03-05 13:35:46 -05:00
Jay Berkenbilt	30027481f7	Remove all old-style casts from C++ code	2013-03-04 16:45:16 -05:00
Jay Berkenbilt	32b62035ce	Replace many calls to sprintf with QUtil::hex_encode Add QUtil::hex_encode to encode binary data has a hexadecimal string, and use it in place of sprintf where possible.	2013-03-04 16:45:15 -05:00
Jay Berkenbilt	bfda717749	Cosmetic changes to be closer to Adobe terminology Change object type Keyword to Operator, and place the order of the object types in object_type_e in the same order as they are mentioned in the PDF specification. Note that this change only breaks backward compatibility with code that has not yet been released.	2013-01-23 09:38:05 -05:00
Jay Berkenbilt	913eb5ac35	Add getTypeCode() and getTypeName() Add virtual methods to QPDFObject, wrappers to QPDFObjectHandle, and implementations to all the QPDF_Object types.	2013-01-22 10:01:45 -05:00
Jay Berkenbilt	f81152311e	Add QPDFObjectHandle::parseContentStream method This method allows parsing of the PDF objects in a content stream or array of content streams.	2013-01-20 15:35:39 -05:00
Jay Berkenbilt	1d88955fa6	Added new QPDFObjectHandle types Keyword and InlineImage These object types are to facilitate content stream parsing.	2013-01-20 15:35:39 -05:00
Jay Berkenbilt	a04a835849	Clarify methods to get user password With newer encryption formats, it is no longer possible to recover the user password using the owner password.	2013-01-03 20:45:53 -05:00
Jay Berkenbilt	f8306913ba	Update "C" API with functions for new features	2012-12-31 10:32:32 -05:00
Jay Berkenbilt	9eb5982fa3	Avoid modifying trailer when writing When preparing the trailer for writing to the new file, trim a copy of the trailer instead of the original file's trailer.	2012-12-31 10:32:32 -05:00
Jay Berkenbilt	8843e499b8	Update copyright year to 2013 Also add copyright notice to a few public headers that were missing one.	2012-12-31 10:32:32 -05:00
Jay Berkenbilt	4237a29c94	Refactor Dictionary writing code Original code was written before we could shallow copy objects, so all the filtering was done by suppressing the output of certain keys and replacing them with other keys. Now we can simplify the code greatly by modifying shallow copies of dictionaries in place.	2012-12-31 10:32:32 -05:00
Jay Berkenbilt	e57c25814e	Support for encryption with /V=5 and /R=5 and /R=6 Read and write support is implemented for /V=5 with /R=5 as well as /R=6. /R=5 is the deprecated encryption method used by Acrobat IX. /R=6 is the encryption method used by PDF 2.0 from ISO 32000-2.	2012-12-31 10:32:32 -05:00
Jay Berkenbilt	93ac1695a4	Support files with only attachments encrypted Test cases added in a future commit since they depend on /R=6 support.	2012-12-31 10:32:32 -05:00
Jay Berkenbilt	4eccb9d87b	Add random number functions to QUtil	2012-12-31 10:32:32 -05:00
Jay Berkenbilt	8f5de08c2a	Comment about non-const Pipeline data	2012-12-31 10:32:31 -05:00
Jay Berkenbilt	774584163f	Add ExtensionLevel support to version handling All version operations are now fully aware of extension levels.	2012-12-31 05:36:50 -05:00
Jay Berkenbilt	3101955ac0	Add V5 parameters to EncryptionData	2012-12-31 05:36:50 -05:00
Jay Berkenbilt	68447bb556	change EncryptionData	2012-12-31 05:36:50 -05:00
Jay Berkenbilt	04c203ae06	Eliminate flattenScalarReferences	2012-12-31 05:36:48 -05:00
Jay Berkenbilt	041397fdab	Allow reading from InputSource and writing to Pipeline Allowing users to subclass InputSource and Pipeline to read and write from/to arbitrary sources provides the maximum flexibility for users who want to read and write from other than files or memory.	2012-09-23 17:42:26 -04:00
Jay Berkenbilt	c1627d0438	Add QPDFWriter::setExtraHeaderText	2012-09-06 15:31:12 -04:00
Jay Berkenbilt	137dc7acb9	Refactor: move resolution of literal to its own method	2012-08-11 09:22:59 -04:00
Jay Berkenbilt	32051283b9	Fix spelling errors	2012-07-29 14:44:12 -04:00
Jay Berkenbilt	f83bddf882	Update copyright to 2012	2012-07-28 22:03:36 -04:00
Tobias Hoffmann	9c00874e77	added QPDFObjectHandle::replaceStreamData(std::string data).	2012-07-25 03:02:46 +02:00
Jay Berkenbilt	316328704b	Windows compilation fixes	2012-07-21 20:51:56 -04:00
Jay Berkenbilt	6bbea4baa0	Implement QPDFObjectHandle::parse Move object parsing code from QPDF to QPDFObjectHandle and parameterize the parts of it that are specific to a QPDF object. Provide a version that can't handle indirect objects and that can be called on an arbitrary string. A side effect of this change is that the offset used when reporting invalid stream length has changed, but since the new value seems like a better value than the old one, the test suite has been updated rather than making the code backward compatible. This only effects the offset reported for invalid streams that lack /Length or have an invalid /Length key. Updated some test code and exmaples to use QPDFObjectHandle::parse. Supporting changes include adding a BufferInputSource constructor that takes a string.	2012-07-21 09:06:10 -04:00
Jay Berkenbilt	f3e267fce2	Move readToken from QPDF to QPDFTokenizer	2012-07-21 09:06:10 -04:00
Jay Berkenbilt	15eaed5c52	Refactor: pull *InputSource out of QPDF InputSource, FileInputSource, and BufferInputSource are now top-level classes instead of privately nested inside QPDF.	2012-07-21 09:06:06 -04:00
Jay Berkenbilt	a101533e0a	Add command line option to copy encryption from other file Add --copy-encryption and --encryption-file-password options to qpdf. Also strengthen test suite for copying encryption. The strengthened test suite would have caught the failure to preserve AES and the failure to update the file version, which was invalidating the encrypted data.	2012-07-15 21:15:24 -04:00
Jay Berkenbilt	0575d77d77	Add public QPDFWriter::copyEncryptionParameters Method to copy encryption parameters from another file. Adapted from existing code to copy encryption parameters from the original file.	2012-07-14 09:14:41 -04:00
Jay Berkenbilt	11b194a1d0	Update getPageImages() comment to mention pushInheritedAttributesToPage()	2012-07-11 15:56:50 -04:00
Jay Berkenbilt	e7b8f297ba	Support copying objects from another QPDF object This includes QPDF::copyForeignObject and supporting foreign objects as arguments to addPage*.	2012-07-11 15:54:33 -04:00
Jay Berkenbilt	8a217eb3a2	Add concept of reserved objects QPDFObjectHandle::{new,is,assert}Reserved, QPDF::replaceReserved provide a mechanism to add objects to a PDF file when there are circular references. This is a prerequisite to copying objects from one PDF to another.	2012-07-10 23:34:32 -04:00
Tobias Hoffmann	8720446b23	Added assertNumber and assertScalar to QPDFObjectHandle	2012-07-07 18:55:08 -04:00
Tobias Hoffmann	a8266ccb0e	Added public assert{Type} methods to QPDFObjectHandle	2012-07-07 18:53:38 -04:00
Tobias Hoffmann	39bbaa86e3	Build this->all_pages while traversing with pushInheritedAttributesToPage	2012-07-07 17:45:10 -04:00
Jay Berkenbilt	e2dedde4bd	Don't require stream data provider to know length in advance Breaking API change: length parameter has disappeared from the StreamDataProvider version of QPDFObjectHandle::replaceStreamData since it is no longer necessary to compute it in advance. This breaking change is justified by the fact that removing the length parameter provides the caller an opportunity to simplify the calling code.	2012-07-07 17:33:45 -04:00
Jay Berkenbilt	8705e2e8fc	Add QPDFWriter method to output to FILE*	2012-07-05 21:24:04 -04:00
Tobias Hoffmann	abb53ac369	Limited inheritance to the attributes explicitly listed in the PDF spec Previous versions of qpdf incorrectly passed arbitrary objects from /Pages objects down to individual pages in direct contradition with the PDF specification. These are now left in /Pages. When intermediate /Pages nodes are being discarded as when the /Pages tree is being flattened, a warning is issued when unknown keys are encountered.	2012-07-04 23:04:55 -04:00
Tobias Hoffmann	7770a1b036	Added public method QPDF::pushInheritedAttributesToPage Refactored optimizePagesTree to pushInheritedAttributesToPage and made public	2012-07-04 16:24:03 -04:00
Tobias Hoffmann	235188df85	Fixed wording error in comment	2012-07-04 14:51:53 -04:00
Jay Berkenbilt	5f59c32f87	Add a few minor enhancements to recent work Test coverage case for new newStream method Expose decimal_places argument for double-based newReal All enhancements suggested by Tobias.	2012-06-27 10:43:27 -04:00
Tobias Hoffmann	f07e3370f0	Add Pl_Concatenate filter	2012-06-27 10:20:38 -04:00
Tobias Hoffmann	43c404b45a	Add QPDFObjectHandle::newStream(QPDF *, std::string const&) This makes the code simpler than having to create a buffer of a fixed size and copy the string to it.	2012-06-27 10:19:57 -04:00
Tobias Hoffmann	75054c0b94	Add QPDFObjectHandle::newReal(double)	2012-06-27 10:19:01 -04:00
Jay Berkenbilt	2266c6232b	Rework InputSource::readLine to make it much more efficient This rework makes xref reconstruction run much faster and use much less memory.	2012-06-27 06:48:06 -04:00
Jay Berkenbilt	736bafbb9c	Rename seek functions in QUtil	2012-06-26 23:10:10 -04:00
Jay Berkenbilt	8318d81ada	Fix and test support for files >= 4 GB	2012-06-24 15:56:50 -04:00
Jay Berkenbilt	781c313058	Change QPDF_Integer from int to long long This makes it possible to store offsets that are larger than 2 GB in the trailer dictionary.	2012-06-24 15:20:01 -04:00
Jay Berkenbilt	4f305488d8	Improve the FILE* version of QPDF::processFile	2012-06-23 18:23:06 -04:00
Jay Berkenbilt	ffb96ee17e	Add pdf-from-scratch example	2012-06-23 09:05:06 -04:00
Jay Berkenbilt	b6bdc0f595	Add factory methods for creating empty arrays and dictionaries. Also updated pdf_from_scratch test driver to use the new factories, and made some cosmetic improvements and documentation updates for the emptyPDF() method.	2012-06-22 09:46:33 -04:00
Jay Berkenbilt	a0768e4190	Add QPDF::emptyPDF() and pdf_from_scratch test code	2012-06-21 23:09:05 -04:00
Jay Berkenbilt	81e8752362	Use qpdf_offset_t in place of off_t in public APIs. off_t is used internally only when needed to talk to standard libraries. This requires that the "long long" type be supported by the compiler.	2012-06-21 21:23:24 -04:00
Jay Berkenbilt	d1ebe30ff6	Add QPDFObjectHandle::shallowCopy()	2012-06-21 16:15:09 -04:00
Jay Berkenbilt	eb802cfa8c	Implement page manipulation APIs	2012-06-21 15:01:02 -04:00
Jay Berkenbilt	df493c352f	Refactor optimizePagesTree Split optimizePagesTree into a simpler top-level routine and a recursive internal routine.	2012-06-21 15:01:02 -04:00
Tobias Hoffmann	5d3f93be29	Added first version of pages API.	2012-06-21 15:01:02 -04:00
Tobias Hoffmann	405a549f8c	Make QPDFObjectHandle::assertPageObject() public. The method is helpful in other places, like the upcoming QPDF::addPage, too.	2012-06-21 15:01:02 -04:00
Tobias Hoffmann	47a846a7e0	Added method to clear pages cache.	2012-06-21 15:01:02 -04:00
Jay Berkenbilt	f59ff6fcc2	fix include order for off_t	2012-06-21 14:11:22 -04:00
Jay Berkenbilt	bc1c4bb578	Add QPDF::processFile that takes an open FILE*	2012-06-21 08:00:35 -04:00
Tobias Hoffmann	db7474e0fa	Added additional array mutators Added methods to append to arrays, insert items into arrays, and replace array contents with a vector of items.	2012-06-20 15:29:44 -04:00
Jay Berkenbilt	5d4cad9c02	ABI change: fix use of off_t, size_t, and integer types Significantly improve the code's use of off_t for file offsets, size_t for memory sizes, and integer types in cases where there has to be compatibility with external interfaces. Rework sections of the code that would have prevented qpdf from working on files larger than 2 (or maybe 4) GB in size.	2012-06-20 15:20:26 -04:00
Jay Berkenbilt	b856379370	Portability issues: off_t, unlink New header qpdf/Types.h attempts to make sure size_t and off_t are defined on any platform and in a way that would work with large file support. Additionally, missing header files are included to get unlink.	2012-06-20 15:18:14 -04:00

1 2 3 4 5

224 Commits