octoleo/qpdf - qpdf - Vast Development Method

mirror of https://github.com/qpdf/qpdf.git synced 2024-11-11 23:45:47 +00:00

Author	SHA1	Message	Date
Jay Berkenbilt	ddd78c1b7f	Fix QPDFObjectHandle::shallowCopy It's not really a shallow copy. It just doesn't cross indirect object boundaries. The old implementation had a bug that would cause multiple shallow copies of the same object to share memory, which was not the intention.	2018-06-21 20:34:45 -04:00
Jay Berkenbilt	397b097c46	Allow setting a form field's value	2018-06-21 15:57:13 -04:00
Jay Berkenbilt	952a665a4e	Better support for creating Unicode strings	2018-06-21 15:57:13 -04:00
Jay Berkenbilt	e44c395c51	QUtil::toUTF16	2018-06-21 15:57:13 -04:00
Jay Berkenbilt	0b05111db8	Implement helper class for interactive forms	2018-06-21 15:57:13 -04:00
Jay Berkenbilt	2e7ee23bf6	Add QPDFPageDocumentHelper and QPDFPageObjectHelper This is the beginning of higher-level API support using helper classes. The goal is to be able to add more helpers without continuing to pollute QPDF's and QPDFObjectHandle's public interfaces.	2018-06-21 15:57:13 -04:00
Jay Berkenbilt	4cded10821	Add QPDFObjectHandle::Rectangle type Provide a convenient way of accessing rectangles.	2018-06-21 15:57:13 -04:00
Jay Berkenbilt	e4e2e26d99	Properly handle pages with no contents (fixes #194 ) Remove calls to assertPageObject(). All cases in the library that called assertPageObject() work fine if you don't call assertPageObject() because nothing assumes anything that was being checked by that call. Removing the calls enables more files to be successfully processed.	2018-03-06 11:34:07 -05:00
Jay Berkenbilt	4bb3046f0b	Properly handle strings with PDF Doc Encoding (fixes #179 ) The QPDF_String::getUTF8Val() method was not treating strings that weren't explicitly Unicode as PDF Doc Encoded. This only affects characters in the range 0x80 through 0xa0.	2018-02-18 21:06:27 -05:00
Jay Berkenbilt	2780a1871d	Add C API for checking PDF files	2018-02-18 21:06:27 -05:00
Jay Berkenbilt	d0e99f195a	More robust handling of type errors Give objects descriptions and context so it is possible to issue warnings instead of fatal errors for attempts to access objects of the wrong type.	2018-02-18 21:06:27 -05:00
Jay Berkenbilt	21b7481b0e	Push members of QPDFObjectHandle into a Members object As in other cases, this is to enable adding new member variables in the future without breaking ABI compatibility.	2018-02-18 21:06:27 -05:00
Jay Berkenbilt	e410b0fe0d	Simplify TokenFilter interface Expose Pl_QPDFTokenizer, and have it do more of the work of managing the token filter's pipeline.	2018-02-18 21:05:47 -05:00
Jay Berkenbilt	1fdd86a049	Move Pl_QPDFTokenizer to public interface	2018-02-18 21:05:47 -05:00
Jay Berkenbilt	5708b5d0aa	Add additional interface for filtering page contents	2018-02-18 21:05:47 -05:00
Jay Berkenbilt	9910104442	Implement TokenFilter and refactor Pl_QPDFTokenizer Implement a TokenFilter class and refactor Pl_QPDFTokenizer to use a TokenFilter class called ContentNormalizer. Pl_QPDFTokenizer is now a general filter that passes data through a TokenFilter.	2018-02-18 21:05:46 -05:00
Jay Berkenbilt	b8723e97f4	Add coalesce contents capability	2018-02-18 21:05:46 -05:00
Jay Berkenbilt	fcd611b61e	Refactor parseContentStream	2018-02-18 21:05:46 -05:00
Jay Berkenbilt	05ff619b09	Remove redundant method Remove a redundant method that was equal to another one with additional arguments. This breaks binary compatibility, but there are other ABI breaking changes in the upcoming release, so now is the time to do it.	2018-02-18 21:05:46 -05:00
Jay Berkenbilt	fefe25030e	Inline image token type	2018-02-18 21:05:46 -05:00
Jay Berkenbilt	2699ecf13e	Push QPDFTokenizer members into a nested structure This is for protection against future ABI breaking changes.	2018-02-18 21:05:46 -05:00
Jay Berkenbilt	d97474868d	Lexer enhancements: EOF, comment, space Significant enhancements to the lexer to improve EOF handling and to support comments and spaces as tokens. Various other minor issues were fixed as well.	2018-02-18 20:18:40 -05:00
Jay Berkenbilt	ebd5ed63de	Add option to save pass 1 of lineariziation This is useful only for debugging the linearization code.	2018-02-18 20:18:40 -05:00
Jay Berkenbilt	e3167c1a60	Fix linearization for files with nonstandard ID length	2018-02-04 18:16:23 -05:00
Jay Berkenbilt	aa2cfad61a	Clarify some comments	2018-01-28 18:29:47 -05:00
Jay Berkenbilt	569d74d36b	Allow raw encryption key to be specified Add options to enable the raw encryption key to be directly shown or specified. Thanks to Didier Stevens <didier.stevens@gmail.com> for the idea and contribution of one implementation of this idea.	2018-01-14 10:21:05 -05:00
Jay Berkenbilt	3e306ae64c	Add QUtil::hex_decode	2018-01-14 09:04:13 -05:00
Jay Berkenbilt	68572df2bf	Update copyright to 2018	2018-01-13 20:25:58 -05:00
Jay Berkenbilt	07c8bb2843	Additionally license under Apache License version 2.0 The Apache License version 2.0 is now the primary license for qpdf. However, users may, at their option, continue to use Artistic version 2.0.	2017-09-14 12:59:25 -04:00
Jay Berkenbilt	d31a7b76e7	Improve message for stream decoding error Tweak the message so that we inform the user that we are mitigating data loss.	2017-09-12 16:03:48 -04:00
Jay Berkenbilt	eaacf94005	Update C API with new QPDFWriter methods	2017-09-12 14:30:39 -04:00
Jay Berkenbilt	6d46346eb9	Detect integer overflow/underflow	2017-08-29 12:28:32 -04:00
Jay Berkenbilt	e999bbae43	Fix memory leak with bad jpeg data	2017-08-28 22:16:45 -04:00
Jay Berkenbilt	c6872d2c70	Clean up circular references in QPDF_Stream	2017-08-28 22:16:31 -04:00
Jay Berkenbilt	728dc9e6d8	Fix error caught by clang	2017-08-26 21:51:17 -04:00
Jay Berkenbilt	ad527a64f9	Parse iteratively to avoid stack overflow (fixes #146 )	2017-08-25 21:56:45 -04:00
Jay Berkenbilt	e452d9dca6	Spell check	2017-08-22 14:22:20 -04:00
Jay Berkenbilt	fabff0f3ec	Limit token length during xref recovery While scanning the file looking for objects, limit the length of tokens we allow. This prevents us from getting caught up in reading a file character by character while digging through large streams.	2017-08-22 14:13:10 -04:00
Jay Berkenbilt	ce435222b2	Push QPDFWriter member variables into a nested class	2017-08-21 22:04:07 -04:00
Jay Berkenbilt	a8c93bd324	Push QPDF member variables into a nested class Pushing member variables into a nested class enables addition of new member variables without breaking binary compatibility.	2017-08-21 21:35:11 -04:00
Jay Berkenbilt	8288a4eb3a	Update copyright to 2017	2017-08-21 21:18:47 -04:00
Jay Berkenbilt	8ab52fa558	Combine writePCLm with writeStandard Reduce code duplication	2017-08-21 21:05:48 -04:00
Jay Berkenbilt	9f60a864a0	Combine PCLm header into writeHeader	2017-08-21 21:05:47 -04:00
Jay Berkenbilt	4b908ade70	Update header documentation and ChangeLog entry for PCLm	2017-08-21 21:05:44 -04:00
Sahil Arora	b19210fa7d	QPDFWriter: Add setPCLm() and writePCLm() methods * Add support for PCLm using setPCLm() and writePCLm() methods in QPDFWriter.hh and QPDFWriter.cc * Add a function writePCLmHeader() for PCLm header in QPDFWriter	2017-08-21 18:55:02 -04:00
Jay Berkenbilt	ddc6cf0cf6	Precheck streams by default There is no need for a --precheck-streams option. We can do the precheck without imposing any penalty, only re-encoding the stream if it fails the first time.	2017-08-21 17:44:22 -04:00
Jay Berkenbilt	9744414c66	Enable finer grained control of stream decoding This commit adds several API methods that enable control over which types of filters QPDF will attempt to decode. It also adds support for /RunLengthDecode and /DCTDecode filters for both encoding and decoding.	2017-08-21 17:44:22 -04:00
Jay Berkenbilt	ae90d2c485	Implement Pl_DCT pipeline Additional testing is added in later commits to be supported by additional changes in the library.	2017-08-21 17:44:02 -04:00
Jay Berkenbilt	2d2f619665	Implement Pl_RunLength pipeline	2017-08-19 14:50:55 -04:00
Jay Berkenbilt	cfa2eb97fb	Add page rotation (fixes #132 )	2017-08-12 22:57:38 -04:00
Jay Berkenbilt	30f109e244	Read xref table without PCRE Also accept more errors than before.	2017-08-10 21:30:32 -04:00
Jay Berkenbilt	ca5b1d267a	Improve stream length recovery Eliminate PCRE and find endobj not preceded by endstream. Be more lax about placement of endstream and endobj.	2017-08-10 21:30:32 -04:00
Jay Berkenbilt	03aa9679ac	Find starxref without PCRE	2017-08-10 21:30:32 -04:00
Jay Berkenbilt	1765c6ec20	Find header without PCRE	2017-08-10 21:30:32 -04:00
Jay Berkenbilt	296b679d6e	Implement findFirst and findLast in InputSource Preparing to refactor some pattern searching code to use these instead of their own memchr loops. This should simplify the code that replaces PCRE.	2017-08-10 21:30:32 -04:00
Jay Berkenbilt	ef8ae5449d	Allow QPDFTokenizer::readToken to return bad tokens Sometimes we want to ignore bad tokens rather than having them throw an exception. A coverage case is commented out here and added in a later commit.	2017-08-10 19:01:41 -04:00
Jay Berkenbilt	c5dc6d8067	Remove unused PointerHolder interface Also fix a bug resulting from incorrect use of PointerHolder because of this unused parameter.	2017-08-10 19:01:38 -04:00
Jay Berkenbilt	8fe261d8b4	QUtil::strcasecmp	2017-08-05 10:22:33 -04:00
Jay Berkenbilt	2d5b854468	Allow reading command-line args from files (fixes #16 )	2017-07-29 22:23:21 -04:00
Jay Berkenbilt	5993c3e83c	Detect input file = output file (fixes #29 )	2017-07-29 20:58:01 -04:00
Jay Berkenbilt	f37d399d82	Add newline-before-endstream option (fixes #103 )	2017-07-29 12:21:38 -04:00
Jay Berkenbilt	b389268f16	Better handle split content streams (fixes #73 ) When parsing content streams, allow content to be split arbitrarily across stream boundaries.	2017-07-29 12:19:04 -04:00
Jay Berkenbilt	4647acbe3c	Clarify documentation on copyForeignObject (fixes #69 ) Be explicit about the need to keep the source QPDF object around.	2017-07-29 12:19:04 -04:00
Jay Berkenbilt	3a1ff5ded9	Add option to preserve unreferenced objects	2017-07-28 19:19:11 -04:00
Jay Berkenbilt	7f8892525f	Add precheck streams capability When requested, QPDFWriter will do more aggress prechecking of streams to make sure it can actually succeed in decoding them before attempting to do so. This will allow preservation of raw data even when the raw data is corrupted relative to the specified filters.	2017-07-27 23:42:27 -04:00
Jay Berkenbilt	a4fd4b91c6	Convert stream filtering errors to warnings	2017-07-27 18:43:07 -04:00
Jay Berkenbilt	40f00122b8	Convert object parsing errors to warnings QPDFObjectHandle::parseInternal now issues warnings instead of throwing exceptions for all error conditions that it finds (except internal logic errors) and has stronger recovery for things like invalid tokens and malformed dictionaries. This should improve qpdf's ability to recover from a wide range of broken files that currently cause it to fail.	2017-07-27 18:20:31 -04:00
Jay Berkenbilt	dd8dad74f4	Move lexer helper functions to QUtil	2017-07-27 13:59:56 -04:00
Jay Berkenbilt	701b518d5c	Detect recursion loops resolving objects (fixes #51 ) During parsing of an object, sometimes parts of the object have to be resolved. An example is stream lengths. If such an object directly or indirectly points to the object being parsed, it can cause an infinite loop. Guard against all cases of re-entrant resolution of objects.	2017-07-26 06:24:07 -04:00
Jay Berkenbilt	315092dd98	Avoid xref reconstruction infinite loop (fixes #100 ) This is CVE-2017-9209.	2017-07-26 06:24:07 -04:00
Jay Berkenbilt	bd6c845619	Fix typo in comment	2017-07-26 06:24:07 -04:00
Thorsten Schöning	7c08aa4280	Include QPDFExc.hh for use in std::list	2016-01-24 12:07:03 -05:00
Thorsten Schöning	e0201c12cc	Include QPDFObjectHandle for use in std::list QPDFObjectHandle was used as forward declaration, but C++-Builder 10 Seattle can't use it in std::list in such cases because the type is undefined.	2016-01-24 12:04:25 -05:00
Jay Berkenbilt	e0e9d64674	Remove some ABI compatibility private methods Since we have to bump soname, remove some private methods that were just there for binary compatibility	2015-11-10 12:22:40 -05:00
Jay Berkenbilt	0496ab1a6e	Fix spelling errors	2015-10-31 18:56:43 -04:00
Jay Berkenbilt	b8bdef0ad1	Implement deterministic ID For non-encrypted files, determinstic ID generation uses file contents instead of timestamp and file name. At a small runtime cost, this enables generation of the same /ID if the same inputs are converted in the same way multiple times.	2015-10-31 18:56:42 -04:00
Jay Berkenbilt	f77acbdbba	Copyright 2015	2015-05-24 17:26:49 -04:00
Jay Berkenbilt	857bb208d3	include time.h in QUtil.hh QUtil.hh needs time.h to get time_t on some platforms. Thanks Peter Korsgaard <peter@korsgaard.com>	2015-05-24 16:26:05 -04:00
Jay Berkenbilt	a11549a566	Detect loops in /Pages structure Pushing inherited objects to pages and getting all pages were both prone to stack overflow infinite loops if there were loops in the Pages dictionary. There is a general weakness in the code in that any part of the code that traverses the Pages structure would be prone to this and would have to implement its own loop detection. A more robust fix may provide some general method for handling the Pages structure, but it's probably not worth doing. Note: addition of *Internal2 private functions was done rather than changing signatures of existing methods to avoid breaking compatibility.	2015-02-21 19:47:11 -05:00
Jay Berkenbilt	225b018290	Update Copyright to 2014	2014-01-14 15:40:02 -05:00
Jay Berkenbilt	235d8f28f8	Increase random data provider support Add a method to get the current random data provider, and document and test the method for resetting it.	2013-12-16 16:21:28 -05:00
Jay Berkenbilt	5e3bad2f86	Refactor random data generation Add new RandomDataProvider object and implement existing random number generation in terms of that. This enables end users to supply their own random data providers.	2013-12-14 15:17:35 -05:00
Jay Berkenbilt	f010e07c0c	Add missing #include of <string>	2013-10-28 20:59:58 -04:00
Jay Berkenbilt	4229457068	Security: use a secure random number generator If not available, give an error. The user may also configure qpdf to use an insecure random number generator.	2013-10-18 10:45:12 -04:00
Jay Berkenbilt	cee2592ed1	Change API/ABI and withdraw 4.2.0 4.2.0 was binary incompatible in spite of there being no deletions or changes to any public methods. As such, we have to bump the ABI and are fixing some API breakage while we're at it. Previous 4.3.0 target is now 5.1.0.	2013-07-10 11:30:13 -04:00
Jay Berkenbilt	a3576a7359	Bug fix: handle generation > 0 when generating object streams Rework QPDFWriter to always track old object IDs and QPDFObjGen instead of int, thus not discarding the generation number. Switch to QPDF::getCompressibleObjGen() to properly handle the case of an old object eligible for compression that has a generation of other than zero.	2013-06-14 14:58:09 -04:00
Jay Berkenbilt	96eb965115	Use QPDFObjectHandle::getObjGen() where appropriate In internal code and examples, replace calls to getObjectID() and getGeneration() with calls to getObjGen() where possible.	2013-06-14 14:58:09 -04:00
Jay Berkenbilt	5039da0b91	Add QPDFObjectHandle::getObjGen() This is safer than getObjectID() and getGeneration() for many uses.	2013-06-14 14:58:09 -04:00
Jay Berkenbilt	d88231e01e	Promote QPDF::ObjGen to top-level object QPDFObjGen	2013-06-14 14:58:08 -04:00
Jay Berkenbilt	3803e9cc4a	Export terminateParsing in the DLL Windows fix: QPDFObject::ParserCallbacks::terminateParsing() was not declared with QPDF_DLL.	2013-03-11 12:37:32 -04:00
Jay Berkenbilt	9d4f52c014	Clarify documentation on encrypted files Explicitly state how QPDF handles empty passwords when writing files. Apparently some libraries treat the empty string as the owner password as an instruction to generate a random password.	2013-03-11 12:37:32 -04:00
Jay Berkenbilt	29f5830325	Fix getTypeCode and getTypeName work for indirect objects Remove const qualifier from getTypeCode and get getTypeName methods of QPDFObjectHandle, make them work properly for indirect objects, and exercise them much better in the test suite.	2013-03-05 13:35:46 -05:00
Jay Berkenbilt	119f2a4b68	Add method to terminate content stream parsing	2013-03-05 13:35:46 -05:00
Jay Berkenbilt	ac4deac187	Call QUtil::safe_fopen in place of fopen fopen was previuosly called wrapped by QUtil::fopen_wrapper, but QUtil::safe_fopen does this itself, which is less cumbersome.	2013-03-05 13:35:46 -05:00
Jay Berkenbilt	a51ae10b8d	Remove all calls to sprintf	2013-03-05 13:35:46 -05:00
Jay Berkenbilt	30027481f7	Remove all old-style casts from C++ code	2013-03-04 16:45:16 -05:00
Jay Berkenbilt	32b62035ce	Replace many calls to sprintf with QUtil::hex_encode Add QUtil::hex_encode to encode binary data has a hexadecimal string, and use it in place of sprintf where possible.	2013-03-04 16:45:15 -05:00
Jay Berkenbilt	bfda717749	Cosmetic changes to be closer to Adobe terminology Change object type Keyword to Operator, and place the order of the object types in object_type_e in the same order as they are mentioned in the PDF specification. Note that this change only breaks backward compatibility with code that has not yet been released.	2013-01-23 09:38:05 -05:00
Jay Berkenbilt	913eb5ac35	Add getTypeCode() and getTypeName() Add virtual methods to QPDFObject, wrappers to QPDFObjectHandle, and implementations to all the QPDF_Object types.	2013-01-22 10:01:45 -05:00
Jay Berkenbilt	f81152311e	Add QPDFObjectHandle::parseContentStream method This method allows parsing of the PDF objects in a content stream or array of content streams.	2013-01-20 15:35:39 -05:00

1 2 3 4 5 ...

285 Commits