octoleo/qpdf - qpdf - Vast Development Method

mirror of https://github.com/qpdf/qpdf.git synced 2024-12-22 19:08:59 +00:00

Author	SHA1	Message	Date
Jay Berkenbilt	2712869cf9	Fix logic for when to compress object and xref streams (fixes #271 )	2019-01-28 21:43:06 -05:00
Jay Berkenbilt	52f9d326a5	Resolve duplicated page objects (fixes #268 ) When linearizing a file or getting the list of all pages in a file, detect if the pages tree contains a duplicated page object and, if so, shallow copy it. This makes it possible to have a one to one mapping of page positions to page objects.	2019-01-28 20:29:58 -05:00
Jay Berkenbilt	426434c772	Add --overlay and --underlay to qpdf CLI (fixes #207 )	2019-01-27 09:30:13 -05:00
Jay Berkenbilt	2d1db06042	Example of form XObject, page overlay	2019-01-27 07:50:30 -05:00
Jay Berkenbilt	623f5b664e	Convert pages to form XObjects Support conversion of pages to form XObjects and placement of form XObjects on pages.	2019-01-27 07:50:30 -05:00
Jay Berkenbilt	8cb245739c	Add QPDFObjectHandle::getUniqueResourceName	2019-01-27 07:50:30 -05:00
Jay Berkenbilt	009767d97a	Handle inheritable page attributes Add getAttribute for handling inheritable page attributes, and fix getPageImages and annotation flattening code to use it.	2019-01-25 22:30:05 -05:00
Jay Berkenbilt	2d32f4db8f	Handle fallback font size in text appearances If we end up using our fallback font size when generating appearances for text fields, reflect that in the Tf operator used in the appearance stream.	2019-01-21 07:38:21 -05:00
Jay Berkenbilt	930eade6d3	Fix omissions in text appearance generation When generating appearance streams for variable text annotations, properly handle the cases of there being no appearance dictionary, no appearance stream, or an appearance stream with no BMC..EMC marker.	2019-01-20 23:05:58 -05:00
Jay Berkenbilt	65ef0bf313	When flattening, remove annotations with no appearance stream With the exception of form field annotations when /NeedAppearances is true, remove annotations that don't have appearance streams when flattening. There is no reason to keep these when flattening since they are invisible. This may include unchecked checkboxes, unshown popup windows, etc.	2019-01-20 23:05:58 -05:00
Jay Berkenbilt	c2030d1f33	Implement password recovery suppression and password mode (fixes #215 ) Allow fine control over how passwords are encoded for writing, and allow password for reading to be given as a hexademical encoded string. Allow suppression of password recovery as a means to ensure that the password you specify is actually the right one.	2019-01-19 10:14:07 -05:00
Jay Berkenbilt	392f2ece51	Try passwords with different string encodings	2019-01-19 10:10:58 -05:00
Jay Berkenbilt	e87d149918	Add QUtil::possible_repaired_encodings	2019-01-17 11:43:56 -05:00
Jay Berkenbilt	966429e718	Update CLI and manual for new encryption granularity (fixes #214 )	2019-01-17 11:43:56 -05:00
Jay Berkenbilt	6ec22f117d	Modernize encryption API for more granularity Setting encryption permissions for R >= 3 set permission bits in groups corresponding to menu options in Acrobat 5. The new API allows the bits to be set individually.	2019-01-17 11:43:56 -05:00
Jay Berkenbilt	4630377731	Add status-reporting transcoders to QUtil	2019-01-17 11:43:56 -05:00
Jay Berkenbilt	8f389f14c0	QUtil::analyze_encoding	2019-01-17 11:43:56 -05:00
Jay Berkenbilt	e09ae710dc	Add tests for shared font/xobject The tests are in a separate commit so the bug-fix commit can be taken as a patch for older versions.	2019-01-17 09:44:29 -05:00
Jay Berkenbilt	654c0e8caf	Allow adding the same page more than once in --pages (fixes #272 )	2019-01-12 10:01:47 -05:00
Jay Berkenbilt	53d8e916b7	Interpret . in --pages as a shortcut for the primary file	2019-01-12 09:59:03 -05:00
Jay Berkenbilt	4ecd1df6f2	Add configure option AVOID_WINDOWS_HANDLE If set, we avoid using Windows I/O HANDLE, which is disallowed in some versions of the Windows SDK, such as for Windows phones. QUtil::same_file will always return false in this case. Only applies to Windows builds.	2019-01-10 22:35:08 -05:00
Jay Berkenbilt	d24a120c7f	Add QPDF::setImmediateCopyFrom	2019-01-10 22:35:08 -05:00
Jay Berkenbilt	1dc235e56d	Add completion files for packagers	2019-01-07 19:56:46 -05:00
Jay Berkenbilt	2d0336d862	Add --disable-check-autofiles to configure	2019-01-07 19:56:36 -05:00
Jay Berkenbilt	8f6f7cec50	Prepare 8.3.0 release	2019-01-07 11:16:54 -05:00
Jay Berkenbilt	74bef044cc	Update release notes for 8.3.0	2019-01-07 11:16:54 -05:00
Jay Berkenbilt	fddbcab0e7	Mostly don't require original QPDF for copyForeignObject (fixes #219 ) The original QPDF is only required now when the source QPDFObjectHandle is a stream that gets its stream data from a QPDFObjectHandle::StreamDataProvider.	2019-01-07 00:11:15 -05:00
Jay Berkenbilt	a70fbaaf50	Honor other base encodings when generating appearances	2019-01-05 23:01:59 -05:00
Jay Berkenbilt	b341d742db	Add WinAnsi and MacRoman encoding	2019-01-05 23:01:44 -05:00
Jay Berkenbilt	089ce5902e	Move utf8_to_utf16 into QUtil	2019-01-05 22:59:27 -05:00
Jay Berkenbilt	ee2aad4381	Add CLI flags for image optimization	2019-01-04 21:33:14 -05:00
Jay Berkenbilt	7b6ab900dc	Support page collation with --collate (fixes #259 )	2019-01-04 15:13:02 -05:00
Jay Berkenbilt	16fd6e64f9	Add QPDFWriter::getFinalVersion (fixes #266 )	2019-01-04 12:37:22 -05:00
Jay Berkenbilt	837dcf8fc2	Don't call assert while checking linearization data (fixes #209 , #231 ) Instead of calling assert for problems found during checking linearization data, throw an exception which is later caught and issued as an error. Ideally we would handle errors more robustly, but this is still a significant improvement.	2019-01-04 11:55:42 -05:00
Jay Berkenbilt	a01359189b	Fix dangling references (fixes #240 ) On certain operations, such as iterating through all objects and adding new indirect objects, walk through the entire object structure and explicitly resolve any indirect references to non-existent objects. That prevents new objects from springing into existence and causing the previously dangling references to point to them.	2019-01-04 10:29:29 -05:00
Jay Berkenbilt	158156d506	Add basic appearance stream generation	2019-01-04 08:00:19 -05:00
Jay Berkenbilt	02281632cc	Add QUtil::utf8_to_ascii	2019-01-03 23:18:13 -05:00
Jay Berkenbilt	ca94ac68d9	Honor flags when flattening annotations	2019-01-03 11:59:55 -05:00
Jay Berkenbilt	06d6438ddf	Minor fixes	2019-01-03 09:17:43 -05:00
Jay Berkenbilt	f78ea057ca	Switch annotation flattening to use the form xobjects Instead of directly putting the contents of the annotation appearance streams into the page's content stream, add commands to render the form xobjects directly. This is a more robust way to do it than the original solution as it works properly with patterns and avoids problems with resource name clashes between the pages and the form xobjects.	2019-01-02 21:49:47 -05:00
Jay Berkenbilt	3b8ce4f12a	Annotation flattening including form fields Flatten annotations by integrating their appearance streams into the content stream of the containing page. In the case of form fields, only flatten if /NeedAppearance is false (or equivalently absent). If flattening form fields, also remove /AcroForm from the document catalog.	2019-01-01 08:14:15 -05:00
Jay Berkenbilt	95d6b17a89	Add QPDFObjectHandle::mergeDictionary()	2019-01-01 08:12:56 -05:00
Jay Berkenbilt	5059ec0d35	Add Matrix class under QPDFObjectHandle	2018-12-31 23:02:43 -05:00
Jay Berkenbilt	6048c6e2f0	Don't crash on @file when file doesn't exist (fixes #265 ) When @file is used and file doesn't exist, just treat it as a normal argument.	2018-12-23 11:46:56 -05:00
Jay Berkenbilt	64c1579544	Support zsh completion	2018-12-23 11:21:59 -05:00
Jay Berkenbilt	24aeb9ae22	Document json support	2018-12-22 14:05:01 -05:00
Jay Berkenbilt	bb89382f93	Allow --show-object=trailer	2018-12-21 19:11:57 -05:00
Jay Berkenbilt	dd1aca552c	Support bash completion using complete -C	2018-12-21 19:11:57 -05:00
Jay Berkenbilt	313ba08126	Preserve some outline functionality in page splitting	2018-12-21 19:11:57 -05:00
Jay Berkenbilt	d5d179f441	Add document and object helpers for outlines (bookmarks)	2018-12-21 19:11:57 -05:00
Jay Berkenbilt	30a0c070e4	Add QPDFObjectHandle::getJSON()	2018-12-21 18:34:56 -05:00
Jay Berkenbilt	651179b5da	Add simple JSON serializer	2018-12-21 18:34:56 -05:00
Jay Berkenbilt	0776c00129	Add QPDFNameTreeObjectHelper	2018-12-21 18:34:56 -05:00
Jay Berkenbilt	352ce9b22b	Preserve page labels (numbers) when splitting and merging	2018-12-18 16:59:24 -05:00
Jay Berkenbilt	6ef9e31233	Add QPDFPageLabelDocumentHelper	2018-12-18 16:59:24 -05:00
Jay Berkenbilt	f38df27aa3	Add QPDFNumberTreeObjectHelper	2018-12-18 16:46:10 -05:00
Jay Berkenbilt	077d3d4512	Add QPDFObjectHandle::wrapInArray() Wrap an object in an array if it is not already an array.	2018-12-18 16:45:48 -05:00
Jay Berkenbilt	a5ee55f2e8	ChangeLog	2018-10-11 19:16:26 -04:00
Jay Berkenbilt	4628461383	Set up Azure Pipelines Use free Azure Pipelines to do Linux, Windows, and Mac build and test and to generate Windows binary distributions.	2018-10-11 15:07:51 -04:00
Jay Berkenbilt	6ee761fc86	Prepare 8.2.1 release	2018-08-18 10:56:19 -04:00
Jay Berkenbilt	28453a4908	Add --keep-files-open flag (fixes #237 )	2018-08-18 10:56:01 -04:00
Jay Berkenbilt	5e9e17e62a	Prepare 8.2.0 release	2018-08-16 11:53:10 -04:00
Jay Berkenbilt	723b054bf9	Spell check	2018-08-16 11:53:10 -04:00
Jay Berkenbilt	e37ce85190	Clarify static vs. import library on Windows (fixes #225 )	2018-08-14 16:57:37 -04:00
Jay Berkenbilt	b4bdc42b4f	New exception class QPDFSystemError (fixes #221 )	2018-08-13 20:01:51 -04:00
Jay Berkenbilt	fb1e29476c	Add --no-warn option to suppress warnings (fixes #232 )	2018-08-12 22:20:40 -04:00
Jay Berkenbilt	3d6615b276	Pl_Buffer: reduce memory growth (fixes #228 ) Rather than keeping a list of buffers for every write, accumulate bytes in a single buffer, doubling the size of the buffer when needed to accommodate new data. This is not the best possible implementation, but the change was implemented in this way to avoid changing the shape of Pl_Buffer and thus breaking backward compatibility.	2018-08-12 17:45:43 -04:00
Jay Berkenbilt	4a4736c695	Fix EOL handling inside strings (fixes #226 ) CR, CRLF, and LF are all supposed to be treated as LF; only one EOL is to be ignored after backslash.	2018-08-05 20:48:35 -04:00
Jay Berkenbilt	e1cd5891af	Fix infinite loop on small files with progress reporting (fixes #230 ) Turns out you can keep adding zero to a number over and over again and it just doesn't get any bigger. Who would have known?	2018-08-05 15:43:34 -04:00
Jay Berkenbilt	fe769f2723	Keep file open while adding its pages during merge (fixes #217 )	2018-08-04 19:58:13 -04:00
Jay Berkenbilt	4f4c627b77	ClosedFileInputSource: add method to keep file open During periods of intensive operation on a specific file, this method can reduce the overhead of repeated open/close operations.	2018-08-04 19:52:46 -04:00
Jay Berkenbilt	1bd2a2e79b	Prepare 8.1.0 release	2018-06-23 07:50:11 -04:00
Jay Berkenbilt	6bf47ac6e8	With --verbose, give information on processing merge inputs	2018-06-22 16:14:54 -04:00
Jay Berkenbilt	a433ed24f9	Add progress reporting for QPDFWriter (fixes #200 )	2018-06-22 16:14:54 -04:00
Jay Berkenbilt	2a82f6e1e0	Add method to get count of objects in QPDF	2018-06-22 15:53:40 -04:00
Jay Berkenbilt	99593e0eef	Use ClosedFileInputSource when merging files (fixes #154 )	2018-06-22 12:53:41 -04:00
Jay Berkenbilt	4ccc8b1a44	Add ClosedFileInputSource ClosedFileInputSource is an input source that keeps the file closed when not reading it.	2018-06-22 12:52:45 -04:00
Jay Berkenbilt	c71dc6888c	Don't prune resource dictionaries on errors or by request If we are unable to filter a page's content streams, don't attempt to remove objects from the page's resource dictionary. Also provide a command line option to suppress resource removal in case we ever need this as a workaround for some bug or broken PDF files.	2018-06-22 10:45:31 -04:00
Jay Berkenbilt	6c89d4b35b	When splitting files, remove unreferenced objects (fixes #203 )	2018-06-21 21:03:30 -04:00
Jay Berkenbilt	84cd53f5af	Make page range optional in --rotate (fixes #211 )	2018-06-21 16:28:44 -04:00
Jay Berkenbilt	2e8a3e163f	Add interactive form example	2018-06-21 16:04:54 -04:00
Jay Berkenbilt	397b097c46	Allow setting a form field's value	2018-06-21 15:57:13 -04:00
Jay Berkenbilt	952a665a4e	Better support for creating Unicode strings	2018-06-21 15:57:13 -04:00
Jay Berkenbilt	0b05111db8	Implement helper class for interactive forms	2018-06-21 15:57:13 -04:00
Jay Berkenbilt	2e6e1204a5	Convert examples to use new page helper classes	2018-06-21 15:57:13 -04:00
Jay Berkenbilt	2e7ee23bf6	Add QPDFPageDocumentHelper and QPDFPageObjectHelper This is the beginning of higher-level API support using helper classes. The goal is to be able to add more helpers without continuing to pollute QPDF's and QPDFObjectHandle's public interfaces.	2018-06-21 15:57:13 -04:00
Jay Berkenbilt	4cded10821	Add QPDFObjectHandle::Rectangle type Provide a convenient way of accessing rectangles.	2018-06-21 15:57:13 -04:00
Jay Berkenbilt	078cf9bf90	newline before endstream fix for object streams (fixes #205 )	2018-05-12 13:17:43 -04:00
Jay Berkenbilt	b4d6cf6836	Limit depth of nesting in direct objects (fixes #202 ) This fixes CVE-2018-9918.	2018-04-15 16:11:22 -04:00
Jay Berkenbilt	f8c8e4dcc0	Prepare 8.0.2 release	2018-03-06 11:34:07 -05:00
Jay Berkenbilt	e4e2e26d99	Properly handle pages with no contents (fixes #194 ) Remove calls to assertPageObject(). All cases in the library that called assertPageObject() work fine if you don't call assertPageObject() because nothing assumes anything that was being checked by that call. Removing the calls enables more files to be successfully processed.	2018-03-06 11:34:07 -05:00
Jay Berkenbilt	ee44aef8d0	Treat loop in xref tables as damage (fixes #192 ) Prior to this fix, if there was a loop detected in following /Prev pointers in xref streams/tables, it would cause qpdf to lose data. Note that this condition causes many PDF readers to hang or fail.	2018-03-05 14:26:58 -05:00
Jay Berkenbilt	6fe1e9de40	Prepare 8.0.1 release	2018-03-04 07:16:20 -05:00
Jay Berkenbilt	666f794393	Support "r" in page ranges (fixes #155 )	2018-03-04 07:05:14 -05:00
Jay Berkenbilt	7b9f23a99a	Ignore zlib data check errors (fixes #191 )	2018-03-03 11:35:01 -05:00
Jay Berkenbilt	3e8b643ae3	Release 8.0.0	2018-02-25 16:00:11 -05:00
Jay Berkenbilt	4bb3046f0b	Properly handle strings with PDF Doc Encoding (fixes #179 ) The QPDF_String::getUTF8Val() method was not treating strings that weren't explicitly Unicode as PDF Doc Encoded. This only affects characters in the range 0x80 through 0xa0.	2018-02-18 21:06:27 -05:00
Jay Berkenbilt	2780a1871d	Add C API for checking PDF files	2018-02-18 21:06:27 -05:00
Jay Berkenbilt	d0e99f195a	More robust handling of type errors Give objects descriptions and context so it is possible to issue warnings instead of fatal errors for attempts to access objects of the wrong type.	2018-02-18 21:06:27 -05:00
Jay Berkenbilt	c2e16827b6	Replace "file position" with "offset" in error messages Sometimes it's an offset in an object stream or a content stream, so file position is confusing in some cases.	2018-02-18 21:06:27 -05:00
Jay Berkenbilt	52e024f701	Include omitted object description in error message	2018-02-18 21:06:27 -05:00
Jay Berkenbilt	cb3b705cf9	Include filename in object stream parse error	2018-02-18 21:06:27 -05:00
Jay Berkenbilt	5708b5d0aa	Add additional interface for filtering page contents	2018-02-18 21:05:47 -05:00
Jay Berkenbilt	510d45d00d	General comment in ChangeLog	2018-02-18 21:05:47 -05:00
Jay Berkenbilt	5136238f2a	Detect and report bad tokens in content normalization	2018-02-18 21:05:47 -05:00
Jay Berkenbilt	30709935af	Filter tokens example	2018-02-18 21:05:47 -05:00
Jay Berkenbilt	9910104442	Implement TokenFilter and refactor Pl_QPDFTokenizer Implement a TokenFilter class and refactor Pl_QPDFTokenizer to use a TokenFilter class called ContentNormalizer. Pl_QPDFTokenizer is now a general filter that passes data through a TokenFilter.	2018-02-18 21:05:46 -05:00
Jay Berkenbilt	b8723e97f4	Add coalesce contents capability	2018-02-18 21:05:46 -05:00
Jay Berkenbilt	25988e8d10	Bug fix: content normalizer should not add trailing newline Adding a trailing newline in content normalization damages files whose contents are split across streams in the middle of tokens. Let QPDFWriter add the newline with the indicator to ignore the newline, which it already does. This changes the way some qdf files look.	2018-02-18 21:05:46 -05:00
Jay Berkenbilt	6afe83978f	Switch from parseContentStream to parsePageContents	2018-02-18 21:05:46 -05:00
Jay Berkenbilt	fcd611b61e	Refactor parseContentStream	2018-02-18 21:05:46 -05:00
Jay Berkenbilt	fefe25030e	Inline image token type	2018-02-18 21:05:46 -05:00
Jay Berkenbilt	d97474868d	Lexer enhancements: EOF, comment, space Significant enhancements to the lexer to improve EOF handling and to support comments and spaces as tokens. Various other minor issues were fixed as well.	2018-02-18 20:18:40 -05:00
Jay Berkenbilt	ebd5ed63de	Add option to save pass 1 of lineariziation This is useful only for debugging the linearization code.	2018-02-18 20:18:40 -05:00
Jay Berkenbilt	2ebdd6929e	Prepare 7.1.1 release	2018-02-04 18:31:42 -05:00
Jay Berkenbilt	2e4ca7ecf4	Update version numbers for 7.1.0	2018-01-14 20:09:20 -05:00
Jay Berkenbilt	569d74d36b	Allow raw encryption key to be specified Add options to enable the raw encryption key to be directly shown or specified. Thanks to Didier Stevens <didier.stevens@gmail.com> for the idea and contribution of one implementation of this idea.	2018-01-14 10:21:05 -05:00
Jay Berkenbilt	791e0db762	Allow trailing . in numeric token (fixes #165 )	2018-01-13 20:05:40 -05:00
Jay Berkenbilt	6299c64cf3	Use correct link directory order (fixes #158 ) Make sure to link from the source tree before linking from the system. In many environments, this is necessary to allow a newly built qpdf to link properly instead of trying to link or resolve libraries from an older installed version.	2018-01-13 19:53:52 -05:00
Jay Berkenbilt	ec0087e3ce	Support TIFF Predictor (fixes #171 )	2018-01-13 19:49:42 -05:00
Jay Berkenbilt	48864b8d6e	Clarify documentation of advanced parsing options	2017-12-25 18:42:33 -05:00
Jay Berkenbilt	794b649e5b	Update TODO and ChangeLog. Fixes #166 , #83	2017-12-25 18:29:18 -05:00
Jay Berkenbilt	0f1ce8e646	Prepare 7.0.0 release	2017-09-16 13:22:15 -04:00
Jay Berkenbilt	07c8bb2843	Additionally license under Apache License version 2.0 The Apache License version 2.0 is now the primary license for qpdf. However, users may, at their option, continue to use Artistic version 2.0.	2017-09-14 12:59:25 -04:00
Jay Berkenbilt	d31a7b76e7	Improve message for stream decoding error Tweak the message so that we inform the user that we are mitigating data loss.	2017-09-12 16:03:48 -04:00
Jay Berkenbilt	eaacf94005	Update C API with new QPDFWriter methods	2017-09-12 14:30:39 -04:00
Jay Berkenbilt	ad527a64f9	Parse iteratively to avoid stack overflow (fixes #146 )	2017-08-25 21:56:45 -04:00
Jay Berkenbilt	85f05cc57f	Detect xref pointer infinite loop (fixes #149 )	2017-08-25 19:58:31 -04:00
Jay Berkenbilt	1e52d33822	Bump soname to 18 and version to 7.0.b1	2017-08-22 16:50:48 -04:00
Jay Berkenbilt	6219111ed7	Update references to README files Most of the README files have been renamed. Refer to the new names.	2017-08-22 14:13:10 -04:00
Jay Berkenbilt	4b908ade70	Update header documentation and ChangeLog entry for PCLm	2017-08-21 21:05:44 -04:00
Jay Berkenbilt	9744414c66	Enable finer grained control of stream decoding This commit adds several API methods that enable control over which types of filters QPDF will attempt to decode. It also adds support for /RunLengthDecode and /DCTDecode filters for both encoding and decoding.	2017-08-21 17:44:22 -04:00
Jay Berkenbilt	ae0399ef87	Revert "Add page rotation example in contrib" This reverts commit `8ee83ca722`. This is being removed because qpdf now has its own page rotation. The example was an excellent contribution to qpdf, but now it illustrates rotating pages "by hand", which is no longer needed because of QPDFObjectHandle::rotatePage.	2017-08-12 22:58:11 -04:00
Jay Berkenbilt	cfa2eb97fb	Add page rotation (fixes #132 )	2017-08-12 22:57:38 -04:00
Jay Berkenbilt	d926d78059	Add --verbose flag	2017-08-12 12:30:18 -04:00
Jay Berkenbilt	df33c368b4	Change --single-pages to --split-pages This is in preparation for implementing page groups.	2017-08-12 11:49:04 -04:00
Jay Berkenbilt	36b3fe5af7	Fix --newline-before-endstream option (fixes #133 ) Add a newline unconditionally before endstream even if a newline was already written as part of the stream data.	2017-08-11 20:57:05 -04:00
Jay Berkenbilt	8fe0b06cd8	Pad encryption parameters that are too short (fixes #96 )	2017-08-11 19:53:56 -04:00
Jay Berkenbilt	9a96e233b0	Remove PCRE	2017-08-10 21:30:32 -04:00
Jay Berkenbilt	30f109e244	Read xref table without PCRE Also accept more errors than before.	2017-08-10 21:30:32 -04:00
Jay Berkenbilt	ca5b1d267a	Improve stream length recovery Eliminate PCRE and find endobj not preceded by endstream. Be more lax about placement of endstream and endobj.	2017-08-10 21:30:32 -04:00
Jay Berkenbilt	c5dc6d8067	Remove unused PointerHolder interface Also fix a bug resulting from incorrect use of PointerHolder because of this unused parameter.	2017-08-10 19:01:38 -04:00
Jay Berkenbilt	49825e5cb6	Add --split-pages option (fixes #30 )	2017-08-05 10:22:33 -04:00
Jay Berkenbilt	909daf9543	Move page spec processing earlier	2017-08-05 10:22:33 -04:00
Jay Berkenbilt	c88eaae2f2	Fix off-by-one error in --pages argument parsing (fixes #129 )	2017-08-02 21:08:43 -04:00
iskander.sharipov	8ee83ca722	Add page rotation example in contrib This is added to contrib rather than examples because it requires c++-11 and lacks a test suite, but it is still useful enough to include with the distribution.	2017-07-30 08:55:15 -04:00
Jay Berkenbilt	2d5b854468	Allow reading command-line args from files (fixes #16 )	2017-07-29 22:23:21 -04:00
Jay Berkenbilt	5993c3e83c	Detect input file = output file (fixes #29 )	2017-07-29 20:58:01 -04:00
Jay Berkenbilt	885b8781cc	Allow --check to coexist with and precede other operations (fixes #42 )	2017-07-29 19:56:21 -04:00
Jay Berkenbilt	b43a0ac237	When recover stream length, indicate the length (fixes #44 )	2017-07-29 19:15:06 -04:00
Jay Berkenbilt	f37d399d82	Add newline-before-endstream option (fixes #103 )	2017-07-29 12:21:38 -04:00
Jay Berkenbilt	6a7d53ad2b	Handle zlib data errors better (fixes #106 )	2017-07-29 12:19:04 -04:00
Jay Berkenbilt	07d6f770b2	Better recovery of bad stream start (fixes #104 )	2017-07-29 12:19:04 -04:00
Jay Berkenbilt	b389268f16	Better handle split content streams (fixes #73 ) When parsing content streams, allow content to be split arbitrarily across stream boundaries.	2017-07-29 12:19:04 -04:00
Jay Berkenbilt	3a1ff5ded9	Add option to preserve unreferenced objects	2017-07-28 19:19:11 -04:00
Jay Berkenbilt	7f8892525f	Add precheck streams capability When requested, QPDFWriter will do more aggress prechecking of streams to make sure it can actually succeed in decoding them before attempting to do so. This will allow preservation of raw data even when the raw data is corrupted relative to the specified filters.	2017-07-27 23:42:27 -04:00
Jay Berkenbilt	a4fd4b91c6	Convert stream filtering errors to warnings	2017-07-27 18:43:07 -04:00
Jay Berkenbilt	40f00122b8	Convert object parsing errors to warnings QPDFObjectHandle::parseInternal now issues warnings instead of throwing exceptions for all error conditions that it finds (except internal logic errors) and has stronger recovery for things like invalid tokens and malformed dictionaries. This should improve qpdf's ability to recover from a wide range of broken files that currently cause it to fail.	2017-07-27 18:20:31 -04:00
Jay Berkenbilt	ac3c81a8ed	Include tests for other infinite loop bugs fixes #117 fixes #118 fixes #119 fixes #120 Several other infinite loop bugs were fixed by previous changes. Include their test files in the test suite.	2017-07-26 06:24:07 -04:00
Jay Berkenbilt	12db09898e	Don't interpret word tokens in content streams (fixes #82 )	2017-07-26 06:24:07 -04:00
Jay Berkenbilt	701b518d5c	Detect recursion loops resolving objects (fixes #51 ) During parsing of an object, sometimes parts of the object have to be resolved. An example is stream lengths. If such an object directly or indirectly points to the object being parsed, it can cause an infinite loop. Guard against all cases of re-entrant resolution of objects.	2017-07-26 06:24:07 -04:00
Jay Berkenbilt	afe0242b26	Handle object ID 0 (fixes #99 ) This is CVE-2017-9208. The QPDF library uses object ID 0 internally as a sentinel to represent a direct object, but prior to this fix, was not blocking handling of 0 0 obj or 0 0 R as a special case. Creating an object in the file with 0 0 obj could cause various infinite loops. The PDF spec doesn't allow for object 0. Having qpdf handle object 0 might be a better fix, but changing all the places in the code that assumes objid == 0 means direct would be risky.	2017-07-26 06:24:07 -04:00
Jay Berkenbilt	315092dd98	Avoid xref reconstruction infinite loop (fixes #100 ) This is CVE-2017-9209.	2017-07-26 06:24:07 -04:00
Jay Berkenbilt	603f222365	Fix infinite loop while reporting an error (fixes #101 ) This is CVE-2017-9210. The description string for an error message included unparsing an object, which is too complex of a thing to try to do while throwing an exception. There was only one example of this in the entire codebase, so it is not a pervasive problem. Fixing this eliminated one class of infinite loop errors.	2017-07-26 06:24:07 -04:00
Jay Berkenbilt	b7302a9b72	Prepare 6.0.0 release	2015-11-10 12:48:52 -05:00
Jay Berkenbilt	e5abc789a2	Prepare 5.2.0 release	2015-11-01 16:40:01 -05:00
Jay Berkenbilt	b62cbe2508	Tolerate some mangled xref tables If xref table entries lack the spec-required trailing whitespace or contain a small amount of extra space, handle them anyway.	2015-10-31 18:56:43 -04:00
Jay Berkenbilt	b8bdef0ad1	Implement deterministic ID For non-encrypted files, determinstic ID generation uses file contents instead of timestamp and file name. At a small runtime cost, this enables generation of the same /ID if the same inputs are converted in the same way multiple times.	2015-10-31 18:56:42 -04:00
Jay Berkenbilt	94e55394ed	Prepare 5.1.3 release	2015-05-24 17:26:49 -04:00
Jay Berkenbilt	b356b9dfa2	fix-qdf: handle object streams with > 255 objects fix-qdf was previously hard-coding the number of bytes for the f2 field of the xref stream entry. This addresses issue #37. Thanks aluebcke for reporting.	2015-05-24 16:52:42 -04:00
Jay Berkenbilt	cf43882e9f	Handle Microsoft crypt provider without prior keys As reported in issue #40, a call to CryptAcquireContext in SecureRandomDataProvider fails in a fresh windows install prior to any user keys being created in AppData\Roaming\Microsoft\Crypto\RSA. Thanks michalrames.	2015-05-24 16:52:42 -04:00
Jay Berkenbilt	857bb208d3	include time.h in QUtil.hh QUtil.hh needs time.h to get time_t on some platforms. Thanks Peter Korsgaard <peter@korsgaard.com>	2015-05-24 16:26:05 -04:00
Jay Berkenbilt	a11549a566	Detect loops in /Pages structure Pushing inherited objects to pages and getting all pages were both prone to stack overflow infinite loops if there were loops in the Pages dictionary. There is a general weakness in the code in that any part of the code that traverses the Pages structure would be prone to this and would have to implement its own loop detection. A more robust fix may provide some general method for handling the Pages structure, but it's probably not worth doing. Note: addition of *Internal2 private functions was done rather than changing signatures of existing methods to avoid breaking compatibility.	2015-02-21 19:47:11 -05:00
Jay Berkenbilt	28a9df5119	Avoid buffer overrun copying digest Converting a password to an encryption key is supposed to copy up to a certain number of bytes from a digest. Make sure never to copy more than the size of the digest.	2015-02-21 17:51:08 -05:00
Jay Berkenbilt	c729e07d55	Avoid resolving arguments to R When checking two objects preceding R while parsing, ensure that the objects are direct. This avoids stuff like 1 0 obj containing 1 0 R 0 R from causing an infinite loop in object resolution.	2015-02-21 17:51:08 -05:00
Jay Berkenbilt	d8900c2255	Handle page tree node with no /Type Original reported here: https://bugs.launchpad.net/ubuntu/+source/qpdf/+bug/1397413 The PDF specification says that the /Type key for nodes in the pages dictionary (both /Page and /Pages) is required, but some PDF files omit them. Use the presence of other keys to determine the type of pages tree node this is if the type key is not found.	2014-12-29 10:17:21 -05:00
Jay Berkenbilt	caab1b0e16	Handle pages with no /Contents from getPageContents() The spec allows /Contents to be omitted for pages that are blank, but QPDFObjectHandle::getPageContents() was throwing an exception in this case.	2014-11-14 13:43:34 -05:00
Jay Berkenbilt	4071db59aa	Prepare 5.1.2 release	2014-06-07 17:16:52 -04:00
Jay Berkenbilt	3c5e602a1e	Windows build (msvc): target Windows 5.0.1 (XP) Without this, qpdf executables work only on Vista or newer. Fixes #35	2014-06-07 17:16:50 -04:00
Jay Berkenbilt	0b2e9cb168	Example: fast split into single pages This is faster than using qpdf --pages to do it.	2014-06-07 16:40:38 -04:00
Jay Berkenbilt	9f8aba1db7	Handle indirect stream filter/decode parameters QPDFWriter was trying to make /Filter and /DecodeParms direct in all cases, but there are some cases where /DecodeParms may refer to a stream, which can't be direct. QPDFWriter doesn't actually need /DecodeParms to be direct in that case because it won't be able to filter the stream. Until we can handle this type of stream, just don't make /Filter and /DecodeParms direct if we can't filter the stream anyway. Fixes #34	2014-06-07 16:31:03 -04:00
Jay Berkenbilt	b0a96ce6aa	Fix calculation of xref stream stream columns Fix problem: if the last object in the first part of a linearized file had an offset that was below 65536 by less than the size of the hint stream, the xref stream was invalid and the resulting file is not usable.	2014-02-22 22:13:31 -05:00
Jay Berkenbilt	247d70efee	Prepare 5.1.1 release	2014-01-14 15:45:35 -05:00
Jay Berkenbilt	c9a9fe9c2f	Avoid traversing same object twice when copying objects This is a performance fix. The output is unchanged. Fixes #28.	2013-12-26 11:51:50 -05:00
Jay Berkenbilt	0b6127558d	Prepare 5.1.0 release	2013-12-17 15:26:07 -05:00
Jay Berkenbilt	235d8f28f8	Increase random data provider support Add a method to get the current random data provider, and document and test the method for resetting it.	2013-12-16 16:21:28 -05:00
Jay Berkenbilt	30287d2d65	Allow OS-provided secure random to be disabled	2013-12-14 15:17:36 -05:00
Jay Berkenbilt	5e3bad2f86	Refactor random data generation Add new RandomDataProvider object and implement existing random number generation in terms of that. This enables end users to supply their own random data providers.	2013-12-14 15:17:35 -05:00
Jay Berkenbilt	e9a319fb95	Allow arbitrary whitespace, not just newline, after xref Fixes #27.	2013-12-14 15:17:23 -05:00
Jay Berkenbilt	478c05fcab	Allow -DNO_GET_ENVIRONMENT to avoid GetEnvironmentVariable If NO_GET_ENVIRONMENT is #defined at compile time on Windows, do not call GetEnvironmentVariable. QUtil::get_env will always return false. This option is not available through configure. This was added to support a specific user's requirements to avoid calling GetEnvironmentVariable from the Windows API. Nothing in qpdf outside the test coverage system in qtest relies on QUtil::get_env.	2013-11-30 15:58:32 -05:00
Jay Berkenbilt	88c29873e5	Add /FS flag (msvc) for parallel builds	2013-11-30 15:58:32 -05:00
Jay Berkenbilt	b75b19589d	Add more detail to previous ChangeLog entry	2013-11-30 15:58:32 -05:00
Jay Berkenbilt	dc9df97466	Include <algorithm> for std::min, std::max	2013-11-29 10:48:16 -05:00
Jay Berkenbilt	157c936b97	Use 8 bit per sample images in tests In compare image tests, use the gs device tiff24nc instead of tiff12nc since the 4 bit per sample images created by tiff12nc could sometimes trigger a bug in tiffcmp. Fixes #20.	2013-11-21 13:41:37 -05:00
Jay Berkenbilt	c1e39381fa	Add a ChangeLog note for previous fix	2013-11-21 13:30:58 -05:00
Jay Berkenbilt	e1bd72b46c	Prepare for 5.0.1 release	2013-10-18 13:51:30 -04:00
Jay Berkenbilt	a237e92445	Warn when -accessibility=n will be ignored Also accept -accessibility=n with 256 bit keys even though it will be ignored.	2013-10-18 10:45:15 -04:00
Jay Berkenbilt	ac9c1f0d56	Security: replace operator[] with at For std::string and std::vector, replace operator[] with at. This was done using an automated process. See README.hardening for details.	2013-10-18 10:45:14 -04:00
Jay Berkenbilt	4229457068	Security: use a secure random number generator If not available, give an error. The user may also configure qpdf to use an insecure random number generator.	2013-10-18 10:45:12 -04:00
Jay Berkenbilt	e19eb579b2	Replace some assertions with std::logic_error Ideally, the library should never call assert outside of test code, but it does in several places. For some cases where the assertion might conceivably fail because of a problem with the input data, replace assertions with exceptions so that they can be trapped by the calling application. This commit surely misses some cases and replaced some cases unnecessarily, but it should still be an improvement.	2013-10-09 20:57:14 -04:00
Jay Berkenbilt	0bfe902489	Security: avoid pre-allocating vectors based on file data In places where std::vector<T>(size_t) was used, either validate that the size parameter is sane or refactor code to avoid the need to pre-allocate the vector.	2013-10-09 20:57:14 -04:00
Jay Berkenbilt	10bceb552f	Security: sanitize /W in xref stream The /W array was not sanitized, possibly causing an integer overflow in a multiplication. An analysis of the code suggests that there were no possible exploits based on this since the problems were in checking expected values but bounds checks were performed on actual values.	2013-10-09 20:57:07 -04:00
Jay Berkenbilt	3eb4b066ab	Security: better bounds checks for linearization data The faulty code was only used during explicit checks of linearization data. Those checks are not part of normal reading or writing of PDF files.	2013-10-09 19:50:09 -04:00
Jay Berkenbilt	b097d7a81b	Security: handle empty name in normalizeName	2013-10-09 19:50:09 -04:00
Jay Berkenbilt	eb1b1264b4	Security: fix potential multiplication overflow Better sanity check inputs to bit stream reader	2013-10-09 19:50:09 -04:00
Jay Berkenbilt	c2e91d8ec3	Security: keep cur_byte pointing into bytes array	2013-10-09 19:50:07 -04:00
Jay Berkenbilt	66e63b8667	Prepare 5.0.0 release	2013-07-10 12:29:13 -04:00
Jay Berkenbilt	cee2592ed1	Change API/ABI and withdraw 4.2.0 4.2.0 was binary incompatible in spite of there being no deletions or changes to any public methods. As such, we have to bump the ABI and are fixing some API breakage while we're at it. Previous 4.3.0 target is now 5.1.0.	2013-07-10 11:30:13 -04:00
Jay Berkenbilt	f31e526d67	Prepare 4.2.0 release	2013-07-07 19:43:16 -04:00
Jay Berkenbilt	b84f57e56d	Ignore broken DecodeParms for stream with no filters	2013-07-07 19:43:16 -04:00
Jay Berkenbilt	91367239fd	Add --show-npages option to qpdf	2013-07-07 19:43:16 -04:00
Jay Berkenbilt	adccedc02f	Allow numeric range to be omitted qpdf --pages Detect a missing page range and assume 1-z.	2013-07-07 19:43:16 -04:00
Jay Berkenbilt	a85007cb0d	Handle more broken files Space rather than newline after xref, missing /ID in trailer for encrypted file. This enables qpdf to handle some files that xpdf can handle. Adobe reader can't necessarily handle them.	2013-06-15 12:40:01 -04:00
Jay Berkenbilt	16051788ed	Handle /Outlines dictionary being a direct object Even though this case is not valid according to the spec, it has been seen, and caused an internal error.	2013-06-14 21:36:04 -04:00
Jay Berkenbilt	eae8370cd9	Add optional /Length key in crypt filter dictionary	2013-06-14 20:42:39 -04:00
Jay Berkenbilt	a3576a7359	Bug fix: handle generation > 0 when generating object streams Rework QPDFWriter to always track old object IDs and QPDFObjGen instead of int, thus not discarding the generation number. Switch to QPDF::getCompressibleObjGen() to properly handle the case of an old object eligible for compression that has a generation of other than zero.	2013-06-14 14:58:09 -04:00
Jay Berkenbilt	5039da0b91	Add QPDFObjectHandle::getObjGen() This is safer than getObjectID() and getGeneration() for many uses.	2013-06-14 14:58:09 -04:00
Jay Berkenbilt	d88231e01e	Promote QPDF::ObjGen to top-level object QPDFObjGen	2013-06-14 14:58:08 -04:00
Jay Berkenbilt	f02c5f5e12	Final preparation for 4.1.0 release	2013-04-14 15:03:51 -04:00
Jay Berkenbilt	e8ddac8950	Document casting policy	2013-03-25 14:37:25 -04:00
Jay Berkenbilt	49c7681c58	Windows install: check DLL type When copying dlls, make sure to only consider DLLs whose type matches the type of what is loading them.	2013-03-11 14:10:37 -04:00
Jay Berkenbilt	197af341de	Use ./install-sh instead of install -c	2013-03-07 11:29:56 -05:00
Jay Berkenbilt	119f2a4b68	Add method to terminate content stream parsing	2013-03-05 13:35:46 -05:00
Jay Berkenbilt	fd64959398	Favor strerror_s and fopen_s on MSVC Make remaining calls to fopen and strerror use strerror_s and fopen_s on MSVC.	2013-03-05 13:35:46 -05:00
Jay Berkenbilt	ac4deac187	Call QUtil::safe_fopen in place of fopen fopen was previuosly called wrapped by QUtil::fopen_wrapper, but QUtil::safe_fopen does this itself, which is less cumbersome.	2013-03-05 13:35:46 -05:00
Jay Berkenbilt	a51ae10b8d	Remove all calls to sprintf	2013-03-05 13:35:46 -05:00
Jay Berkenbilt	8be8277613	Rewrite QUtil::int_to_string and QUtil::double_to_string Make them safer by avoiding any internal limits and replacing sprintf with std::ostringstream.	2013-03-04 16:45:16 -05:00
Jay Berkenbilt	a11081085b	Handle warning flags better Make --enable-werror work properly on msvc, handle extra warnings flags for msvc in configure.ac instead of hardcoding into make/msvc.mk, separate warnings flags into WFLAGS in autoconf.mk to avoid duplication and to make it easier to override.	2013-03-04 16:45:15 -05:00
Jay Berkenbilt	32b62035ce	Replace many calls to sprintf with QUtil::hex_encode Add QUtil::hex_encode to encode binary data has a hexadecimal string, and use it in place of sprintf where possible.	2013-03-04 16:45:15 -05:00
Jay Berkenbilt	6c7bf114dc	Bug fix: properly handle overridden compressed objects When caching objects in an object stream, only cache objects that still resolve to that stream. See Changelog mod from this commit for details.	2013-02-23 17:51:17 -05:00
Jay Berkenbilt	7e7c93951f	Do not remove libqpdf.la Some distributions (like debian) don't want .la files to be installed, but the responsibility for doing this should like in the packaging, not in qpdf itself.	2013-01-31 16:16:45 -05:00
Jay Berkenbilt	a5d8783f67	Improve qpdf --check Fix exit status for case of errors without warnings, continue after errors when possible, add test case for parsing a file with content stream errors on some but not all pages.	2013-01-25 11:08:50 -05:00
Jay Berkenbilt	a7e8b8c789	Have qpdf --check parse content streams Also move writing to null and parsing of content streams out of the wrong if block.	2013-01-24 11:47:36 -05:00
Jay Berkenbilt	bfda717749	Cosmetic changes to be closer to Adobe terminology Change object type Keyword to Operator, and place the order of the object types in object_type_e in the same order as they are mentioned in the PDF specification. Note that this change only breaks backward compatibility with code that has not yet been released.	2013-01-23 09:38:05 -05:00
Jay Berkenbilt	913eb5ac35	Add getTypeCode() and getTypeName() Add virtual methods to QPDFObject, wrappers to QPDFObjectHandle, and implementations to all the QPDF_Object types.	2013-01-22 10:01:45 -05:00
Jay Berkenbilt	f81152311e	Add QPDFObjectHandle::parseContentStream method This method allows parsing of the PDF objects in a content stream or array of content streams.	2013-01-20 15:35:39 -05:00
Jay Berkenbilt	1d88955fa6	Added new QPDFObjectHandle types Keyword and InlineImage These object types are to facilitate content stream parsing.	2013-01-20 15:35:39 -05:00
Jay Berkenbilt	8708fd373d	Prepare 4.0.1 release	2013-01-17 09:51:04 -05:00
Jay Berkenbilt	0e9949afde	Update versions for 4.0.0 release	2012-12-31 11:43:27 -05:00
Jay Berkenbilt	f8306913ba	Update "C" API with functions for new features	2012-12-31 10:32:32 -05:00
Jay Berkenbilt	ae1385cd8a	Update ChangeLog with recent changes	2012-12-31 10:32:32 -05:00
Jay Berkenbilt	04c203ae06	Eliminate flattenScalarReferences	2012-12-31 05:36:48 -05:00
Jay Berkenbilt	7f84239cad	Find PDF header anywhere in the first 1024 bytes	2012-12-25 14:43:37 -05:00
Jay Berkenbilt	739a78e200	Add Requires.private to libqpdf.pc for static linking	2012-11-20 13:57:37 -05:00
Jay Berkenbilt	f256670eba	Ignore objects with offset 0	2012-11-20 13:57:37 -05:00
Jay Berkenbilt	041397fdab	Allow reading from InputSource and writing to Pipeline Allowing users to subclass InputSource and Pipeline to read and write from/to arbitrary sources provides the maximum flexibility for users who want to read and write from other than files or memory.	2012-09-23 17:42:26 -04:00
Jay Berkenbilt	b4dc0f072a	Prepare 3.0.2 release	2012-09-06 15:47:58 -04:00
Jay Berkenbilt	c1627d0438	Add QPDFWriter::setExtraHeaderText	2012-09-06 15:31:12 -04:00
Jay Berkenbilt	fc4c82a950	Reset state in QPDF::calculateLinearizationData This makes it possible to use two different writers to write linearized files from the same QPDF object.	2012-09-06 15:28:16 -04:00
Jay Berkenbilt	8d2b29ef98	Fix segmentation fault with use of QPDFWriter::setOutputMemory	2012-09-06 14:39:06 -04:00

... 3 4 5 6 7 ...

560 Commits