octoleo/qpdf - qpdf - Vast Development Method

mirror of https://github.com/qpdf/qpdf.git synced 2024-06-05 11:50:53 +00:00

Author	SHA1	Message	Date
Jay Berkenbilt	f1ae55a430	Better indirect filter test case The test suite now contains test cases that fail with both 10.0.1 and 10.0.2 and reproduce the internal error from #449.	2020-10-31 09:02:30 -04:00
Jay Berkenbilt	f8e4b6161c	With --no-warn, suppress warnings in split-pages Warnings issued on the output QPDF object were not suppressing warnings since that option was only set on the input QPDF object.	2020-10-23 16:27:51 -04:00
Jay Berkenbilt	b30deaeeab	Avoid merging adjacent tokens when concatenating contents (fixes #444 )	2020-10-23 08:00:04 -04:00
Jay Berkenbilt	956c8f6432	Obscure bug fix copying foreign streams in special cases (fixes #449 ) Specifically, if a stream had its stream data replaced and had indirect /Filter or /DecodeParms, it would result in non-silent loss of data and/or internal error.	2020-10-21 19:23:23 -04:00
Jay Berkenbilt	deeface146	Add automated test for shell wildcard expansion Wildcard expansion is different in Windows from non-Windows and sometimes requires special link options to work. Add tests that fail if we link incorrectly.	2020-10-21 14:15:31 -04:00
Jay Berkenbilt	758e3e38f5	Add option --warning-exit-0 to exit 0 instead of 3 with warnings	2020-10-20 18:02:39 -04:00
Jay Berkenbilt	4977a7efa5	Bug fix: getStreamData should on unfilterable stream (fixes #425 )	2020-04-08 18:52:04 -04:00
Jay Berkenbilt	0837932164	Update documentation and test suite to lock in hard page copy Issue #399 mentioned a use case for which qpdf has support, but the fact that it is supported was not documented or in the test suite, making it vulerable to accidental breakage.	2020-04-05 20:07:13 -04:00
Jay Berkenbilt	893d38b87e	Allow propagation of errors and retry through StreamDataProvider StreamDataProvider::provideStreamData now has a rich enough API for it to effectively proxy to pipeStreamData.	2020-04-05 20:07:13 -04:00
Jay Berkenbilt	67d5ed3a64	Implement remove-unreferenced-resources=auto	2020-04-04 13:19:49 -04:00
Jay Berkenbilt	1e766dcda2	Add --remove-unreferenced-resources option	2020-04-04 13:19:49 -04:00
Jay Berkenbilt	4f3b89991b	placeFormXObject: allow control of shrink/expand (fixes #409 )	2020-04-03 21:39:17 -04:00
Jay Berkenbilt	dac65a21fb	Look in form XObjects when removing unreferenced resources (fixes #373 ) If a page contains a form XObject, also filter the form XObject and remove its unreferenced resources.	2020-03-31 17:39:20 -04:00
Jay Berkenbilt	bb3137296d	Handle root /Pages pointing to other than page tree root (fixes #398 )	2020-02-22 11:10:31 -05:00
Jay Berkenbilt	57c01ef81f	In qdf mode, don't write extra XRef streams (fixes #386 ) fix-qdf assumes there is exactly one XRef stream and that it is at the end of the file.	2020-01-26 16:50:57 -05:00
Jay Berkenbilt	bbc2f8ffae	Bug fix: handle ColorSpace lookup for inline images (fixes #392 ) If the value of /CS in the inline image dictionary was is key in the page's /Resource -> /ColorSpace dictionary, properly resolve it by referencing the proper colorspace, and not just the name, in the external image dictionary.	2020-01-26 15:29:10 -05:00
Jay Berkenbilt	12777a04ca	Add encrypt key to json	2020-01-26 14:44:03 -05:00
Jay Berkenbilt	656d7bc006	Rename test files This change makes it possible to get both the user and owner password from the file name of all the encryption test files.	2020-01-26 14:42:10 -05:00
Jay Berkenbilt	731c4f711b	Add --is-encrypted and --requires-password (fixes #390 ) Allow exit status-based checking of whether a file is encrypted or requires a password without necessarily supplying the correct password. Useful for scripting.	2020-01-26 11:26:53 -05:00
Jay Berkenbilt	5508f74603	Allow /P in encryption dictionary to be positive (fixes #382 ) Even though this is disallowed by the spec, files like this have been encountered in the wild.	2019-11-09 12:33:15 -05:00
Masamichi Hosoda	5a842792b6	Parse Contents in signature dictionary without encryption Various PDF digital signing tools do not encrypt /Contents value in signature dictionary. Adobe Acrobat Reader DC can handle a PDF with the /Contents value not encrypted. Write Contents in signature dictionary without encryption Tests ensure that string /Contents are not handled specially when not found in sig dicts.	2019-10-22 16:20:21 -04:00
Masamichi Hosoda	cdc46d78f4	Add QPDFObject::getParsedOffset()	2019-10-22 16:19:06 -04:00
Masamichi Hosoda	5cf4090aee	Add QPDFWriter::getRenumberedObjGen()	2019-10-22 16:16:16 -04:00
Masamichi Hosoda	46ac3e21b3	Add QPDF::getXRefTable()	2019-10-22 16:16:16 -04:00
Masamichi Hosoda	06b818dcd3	Exclude signature dictionary from compressible objects It seems better not to compress signature dictionaries. Various PDF digital signing tools, including Adobe Acrobat Reader DC, do not compress signature dictionaries. Table 8.93 "Entries in a signature dictionary" in PDF 1.5 reference describes that /ByteRange in the signature dictionary shall be used to describe a digest that does not include the signature value (/Contents) itself. The byte ranges cannot be determined if the dictionary is compressed.	2019-10-22 16:16:16 -04:00
Masamichi Hosoda	5e0ba12687	Fix /Contents value representation in a signature dictionary Table 8.93 "Entries in a signature dictionary" in PDF 1.5 reference describes that the value of Contents entry is a hexadecimal string representation when ByteRange is specified. This commit makes QPDF always uses hexadecimal strings representation instead of literal strings for it.	2019-10-22 16:16:16 -04:00
Jay Berkenbilt	e188d0fffa	Make --replace-input work with / in path (fixes #365 )	2019-10-12 19:27:50 -04:00
Jay Berkenbilt	8b1e307741	Warn for duplicated dictionary keys (fixes #345 )	2019-09-19 20:22:34 -04:00
Jay Berkenbilt	d492bb0a90	Add --replace-input option (fixes #321 )	2019-08-31 15:51:21 -04:00
Jay Berkenbilt	47a38a942d	Detect stream in object stream, fixing fuzz 16214 It's detected in QPDFWriter instead of at parse time because I can't figure out how to construct a test case in a reasonable time. This commit moves the fuzz file into the regular test suite for a QTC coverage case.	2019-08-28 12:49:04 -04:00
Jay Berkenbilt	9ebb55aff1	Include password match information in show encryption	2019-08-24 11:01:19 -04:00
Jay Berkenbilt	2794bfb1a6	Add flags to control zlib compression level (fixes #113 )	2019-08-23 20:34:21 -04:00
Jay Berkenbilt	4b2e72c4cd	Test for direct, rather than resolved nulls in parser Just because we know an indirect reference is null, doesn't mean we shouldn't keep it indirect.	2019-08-22 17:55:16 -04:00
Jay Berkenbilt	ae5bd7102d	Accept extraneous space before xref (fixes #341 )	2019-08-19 22:24:53 -04:00
Jay Berkenbilt	42d396f1dd	Handle invalid name tokens symmetrically for PDF < 1.2 (fixes #332 )	2019-08-19 19:48:27 -04:00
Jay Berkenbilt	d9dd99eca3	Attempt to repair /Type key in pages nodes (fixes #349 )	2019-08-18 18:54:37 -04:00
Jay Berkenbilt	04f45cf652	Treat all linearization errors as warnings This also reverts the addition of a new checkLinearization that distinguishes errors from warnings. There's no practical distinction between what was considered an error and what was considered a warning.	2019-06-23 13:45:45 -04:00
Jay Berkenbilt	c5ed1b8075	Handle invalid encryption Length (fixes #333 )	2019-06-22 20:57:33 -04:00
Jay Berkenbilt	551dfbf697	Allow set*EncryptionParameters before filename iset (fixes #336 )	2019-06-22 20:57:33 -04:00
Jay Berkenbilt	85a3f95a89	qpdf: exit 3 for linearization warnings without errors (fixes #50 )	2019-06-22 16:57:51 -04:00
Jay Berkenbilt	45dac410b5	Remove broken QPDFTokenizer::expectInlineImage	2019-06-21 22:29:31 -04:00
Jay Berkenbilt	ed7f2a6c76	Add smaller image streams file for testing	2019-06-21 17:39:53 -04:00
Jay Berkenbilt	3608afd5c5	Add new integer accessors to QPDFObjectHandle	2019-06-21 13:17:21 -04:00
Jay Berkenbilt	bcfa407912	As a test suite, run stand-alone fuzzer on seed corpus Temporarily skip fuzz tests on Windows. There are Windows-specific failures to address later.	2019-06-15 17:24:24 -04:00
Jay Berkenbilt	320702c086	Add test files from oss-fuzz bugs (fixes #335 )	2019-06-15 17:24:24 -04:00
Jay Berkenbilt	cf469d7890	Give up reading objects with too many consecutive errors	2019-06-15 08:52:19 -04:00
Jay Berkenbilt	31bde2f9d7	Handle empty DecodeParams array for (fixes #331 ) On read, ignore /DecodeParms when empty list; on write, delete it. Some files have been found that include an empty list for /DecodeParms, but this is not technically compliant with the spec, and the only sensible interpretation is to treat it as if there are no decode parameters.	2019-06-09 17:19:49 -04:00
Jay Berkenbilt	03e27709f3	Improve Unicode filename testing Remove dependency on the behavior of perl for reliable creation of Unicode file names on Windows.	2019-04-27 20:37:33 -04:00
Jay Berkenbilt	7ff234a92f	Remove stray comment	2019-04-27 20:37:33 -04:00
Jay Berkenbilt	12b159118a	Compare versions between CLI and library	2019-04-20 21:00:43 -04:00
Jay Berkenbilt	2b011f9d81	Add --remove-page-labels option (fixes #317 )	2019-04-20 21:00:43 -04:00
Jay Berkenbilt	e50d5201df	Add --keep-files-open-threshold (fixes #288 )	2019-04-20 21:00:43 -04:00
Jay Berkenbilt	011695dfdf	Support Unicode in filenames (fixes #298 )	2019-04-20 21:00:43 -04:00
Jay Berkenbilt	a5a016cdd2	Revert preservations of outlines with --split-pages The preservation of outlines didn't provide very useful behavior anyway as it copied all outlines but most didn't work. This implementation also caused a very significant performance hit and so is being reverted until a proper solution can be coded. The eventual solution will not be compatible with the reverted solution anyway, so it's best not to leave this in.	2019-04-20 21:00:43 -04:00
Thorsten Schöning	af42fe9daf	Don't open more than 50 files. Embarcadero C++Builder doesn't support more than 50 files open at the same time for legacy 32 Bit apps, which makes a test fail trying to open more than that many files. This changes the number of open files for that test to far less to make the test succeed. Alternatively one could reduce the hard coded number of 200 in QPDF itself, which I didn't do currently because it needs adoption of manuals etc. and is something which needs to be discussed with the author of QPDF. I guess chances are better to get the test changed upstream. This fixes #288: https://github.com/qpdf/qpdf/issues/288	2019-03-11 17:14:22 -04:00
Thorsten Schöning	27f18e0f67	The kfo-PDF files for testing need to be copied using "binmode" or Windows will introduce \r\n. qpdf: selecting --keep-open-files=n qpdf: processing 001-kfo.pdf WARNING: 001-kfo.pdf: file is damaged WARNING: 001-kfo.pdf (offset 556): xref not found WARNING: 001-kfo.pdf: Attempting to reconstruct cross-reference table	2019-02-14 18:54:38 +01:00
Jay Berkenbilt	fc2e491f74	Add test for exception handling There have been issues reported where exceptions are not thrown properly across shared library/DLL boundaries, so add a test specifically to ensure that exceptions are caught as thrown.	2019-02-07 19:21:26 -05:00
Jay Berkenbilt	8acf636b4e	Incorporate improved Windows fragility workaround from qtest	2019-02-01 22:25:25 -05:00
Jay Berkenbilt	1fba24aada	Add another test case for weird page trees	2019-01-31 21:29:28 -05:00
Jay Berkenbilt	0a470d2daf	Don't optimize non-8-bit images Also add test cases for additional coverage on image optimization.	2019-01-31 21:29:28 -05:00
Jay Berkenbilt	eb49e07c0a	Make inline image token exactly contain the image data Do not include the trailing EI, and handle cases where EI is not preceded by a delimiter. Such cases have been seen in the wild.	2019-01-31 20:28:44 -05:00
Jay Berkenbilt	5211bcb5ea	Externalize inline images (fixes #278 )	2019-01-31 10:38:13 -05:00
Jay Berkenbilt	22bcdbe786	Remove acroread from tests This hasn't worked or been exercised in years since Adobe stopped releasing a Linux version of reader.	2019-01-31 10:38:13 -05:00
Jay Berkenbilt	2b6c79bcae	Improve locating inline image's EI We've actually seen a PDF file in the wild that contained EI surrounded by delimiters inside the image data, which confused qpdf's naive code. This significantly improves EI detection.	2019-01-31 09:26:37 -05:00
Jay Berkenbilt	ec9e310c9e	Refactor QPDFTokenizer's inline image handling Add a version of expectInlineImage that takes an input source and searches for EI. This is in preparation for improving the way EI is found. This commit just refactors the code without changing the functionality and adds tests to make sure the old and new code behave identically.	2019-01-31 09:26:37 -05:00
Jay Berkenbilt	8a9cfd2605	Handle direct page objects (fixes #164 )	2019-01-29 17:01:36 -05:00
Jay Berkenbilt	2712869cf9	Fix logic for when to compress object and xref streams (fixes #271 )	2019-01-28 21:43:06 -05:00
Jay Berkenbilt	52f9d326a5	Resolve duplicated page objects (fixes #268 ) When linearizing a file or getting the list of all pages in a file, detect if the pages tree contains a duplicated page object and, if so, shallow copy it. This makes it possible to have a one to one mapping of page positions to page objects.	2019-01-28 20:29:58 -05:00
Jay Berkenbilt	426434c772	Add --overlay and --underlay to qpdf CLI (fixes #207 )	2019-01-27 09:30:13 -05:00
Jay Berkenbilt	c2ae35540e	Add boundary condition test for getUniqueResourceName	2019-01-27 09:26:33 -05:00
Jay Berkenbilt	623f5b664e	Convert pages to form XObjects Support conversion of pages to form XObjects and placement of form XObjects on pages.	2019-01-27 07:50:30 -05:00
Jay Berkenbilt	009767d97a	Handle inheritable page attributes Add getAttribute for handling inheritable page attributes, and fix getPageImages and annotation flattening code to use it.	2019-01-25 22:30:05 -05:00
Jay Berkenbilt	930eade6d3	Fix omissions in text appearance generation When generating appearance streams for variable text annotations, properly handle the cases of there being no appearance dictionary, no appearance stream, or an appearance stream with no BMC..EMC marker.	2019-01-20 23:05:58 -05:00
Jay Berkenbilt	65ef0bf313	When flattening, remove annotations with no appearance stream With the exception of form field annotations when /NeedAppearances is true, remove annotations that don't have appearance streams when flattening. There is no reason to keep these when flattening since they are invisible. This may include unchecked checkboxes, unshown popup windows, etc.	2019-01-20 23:05:58 -05:00
Jay Berkenbilt	0a3057dc0a	More testing for Unicode passwords	2019-01-19 14:16:03 -05:00
Jay Berkenbilt	c2030d1f33	Implement password recovery suppression and password mode (fixes #215 ) Allow fine control over how passwords are encoded for writing, and allow password for reading to be given as a hexademical encoded string. Allow suppression of password recovery as a means to ensure that the password you specify is actually the right one.	2019-01-19 10:14:07 -05:00
Jay Berkenbilt	966429e718	Update CLI and manual for new encryption granularity (fixes #214 )	2019-01-17 11:43:56 -05:00
Jay Berkenbilt	5cfcd4f361	Additional checks for unreferenced resources Explicitly abandon removal of unreferenced resources if there are any lexical errors in the page's contents. This case always generated a warning, but it now also prevents removal of unreferenced resources, this strongly decreasing the likelihood of data loss.	2019-01-17 11:43:56 -05:00
Jay Berkenbilt	e09ae710dc	Add tests for shared font/xobject The tests are in a separate commit so the bug-fix commit can be taken as a patch for older versions.	2019-01-17 09:44:29 -05:00
Jay Berkenbilt	654c0e8caf	Allow adding the same page more than once in --pages (fixes #272 )	2019-01-12 10:01:47 -05:00
Jay Berkenbilt	53d8e916b7	Interpret . in --pages as a shortcut for the primary file	2019-01-12 09:59:03 -05:00
Jay Berkenbilt	c3cee5f154	Exercise out of scope original pdf for copyForeignObject	2019-01-07 07:38:03 -05:00
Jay Berkenbilt	ee2aad4381	Add CLI flags for image optimization	2019-01-04 21:33:14 -05:00
Jay Berkenbilt	7b6ab900dc	Support page collation with --collate (fixes #259 )	2019-01-04 15:13:02 -05:00
Jay Berkenbilt	16fd6e64f9	Add QPDFWriter::getFinalVersion (fixes #266 )	2019-01-04 12:37:22 -05:00
Jay Berkenbilt	a01359189b	Fix dangling references (fixes #240 ) On certain operations, such as iterating through all objects and adding new indirect objects, walk through the entire object structure and explicitly resolve any indirect references to non-existent objects. That prevents new objects from springing into existence and causing the previously dangling references to point to them.	2019-01-04 10:29:29 -05:00
Jay Berkenbilt	158156d506	Add basic appearance stream generation	2019-01-04 08:00:19 -05:00
Jay Berkenbilt	b55567a0fa	Add special case setV code for button fields	2019-01-03 23:18:13 -05:00
Jay Berkenbilt	1342612308	Replace need-appearances.pdf Create a new need-appearances.pdf based on newer test files with more modified fields.	2019-01-03 23:18:13 -05:00
Jay Berkenbilt	e3144ac417	Add form fields to json output Also add some additional methods for detecting form field types to assist in the json creation and for later use.	2019-01-03 23:18:13 -05:00
Jay Berkenbilt	87f855dbfc	Rename test file	2019-01-03 23:18:13 -05:00
Jay Berkenbilt	ca94ac68d9	Honor flags when flattening annotations	2019-01-03 11:59:55 -05:00
Jay Berkenbilt	3e74916c5a	Fix seg fault on empty xref stream (fixes #263 ) Thanks to @p-cher for supplying a patch.	2019-01-03 09:17:43 -05:00
Jay Berkenbilt	23bcfeb336	Remove bogus test cheating code	2019-01-02 21:49:47 -05:00
Jay Berkenbilt	3b8ce4f12a	Annotation flattening including form fields Flatten annotations by integrating their appearance streams into the content stream of the containing page. In the case of form fields, only flatten if /NeedAppearance is false (or equivalently absent). If flattening form fields, also remove /AcroForm from the document catalog.	2019-01-01 08:14:15 -05:00
Jay Berkenbilt	95d6b17a89	Add QPDFObjectHandle::mergeDictionary()	2019-01-01 08:12:56 -05:00
Jay Berkenbilt	6048c6e2f0	Don't crash on @file when file doesn't exist (fixes #265 ) When @file is used and file doesn't exist, just treat it as a normal argument.	2018-12-23 11:46:56 -05:00
Jay Berkenbilt	968e7e60b7	Add json tests	2018-12-23 11:21:59 -05:00
Jay Berkenbilt	64c1579544	Support zsh completion	2018-12-23 11:21:59 -05:00
Jay Berkenbilt	52a0b767c8	Slightly improve bash completion arg parsing	2018-12-23 09:15:40 -05:00

1 2 3 4 5 ...

339 Commits