octoleo/qpdf - qpdf - Vast Development Method

mirror of https://github.com/qpdf/qpdf.git synced 2024-11-16 17:45:09 +00:00

Author	SHA1	Message	Date
m-holger	eeb6162f76	Add optional parameter separator to QPDFObjGen::unparse Also, revert inlining of unparse and operator << from commit `4c6640c` in order to avoid exposing QUtil.	2022-07-24 15:41:48 +01:00
Jay Berkenbilt	3a7ee7e938	Move C-based ProgressReporter helper into QPDFWriter	2022-06-19 08:46:58 -04:00
m-holger	6c69a747b9	Code clean up: use range-style for loops wherever possible Remove variables obsoleted by commit `4f24617`.	2022-05-21 16:06:29 -04:00
Jay Berkenbilt	6c2fb5b8f0	Add test for bad data and bad datafile	2022-05-20 13:33:30 -04:00
Jay Berkenbilt	21d6e3231f	Make use of the new Pipeline methods in some places	2022-05-03 18:31:23 -04:00
Jay Berkenbilt	59f3e09edf	Make Pipeline::write take an unsigned char const* (API change)	2022-05-03 18:31:22 -04:00
Jay Berkenbilt	62bf296a9c	Make assert handling less error-prone Prevent my future self or other contributors from using assert in tests and then having that assert not do anything because of the NDEBUG macro.	2022-05-03 18:31:22 -04:00
Jay Berkenbilt	92b692466f	Remove remaining incorrect assert calls from implementation	2022-05-03 18:31:22 -04:00
Jay Berkenbilt	8ccd3a8a89	Mark weak encryption with API changes (fixes #576 )	2022-04-30 17:24:15 -04:00
Jay Berkenbilt	2213ed0c3d	Remove deprecated (pre-8.4.0) encryption APIs	2022-04-30 17:23:58 -04:00
Jay Berkenbilt	4f24617e1e	Code clean up: use range-style for loops wherever possible Where not possible, use "auto" to get the iterator type. Editorial note: I have avoid this change for a long time because of not wanting to make gratuitous changes to version history, which can obscure when certain changes were made, but with having recently touched every single file to apply automatic code formatting and with making several broad changes to the API, I decided it was time to take the plunge and get rid of the older (pre-C++11) verbose iterator syntax. The new code is just easier to read and understand, and in many cases, it will be more effecient as fewer temporary copies are being made. m-holger, if you're reading, you can see that I've finally come around. :-)	2022-04-30 13:27:18 -04:00
Jay Berkenbilt	7f023701dd	Formatting: remove space in range-style for loops Change .clang-format and commit automated changes from a fresh run of format-code	2022-04-30 13:26:43 -04:00
Jay Berkenbilt	d8fdf632a9	Use replaceKeyAndGet in a few places in existing code	2022-04-29 20:28:02 -04:00
Jay Berkenbilt	cdd0b4fb7d	Use = default and = delete where possible in classes	2022-04-16 11:39:14 -04:00
Jay Berkenbilt	a68703b07e	Replace PointerHolder with std::shared_ptr in library sources only (patrepl and cleanpatch are my own utilities) patrepl s/PointerHolder/std::shared_ptr/g {include,libqpdf}/qpdf/.hh patrepl s/PointerHolder/std::shared_ptr/g libqpdf/.cc patrepl s/make_pointer_holder/std::make_shared/g libqpdf/.cc patrepl s/make_array_pointer_holder/QUtil::make_shared_array/g libqpdf/.cc patrepl s,qpdf/std::shared_ptr,qpdf/PointerHolder, */.cc */.hh git restore include/qpdf/PointerHolder.hh cleanpatch ./format-code	2022-04-09 17:33:29 -04:00
Jay Berkenbilt	12f1eb15ca	Programmatically apply new formatting to code Run this: for i in */.cc */.c */.h */.hh; do clang-format < $i >\| $i.new && mv $i.new $i done	2022-04-04 08:10:40 -04:00
Jay Berkenbilt	f91b21c7d4	Preserve input PDF version on pages/split-pages (fixes #610 )	2022-02-08 12:34:14 -05:00
Jay Berkenbilt	cb769c62e5	WHITESPACE ONLY -- expand tabs in source code This comment expands all tabs using an 8-character tab-width. You should ignore this commit when using git blame or use git blame -w. In the early days, I used to use tabs where possible for indentation, since emacs did this automatically. In recent years, I have switched to only using spaces, which means qpdf source code has been a mixture of spaces and tabs. I have avoided cleaning this up because of not wanting gratuitous whitespaces change to cloud the output of git blame, but I changed my mind after discussing with users who view qpdf source code in editors/IDEs that have other tab widths by default and in light of the fact that I am planning to start applying automatic code formatting soon.	2022-02-08 11:51:15 -05:00
Jay Berkenbilt	c62e8e2b28	Update for clean compile with POINTERHOLDER_TRANSITION=2	2022-02-07 17:38:22 -05:00
Jay Berkenbilt	cfaae47dc6	Add getBufferSharedPointer() to Pl_Buffer and QPDFWriter	2022-02-07 12:53:28 -05:00
Jay Berkenbilt	5f3f78822b	Improve use of std::unique_ptr * Use unique_ptr in place of shared_ptr in some cases * unique_ptr for arrays does not require a custom deleter * use std::make_unique (c++14) where possible	2022-02-05 11:24:56 -05:00
Jay Berkenbilt	2229e37e88	Add a blank line after the first header included in each source	2022-02-04 16:31:31 -05:00
Jay Berkenbilt	abc300f05c	Replace containers of PointerHolder with containers of std::shared_ptr None of these are in the public API.	2022-02-04 13:12:37 -05:00
Jay Berkenbilt	9044a24097	PointerHolder: deprecate getPointer() and getRefcount() Use get() and use_count() instead. Add #define NO_POINTERHOLDER_DEPRECATION to remove deprecation markers for these only. This commit also removes all deprecated PointerHolder API calls from qpdf's code except in PointerHolder's test suite, which must continue to test the deprecated APIs.	2022-02-04 13:12:37 -05:00
Jay Berkenbilt	76c4f78b5c	Add QUtil::make_shared_cstr Replace most of the calls to QUtil::copy_string with this instead.	2022-01-30 13:11:03 -05:00
m-holger	07db3200cb	Remove some if statements and simplify some boolean expressions Use QPDFObjectHandle::isNameAndEquals, isDictionaryOfType and isStreamOfType.	2022-01-27 07:31:12 -06:00
Jay Berkenbilt	3cacb27a90	Performance fix on preserveObjectStreams	2021-05-09 07:51:14 -04:00
Jay Berkenbilt	30ac51bc78	Exclude unreferenced objects in object streams (fixes #520 )	2021-05-08 09:42:09 -04:00
Jay Berkenbilt	12ecd2019a	Add QPDFObjectHandle::setFilterOnWrite	2020-12-28 12:58:19 -05:00
Jay Berkenbilt	858c7b89bc	Let optimize filter stream parameters instead of making them direct Also removes preclusion of stream references in stream parameters of filterable streams and reduces write times by about 8% by eliminating an extra traversal of the objects.	2020-12-28 12:58:19 -05:00
Jay Berkenbilt	09027344b9	Refactor: separate code that determines whether to filter a stream	2020-12-28 12:58:19 -05:00
Jay Berkenbilt	6971f78ff6	Fix stack overflow on direct root (fuzz issue 26761)	2020-10-31 13:10:39 -04:00
Jay Berkenbilt	30bb4c64ee	Minor code cleanup * Return rather than exiting from realmain in qpdf.cc * Remove extraneous blank line * Don't assign temporary to const reference	2020-10-22 15:39:36 -04:00
Jay Berkenbilt	92d3cbecd4	Fix warnings reported by -Wshadow=local (fixes #431 )	2020-04-16 12:41:43 -04:00
Jay Berkenbilt	70665cb381	Internally use unsafeShallowCopy where we can	2020-04-03 12:16:24 -04:00
Jay Berkenbilt	57c01ef81f	In qdf mode, don't write extra XRef streams (fixes #386 ) fix-qdf assumes there is exactly one XRef stream and that it is at the end of the file.	2020-01-26 16:50:57 -05:00
Jay Berkenbilt	5508f74603	Allow /P in encryption dictionary to be positive (fixes #382 ) Even though this is disallowed by the spec, files like this have been encountered in the wild.	2019-11-09 12:33:15 -05:00
Masamichi Hosoda	5a842792b6	Parse Contents in signature dictionary without encryption Various PDF digital signing tools do not encrypt /Contents value in signature dictionary. Adobe Acrobat Reader DC can handle a PDF with the /Contents value not encrypted. Write Contents in signature dictionary without encryption Tests ensure that string /Contents are not handled specially when not found in sig dicts.	2019-10-22 16:20:21 -04:00
Masamichi Hosoda	50b329ee9f	Add QPDFWriter::getWrittenXRefTable()	2019-10-22 16:16:16 -04:00
Masamichi Hosoda	5cf4090aee	Add QPDFWriter::getRenumberedObjGen()	2019-10-22 16:16:16 -04:00
Masamichi Hosoda	5e0ba12687	Fix /Contents value representation in a signature dictionary Table 8.93 "Entries in a signature dictionary" in PDF 1.5 reference describes that the value of Contents entry is a hexadecimal string representation when ByteRange is specified. This commit makes QPDF always uses hexadecimal strings representation instead of literal strings for it.	2019-10-22 16:16:16 -04:00
Jay Berkenbilt	0e51a9aca6	Don't encrypt trailer, fixes fuzz issue 15983 Ordinarily the trailer doesn't contain any strings, so this is usually a non-issue, but if the trailer contains strings, linearizing and encrypting with object streams would include encrypted strings in the trailer, which would blow out the padding because encrypted strings are longer than their cleartext counterparts.	2019-08-28 23:06:32 -04:00
Jay Berkenbilt	47a38a942d	Detect stream in object stream, fixing fuzz 16214 It's detected in QPDFWriter instead of at parse time because I can't figure out how to construct a test case in a reasonable time. This commit moves the fuzz file into the regular test suite for a QTC coverage case.	2019-08-28 12:49:04 -04:00
Jay Berkenbilt	ba5fb69164	Make popping pipeline stack safer Use destructors to pop the pipeline stack, and ensure that code that pops the stack is actually popping the intended thing.	2019-08-27 22:27:47 -04:00
Jay Berkenbilt	2794bfb1a6	Add flags to control zlib compression level (fixes #113 )	2019-08-23 20:34:21 -04:00
Jay Berkenbilt	3f3dbe22ea	Remove array null flattening For some reason, qpdf from the beginning was replacing indirect references to null with literal null in arrays even after removing the old behavior of flattening scalar references. This seems like a bad idea.	2019-08-22 17:55:16 -04:00
Jay Berkenbilt	551dfbf697	Allow set*EncryptionParameters before filename iset (fixes #336 )	2019-06-22 20:57:33 -04:00
Jay Berkenbilt	6c39aa8763	In shippable code, favor smart pointers (fixes #235 ) Use PointerHolder in several places where manually memory allocation and deallocation were being used. This helps to protect against memory leaks when exceptions are thrown in surprising places.	2019-06-22 16:57:52 -04:00
Jay Berkenbilt	658b5bb3be	QPDFWriter: clean up overloaded functions In a small number of cases, it makes sense to replace an overloaded function with a function that takes a default argument. We can do this now because we've already broken binary compatibility since the last release.	2019-06-22 10:13:27 -04:00
Jay Berkenbilt	b07ad6794e	Fix bugs found by fuzz tests * Several assertions in linearization were not always true; change them to run time errors * Handle a few cases of uninitialized objects * Handle pages with no contents when doing form operations * Handle invalid page tree nodes when traversing pages	2019-06-21 17:56:24 -04:00
Jay Berkenbilt	d71f05ca07	Fix sign and conversion warnings (major) This makes all integer type conversions that have potential data loss explicit with calls that do range checks and raise an exception. After this commit, qpdf builds with no warnings when -Wsign-conversion -Wconversion is used with gcc or clang or when -W3 -Wd4800 is used with MSVC. This significantly reduces the likelihood of potential crashes from bogus integer values. There are some parts of the code that take int when they should take size_t or an offset. Such places would make qpdf not support files with more than 2^31 of something that usually wouldn't be so large. In the event that such a file shows up and is valid, at least qpdf would raise an error in the right spot so the issue could be legitimately addressed rather than failing in some weird way because of a silent overflow condition.	2019-06-21 13:17:21 -04:00
Jay Berkenbilt	eb7948876b	Fix problems found in fuzz corpus	2019-06-15 17:24:24 -04:00
Jay Berkenbilt	31bde2f9d7	Handle empty DecodeParams array for (fixes #331 ) On read, ignore /DecodeParms when empty list; on write, delete it. Some files have been found that include an empty list for /DecodeParms, but this is not technically compliant with the spec, and the only sensible interpretation is to treat it as if there are no decode parameters.	2019-06-09 17:19:49 -04:00
Jay Berkenbilt	2712869cf9	Fix logic for when to compress object and xref streams (fixes #271 )	2019-01-28 21:43:06 -05:00
Jay Berkenbilt	6ec22f117d	Modernize encryption API for more granularity Setting encryption permissions for R >= 3 set permission bits in groups corresponding to menu options in Acrobat 5. The new API allows the bits to be set individually.	2019-01-17 11:43:56 -05:00
Jay Berkenbilt	16fd6e64f9	Add QPDFWriter::getFinalVersion (fixes #266 )	2019-01-04 12:37:22 -05:00
Jay Berkenbilt	a01359189b	Fix dangling references (fixes #240 ) On certain operations, such as iterating through all objects and adding new indirect objects, walk through the entire object structure and explicitly resolve any indirect references to non-existent objects. That prevents new objects from springing into existence and causing the previously dangling references to point to them.	2019-01-04 10:29:29 -05:00
Jay Berkenbilt	b6e414b10b	Remove some extraneous null pointer checks (fixes #234 ) There were a few places in the code that were checking that a pointer wasn't null before deleting it, even though C++ has always allowed delete 0. Most of the code did not perform these checks.	2018-08-12 12:58:39 -04:00
Jay Berkenbilt	e1cd5891af	Fix infinite loop on small files with progress reporting (fixes #230 ) Turns out you can keep adding zero to a number over and over again and it just doesn't get any bigger. Who would have known?	2018-08-05 15:43:34 -04:00
Jay Berkenbilt	a433ed24f9	Add progress reporting for QPDFWriter (fixes #200 )	2018-06-22 16:14:54 -04:00
Jay Berkenbilt	c81836076f	Correct incorrect comment	2018-06-22 13:13:09 -04:00
Jay Berkenbilt	078cf9bf90	newline before endstream fix for object streams (fixes #205 )	2018-05-12 13:17:43 -04:00
Jay Berkenbilt	9910104442	Implement TokenFilter and refactor Pl_QPDFTokenizer Implement a TokenFilter class and refactor Pl_QPDFTokenizer to use a TokenFilter class called ContentNormalizer. Pl_QPDFTokenizer is now a general filter that passes data through a TokenFilter.	2018-02-18 21:05:46 -05:00
Jay Berkenbilt	ebd5ed63de	Add option to save pass 1 of lineariziation This is useful only for debugging the linearization code.	2018-02-18 20:18:40 -05:00
Jay Berkenbilt	e3167c1a60	Fix linearization for files with nonstandard ID length	2018-02-04 18:16:23 -05:00
Jay Berkenbilt	34a9b835b0	Fix indentation	2018-02-04 14:19:00 -05:00
Jay Berkenbilt	a3a55be9cd	Correct errors in PNG filters and make use from library	2017-12-25 14:24:48 -05:00
Jay Berkenbilt	d31a7b76e7	Improve message for stream decoding error Tweak the message so that we inform the user that we are mitigating data loss.	2017-09-12 16:03:48 -04:00
Jay Berkenbilt	1868a10f8b	Replace all atoi calls with QUtil::string_to_int The latter catches underflow/overflow.	2017-08-29 12:28:32 -04:00
Jay Berkenbilt	e452d9dca6	Spell check	2017-08-22 14:22:20 -04:00
Jay Berkenbilt	ce435222b2	Push QPDFWriter member variables into a nested class	2017-08-21 22:04:07 -04:00
Jay Berkenbilt	198856a825	Improve pclm parameter settings	2017-08-21 21:05:48 -04:00
Jay Berkenbilt	8ab52fa558	Combine writePCLm with writeStandard Reduce code duplication	2017-08-21 21:05:48 -04:00
Jay Berkenbilt	9f60a864a0	Combine PCLm header into writeHeader	2017-08-21 21:05:47 -04:00
Jay Berkenbilt	adbcfcff2d	Remove duplicated coverage cases Remove duplicated coverage cases from Sahil's code so existing test suite passes.	2017-08-21 18:55:02 -04:00
Sahil Arora	b19210fa7d	QPDFWriter: Add setPCLm() and writePCLm() methods * Add support for PCLm using setPCLm() and writePCLm() methods in QPDFWriter.hh and QPDFWriter.cc * Add a function writePCLmHeader() for PCLm header in QPDFWriter	2017-08-21 18:55:02 -04:00
Jay Berkenbilt	ddc6cf0cf6	Precheck streams by default There is no need for a --precheck-streams option. We can do the precheck without imposing any penalty, only re-encoding the stream if it fails the first time.	2017-08-21 17:44:22 -04:00
Jay Berkenbilt	9744414c66	Enable finer grained control of stream decoding This commit adds several API methods that enable control over which types of filters QPDF will attempt to decode. It also adds support for /RunLengthDecode and /DCTDecode filters for both encoding and decoding.	2017-08-21 17:44:22 -04:00
Jay Berkenbilt	8249a26d69	Fix infinite loop in QPDFWriter (fixes #143 )	2017-08-12 08:36:36 -04:00
Jay Berkenbilt	36b3fe5af7	Fix --newline-before-endstream option (fixes #133 ) Add a newline unconditionally before endstream even if a newline was already written as part of the stream data.	2017-08-11 20:57:05 -04:00
Jay Berkenbilt	46611f0710	Prevent a division by zero error (fixes #141 ) Bad /W in an xref stream could cause a division by zero error. Now this is handled as a special case.	2017-08-11 20:11:19 -04:00
Jay Berkenbilt	f37d399d82	Add newline-before-endstream option (fixes #103 )	2017-07-29 12:21:38 -04:00
Jay Berkenbilt	a136824243	Fix exception catch	2017-07-29 12:19:04 -04:00
Jay Berkenbilt	3a1ff5ded9	Add option to preserve unreferenced objects	2017-07-28 19:19:11 -04:00
Jay Berkenbilt	7f8892525f	Add precheck streams capability When requested, QPDFWriter will do more aggress prechecking of streams to make sure it can actually succeed in decoding them before attempting to do so. This will allow preservation of raw data even when the raw data is corrupted relative to the specified filters.	2017-07-27 23:42:27 -04:00
Jay Berkenbilt	e0e9d64674	Remove some ABI compatibility private methods Since we have to bump soname, remove some private methods that were just there for binary compatibility	2015-11-10 12:22:40 -05:00
Jay Berkenbilt	b8bdef0ad1	Implement deterministic ID For non-encrypted files, determinstic ID generation uses file contents instead of timestamp and file name. At a small runtime cost, this enables generation of the same /ID if the same inputs are converted in the same way multiple times.	2015-10-31 18:56:42 -04:00
Jay Berkenbilt	9f8aba1db7	Handle indirect stream filter/decode parameters QPDFWriter was trying to make /Filter and /DecodeParms direct in all cases, but there are some cases where /DecodeParms may refer to a stream, which can't be direct. QPDFWriter doesn't actually need /DecodeParms to be direct in that case because it won't be able to filter the stream. Until we can handle this type of stream, just don't make /Filter and /DecodeParms direct if we can't filter the stream anyway. Fixes #34	2014-06-07 16:31:03 -04:00
Jay Berkenbilt	b0a96ce6aa	Fix calculation of xref stream stream columns Fix problem: if the last object in the first part of a linearized file had an offset that was below 65536 by less than the size of the hint stream, the xref stream was invalid and the resulting file is not usable.	2014-02-22 22:13:31 -05:00
Jay Berkenbilt	b802ca47e9	Comments about incremental update support Also remove some trivial, non-functional code.	2013-12-14 15:17:36 -05:00
Jay Berkenbilt	dc9df97466	Include <algorithm> for std::min, std::max	2013-11-29 10:48:16 -05:00
Jay Berkenbilt	a237e92445	Warn when -accessibility=n will be ignored Also accept -accessibility=n with 256 bit keys even though it will be ignored.	2013-10-18 10:45:15 -04:00
Jay Berkenbilt	ac9c1f0d56	Security: replace operator[] with at For std::string and std::vector, replace operator[] with at. This was done using an automated process. See README.hardening for details.	2013-10-18 10:45:14 -04:00
Jay Berkenbilt	e19eb579b2	Replace some assertions with std::logic_error Ideally, the library should never call assert outside of test code, but it does in several places. For some cases where the assertion might conceivably fail because of a problem with the input data, replace assertions with exceptions so that they can be trapped by the calling application. This commit surely misses some cases and replaced some cases unnecessarily, but it should still be an improvement.	2013-10-09 20:57:14 -04:00
Jay Berkenbilt	cee2592ed1	Change API/ABI and withdraw 4.2.0 4.2.0 was binary incompatible in spite of there being no deletions or changes to any public methods. As such, we have to bump the ABI and are fixing some API breakage while we're at it. Previous 4.3.0 target is now 5.1.0.	2013-07-10 11:30:13 -04:00
Jay Berkenbilt	212812d837	Fix errors reported by Coverity Thanks to Jiri Popelka from Red Hat for sending the output of a Coverity run over qpdf.	2013-07-07 15:36:51 -04:00
Jay Berkenbilt	eae8370cd9	Add optional /Length key in crypt filter dictionary	2013-06-14 20:42:39 -04:00
Jay Berkenbilt	a3576a7359	Bug fix: handle generation > 0 when generating object streams Rework QPDFWriter to always track old object IDs and QPDFObjGen instead of int, thus not discarding the generation number. Switch to QPDF::getCompressibleObjGen() to properly handle the case of an old object eligible for compression that has a generation of other than zero.	2013-06-14 14:58:09 -04:00
Jay Berkenbilt	690d6031db	Remove duplicated comment	2013-06-08 18:58:31 -04:00
Jay Berkenbilt	ac4deac187	Call QUtil::safe_fopen in place of fopen fopen was previuosly called wrapped by QUtil::fopen_wrapper, but QUtil::safe_fopen does this itself, which is less cumbersome.	2013-03-05 13:35:46 -05:00

1 2 3 4 5

213 Commits