mirror of
https://github.com/qpdf/qpdf.git
synced 2024-11-04 12:34:20 +00:00
d71f05ca07
This makes all integer type conversions that have potential data loss explicit with calls that do range checks and raise an exception. After this commit, qpdf builds with no warnings when -Wsign-conversion -Wconversion is used with gcc or clang or when -W3 -Wd4800 is used with MSVC. This significantly reduces the likelihood of potential crashes from bogus integer values. There are some parts of the code that take int when they should take size_t or an offset. Such places would make qpdf not support files with more than 2^31 of something that usually wouldn't be so large. In the event that such a file shows up and is valid, at least qpdf would raise an error in the right spot so the issue could be legitimately addressed rather than failing in some weird way because of a silent overflow condition.
2526 lines
99 KiB
Plaintext
2526 lines
99 KiB
Plaintext
2019-06-20 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Fix all integer sign and conversion warnings. This makes all
|
|
integer type conversions that have potential data loss explicit
|
|
with calls that do range checks and raise an exception.
|
|
|
|
* Change out_bufsize argument to Pl_Flate's constructor for int to
|
|
unsigned int for compatibility with underlying zlib
|
|
implementation.
|
|
|
|
* Change QPDFObjectHandle::pipeStreamData's encode_flags argument
|
|
from unsigned long to int since int is the underlying type of the
|
|
enumerated type values that are passed to it. This change should
|
|
be invisible to virtually all code unless you are compiling with
|
|
strict warning flags and explicitly casting to unsigned long.
|
|
|
|
* Add methods to QPDFObjectHandle to return the value of Integer
|
|
objects as int and unsigned int with range checking and fallback
|
|
behavior to avoid silent underflow/overflow conditions.
|
|
|
|
* Add functions to QUtil to convert unsigned integers to strings,
|
|
avoiding implicit conversion between unsigned and signed integer
|
|
types.
|
|
|
|
* Add QIC.hh, containing integer type converters that do range
|
|
checking.
|
|
|
|
2019-06-18 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Remove previously submitted qpdf_read_memory_fuzzer as it is a
|
|
small subset of qpdf_fuzzer.
|
|
|
|
2019-06-15 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Update CI (Azure Pipelines) to run tests with some sanitizers.
|
|
|
|
* Do "ideal integration" with oss-fuzz. This includes adding a
|
|
better fuzzer with a seed corpus and adding automated tests of the
|
|
fuzzer with the test data.
|
|
|
|
* When parsing files, while reading an object, if there are too
|
|
many consecutive errors without enough intervening successes, give
|
|
up on the specific object. This reduces cases in which very badly
|
|
damaged files send qpdf into a tail spin reading one character at
|
|
a time and reporting warnings.
|
|
|
|
2019-06-13 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Perform initial integration of Google's oss-fuzz project by
|
|
copying the fuzzer someone from Google already did into the qpdf
|
|
repository and adding build support. This shift in control is in
|
|
preparation for an ideal integration with oss-fuzz.
|
|
|
|
2019-06-09 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* When /DecodeParms is an empty list, ignore it on read and delete
|
|
it on write. Fixes #331.
|
|
|
|
2019-05-18 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 8.4.2: release
|
|
|
|
2019-05-16 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Fix memory error in Windows-only code from typo. Fixes #330.
|
|
|
|
2019-04-27 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 8.4.1: release
|
|
|
|
2019-04-20 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* When qpdf --version is run, it will detect if the qpdf CLI was
|
|
built with a different version of qpdf than the library. This
|
|
usually indicates that multiple versions of qpdf are installed and
|
|
that the library path is not set up properly. This situation
|
|
sometimes causes confusing behavior for users who are not actually
|
|
running the version of qpdf they think they are running.
|
|
|
|
* Add parameter --remove-page-labels to remove page labels from
|
|
output. In qpdf 8.3.0, the behavior changed so that page labels
|
|
were preserved when merging and splitting files. Some users were
|
|
relying on the fact that if you ran qpdf --empty --pages ... all
|
|
page labels were dropped. This option makes it possible to get
|
|
that behavior if it is explicitly desired. Fixes #317.
|
|
|
|
* Add parameter --keep-files-open-threshold to override the
|
|
maximum number of files that qpdf will allow to be kept open at
|
|
once. Fixes #288.
|
|
|
|
* Handle Unicode characters in filenames properly on Windows. The
|
|
changes to support Unicode on the CLI in Windows broke Unicode
|
|
filenames on that platform. Fixes #298.
|
|
|
|
* Slightly tighten logic that determines whether an object is a
|
|
page. The previous logic was sometimes failing to preserve
|
|
annotations because they were passing the overly loose test for
|
|
whether something was a page. This fix has a slight risk of
|
|
causing some extraneous objects to be copied during page splitting
|
|
and merging for erroneous PDF files whose page objects contain
|
|
invalid types or are missing the /Type key entirely, both of which
|
|
would be invalid according to the PDF specification.
|
|
|
|
* Revert change that included preservation of outlines (bookmarks)
|
|
in --split-pages. The way it was implemented caused a very
|
|
significant performance penalty when splitting pages with
|
|
outlines. We need a better solution that only copies the relevant
|
|
items, not the whole tree.
|
|
|
|
2019-03-11 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* JSON serialization: add missing leading 0 to decimal values
|
|
between -1 and 1. Fixes #308.
|
|
|
|
2019-02-01 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 8.4.0: release
|
|
|
|
2019-01-31 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Bug fix: do better pre-checks on images before optimizing;
|
|
refuse to optimize images that can't be converted to JPEG because
|
|
of colorspace or depth.
|
|
|
|
* Add new options --externalize-inline-images, which converts
|
|
inline images larger than a specified size to regular images, and
|
|
--ii-min-bytes, which tweaks that size.
|
|
|
|
* When optimizing images, inline images are now included in the
|
|
optimization, first being converted to regular images. Use
|
|
--keep-inline-images to exclude them from optimization. Fixes #278.
|
|
|
|
* Add method QPDFPageObjectHelper::externalizeInlineImages, which
|
|
converts inline images whose size is at least a specified amount
|
|
to regular images.
|
|
|
|
* Remove traces of acroread, which hasn't been available in Linux
|
|
for a long time.
|
|
|
|
2019-01-30 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Do not include space after ID operator in inline image data. The
|
|
token now correctly contains the image data, the EI operator,
|
|
and the delimiter that precedes the EI operator.
|
|
|
|
* Improve locating of an inline image's EI operator to correctly
|
|
handle the case of EI appearing inside the image data.
|
|
|
|
* Very low-level QPDFTokenizer API now includes an
|
|
expectInlineImage method that takes an input stream, enabling it
|
|
to locate an inline image's EI operator better. When this method
|
|
is called, the inline image token returned will not contain the EI
|
|
operator and will contain correct image data. This is called
|
|
automatically everywhere within the qpdf library. Most user code
|
|
will never have to use the low-level tokenizer API. If you use
|
|
Pl_QPDFTokenizer, this will be done automatically for you. If you
|
|
use the low-level API and call expectInlineImage, you should call
|
|
the new version.
|
|
|
|
2019-01-29 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Bug fix: when returning an inline image token, the tokenizer no
|
|
longer includes the delimiter that follows EI. The
|
|
QPDFObjectHandle created from the token was correct.
|
|
|
|
* Handle files with direct page objects, which is not allowed by
|
|
the PDF spec but has been seen in the wild. Fixes #164.
|
|
|
|
2019-01-28 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Bug fix: when using --stream-data=compress, object streams and
|
|
xref streams were not compressed. They were compressed if no
|
|
--stream-data option was specified. Fixes #271.
|
|
|
|
* When linearizing or getting the list of all pages in a file,
|
|
replace duplicated page objects with a shallow copy of the page
|
|
object. Linearization and all page manipulation APIs require page
|
|
objects to be unique. Pages that were originally duplicated will
|
|
still share contents and any other indirect resources. Fixes #268.
|
|
|
|
2019-01-26 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add --overlay and --underlay options. Fixes #207.
|
|
|
|
* Create examples/pdf-overlay-page.cc to demonstrate use of
|
|
page/form XObject interaction
|
|
|
|
* Add new methods QPDFPageObjectHelper::getFormXObjectForPage,
|
|
which creates a form XObject equivalent to a page, and
|
|
QPDFObjectHandle::placeFormXObject, which generates content stream
|
|
code to placing a form XObject on a page.
|
|
|
|
2019-01-25 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add new method QPDFObjectHandle::getUniqueResourceName() to
|
|
return an unused key available to be used in a resource
|
|
dictionary.
|
|
|
|
* Add new method QPDFPageObjectHelper::getAttribute() that
|
|
properly handles inherited attributes and allows for creation of a
|
|
copy of shared attributes. This is very useful if you are getting
|
|
an attribute of a page dictionary with the intent to modify it
|
|
privately for that page.
|
|
|
|
* Fix QPDFPageObjectHelper::getPageImages (and the legacy
|
|
QPDFObjectHandle::getPageImages()) to properly handle images in
|
|
inherited resources dictionaries.
|
|
|
|
2019-01-20 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Tweak the content code generated for variable text fields to
|
|
better handle font sizes and multi-line text.
|
|
|
|
* When generating appearance streams for variable text
|
|
annotations, properly handle the cases of there being no
|
|
appearance dictionary, no appearance stream, or an appearance
|
|
stream with no BMC..EMC marker.
|
|
|
|
* When flattening annotations, remove annotations from the file
|
|
that don't have appearance streams. These were previously being
|
|
preserved, but since they are invisible, there is no reason to
|
|
preserve them when flattening annotations.
|
|
|
|
2019-01-19 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* NOTE: qpdf CLI: some non-compatible changes were made to how
|
|
qpdf interprets password arguments that contain Unicode characters
|
|
that fall outside of ASCII. On Windows, the non-compatibility was
|
|
unavoidable, as explained in the release notes. On all platforms,
|
|
it is possible to get the old behavior if desired, though the old
|
|
behavior would almost always result in files that other
|
|
applications were unable to open. As it stands, qpdf should now be
|
|
able to open passwords encrypted with a wide range of passwords
|
|
that some other viewers might not handle, though even now, qpdf's
|
|
Unicode password handling is not 100% complete.
|
|
|
|
* Add --password-mode option, which allows fine-grained control of
|
|
how password arguments are treated. This is discussed fully in the
|
|
manual. Fixes #215.
|
|
|
|
* Add option --suppress-password-recovery to disable the behavior
|
|
of searching for a correct password by re-encoding the provided
|
|
password. This option can be useful if you want to ensure you know
|
|
exactly what password is being used.
|
|
|
|
2019-01-17 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* When attempting to open an encrypted file with a password, if
|
|
the password doesn't work, try alternative passwords created by
|
|
re-interpreting the supplied password with different string
|
|
encodings. This makes qpdf able to recover passwords with
|
|
non-ASCII characters when either the decryption or encryption
|
|
operation was performed with an incorrectly encoded password.
|
|
|
|
* Fix data loss bug: qpdf was discarding referenced resources in
|
|
the case in which a page's resource dictionary contained an
|
|
indirect reference for either /Font or /XObject that contained
|
|
fonts or XObjects not referenced on all pages that shared the
|
|
resource. This was a "typo" in the code. The comment explained the
|
|
correct behavior, and the code was clearly intended to handle this
|
|
issue, but the implementation had an error in it. This is fixed by
|
|
a single-line change, which can be found in git commit
|
|
4bc434000c42a7191e705c8a38216ca6743ad9ff. That commit can be used
|
|
as a patch that applies cleanly against qpdf 8.1.0 and forward.
|
|
The bug was introduced in version 8.1.0. For the record, this is
|
|
the first bug in qpdf's history that could result in silent loss
|
|
of data when processing a correct input file. Fixes #276.
|
|
|
|
2019-01-15 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add QUtil::possible_repaired_encodings which, given a string,
|
|
generates other strings that represent re-interpretation of the
|
|
bytes in a different coding system. This is used to help recover
|
|
passwords if the password string was improperly encoded on a
|
|
different system due to user error or a software bug.
|
|
|
|
2019-01-14 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add new CLI flags to 128-bit and 256-bit encryption: --assemble,
|
|
--annotate, --form, and --modify-other to control encryption
|
|
permissions with more granularity than was allowed with the
|
|
--modify flag. Fixes #214.
|
|
|
|
* Add new versions of
|
|
QPDFWriter::setR{3,4,5,6}EncryptionParameters that allow
|
|
individual setting of the various permission bits. The old
|
|
interfaces are retained for backward compatibility. In the "C"
|
|
API, add qpdf_set_r{3,4,5,6}_encryption_parameters2. The new
|
|
interfaces use separate booleans for various permissions instead
|
|
of the qpdf_r3_modify_e enumerated type, which set permission bits
|
|
in predefined groups.
|
|
|
|
* Add versions of utf8 to single-byte character transcoders that
|
|
return a success code.
|
|
|
|
2019-01-13 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add several more string transcoding and analysis methods to
|
|
QUtil for bidirectional conversion between PDF Doc, Win Ansi, Mac
|
|
Roman, UTF-6, and UTF-16 along with detection of valid UTF-8 and
|
|
UTF-16.
|
|
|
|
2019-01-12 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* In the --pages option, allow the same page to be specified more
|
|
than once. You can now do "--pages A.pdf 1,1 --" or
|
|
"--pages A.pdf 1 A.pdf 1" instead of having to use two different
|
|
paths to specify A.pdf. Fixes #272.
|
|
|
|
* Add QPDFPageObjectHelper::shallowCopyPage(). This method creates
|
|
a new page object that is a "shallow copy" of the given page as
|
|
described in the comments in QPDFPageObjectHelper. The resulting
|
|
object has not been added anywhere but is ready to be passed to
|
|
QPDFPageDocumentHelper::addPage of its own QPDF or another QPDF
|
|
object.
|
|
|
|
* Add QPDF::getUniqueId() method to return an identifier that is
|
|
intended to be unique within the scope of all QPDF objects created
|
|
by the calling application in a single run.
|
|
|
|
* In --pages, allow "." as a replacement for the current input
|
|
file, making it possible to say "qpdf A.pdf --pages . 1-3 --"
|
|
instead of having to repeat the input filename.
|
|
|
|
2019-01-10 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add new configure option --enable-avoid-windows-handle, which
|
|
causes the symbol AVOID_WINDOWS_HANDLE to be defined. If set, we
|
|
avoid using Windows I/O HANDLE, which is disallowed in some
|
|
versions of the Windows SDK, such as for Windows phones.
|
|
QUtil::same_file will always return false in this case. Only
|
|
applies to Windows builds.
|
|
|
|
* Add new method QPDF::setImmediateCopyFrom. When called on a
|
|
source QPDF object, streams can be copied FROM that object to
|
|
other ones without having to keep the source QPDF or its input
|
|
source around. The cost is copying the streams into RAM. See
|
|
comments in QPDF.hh for setImmediateCopyFrom for a detailed
|
|
explanation.
|
|
|
|
2019-01-07 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 8.3.0: release
|
|
|
|
* Add sample completion files in completions. These can be used by
|
|
packagers to install on the system wherever bash and zsh keep
|
|
their vendor-supplied completions.
|
|
|
|
* Add configure flag --enable-check-autofiles, which is on by
|
|
default. Packagers whose packaging systems automatically refresh
|
|
autoconf or libtool files should pass --disable-check-autofiles to
|
|
./configure to suppress warnings about automatically generated
|
|
files being outdated.
|
|
|
|
2019-01-06 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Remove the restriction in most cases that the source QPDF used
|
|
in a copyForeignObject call has to stick around until the
|
|
destination QPDF is written. The exceptional case is when the
|
|
source stream gets is data using a
|
|
QPDFObjectHandle::StreamDataProvider. For a more in-depth
|
|
discussion, see comments around copyForeignObject in QPDF.hh.
|
|
Fixes #219.
|
|
|
|
2019-01-05 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* When generating appearances, if the font uses one of the
|
|
standard, built-in encodings, restrict the character set to that
|
|
rather than just to ASCII. This will allow most appearances to
|
|
contain characters from the ISO-Latin-1 range plus a few
|
|
additional characters.
|
|
|
|
* Add methods QUtil::utf8_to_win_ansi and
|
|
QUtil::utf8_to_mac_roman.
|
|
|
|
* Add method QUtil::utf8_to_utf16.
|
|
|
|
2019-01-04 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add new option --optimize-images, which recompresses every image
|
|
using DCT (JPEG) compression as long as the image is not already
|
|
compressed with lossy compression and recompressing the image
|
|
reduces its size. The additional options --oi-min-width,
|
|
--oi-min-height, and --oi-min-area prevent recompression of images
|
|
whose width, height, or pixel area (width * height) are below a
|
|
specified threshold.
|
|
|
|
* Add new option --collate. When specified, the semantics of
|
|
--pages change from concatenation to collation. See the manual for
|
|
a more detailed discussion. Fixes #259.
|
|
|
|
* Add new method QPDFWriter::getFinalVersion, which returns the
|
|
PDF version that will ultimately be written to the final file. See
|
|
comments in QPDFWriter.hh for some restrictions on its use. Fixes
|
|
#266.
|
|
|
|
* When unexpected errors are found while checking linearization
|
|
data, print an error message instead of calling assert, which
|
|
cause the program to crash. Fixes #209, #231.
|
|
|
|
* Detect and recover from dangling references. If a PDF file
|
|
contained an indirect reference to a non-existent object (which is
|
|
valid), when adding a new object to the file, it was possible for
|
|
the new object to take the object ID of the dangling reference,
|
|
thereby causing the dangling reference to point to the new object.
|
|
This case is now prevented. Fixes #240.
|
|
|
|
2019-01-03 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add --generate-appearances flag to the qpdf command-line tool to
|
|
trigger generation of appearance streams.
|
|
|
|
* Fix behavior of form field value setting to handle the following
|
|
cases:
|
|
- Strings are always written as UTF-16
|
|
- Check boxes and radio buttons are handled properly with
|
|
synchronization of values and appearance states
|
|
|
|
* Define constants in qpdf/Constants.h for interpretation of
|
|
annotation and form field flags
|
|
|
|
* Add QPDFAnnotationObjectHelper::getFlags
|
|
|
|
* Add many new methods to QPDFFormFieldObjectHelper for querying
|
|
flags and field types
|
|
|
|
* Add new methods for appearance stream generation. See comments
|
|
in QPDFFormFieldObjectHelper.hh for generateAppearance() for a
|
|
description of limitations.
|
|
- QPDFAcroFormDocumentHelper::generateAppearancesIfNeeded
|
|
- QPDFFormFieldObjectHelper::generateAppearance
|
|
|
|
* Bug fix: when writing form field values, always write string
|
|
values encoded as UTF-16.
|
|
|
|
* Add method QUtil::utf8_to_ascii, which returns an ASCII string
|
|
for a UTF-8 string, replacing out-of-range characters with a
|
|
specified substitute.
|
|
|
|
2019-01-02 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add method QPDFObjectHandle::getResourceNames that returns a set
|
|
of strings representing all second-level keys in a dictionary
|
|
(i.e. all keys of all direct dictionary members).
|
|
|
|
2018-12-31 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add --flatten-annotations flag to the qpdf command-line tool for
|
|
annotation flattening.
|
|
|
|
* Add methods for flattening form fields and annotations:
|
|
- QPDFPageDocumentHelper::flattenAnnotations - integrate
|
|
annotation appearance streams into page contents with special
|
|
handling for form fields: if appearance streams are up to date
|
|
(/NeedAppearances is false in /AcroForm), the /AcroForm key of
|
|
the document catalog is removed. Otherwise, a warning is
|
|
issued, and form fields are ignored. Non-form-field
|
|
annotations are always flattened if an appearance stream can
|
|
be found.
|
|
- QPDFAnnotationObjectHelper::getPageContentForAppearance -
|
|
generate the content stream fragment to render an appearance
|
|
stream in a page's content stream as a form xobject. Called by
|
|
flattenAnnotations.
|
|
|
|
* Add method QPDFObjectHandle::mergeResources(), which merges
|
|
resource dictionaries. See detailed description in
|
|
QPDFObjectHandle.hh.
|
|
|
|
* Add QPDFObjectHandle::Matrix, similar to
|
|
QPDFObjectHandle::Rectangle, as a convenience class for
|
|
six-element arrays that are used as matrices.
|
|
|
|
2018-12-23 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* When specifying @arg on the command line, if the file "arg" does
|
|
not exist, just treat this is a normal argument. This makes it
|
|
easier to deal with files whose names start with the @ character.
|
|
Fixes #265.
|
|
|
|
* Tweak completion so it works with zsh as well using
|
|
bashcompinit.
|
|
|
|
2018-12-22 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add new options --json, --json-key, and --json-object to
|
|
generate a json representation of the PDF file. This is described
|
|
in more depth in the manual. You can also run qpdf --json-help to
|
|
get a description of the json format.
|
|
|
|
2018-12-21 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Allow --show-object=trailer for showing the document trailer.
|
|
|
|
* You can now use eval $(qpdf --completion-bash) to enable bash
|
|
completion for qpdf. It's not perfect, but it works pretty well.
|
|
|
|
2018-12-19 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* When splitting pages using --split-pages, the outlines
|
|
dictionary and some supporting metadata are copied into the split
|
|
files. The result is that all bookmarks from the original file
|
|
appear, and those that point to pages that are preserved work
|
|
while those that point to pages that are not preserved don't do
|
|
anything. This is an interim step toward proper support for
|
|
bookmark preservation in split files.
|
|
|
|
* Add QPDFOutlineDocumentHelper and QPDFOutlineObjectHelper for
|
|
handling outlines (bookmarks) including bidirectionally mapping
|
|
between bookmarks and pages. Initially there is no support for
|
|
modifying the outlines hierarchy.
|
|
|
|
2018-12-18 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* New method QPDFObjectHandle::getJSON() returns a JSON object
|
|
with a partial representation of the object. See
|
|
QPDFObjectHandle.hh for a detailed description.
|
|
|
|
* Add a simple JSON serializer. This is not a complete or
|
|
general-purpose JSON library. It allows assembly and serialization
|
|
of JSON structures with some restrictions, which are described in
|
|
the header file.
|
|
|
|
* Add QPDFNameTreeObjectHelper class. This class provides useful
|
|
methods for dealing with name trees, which are discussed in
|
|
section 7.9.6 of the PDF spec (ISO-32000).
|
|
|
|
* Preserve page labels when merging and splitting files. Prior
|
|
versions of qpdf simply preserved the page label information from
|
|
the first file, which usually wouldn't make any sense in the
|
|
merged file. Now any page that had a page number in any original
|
|
file will have the same page number after merging or splitting.
|
|
|
|
* Add QPDFPageLabelDocumentHelper class. This is a document helper
|
|
class that provides useful methods for dealing with page labels.
|
|
It abstracts the fact that they are stored as number trees and
|
|
deals with interpolating intermediate values that are not in the
|
|
tree. It also has helper functions used by the qpdf command line
|
|
tool to preserve page labels when merging and splitting files.
|
|
|
|
* Add QPDFNumberTreeObjectHelper class. This class provides useful
|
|
methods for dealing with number trees, which are discussed in
|
|
section 7.9.7 of the PDF spec (ISO-32000). Page label dictionaries
|
|
are represented as number trees.
|
|
|
|
* New method QPDFObjectHandle::wrapInArray returns the object
|
|
itself if it is an array. Otherwise, it returns an array
|
|
containing the object. This is useful for dealing with PDF data
|
|
that is sometimes expressed as a single element and sometimes
|
|
expressed as an array, which is a somewhat common PDF idiom.
|
|
|
|
2018-10-11 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Files generated by autogen.sh are now committed so that it is
|
|
possible to build on platforms without autoconf directly from a
|
|
clean checkout of the repository. The configure script detects if
|
|
the files are out of date when it also determines that the tools
|
|
are present to regenerate them.
|
|
|
|
* Add build in Azure Pipelines, now that it is free for open
|
|
source projects.
|
|
|
|
2018-08-18 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 8.2.1: release
|
|
|
|
* Add new option --keep-files-open=[yn] to control whether qpdf
|
|
keeps files open when merging. Prior to version 8.1.0, qpdf always
|
|
kept all files open, but this meant that the number of files that
|
|
could be merged was limited by the operating system's open file
|
|
limit. Version 8.1.0 opened files as they were referenced, but
|
|
this caused a major performance impact. Version 8.2.0 optimized
|
|
the performance but did so in a way that, for local file systems,
|
|
there was a small but unavoidable performance hit, but for
|
|
networked file systems, the performance impact could be very high.
|
|
Starting with version 8.2.1, the default behavior is that files
|
|
are kept open if no more than 200 files are specified, but that
|
|
the behavior can be explicitly overridden with the
|
|
--keep-files-open flag. If you are merging more than 200 files but
|
|
less than the operating system's max open files limit, you may
|
|
want to use --keep-files-open=y. If you are using a local file
|
|
system where the overhead is low and you might sometimes merge
|
|
more than the OS limit's number of files, you may want to specify
|
|
--keep-files-open=n. Fixes #237.
|
|
|
|
2018-08-16 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 8.2.0: release
|
|
|
|
2018-08-14 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* For the mingw builds, change the name of the DLL import library
|
|
from libqpdf.a to libqpdf.dll.a to avoid confusing it with a
|
|
static library. This potentially clears the way for supporting a
|
|
static library in the future, though presently, the qpdf Windows
|
|
build only builds the DLL and executables. Fixes #225.
|
|
|
|
2018-08-13 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add new class QPDFSystemError, derived from std::runtime_error,
|
|
which is now thrown by QUtil::throw_system_error. This enables the
|
|
triggering errno value to be retrieved. Fixes #221.
|
|
|
|
2018-08-12 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* qpdf command line: add --no-warn option to suppress issuing
|
|
warning messages. If there are any conditions that would have
|
|
caused warnings to be issued, the exit status is still 3.
|
|
|
|
* Rewrite the internals of Pl_Buffer to be much more efficient in
|
|
use of memory at a very slight performance cost. The old
|
|
implementation could cause memory usage to go out of control for
|
|
files with large images compressed using the TIFF predictor.
|
|
Fixes #228.
|
|
|
|
2018-08-05 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Bug fix: end of line characters were not properly handled inside
|
|
strings in some cases. Fixes #226.
|
|
|
|
* Bug fix: infinite loop on progress reporting for very small
|
|
files. Fixes #230.
|
|
|
|
2018-08-04 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Performance fix: optimize page merging operation to avoid
|
|
unnecessary open/close calls on files being merged. Fixes #217.
|
|
|
|
* Add ClosedFileInputSource::stayOpen method, enabling a
|
|
ClosedFileInputSource to stay open during manually indicated
|
|
periods of high activity, thus reducing the overhead of frequent
|
|
open/close operations.
|
|
|
|
2018-06-23 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 8.1.0: release
|
|
|
|
2018-06-22 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Bug fix: properly decrypt files with 40-bit keys that use
|
|
revision 3 of the security handler. Prior to this, qpdf was
|
|
reporting "invalid password" in this case. Fixes #212.
|
|
|
|
* With --verbose, print information about each input file when
|
|
merging files.
|
|
|
|
* Add progress reporting to QPDFWriter. Programmatically, you can
|
|
register a progress reporter with registerProgressReporter(). From
|
|
the command line, passing --progress will give progress indicators
|
|
in increments of no less than 1% as output files are written.
|
|
Fixes #200.
|
|
|
|
* Add new method QPDF::getObjectCount(). This gives an approximate
|
|
(upper bound) account of objects in the QPDF object.
|
|
|
|
* Don't leave files open when merging. This makes it possible
|
|
merge more files at once than the operating system's open file
|
|
limit. Fixes #154.
|
|
|
|
* Add ClosedFileInputSource class, and input source that keeps its
|
|
input file closed when not reading it. At the expense of some
|
|
performance, this allows you to operate on many files without
|
|
opening too many files at the operating system level.
|
|
|
|
* Add new option --preserve-unreferenced-resources, which
|
|
suppresses removal of unreferenced objects from page resource
|
|
dictionaries during page splitting operations.
|
|
|
|
2018-06-21 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add method QPDFPageObjectHelper::removeUnreferencedResources and
|
|
also QPDFPageDocumentHelper::removeUnreferencedResources that
|
|
calls the former on every page. This method removes any XObject or
|
|
Font references from the page's resource dictionary if they are
|
|
not referenced anywhere in any of the content streams. This
|
|
significantly reduces the size of split files whose pages
|
|
internally share resource dictionaries. Fixes #203.
|
|
|
|
* The --rotate option to qpdf no longer requires an explicit page
|
|
range. You can now rotate all pages of a document with
|
|
qpdf --rotate=angle in.pdf out.pdf. Fixes #211.
|
|
|
|
* Create examples/pdf-set-form-values.cc to illustrate use of
|
|
interactive form helpers.
|
|
|
|
* Added methods QPDFAcroFormDocumentHelper::setNeedAppearances and
|
|
added methods to QPDFFormFieldObjectHelper to set a field's value,
|
|
optionally updating the document to indicate that appearance
|
|
streams need to be regenerated.
|
|
|
|
* Added QPDFObject::newUnicodeString and QPDFObject::unparseBinary
|
|
to allow for more convenient creation of strings that are
|
|
explicitly encoded in UTF-16 BE. This is useful for creating
|
|
Unicode strings that appear outside of content streams, such as in
|
|
page labels, outlines, form field values, etc.
|
|
|
|
2018-06-20 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Added new classes QPDFAcroFormDocumentHelper,
|
|
QPDFFormFieldObjectHelper, and QPDFAnnotationObjectHelper to
|
|
assist with working with interactive forms in PDF files. At
|
|
present, API methods for reading forms, form fields, and widget
|
|
annotations have been added. It is likely that some additional
|
|
methods for modifying forms will be added in the future. Note that
|
|
qpdf remains a library whose function is primarily focused around
|
|
document structure and metadata rather than content. As such, it
|
|
is not expected that qpdf will have higher level APIs for
|
|
generating form contents, but qpdf will hopefully gain the
|
|
capability to deal with the bookkeeping aspects of wiring up all
|
|
the objects, which could make it a useful library for other
|
|
software that works with PDF interactive forms. PDF forms are
|
|
complex, and the terminology around them is confusing. Please see
|
|
comments at the top of QPDFAcroFormDocumentHelper.hh for
|
|
additional discussion.
|
|
|
|
* Added new classes QPDFPageDocumentHelper and QPDFPageObjectHelper
|
|
for page-level API functions. These classes introduce a new API
|
|
pattern of document helpers and object helpers in qpdf. The helper
|
|
classes provide a higher level API for working with certain types
|
|
of structural features of PDF while still staying true to qpdf's
|
|
philosophy of not isolating the user from the underlying
|
|
structure. Please see the chapter in the documentation entitled
|
|
"Design and Library Notes" for additional discussion. The examples
|
|
have also been updated to use QPDFPageDocumentHelper and
|
|
QPDFPageObjectHelper when performing page-level operations.
|
|
|
|
2018-06-19 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* New QPDFObject::Rectangle class will convert to and from arrays
|
|
of four numerical values. Rectangles are used in various places
|
|
within the PDF file format and are called out as a specific data
|
|
type in the PDF specification.
|
|
|
|
2018-05-12 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* In newline before endstream mode, an extra newline was not
|
|
inserted prior to the endstream that ends object streams.
|
|
Fixes #205.
|
|
|
|
2018-04-15 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Arbitrarily limit the depth of data structures represented by
|
|
direct object. This is CVE-2018-9918. Fixes #202.
|
|
|
|
2018-03-06 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 8.0.2: release
|
|
|
|
* Properly handle pages with no contents. Fixes #194.
|
|
|
|
2018-03-05 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Improve handling of loops while following cross reference
|
|
tables. Fixes #192.
|
|
|
|
2018-03-04 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 8.0.1: release
|
|
|
|
* On the command line when specifying page ranges, support
|
|
preceding a page number by "r" to indicate that it should be
|
|
counted from the end. For example, the range r3-r1 would indicate
|
|
the last three pages of a document.
|
|
|
|
2018-03-03 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Ignore zlib data check errors while uncompressing streams. This
|
|
is consistent with behaviors of other readers and enables handling
|
|
of some incorrectly written zlib streams. Fixes #191.
|
|
|
|
2018-02-25 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 8.0.0: release
|
|
|
|
2018-02-17 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Fix QPDFObjectHandle::getUTF8Val() to properly handle strings
|
|
that are encoded with PDF Doc Encoding. Fixes #179.
|
|
|
|
* Add qpdf_check_pdf to the "C" API. This method just attempts to
|
|
read the entire file and produce no output, making possible to
|
|
assess whether the file has any errors that qpdf can detect.
|
|
|
|
* Major enhancements to handling of type errors within the qpdf
|
|
library. This fix is intended to eliminate those annoying cases
|
|
where qpdf would exit with a message like "operation for
|
|
dictionary object attempted on object of wrong type" without
|
|
providing any context. Now qpdf keeps enough context to be able to
|
|
issue a proper warning and to handle such conditions in a sensible
|
|
way. This should greatly increase the number of bad files that
|
|
qpdf can recover, and it should make it much easier to figure out
|
|
what's broken when a file contains errors.
|
|
|
|
* Error message fix: replace "file position" with "offset" in
|
|
error messages that report lexical or parsing errors. Sometimes
|
|
it's an offset in an object stream or a content stream rather than
|
|
a file position, so this makes the error message less confusing in
|
|
those cases. It still requires some knowledge to find the exact
|
|
position of the error, since when it's not a file offset, it's
|
|
probably an offset into a stream after uncompressing it.
|
|
|
|
* Error message fix: correct some cases in which the object that
|
|
contained a lexical error was omitted from the error message.
|
|
|
|
* Error message fix: improve file name in the error message when
|
|
there is a parser error inside an object stream.
|
|
|
|
2018-02-11 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add QPDFObjectHandle::filterPageContents method to provide a
|
|
different interface for applying token filters to page contents
|
|
without modifying the ultimate output.
|
|
|
|
2018-02-04 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Changes listed on today's date are numerous and reflect
|
|
significant enhancements to qpdf's lexical layer. While many
|
|
nuances are discussed and a handful of small bugs were fixed, it
|
|
should be emphasized that none of these issues have any impact on
|
|
any output or behavior of qpdf under "normal" operation. There are
|
|
some changes that have an effect on content stream normalization
|
|
as with qdf mode or on code that interacts with PDF files
|
|
lexically using QPDFTokenizer. There are no incompatible changes
|
|
for normal operation. There are a few changes that will affect the
|
|
exact error messages issued on certain bad files, and there is a
|
|
small non-compatible enhancement regarding the behavior of
|
|
manually constructed QPDFTokenizer::Token objects. Users of the
|
|
qpdf command line tool will see no changes other than the addition
|
|
of a new command-line flag and possibly some improved error
|
|
messages.
|
|
|
|
* Significant lexer (tokenizer) enhancements. These are changes to
|
|
the QPDFTokenizer class. These changes are of concern only to
|
|
people who are operating with PDF files at the lexical layer using
|
|
qpdf. They have little or no impact on most high-level interfaces
|
|
or the command-line tool.
|
|
|
|
New token types tt_space and tt_comment to recognize whitespace
|
|
and comments. this makes it possible to tokenize a PDF file or
|
|
stream and preserve everything about it.
|
|
|
|
For backward compatibility, space and comment tokens are not
|
|
returned by the tokenizer unless QPDFTokenizer.includeIgnorable()
|
|
is called.
|
|
|
|
Better handling of null bytes. These are now included in space
|
|
tokens rather than being their own "tt_word" tokens. This should
|
|
have no impact on any correct PDF file and has no impact on
|
|
output, but it may change offsets in some error messages when
|
|
trying to parse contents of bad files. Under default operation,
|
|
qpdf does not attempt to parse content streams, so this change is
|
|
mostly invisible.
|
|
|
|
Bug fix to handling of bad tokens at ends of streams. Now, when
|
|
allowEOF() has been called, these are treated as bad tokens
|
|
(tt_bad or an exception, depending on invocation), and a
|
|
separate tt_eof token is returned. Before the bad token
|
|
contents were returned as the value of a tt_eof token. tt_eof
|
|
tokens are always empty now.
|
|
|
|
Fix a bug that would, on rare occasions, report the offset in an
|
|
error message in the wrong space because of spaces or comments
|
|
adjacent to a bad token.
|
|
|
|
Clarify in comments exactly where the input source is positioned
|
|
surrounding calls to readToken and getToken.
|
|
|
|
* Add a new token type for inline images. This token type is only
|
|
returned by QPDFTokenizer immediately following a call to
|
|
expectInlineImage(). This change includes internal refactoring of
|
|
a handful of places that all separately handled inline images, The
|
|
logic of detecting inline images in content streams is now handled
|
|
in one place in the code. Also we are more flexible about what
|
|
characters may surround the EI operator that marks the end of an
|
|
inline image.
|
|
|
|
* New method QPDFObjectHandle::parsePageContents() to improve upon
|
|
QPDFObjectHandle::parseContentStream(). The parseContentStream
|
|
method used to operate on a single content stream, but was fixed
|
|
to properly handle pages with contents split across multiple
|
|
streams in an earlier release. The new method parsePageContents()
|
|
can be called on the page object rather than the value of the
|
|
page dictionary's /Contents key. This removes a few lines of
|
|
boiler-plate code from any code that uses parseContentStream, and
|
|
it also enables creation of more helpful error messages if
|
|
problems are encountered as the error messages can include
|
|
information about which page the streams come from.
|
|
|
|
* Update content stream parsing example
|
|
(examples/pdf-parse-content.cc) to use new
|
|
QPDFObjectHandle::parsePageContents() method in favor of the older
|
|
QPDFObjectHandle::parseContentStream() method.
|
|
|
|
* Bug fix: change where the trailing newline is added to a stream
|
|
in QDF mode when content normalization is enabled (the default for
|
|
QDF mode). Before, the content normalizer ensured that the output
|
|
ended with a trailing newline, but this had the undesired side
|
|
effect of including the newline in the stream data for purposes of
|
|
length computation. QPDFWriter already appends a newline without
|
|
counting in length for better readability. Ordinarily this makes
|
|
no difference, but in the rare case of a page's contents being
|
|
split in the middle of a token, the old behavior could cause the
|
|
extra newline to be interpreted as part of the token. This bug
|
|
could only be triggered in qdf mode, which is a mode intended for
|
|
manual inspection of PDF files' contents, so it is very unlikely
|
|
to have caused any actual problems for people using qpdf for
|
|
production use. Even if it did, it would be very unusual for a PDF
|
|
file to actually be adversely affected by this issue.
|
|
|
|
* Add support for coalescing a page's contents into a single
|
|
stream if they are represented as an array of streams. This can be
|
|
performed from the command line using the --coalesce-contents
|
|
option. Coalescing content streams can simplify things for
|
|
software that wants to operate on a page's content streams without
|
|
having to handle weird edge cases like content streams split in
|
|
the middle of tokens. Note that
|
|
QPDFObjectHandle::parsePageContents and
|
|
QPDFObjectHandle::parseContentStream already handled split content
|
|
streams. This is mainly to set the stage for new methods of
|
|
operating on page contents. The new method
|
|
QPDFObjectHandle::pipeContentStreams will pipe all of a page's
|
|
content streams though a single pipeline. The new method
|
|
QPDFObjectHandle.coalesceContentStreams, when called on a page
|
|
object, will do nothing if the page's contents are a single
|
|
stream, but if they are an array of streams, it will replace the
|
|
page's contents with a single stream whose contents are the
|
|
concatenation of the original streams.
|
|
|
|
* A few library routines throw exceptions if called on non-page
|
|
objects. These constraints have been relaxed somewhat to make qpdf
|
|
more tolerant of files whose page dictionaries are not properly
|
|
marked as such. Mostly exceptions about page operations being
|
|
called on non page objects will only be thrown in cases where the
|
|
operation had no chance of succeeding anyway. This change has no
|
|
impact on any default mode operations, but it could allow
|
|
applications that use page-level APIs in QPDFObjectHandle to be
|
|
more tolerant of certain types of damaged files.
|
|
|
|
* Add QPDFObjectHandle::TokenFilter class and methods to use it to
|
|
perform lexical filtering on content streams. You can call
|
|
QPDFObjectHandle::addTokenFilter on stream object, or you can call
|
|
the higher level QPDFObjectHandle::addContentTokenFilter on a page
|
|
object to cause the stream's contents to passed through a token
|
|
filter while being retrieved by QPDFWriter or any other consumer.
|
|
For details on using TokenFilter, please see comments in
|
|
QPDFObjectHandle.hh.
|
|
|
|
* Enhance the string, type QPDFTokenizer::Token constructor to
|
|
initialize a raw value in addition to a value. Tokens have a
|
|
value, which is a canonical representation, and a raw value. For
|
|
all tokens except strings and names, the raw value and the value
|
|
are the same. For strings, the value excludes the outer delimiters
|
|
and has non-printing characters normalized. For names, the value
|
|
resolves non-printing characters. In order to better facilitate
|
|
token filters that mostly preserve contents and to enable
|
|
developers to be mostly unconcerned about the nuances of token
|
|
values and raw values, creating string and name tokens now
|
|
properly handles this subtlety of values and raw values. When
|
|
constructing string tokens, take care to avoid passing in the
|
|
outer delimiters. This has always been the case, but it is now
|
|
clarified in comments in QPDFObjectHandle.hh::TokenFilter. This
|
|
has no impact on any existing code unless there's some code
|
|
somewhere that was relying on Token::getRawValue() returning an
|
|
empty string for a manually constructed token. The token class's
|
|
operator== method still only looks at type and value, not raw
|
|
value. For example, string tokens for <41> and (A) would still be
|
|
equal because both are representations of the string "A".
|
|
|
|
* Add QPDFObjectHandle::isDataModified method. This method just
|
|
returns true if addTokenFilter has been called on the stream. It
|
|
enables a caller to determine whether it is safe to optimize away
|
|
piping of stream data in cases where the input and output are
|
|
expected to be the same. QPDFWriter uses this internally to skip
|
|
the optimization of not re-compressing already compressed streams
|
|
if addTokenFilter has been called. Most developers will not have
|
|
to worry about this as it is used internally in the library in the
|
|
places that need it. If you are manually retrieving stream data
|
|
with QPDFObjectHandle::getStreamData or
|
|
QPDFObjectHandle::pipeStreamData, you don't need to worry about
|
|
this at all.
|
|
|
|
* Provide heavily annotated examples/pdf-filter-tokens.cc example
|
|
that illustrates use of some simple token filters.
|
|
|
|
* When normalizing content streams, as in qdf mode, issue warning
|
|
about bad tokens. Content streams are only normalized when this is
|
|
explicitly requested, so this has no impact on normal operation.
|
|
However, in qdf mode, if qpdf detects a bad token, it means that
|
|
either there's a bug in qpdf's lexer, that the file is damaged, or
|
|
that the page's contents are split in a weird way. In any of those
|
|
cases, qpdf could potentially damage the stream's contents by
|
|
replacing carriage returns with newlines or otherwise messing with
|
|
spaces. The mostly likely case of this would be an inline image's
|
|
compressed data being divided across two streams and having the
|
|
compressed data in the second stream contain a carriage return as
|
|
part of its binary data. If you are using qdf mode just to look at
|
|
PDF files in text editors, this usually doesn't matter. In cases
|
|
of contents split across multiple streams, coalescing streams
|
|
would eliminate the problem, so the warning mentions this. Prior
|
|
to this enhancement, the chances of qdf mode writing incorrect
|
|
data were already very low. This change should make it nearly
|
|
impossible for qdf mode to unknowingly write invalid data.
|
|
|
|
2018-02-04 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add QPDFWriter::setLinearizationPass1Filename method and
|
|
--linearize-pass1 command line option to allow specification of a
|
|
file into which QPDFWriter will write its intermediate
|
|
linearization pass 1 file. This is useful only for debugging qpdf.
|
|
qpdf creates linearized files by computing the output in two
|
|
passes. Ordinarily the first pass is discarded and not written
|
|
anywhere. This option allows it to be inspected.
|
|
|
|
2018-02-04 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 7.1.1: release
|
|
|
|
* Bug fix: properly linearize files whose /ID has a length of
|
|
other than 16 bytes.
|
|
|
|
* Rename some test files to avoid files with three dots in their
|
|
names. Fixes #173.
|
|
|
|
* Fix various build and compilation issues on some platforms and
|
|
compilers. Fixes #176, #172, #177
|
|
|
|
* Fix a few typos and clarify a few comments in header files.
|
|
|
|
2018-01-14 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 7.1.0: release
|
|
|
|
* Allow raw encryption key to be specified in library and command
|
|
line with the QPDF::setPasswordIsHexKey method and
|
|
--password-is-hex-key option. Allow encryption key to be displayed
|
|
with --show-encryption-key option. Thanks to Didier Stevens
|
|
<didier.stevens@gmail.com> for the idea and contribution of one
|
|
implementation of this idea. See his blog post at
|
|
https://blog.didierstevens.com/2017/12/28/cracking-encrypted-pdfs-part-3/
|
|
for a discussion of using this for cracking encrypted PDFs. I hope
|
|
that a future release of qpdf will include some additional
|
|
recovery options that may also make use of this capability.
|
|
|
|
2018-01-13 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Fix lexical error: the PDF specification allows floating point
|
|
numbers to end with ".". Fixes #165.
|
|
|
|
* Fix link order in the build to avoid conflicts when building
|
|
from source while an older version of qpdf is installed. Fixes #158.
|
|
|
|
* Add support for TIFF predictor for LZW and Flate streams. Now
|
|
all predictor functions are supported. Fixes #171.
|
|
|
|
2017-12-25 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Clarify documentation around options that control parsing but
|
|
not output creation. Two options: --suppress-recovery and
|
|
--ignore-xref-streams, were documented in the "Advanced
|
|
Transformation Options" section of the manual and --help output
|
|
even though they are not related to output. These are now
|
|
described in a separate section called "Advanced Parsing Options."
|
|
|
|
* Implement remaining PNG filters for decode. Prior versions could
|
|
decode only the "up" filter. Now all PNG filters (sub, up,
|
|
average, Paeth, optimal) are supported for decoding. Thanks to
|
|
Tobias Hoffmann for providing a test PDF file that has images with
|
|
all PNG filters along with different numbers of bits per sample
|
|
and samples per pixel, and thanks to Casey Rojas for providing
|
|
implementations of the remaining PNG filters.
|
|
|
|
The implementation of the remaining PNG filters changed the
|
|
interface to the private Pl_PNGFilter class, but this class's
|
|
header file is not in the installation, and there is no public
|
|
interface to the class. Within the library, the class is never
|
|
allocated on the stack; it is only ever dynamically allocated. As
|
|
such, this does not actually break binary compatibility of the
|
|
library.
|
|
|
|
2017-09-15 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 7.0.0: release
|
|
|
|
2017-09-12 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Relicense qpdf under version 2.0 of the Apache License rather
|
|
than version 2.0 of the Artistic License. Both are fine, but the
|
|
Apache License is in more widespread use, and I like it a little
|
|
better than Artistic-2.0. It is my intention that there be no
|
|
change in what you can or can't do with qpdf. Versions of qpdf
|
|
prior to version 7 were released under the terms of version 2.0 of
|
|
the Artistic License. At your option, you may continue to consider
|
|
qpdf to be licensed under those terms. Please see the manual for
|
|
additional information.
|
|
|
|
* Improve the error message that is issued when QPDFWriter
|
|
encounters a stream that can't be decoded. In particular, mention
|
|
that the stream will be copied without filtering to avoid data
|
|
loss.
|
|
|
|
* Add new methods to the C API to correspond to new additions to
|
|
QPDFWriter:
|
|
- qpdf_set_compress_streams
|
|
- qpdf_set_decode_level
|
|
- qpdf_set_preserve_unreferenced_objects
|
|
- qpdf_set_newline_before_endstream
|
|
|
|
2017-08-25 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Re-implement parser iteratively to avoid stack overflow on very
|
|
deeply nested arrays and dictionaries. Fixes #146.
|
|
|
|
* Detect infinite loop while finding additional xref tables. Fixes
|
|
#149.
|
|
|
|
2017-08-22 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 7.0.b1: release
|
|
|
|
* Convert all README files to markdown. Names changed as follows:
|
|
- README --> README.md
|
|
- README.hardening --> README-hardening.md
|
|
- README.maintainer --> README-maintainer.md
|
|
- README-what-to-download.txt --> README-what-to-download.md
|
|
- README-windows.txt --> README-windows.md
|
|
The file README-windows-install.txt remains a text file.
|
|
|
|
2017-08-21 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add support for writing PCLm files. Most of the work was done by
|
|
Sahil Arora <sahilarora.535@gmail.com> as part of a Google Summer
|
|
of Code project in 2017. PCLm support is useful only for clients
|
|
that specifically know how to create PCLm files. Support in qpdf
|
|
is just for ensuring that objects are written in the correct order
|
|
and for including some additional material in the output that is
|
|
required by the PCLm standard.
|
|
|
|
2017-08-19 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Remove --precheck-streams. This is enabled by default now
|
|
without any efficiency cost. This feature was never released.
|
|
|
|
* Update pdf-create example to illustrate use of additional image
|
|
compression filters.
|
|
|
|
* Add support for /RunLengthDecode and /DCTDecode:
|
|
- New pipeline types Pl_RunLength and Pl_DCT
|
|
- New command-line flags --compress-streams and --decode-level
|
|
to replace/enhance --stream-data
|
|
- New QPDFWriter::setCompressStreams and
|
|
QPDFWriter::setDecodeLevel methods
|
|
Please see documentation, header files, and help messages for
|
|
details on these new features.
|
|
|
|
2017-08-12 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add QPDFObjectHandle::rotatePage to apply rotation to a page
|
|
object. Add --rotate option to qpdf to specify page rotation from
|
|
the command line.
|
|
|
|
* Provide --verbose option that causes qpdf to print an indication
|
|
of what files it is writing.
|
|
|
|
* Change --single-pages to --split-pages and make it take an
|
|
optional argument specifying the number of pages per file.
|
|
|
|
2017-08-11 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Fix --newline-before-endstream to always add a newline before
|
|
endstream even if the last character was already a newline. This
|
|
is actually what's required by PDF/A. Fixes #133.
|
|
|
|
* Handle encrypted files whose encryption parameters are too
|
|
short. Fixes #96.
|
|
|
|
2017-08-10 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Remove dependency on libpcre.
|
|
|
|
* Be more forgiving of certain types of errors in the xref table
|
|
that don't interfere with interpreting the table.
|
|
|
|
* Remove unused "tracing" parameter from PointerHolder's
|
|
(T*, bool) constructor. This change breaks source code
|
|
compatibility, but since this argument to PointerHolder has not
|
|
used for a long time and the presence of a boolean parameter in
|
|
the primary constructor makes it too easy to use that by mistake
|
|
when trying to use PointerHolder for arrays, it seems like it's
|
|
finally time to take it out. If you have a compile error because
|
|
of this change, please check to see whether you intended to use
|
|
the (bool, T*) version of the constructor instead. If not, just
|
|
remove the second parameter.
|
|
|
|
2017-08-09 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* When recovering stream length, find endobj without endstream as
|
|
well as just looking for endstream. Be a little more lax about
|
|
where we allow it to be found.
|
|
|
|
2017-08-05 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add --single-pages option to cause output to be written to a
|
|
separate file for each page rather than one big file.
|
|
|
|
* Process --pages options earlier so that certain inspection
|
|
options, like --show-pages, can show the state after the merging
|
|
operations.
|
|
|
|
2017-08-02 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Fix off-by-one error in parsing pages options. Fixes #129.
|
|
|
|
2017-07-29 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Support @filename and @- in the qpdf command-line tool to read
|
|
command-line arguments, one per line, from the named file. @-
|
|
reads from standard input. Fixes #16.
|
|
|
|
* Detect when input file and output file are the same and exit to
|
|
avoid overwriting and losing input file. Fixes #29.
|
|
|
|
* When passing multiple inspection arguments, run --check first,
|
|
and defer exit until after all the checks have been run. This
|
|
makes it possible to force operations such as --show-xref to be
|
|
delayed until after recovery attempts have been made. For example,
|
|
if you have a file with a syntactically valid xref table that has
|
|
some offsets that are incorrect, running qpdf --check --show-xref
|
|
on that file will first recover the xref and the dump the
|
|
recovered xref, while just running qpdf --show-xref will show the
|
|
xref table as present in the file. Fixes #42.
|
|
|
|
* When recovering stream length, indicate the recovered length.
|
|
Fixes #44.
|
|
|
|
* Add --newline-before-endstream command-line option and
|
|
setNewlineBeforeEndstream method to QPDFWriter. This forces qpdf
|
|
to always add a newline before the endstream keyword. It is a
|
|
necessary but not sufficient condition for PDF/A compliance. Fixes
|
|
#103.
|
|
|
|
* Handle zlib data errors when decoding streams. Fixes #106.
|
|
|
|
* Improve handling of files where the "stream" keyword is not
|
|
followed by proper line terminators. Fixes #104.
|
|
|
|
* Fix content stream parsing to handle cases of structures within
|
|
the stream split across stream boundaries. Fixes #73.
|
|
|
|
2017-07-28 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add --preserve-unreferenced command-line option and
|
|
setPreserveUnreferencedObjects method to QPDFWriter. This option
|
|
causes QPDFWriter to write all objects from the input file to the
|
|
output file regardless of whether the objects are referenced.
|
|
Objects are written to the output file in numerical order from the
|
|
input file. This option has no effect for linearized files.
|
|
|
|
2017-07-27 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add --precheck-streams command-line option and setStreamPrecheck
|
|
method to QPDFWriter to tell QPDFWriter to attempt decoding a
|
|
stream fully before deciding whether to filter it or not.
|
|
|
|
* Recover gracefully from streams that aren't filterable because
|
|
the filter parameters are invalid in the stream dictionary or the
|
|
dictionary itself is invalid.
|
|
|
|
* Significantly improve recoverability from invalid qpdf objects.
|
|
Most conditions in basic object parsing that used to cause qpdf to
|
|
exit are now warnings. There are still many more opportunities for
|
|
improvements of this sort beyond just object parsing.
|
|
|
|
2017-07-26 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Fixes to infinite loops below also fix problems reported in
|
|
other issues and cover CVE-2017-11624, CVE-2017-11625,
|
|
CVE-2017-11626, and CVE-2017-11627.
|
|
|
|
* Don't attempt to interpret syntactic keywords (like R and
|
|
endobj) found while parsing content streams.
|
|
|
|
* Detect infinite loops while resolving objects. This could happen
|
|
if something inside an object that had to be resolved during
|
|
parsing, such as a stream length, recursively referenced the
|
|
object being resolved.
|
|
|
|
* CVE-2017-9208: Handle references to and appearance of object 0
|
|
as a special case. Object 0 is not allowed, and qpdf was using it
|
|
internally to represent direct objects.
|
|
|
|
* CVE-2017-9209: Fix infinite loop caused by attempting to
|
|
reconstruct the xref table while already in the process of
|
|
reconstructing the xref table.
|
|
|
|
* CVE-2017-9210: Fix infinite loop caused by attempting to unparse
|
|
an object for inclusion in the text of an exception.
|
|
|
|
2015-11-10 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 6.0.0: release
|
|
|
|
* No changes from 5.2.0. The 5.2.0 release broke binary
|
|
compatibility and was withdrawn.
|
|
|
|
2015-10-31 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 5.2.0: release
|
|
|
|
* libqpdf/QPDF.cc (read_xrefTable): Be tolerant of some malformed
|
|
xref tables that don't have the required trailing space after each
|
|
line.
|
|
|
|
2015-10-29 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Implement QPDFWriter::setDeterministicID and --deterministic-id
|
|
commandline-flag to qpdf to request generation of a deterministic
|
|
/ID for non-encrypted files.
|
|
|
|
2015-05-24 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 5.1.3: release
|
|
|
|
* Bug fix: fix-qdf was not handling object streams with more than
|
|
255 objects in them.
|
|
|
|
* Handle Microsoft crypt provider initialization properly for case
|
|
where no keys have been previously created, such as in a fresh
|
|
Windows installation.
|
|
|
|
* Include time.h in QUtil.hh for time_t
|
|
|
|
2015-02-21 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Detect loops in Pages structure. Thanks to Gynvael Coldwind and
|
|
Mateusz Jurczyk of the Google Security Team for providing a sample
|
|
file with this problem.
|
|
|
|
* Prevent buffer overrun when converting a password to an
|
|
encryption key. Thanks to Gynvael Coldwind and Mateusz Jurczyk of
|
|
the Google Security Team for providing a sample file with this
|
|
problem.
|
|
|
|
* Ensure that arguments to "R" when parsing the file are direct
|
|
objects before trying to resolve them. This prevents specially
|
|
crafted files from causing qpdf to crash with a stack overflow.
|
|
Thanks to Gynvael Coldwind and Mateusz Jurczyk of the Google
|
|
Security Team for providing a sample file with this problem.
|
|
|
|
2014-12-01 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Some broken PDF files lack the required /Type key for /Page and
|
|
/Pages nodes in the page dictionary. QPDF now uses other methods
|
|
to figure out what kind of node it is looking at so that it can
|
|
handle those files. Original reported at
|
|
https://bugs.launchpad.net/ubuntu/+source/qpdf/+bug/1397413
|
|
|
|
2014-11-14 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Bug fix: QPDFObjectHandle::getPageContents() no longer throws an
|
|
exception when called on a page that has no /Contents key in its
|
|
dictionary. This is allowed by the spec, and some software
|
|
packages generate files like this for pages that are blank in the
|
|
original.
|
|
|
|
2014-06-07 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 5.1.2: release
|
|
|
|
* MS Visual C++ build: explicitly target Windows 5.0.1 (XP)
|
|
|
|
* New example program: pdf-split-pages: efficiently split PDF
|
|
files into individual pages.
|
|
|
|
* Bug fix: don't fail on files that contain streams where /Filter
|
|
or /DecodeParms references a stream. Before, qpdf would try to
|
|
convert these to direct objects, which would fail because of the
|
|
stream.
|
|
|
|
2014-02-22 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Bug fix: if the last object in the first part of a linearized
|
|
file had an offset that was below 65536 by less than the size of
|
|
the hint stream, the xref stream was invalid and the resulting file
|
|
is not usable. This is now fixed.
|
|
|
|
2014-01-14 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 5.1.1: release
|
|
|
|
2013-12-26 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Bug fix: when copying foreign objects (which occurs during page
|
|
splitting among other cases), avoid traversing the same object
|
|
more than once if it appears more than once in the same direct
|
|
object. This bug is performance-only and does not affect the
|
|
actual output.
|
|
|
|
2013-12-17 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 5.1.0: release
|
|
|
|
2013-12-16 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Document and make explicit that passing null to
|
|
QUtil::setRandomDataProvider() resets the random data provider.
|
|
|
|
* Provide QUtil::getRandomDataProvider().
|
|
|
|
2013-12-14 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Allow any space rather than just newline to follow xref header.
|
|
This allows qpdf to read a wider range of damaged files.
|
|
|
|
2013-11-30 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Allow user-supplied random data provider to be used in place of
|
|
OS-provided or insecure random number generation. See
|
|
documentation for 5.1.0 for details.
|
|
|
|
* Add configure option --enable-os-secure-random (enabled by
|
|
default). Pass --disable-os-secure-random or define
|
|
SKIP_OS_SECURE_RANDOM to avoid attempts to use the operating
|
|
system-provided secure random number generation. This can be
|
|
especially useful on Windows if you wish to avoid any dependency
|
|
on Microsoft's cryptography system.
|
|
|
|
2013-11-29 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* If NO_GET_ENVIRONMENT is #defined, for Windows only,
|
|
QUtil::get_env will always return false. This was added to
|
|
support a user who needs to avoid calling GetEnvironmentVariable
|
|
from the Windows API. QUtil::get_env is not used for any
|
|
functionality in qpdf and exists only to support the test suite
|
|
including test coverage support with QTC (part of qtest).
|
|
|
|
* Add /FS to msvc builds to allow parallel builds to work with
|
|
Visual C++ 2013.
|
|
|
|
* Add missing #include <algorithm> in some files that use std::min
|
|
and std::max.
|
|
|
|
2013-11-21 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Change image comparison tests, which are disabled by default, to
|
|
use tiff files with 8 bits per sample rather than 4. This works
|
|
around a bug in tiffcmp but also increases time and disk space for
|
|
image comparison tests.
|
|
|
|
2013-10-28 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Fix MacOS compilation errors by adding a missing #include
|
|
<string> in a header file.
|
|
|
|
2013-10-18 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 5.0.1: release
|
|
|
|
* Warn when -accessibility=n is specified with a modern encryption
|
|
format (R > 3). Also, accept this flag (and ignore with warning)
|
|
with 256-bit encryption. qpdf has always ignored the
|
|
accessibility setting with R > 3, but it previously did so
|
|
silently.
|
|
|
|
2013-10-05 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Replace operator[] in std::string and std::vector with "at" in
|
|
order to get bounds checking. This reduces the chances that
|
|
incorrect code will result in data exposure or buffer overruns.
|
|
See README.hardening for additional notes.
|
|
|
|
* Use cryptographically secure random number generation when
|
|
available. See additional notes in README.
|
|
|
|
* Replace some assert() calls with std::logic_error exceptions.
|
|
Ideally there shouldn't be assert() calls outside of testing.
|
|
This change may make a few more potential code errors in handling
|
|
invalid data recoverable.
|
|
|
|
* Security fix: In places where std::vector<T>(size_t) was used,
|
|
either validate that the size parameter is sane or refactor code
|
|
to avoid the need to pre-allocate the vector. This reduces the
|
|
likelihood of allocating a lot of memory in response to invalid
|
|
data in linearization hint streams.
|
|
|
|
* Security fix: sanitize /W array in cross reference stream to
|
|
avoid a potential integer overflow in a multiplication. It is
|
|
unlikely that any exploits were possible from this bug as
|
|
additional checks were also performed.
|
|
|
|
* Security fix: avoid buffer overrun that could be caused by bogus
|
|
data in linearization hint streams. The incorrect code could only
|
|
be triggered when checking linearization data, which must be
|
|
invoked explicitly. qpdf does not check linearization data when
|
|
reading or writing linearized files, but the qpdf --check command
|
|
does check linearization data.
|
|
|
|
* Security fix: properly handle empty strings in
|
|
QPDF_Name::normalizeName. The empty string is not a valid name
|
|
and would never be parsed as a name, so there were no known
|
|
conditions where this method could be called with an empty string.
|
|
|
|
* Security fix: perform additional argument sanity checks when
|
|
reading bit streams.
|
|
|
|
* Security fix: in QUtil::toUTF8, change bounds checking to avoid
|
|
having a pointer point temporarily outside the bounds of an
|
|
array. Some compiler optimizations could have made the original
|
|
code unsafe.
|
|
|
|
2013-07-10 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 5.0.0: release
|
|
|
|
* 4.2.0 turned out to be binary incompatible on some platforms
|
|
even though there were no changes to the public API. Therefore
|
|
the 4.2.0 release has been withdrawn, and is being replaced with a
|
|
5.0.0 release that acknowledges the ABI change and also removes
|
|
some problematic methods from the public API.
|
|
|
|
* Remove methods from public API that were only intended to be
|
|
used by QPDFWriter and really didn't make sense to call from
|
|
anywhere else as they required internal knowledge that only
|
|
QPDFWriter had:
|
|
- QPDF::getLinearizedParts
|
|
- QPDF::generateHintStream
|
|
- QPDF::getObjectStreamData
|
|
- QPDF::getCompressibleObjGens
|
|
- QPDF::getCompressibleObjects
|
|
|
|
2013-07-07 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 4.2.0: release [withdrawn]
|
|
|
|
* Ignore error case of a stream's decode parameters having invalid
|
|
length when there are no stream filters.
|
|
|
|
* qpdf: add --show-npages command-line option, which causes the
|
|
number of pages in the input file to be printed on a line by
|
|
itself.
|
|
|
|
* qpdf: allow omission of range in --pages. If range is omitted
|
|
such that an argument that is supposed to be a range is an invalid
|
|
range and a valid file name, the range of 1-z is assumed. This
|
|
makes it possible to merge a bunch of files with something like
|
|
qpdf --empty out.pdf --pages *.pdf --
|
|
|
|
2013-06-15 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Handle some additional broken files with missing /ID in trailer
|
|
for encrypted files and with space rather than newline after xref.
|
|
|
|
2013-06-14 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Detect and correct /Outlines dictionary being a direct object
|
|
when linearizing files. This is not allowed by the spec but has
|
|
been seen in the wild. Prior to this change, such a file would
|
|
cause an internal error in the linearization code, which assumed
|
|
/Outlines was indirect.
|
|
|
|
* Add /Length key to crypt filter dictionary for encrypted files.
|
|
This key is optional, but some version of MacOS reportedly fail to
|
|
open encrypted PDF files without this key.
|
|
|
|
* Bug fix: properly handle object stream generation when the
|
|
original file has some compressible objects with generation != 0.
|
|
|
|
* Add QPDF::getCompressibleObjGens() and deprecate
|
|
QPDF::getCompressibleObjects(), which had a flaw in its logic.
|
|
|
|
* Add new QPDFObjectHandle::getObjGen() method and indicate in
|
|
comments that its use is favored over getObjectID() and
|
|
getGeneration() for most cases.
|
|
|
|
* Add new QPDFObjGen object to represent an object ID/generation
|
|
pair.
|
|
|
|
2013-04-14 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 4.1.0: release
|
|
|
|
2013-03-25 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* manual/qpdf-manual.xml: Document the casting policy that is
|
|
followed in qpdf's implementation.
|
|
|
|
2013-03-11 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* When creating Windows binary distributions, make sure to only
|
|
copy DLLs of the correct type. The ensures that the 32-bit
|
|
distributions contain 32-bit DLLs and the 64-bit distributions
|
|
contain 64-bit DLLs.
|
|
|
|
2013-03-07 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Use ./install-sh (already present) instead of "install -c" to
|
|
install executables to fix portability problems against different
|
|
UNIX variants.
|
|
|
|
2013-03-03 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add protected terminateParsing method to
|
|
QPDFObjectHandle::ParserCallbacks that implementor can call to
|
|
terminate parsing of a content stream.
|
|
|
|
2013-02-28 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Favor fopen_s and strerror_s on MSVC to avoid CRT security
|
|
warnings. This is useful for people who may want to use qpdf in
|
|
an application that is Windows 8 certified.
|
|
|
|
* New method QUtil::safe_fopen to wrap calls to fopen. This is
|
|
less cumbersome than calling QUtil::fopen_wrapper.
|
|
|
|
* Remove all calls to sprintf
|
|
|
|
* New method QUtil::int_to_string_base to convert to octal or
|
|
hexadecimal (or decimal) strings without using sprintf
|
|
|
|
2013-02-26 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Rewrite QUtil::int_to_string and QUtil::double_to_string to
|
|
remove internal length limits but to remain backward compatible
|
|
with the old versions for valid inputs.
|
|
|
|
2013-02-23 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Bug fix: properly handle overridden compressed objects. When
|
|
caching objects from an object stream, only cache objects that,
|
|
based on the xref table, would actually be resolved into this
|
|
stream. Prior to this fix, if an object stream A contained an
|
|
object B that was overridden by an appended section of the file,
|
|
qpdf would cache the old value of B if any non-overridden member
|
|
of A was accessed before B. This commit fixes that bug.
|
|
|
|
2013-01-31 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Do not remove libtool's .la file during the make install step.
|
|
Note to packagers: if your distribution wants to you remove the
|
|
.la file, you will have to do that yourself now.
|
|
|
|
2013-01-25 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* New method QUtil::hex_encode to encode binary data as a
|
|
hexadecimal string
|
|
|
|
* qpdf --check was exiting with status 0 in some rare cases even
|
|
when errors were found. It now always exits with one of the
|
|
document error codes (0 for success, 2 for errors, 3 or warnings).
|
|
|
|
2013-01-24 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Make --enable-werror work for MSVC, and generally handle warning
|
|
options better for that compiler. Warning flags for that compiler
|
|
were previous hard-coded into the build with /WX enabled
|
|
unconditionally.
|
|
|
|
* Split warning flags into WFLAGS in autoconf.mk to make them
|
|
easier to override. Before they were repeated in CFLAGS and
|
|
CXXFLAGS and were commingled with other compiler flags.
|
|
|
|
* qpdf --check now does syntactic checks all pages' content
|
|
streams as well as checking overall document structure. Semantic
|
|
errors are still not checked, and there are no plans to add
|
|
semantic checks.
|
|
|
|
2013-01-22 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add QPDFObjectHandle::getTypeCode(). This method returns a
|
|
unique integer (enumerated type) value corresponding to the object
|
|
type of the QPDFObjectHandle. It can be used as an alternative to
|
|
the QPDFObjectHandle::is* methods for type testing, particularly
|
|
where there is a desire to use a switch statement or optimize for
|
|
performance when testing object types.
|
|
|
|
* Add QPDFObjectHandle::getTypeName(). This method returns a
|
|
string literal describing the object type. It is useful for
|
|
testing and debugging.
|
|
|
|
2013-01-20 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add QPDFObjectHandle::parseContentStream, which parses the
|
|
objects in a content stream and calls handlers in a callback
|
|
class. The example pdf-parse-content illustrates it use.
|
|
|
|
* Add QPDF_Operator and QPDF_InlineImage types along with
|
|
appropriate wrapper methods in QPDFObjectHandle. These new object
|
|
types are to facilitate content stream parsing.
|
|
|
|
2013-01-17 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 4.0.1: release
|
|
|
|
* Add clarifying comment in QPDF.hh for methods that return the
|
|
user password to state that it is no longer possible with newer
|
|
encryption formats to recover the user password knowing the owner
|
|
password.
|
|
|
|
* Fix detection of binary attachments in the test suite. This
|
|
resolves false test failures on some platforms. No changes to the
|
|
actual QPDF code were made.
|
|
|
|
2012-12-31 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 4.0.0: release
|
|
|
|
* Add new methods qpdf_get_pdf_extension_level,
|
|
qpdf_set_r5_encryption_parameters,
|
|
qpdf_set_r6_encryption_parameters,
|
|
qpdf_set_minimum_pdf_version_and_extension, and
|
|
qpdf_force_pdf_version_and_extension to support new functionality
|
|
from the C API.
|
|
|
|
2012-12-30 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Fix long-standing bug that could theoretically have resulted in
|
|
possible misinterpretation of decode parameters in streams. As
|
|
far as I can tell, it is extremely unlikely that files with the
|
|
characteristics that would have triggered the bug actually exist
|
|
in cases that qpdf versions prior to 4.0.0 could have read.
|
|
Unencrypted files with encrypted attachments would have triggered
|
|
this bug, but qpdf versions prior to 4.0.0 already refused to open
|
|
such files.
|
|
|
|
* Fix long-standing bug in which a stream that used a crypt
|
|
filter and was otherwise not filterable by qpdf would be decrypted
|
|
properly but would retain the crypt filter indication in the
|
|
file. There are no known ways to create files like this, so it is
|
|
unlikely that anyone ever hit this bug.
|
|
|
|
2012-12-29 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add read/write support for both the deprecated Acrobat IX
|
|
encryption format and the Acrobat X/PDF 2.0 encryption format
|
|
using 256-bit AES keys. Using the Acrobat IX format (R=5) forces
|
|
the version of the file to 1.7 with extension level 3. Using the
|
|
PDF 2.0 format (R=6) forces it to 1.7 extension level 8.
|
|
|
|
* Add new method QPDF::getEncryptionKey to return the actual
|
|
encryption key used for encryption of data in the file. The key
|
|
is returned as a std::string.
|
|
|
|
* Non-compatible API change: change signature of
|
|
QPDF::compute_data_key to take the R and V values from the
|
|
encryption dictionary. There is no reason for any application
|
|
code to call this method since handling of encryption is done
|
|
automatically by the qpdf library. It is used internally by
|
|
QPDFWriter.
|
|
|
|
* Support reading and decryption of files whose main text is not
|
|
encrypted but whose attachments are. More generally, support the
|
|
case of files and streams encrypted differently with some
|
|
limitations, described in the documentation. This was not
|
|
previously supported due to lack of test files, but I created test
|
|
files using a trial version of Acrobat XI to fully implement this
|
|
case.
|
|
|
|
* Incorporate sha2 code from sphlib 3.0. See README for
|
|
licensing. Create private pipeline class for computing hashes
|
|
with sha256, sha384, and sha512.
|
|
|
|
* Allow specification of initialization vector when using AES
|
|
filtering. This is required to compute the hash used in /R=6 (PDF
|
|
2.0) encryption.
|
|
|
|
2012-12-28 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add random number generation functions to QUtil.
|
|
|
|
* Fix old bug that could cause an infinite loop if user password
|
|
recovery methods were called and a password contained the "("
|
|
character (which happens to be the first byte of padding used by
|
|
older PDF encryption formats). This bug was noticed while reading
|
|
code and would not happen under ordinary usage patterns even if
|
|
the password contained that character.
|
|
|
|
2012-12-27 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add awareness of extension level to PDF Version methods for both
|
|
reading and writing. This includes adding method
|
|
QPDF::getExtensionLevel and new versions of
|
|
QPDFWriter::setMinimumPDFVersion and QPDFWriter::forcePDFVersion
|
|
that support extension levels. The qpdf command-line tool
|
|
interprets version numbers of the form x.y.z as version x.y at
|
|
extension level z.
|
|
|
|
* Update AES classes to support use of 256-bit keys.
|
|
|
|
* Non-compatible API change: Removed public method
|
|
QPDF::flattenScalarReferences. Instead, just flatten the scalar
|
|
references we actually need to flatten. Flattening scalar
|
|
references was a wrong decision years ago and has occasionally
|
|
caused other problems, among which were that it caused qpdf to
|
|
visit otherwise unreferenced and possibly erroneous objects in the
|
|
file when it didn't have to. There's no reason that any
|
|
non-internal code would have had to call this.
|
|
|
|
* Non-compatible API change: Removed public method
|
|
QPDF::decodeStreams which was previously used by qpdf --check but
|
|
is no longer used. The decodeStreams method could generate false
|
|
positives since it would attempt to access all objects in the file
|
|
including those that were not referenced. There's no reason that
|
|
any non-internal code would have had to call this.
|
|
|
|
* Non-compatible API change: Removed public method
|
|
QPDF::trimTrailerForWrite, which was only intended for use by
|
|
QPDFWriter and which is no longer used.
|
|
|
|
2012-12-26 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add new fields to QPDF::EncryptionData to support newer
|
|
encryption formats (V=5, R=5 and R=6)
|
|
|
|
* Non-compatible API change: Change public nested class
|
|
QPDF::EncryptionData to make all member fields private and to add
|
|
method calls. This is a non-compatible API change, but changing
|
|
EncryptionData is necessary to support newer encryption formats,
|
|
and making this change will prevent the need from making a
|
|
non-compatible change in the future if new fields are added. A
|
|
public nested class should never have had public members to begin
|
|
with.
|
|
|
|
2012-12-25 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Allow PDF header to appear anywhere in the first 1024 bytes of
|
|
the file as recommended in the implementation notes of the Adobe
|
|
version of the PDF spec.
|
|
|
|
2012-11-20 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add zlib and libpcre to Requires.private in the pkg-config file
|
|
to support static linking. Thanks Tobias Hoffmann for pointing
|
|
out the omission.
|
|
|
|
* Ignore (with warning) non-freed objects in the xref table whose
|
|
offset is 0. Some PDF producers (incorrectly) do this. See
|
|
https://bugs.linuxfoundation.org/show_bug.cgi?id=1081.
|
|
|
|
2012-09-23 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add public methods QPDF::processInputSource and
|
|
QPDFWriter::setOutputPipeline to allow users to read from custom
|
|
input sources and to write to custom pipelines. This allows the
|
|
maximum flexibility in sources for reading and writing PDF files.
|
|
|
|
2012-09-06 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 3.0.2: release
|
|
|
|
* Add new method QPDFWriter::setExtraHeaderText to add extra text,
|
|
such as application-specific comments, to near the beginning of a
|
|
PDF file. For linearized files, this appears after the
|
|
linearization parameter dictionary. For non-linearized files, it
|
|
appears right after the PDF header and non-ASCII comment.
|
|
|
|
* Make it possible to write the same QPDF object with two
|
|
different QPDFWriter objects that have both called
|
|
setLinearization(true) by making private method
|
|
QPDF::calculateLinearizationData() properly initialize its state.
|
|
|
|
* Bug fix: Writing after calling QPDFWriter::setOutputMemory()
|
|
would cause a segmentation fault because of an internal field not
|
|
being initialized, rendering that method useless. This has been
|
|
corrected.
|
|
|
|
2012-08-11 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 3.0.1: release
|
|
|
|
* Bug fix: let EOF terminate a literal token as well as
|
|
whitespace or comments.
|
|
|
|
2012-07-31 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 3.0.0: release
|
|
|
|
2012-07-29 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 3.0.rc1: release
|
|
|
|
2012-07-25 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* From Tobias: add QPDFObjectHandle::replaceStreamData that takes
|
|
a std::string analogous to the QPDFObjectHandle::newStream that
|
|
takes a string that was added earlier.
|
|
|
|
2012-07-21 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Change configure to have image comparison tests disabled by
|
|
default. Update README and README.maintainer with information
|
|
about running them.
|
|
|
|
* Add --pages command-line option to qpdf to enable page-based
|
|
merging and splitting.
|
|
|
|
* Add new method QPDFObjectHandle::replaceDict to replace a
|
|
stream's dictionary. Use with caution; see comments in
|
|
QPDFObjectHandle.hh.
|
|
|
|
* Add new method QPDFObjectHandle::parse for creation of
|
|
QPDFObjectHandle objects from string representations of the
|
|
objects. Thanks to Tobias Hoffmann for the idea.
|
|
|
|
2012-07-15 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* add new QPDF::isEncrypted method that returns some additional
|
|
information beyond other versions.
|
|
|
|
* libqpdf/QPDFWriter.cc: fix copyEncryptionParameters to fix the
|
|
minimum PDF version based on other file's encryption needs. This
|
|
is a fix to code added on 2012-07-14 and did not impact previously
|
|
released code.
|
|
|
|
* libqpdf/QPDFWriter.cc (copyEncryptionParameters): Bug fix: qpdf
|
|
was not preserving whether or not AES encryption was being used
|
|
when copying encryption parameters. The file would still have
|
|
been properly encrypted, but a file that started off encrypted
|
|
with AES could have become encrypted with RC4.
|
|
|
|
2012-07-14 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* QPDFWriter: add public copyEncryptionParameters to allow copying
|
|
encryption parameters from another file.
|
|
|
|
* QPDFWriter: detect if the user has inserted an indirect object
|
|
from another QPDF object and throw an exception directing the user
|
|
to copyForeignObject.
|
|
|
|
2012-07-11 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Added new APIs to copy objects from one QPDF to another. This
|
|
includes letting QPDF::addPage() (and QPDF::addPageAt()) accept a
|
|
page object from another QPDF and adding
|
|
QPDF::copyForeignObject(). See QPDF.hh for details.
|
|
|
|
* Add method QPDFObjectHandle::getOwningQPDF() to return the QPDF
|
|
object associated with an indirect QPDFObjectHandle.
|
|
|
|
* Add convenience methods to QPDFObjectHandle: assertIndirect(),
|
|
isPageObject(), isPagesObject()
|
|
|
|
* Cache when QPDF::pushInheritedAttributesToPage() has been called
|
|
to avoid traversing the pages trees multiple times. This state is
|
|
cleared by QPDF::updateAllPagesCache() and ignored by
|
|
QPDF::flattenPagesTree().
|
|
|
|
2012-07-08 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add QPDFObjectHandle::newReserved to create a reserved object
|
|
and QPDF::replaceReserved to replace it with a real object.
|
|
QPDFObjectHandle::newReserved reserves an object ID in a QPDF
|
|
object and ensures that any references to it remain unresolved.
|
|
When QPDF::replaceReserved is later called, previous references to
|
|
the reserved object will properly resolve to the replaced object.
|
|
|
|
2012-07-07 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* NOTE: BREAKING API CHANGE. Remove previously required length
|
|
parameter from the version QPDFObjectHandle::replaceStreamData
|
|
that uses a stream data provider. Prior to qpdf 3.0.0, you had to
|
|
compute the stream length in advance so that qpdf could internally
|
|
verify that the stream data had the same length every time the
|
|
provider was invoked. Now this requirement is enforced a
|
|
different way, and the length parameter is no longer required.
|
|
Note that I take API-breaking changes very seriously and only did
|
|
it in this case since the lack of need to know length in advance
|
|
could significantly simplify people's code. If you were
|
|
previously going to a lot of trouble to compute the length of the
|
|
new stream data in advance, you now no longer have to do that.
|
|
You can just drop the length parameter and remove any code that
|
|
was previously computing the length. Thanks to Tobias Hoffmann
|
|
for pointing out how annoying the original interface was.
|
|
|
|
2012-07-05 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add QPDFWriter methods to write to an already open stdio FILE*.
|
|
Implementation and idea area based on contributions from Tobias
|
|
Hoffmann.
|
|
|
|
2012-07-04 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Accept changes from Tobias Hoffmann: add public method
|
|
QPDF::pushInheritedAttributesToPage including warnings for
|
|
non-inherited keys that may be discarded from /Pages by
|
|
non-conformant PDF files when the /Pages tree is flattened.
|
|
|
|
2012-06-27 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add Pl_Concatenate pipeline for stream concatenation also
|
|
implemented by Tobias Hoffmann. Also added test code
|
|
(libtests/concatenate.cc).
|
|
|
|
* Add new methods implemented by Tobias Hoffmann:
|
|
QPDFObjectHandle::newReal(double) and
|
|
QPDFObjectHandle::newStream(QPDF*, std::string const&).
|
|
|
|
2012-06-26 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Minor changes so that support for PDF files larger than 4GB
|
|
works well with 32-bit and 64-bit Linux and also with 32-bit and
|
|
64-bit Windows with both MSVC and mingw.
|
|
|
|
* Rework internal methods for doing recovery of the cross
|
|
reference tables for much greater efficiency both in terms of time
|
|
and memory usage.
|
|
|
|
2012-06-24 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Support PDF files larger than 4 GB. This involved many changes
|
|
to the ABI to increase the size of integer types used in various
|
|
places as well as increasing the amount of padding used when
|
|
creating linearized files. Automated tests for large files are
|
|
disabled by default. Run ./configure --help for information on
|
|
enabling them. Running the tests requires 11 GB of free disk
|
|
space and takes several minutes.
|
|
|
|
2012-06-22 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* examples/pdf-create.cc: Provide an example of creating a PDF
|
|
from scratch. This simple PDF has a single page with some text
|
|
and an image.
|
|
|
|
* Add empty QPDFObjectHandle factories for array and dictionary.
|
|
With PDF-from-scratch capability, it is useful to be able to
|
|
create empty arrays and dictionaries and add keys to them.
|
|
Updated pdf_from_scratch.cc to use these interfaces.
|
|
|
|
2012-06-21 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add QPDF::emptyPDF() to create an empty QPDF object suitable for
|
|
adding pages and other objects to. pdf_from_scratch.cc is test
|
|
code that exercises it.
|
|
|
|
* make/libtool.mk: Place user-specified CPPFLAGS and LDFLAGS later
|
|
in the compilation so that if a user installs things in a
|
|
non-standard place that they have to tell the build about, earlier
|
|
versions of qpdf installed there won't break the build. Thanks to
|
|
Macports for reporting this. (Fixes bug 3468860.)
|
|
|
|
* Instead of using off_t in the public APIs, use qpdf_offset_t
|
|
instead. This is defined as long long in qpdf/Types.h. If your
|
|
system doesn't support long long, you can redefine it.
|
|
|
|
* Add pkg-config files
|
|
|
|
* QPDFObjectHandle: add shallowCopy() method
|
|
|
|
* QPDF: add new APIs for adding and removing pages. This includes
|
|
addPage(), addPageAt(), and removePage(). Also a method
|
|
updateAllPagesCache() is now available to force update of the
|
|
internal pages cache if you should modify the pages structure
|
|
manually.
|
|
|
|
* QPDF: new processFile method that takes an open FILE*
|
|
instead of a filename.
|
|
|
|
2012-06-20 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add new array mutation routines to QPDFObjectHandle.
|
|
Implemented by Tobias Hoffmann.
|
|
|
|
* Rework APIs that use size_t, off_t, and primitive integer types
|
|
so that size_t is used for sizes of memory and off_t is used for
|
|
file offsets. Also set _FILE_OFFSET_BITS so that large files can
|
|
be supported on 32-bit UNIX/Linux platforms. The code assumes in
|
|
places that sizeof(off_t) >= sizeof(size_t). This resulted in
|
|
non-compatible ABI changes and hopefully clears the way for QPDF
|
|
to work with files that are larger than 4 GiB in size.
|
|
|
|
* Add support for versioned symbols on ELF platforms.
|
|
|
|
* Various fixes for gcc 4.7
|
|
|
|
2011-04-06 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Fix PCRE to stop using deprecated (and now dropped) interfaces.
|
|
|
|
2011-12-28 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.3.1: release
|
|
|
|
* include <stdint.h> if available to support MSVC 2010
|
|
|
|
* Since PCRE is not necessarily thread safe, don't declare any
|
|
PCRE objects to be static.
|
|
|
|
* Disregard stderr output from ghostscript when using it to
|
|
compare images in the test suite; see comments in qpdf.test for
|
|
details.
|
|
|
|
* Fixed a few documentation errors.
|
|
|
|
2011-08-11 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.3.0: release
|
|
|
|
* include/qpdf/qpdf-c.h ("C"): add new methods
|
|
qpdf_init_write_memory, qpdf_get_buffer_length, and
|
|
qpdf_get_buffer to support writing to memory from the C API.
|
|
|
|
* include/qpdf/qpdf-c.h ("C"): add new methods qpdf_get_info_key
|
|
and qpdf_set_info_key for manipulating text fields of the /Info
|
|
dictionary.
|
|
|
|
2011-08-10 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* libqpdf/QPDFWriter.cc (copyEncryptionParameters): preserve
|
|
whether metadata is encryption. This fixes part of bug 3173659:
|
|
the password becomes invalid if qpdf copies an encrypted file with
|
|
cleartext-metadata.
|
|
|
|
* include/qpdf/QPDFWriter.hh: add a new constructor that takes
|
|
only a QPDF reference and leaves specification of output for
|
|
later. Add methods setOutputFilename() to set the output to a
|
|
filename or stdout, and setOutputMemory() to indicate that output
|
|
should go to a memory buffer. Add method getBuffer() to retrieve
|
|
the buffer used if output was saved to a memory buffer.
|
|
|
|
* include/qpdf/QPDF.hh: add methods replaceObject() and
|
|
swapObjects() to allow replacement of an object and swapping of
|
|
two objects by object ID.
|
|
|
|
* include/qpdf/QPDFObjectHandle.hh: add new methods getDictAsMap()
|
|
and getArrayAsVector() for returning the elements of a dictionary
|
|
or an array as a map or vector.
|
|
|
|
2011-06-25 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.2.4: release
|
|
|
|
2011-06-23 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* make/libtool.mk (install): Do not strip executables and shared
|
|
libraries during installation. Leave that up to the packager.
|
|
|
|
* configure.ac: disable -Werror by default.
|
|
|
|
2011-05-07 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* libqpdf/QPDF_linearization.cc (isLinearized): remove unused
|
|
offset variable, found by a gcc 4.6 warning.
|
|
|
|
2011-04-30 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.2.3: release
|
|
|
|
* libqpdf/QPDF.cc (readObjectInternal): Accept the case of the
|
|
stream keyword being followed by carriage return by itself. While
|
|
this is not permitted by the specification, there are PDF files
|
|
that do this, and other readers can read them.
|
|
|
|
* libqpdf/Pl_QPDFTokenizer.cc (processChar): When an inline image
|
|
is detected, suspend normalization only up to the end of the
|
|
inline image rather than for the remainder of the content stream.
|
|
(Fixes qpdf-Bugs 3152169.)
|
|
|
|
2011-01-31 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* libqpdf/QPDF.cc (readObjectAtOffset): use -1 rather than 0 when
|
|
reading an object at a given to indicate that no object number is
|
|
expected. This allows xref recovery to proceed even if a file
|
|
uses the invalid object number 0 as a regular object.
|
|
|
|
* libqpdf/QPDF_linearization.cc (isLinearized): use -1 rather than
|
|
0 as a sentinel for not having found the first object in the
|
|
file. Since -1 can never match the regular expression, this
|
|
prevents an infinite loop when checking a file that starts with
|
|
(erroneous) 0 0 obj. (Fixes qpdf-Bugs-3159950.)
|
|
|
|
2010-10-04 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.2.2: release
|
|
|
|
* include/qpdf/qpdf-c.h: Add qpdf_read_memory to C API to call
|
|
QPDF::processMemoryFile.
|
|
|
|
2010-10-01 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.2.1: release
|
|
|
|
* include/qpdf/QPDF.hh: Add setOutputStreams method to allow
|
|
redirection of library-generated output/error to alternative
|
|
streams.
|
|
|
|
* include/qpdf/QPDF.hh: Add processMemoryFile method for
|
|
processing a PDF file from a memory buffer instead of a file.
|
|
|
|
2010-09-24 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* libqpdf/QPDF.cc: change private "file" method to be a
|
|
PointerHolder<InputSource> to prepare qpdf for being able to work
|
|
with PDF files loaded into memory in addition to working with
|
|
files on disk.
|
|
|
|
* include/qpdf/PointerHolder.hh: add operator* and operator->
|
|
methods so that PointerHolder objects can be used like pointers.
|
|
This is consistent with the smart pointer objects in the next
|
|
revision of C++.
|
|
|
|
2010-09-05 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* libqpdf/QPDF.cc (readObjectInternal): Recognize empty objects
|
|
and treat them as null.
|
|
|
|
* libqpdf/QPDF_Stream.cc (filterable): Handle inline image filter
|
|
abbreviations as stream filter abbreviations. Although this is
|
|
not technically allowed by the PDF specification, table H.1 in the
|
|
pre-ISO spec indicates that Adobe's readers accept them. Thanks
|
|
to Jian Ma <stronghorse@tom.com> for bringing this to my
|
|
attention.
|
|
|
|
2010-08-14 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.2.0: release
|
|
|
|
* Rename README.windows to README-windows.txt and convert its line
|
|
endings to Windows-style line endings. Also mention Jian Ma's VC6
|
|
port in the manual and README-windows.txt.
|
|
|
|
2010-08-09 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add QPDFObjectHandle::getRawStreamData to return raw
|
|
(unfiltered) stream data.
|
|
|
|
2010-08-08 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.2.rc1: release
|
|
|
|
2010-08-05 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add QPDFObjectHandle::addPageContents, a convenience routine for
|
|
appending or prepending new streams to a page's content streams.
|
|
The "pdf-double-page-size" example illustrates its use.
|
|
|
|
* Add new methods to QPDFObjectHandle: replaceStreamData and
|
|
newStream. These methods allow users of the qpdf library to add
|
|
new streams and to replace data of existing streams. The
|
|
"pdf-double-page-size" and "pdf-invert-images" examples illustrate
|
|
their use.
|
|
|
|
2010-06-06 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Fix memory leak for QPDF objects whose underlying PDF objects
|
|
contain circular references. Thanks to Jian Ma
|
|
<stronghorse@tom.com> for calling my attention to the memory leak.
|
|
|
|
2010-04-25 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.1.5: release
|
|
|
|
* libqpdf/QPDF_encryption.cc (compute_encryption_key): remove
|
|
restrictions on length of file identifier string. (Fixes
|
|
qpdf-Bugs-2991412.)
|
|
|
|
2010-04-18 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.1.4: release
|
|
|
|
* libqpdf/QPDFWriter.cc (writeLinearized): the padding calculation
|
|
fix in 2.1.2 was applied in only one place but it was needed in
|
|
two places since there are actually two cross reference streams in
|
|
a linearized file. The new padding calculation is now used for
|
|
both streams. Hopefully this should put an end to linearization
|
|
padding problems. (Fixes qpdf-Bugs-2979219.)
|
|
|
|
2010-04-10 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* qpdf/qpdf.cc (main): Since qpdf --check only checks syntax and
|
|
stream encoding without doing any semantic checks, make the output
|
|
clearer when no errors around found. This is inspired by
|
|
qpdf-Bugs-2983225.
|
|
|
|
2010-03-27 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.1.3: release
|
|
|
|
* libqpdf/QPDF_optimization.cc (flattenScalarReferences): Flatten
|
|
scalar references for unreferenced objects as well as those seen
|
|
during traversal of the file. This matters when preserving object
|
|
streams that contain unreferenced objects with indirect scalars.
|
|
(Fixes qpdf-Bugs-2974522.) Updated TODO with a description of a
|
|
possibly better fix involving removal of flattenScalarReferences.
|
|
|
|
* libqpdf/Pl_AES_PDF.cc (finish): Don't complain if an AES input
|
|
buffer is not a multiple of 16 bytes. Instead, just pad with
|
|
nulls and hope for the best. PDF files have been encountered "in
|
|
the wild" that contain AES buffers that aren't a multiple of 16
|
|
bytes.
|
|
|
|
2010-01-24 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.1.2: release
|
|
|
|
* libqpdf/QPDFWriter.cc: fix logic error in padding calculation.
|
|
When writing linearized files with cross reference streams, the
|
|
padding calculation failed to take differences in sizes of
|
|
compressed data between pass 1 and pass 2 into consideration.
|
|
|
|
2009-12-14 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.1.1: release
|
|
|
|
* qpdf/qtest/qpdf.test: improve test for acroread to make sure it
|
|
actually works and is not just present in the path.
|
|
|
|
2009-12-13 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* libqpdf/qpdf/Pl_AES_PDF.hh: include <stdint.h>, if available, so
|
|
we have valid definitions of uint32_t.
|
|
|
|
2009-10-30 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.1: release
|
|
|
|
* libqpdf/QPDF.cc: be more forgiving of extraneous whitespace in
|
|
the xref table and while recovering from error conditions.
|
|
|
|
2009-10-26 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Work around failure of PCRE test case; this test case exercises
|
|
an aspect of PCRE that qpdf does not use, and the test fails with
|
|
the version of PCRE on Red Hat Enterprise Linux 5, so we ignore
|
|
failure on this particular test case.
|
|
|
|
* Fix RPM .spec file to include "C" examples
|
|
|
|
2009-10-24 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.1.rc1: release
|
|
|
|
* Provide interfaces for getting qpdf's own version number
|
|
|
|
2009-10-19 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* include/qpdf/QPDF.hh (QPDF): getWarnings now returns a list of
|
|
QPDFExc rather than a list of strings. This way, warnings may be
|
|
inspected in more detail.
|
|
|
|
* Include information about the last object read in most error
|
|
messages. Most of the time, this will provide a good hint as to
|
|
which object contains the error, but it's possible that the last
|
|
object read may not necessarily be the one that has the error if
|
|
the erroneous object was previously read and cached.
|
|
|
|
2009-10-18 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* If forcing version, disable object stream creation and/or
|
|
encryption if previous specifications are incompatible with new
|
|
version. It is still possible that PDF content, compression
|
|
schemes, etc., may be incompatible with the new version, but at
|
|
least this way, older viewers will at least have a chance.
|
|
|
|
* libqpdf/QPDFWriter.cc (unparseObject): avoid compressing
|
|
Metadata streams if possible.
|
|
|
|
2009-10-13 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Upgrade embedded qtest to version 1.4, which allows the test
|
|
suite to be run in Windows with MSYS and ActiveState Perl rather
|
|
than requiring Cygwin perl.
|
|
|
|
2009-10-04 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Implement support AES encrypt and crypt filters. Implementation
|
|
is not fully tested due to lack of test data but has been tested
|
|
for several cases.
|
|
|
|
2009-10-04 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add methods to QPDFWriter and corresponding command line
|
|
arguments to qpdf to set the minimum output PDF version and also
|
|
to force the version to a particular value.
|
|
|
|
* libqpdf/QPDF.cc (processXRefStream): warn and ignore extra xref
|
|
stream entries when stream is larger than reported size. This
|
|
used to be a fatal error. (Fixes qpdf-Bugs-2872265.)
|
|
|
|
2009-09-27 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Add several methods to query permissions controlled by the
|
|
encryption dictionary. Note that qpdf does not enforce these
|
|
permissions even though it allows the user to query them.
|
|
|
|
* The function QPDF::getUserPassword returned the user password
|
|
with the required padding as specified by the PDF specification.
|
|
This is seldom useful to users. This function has been replaced
|
|
by QPDF::getPaddedUserPassword. Call the new
|
|
QPDF::getTrimmedUserPassword to retrieve the user password in a
|
|
human-readable format.
|
|
|
|
* qpdf/qpdf.cc (main): qpdf --check now prints the PDF version
|
|
number in addition to its other output.
|
|
|
|
2009-09-26 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Removed all references to QEXC; now using std::runtime_error and
|
|
std::logic_error and their subclasses for all exceptions.
|
|
|
|
2009-05-03 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.0.6: release
|
|
|
|
* libqpdf/QPDF_Stream.cc (filterable): ignore /DecodeParms if it's
|
|
not a type we recognize. (Fixes qpdf-Bugs-2779746.)
|
|
|
|
2009-03-10 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.0.5: release
|
|
|
|
2009-03-09 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* libqpdf/Pl_LZWDecoder.cc: adjust LZWDecoder full table
|
|
detection, now having been able to adequately test boundary
|
|
conditions both and with and without early code change. Also
|
|
compared implementation with other LZW decoders.
|
|
|
|
2009-03-08 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* qpdf/fix-qdf (write_ostream): Adjust offsets while writing
|
|
object streams to account for changes in the length of the
|
|
dictionary and offset tables.
|
|
|
|
* qpdf/qpdf.cc (main): In check mode, in addition to checking
|
|
structure of file, attempt to decode all stream data.
|
|
|
|
* libqpdf/QPDFWriter.cc (QPDFWriter::writeObject): In QDF mode,
|
|
write a comment to the QDF file before each object that indicates
|
|
the object ID of the corresponding object from the original file.
|
|
Add --no-original-object-ids flag to qpdf and
|
|
setSuppressOriginalObjectIDs() method to QPDFWriter to turn this
|
|
behavior off.
|
|
|
|
* libqpdf/QPDF.cc (QPDF::pipeStreamData): Issue a warning instead
|
|
of failing if there is a problem found while decoding stream.
|
|
|
|
* qpdf/qpdf.cc: Exit with a status of 3 if warnings were found
|
|
regardless of what mode we're in.
|
|
|
|
2009-02-21 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.0.4: release
|
|
|
|
2009-02-20 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Fix many typos in comments and strings.
|
|
|
|
* qpdf/qpdf.cc: in --check mode, if there are warnings but no
|
|
errors, exit with a status of 3.
|
|
|
|
* libqpdf/QPDF.cc (QPDF::insertXrefEntry): when recovering the
|
|
cross-reference table, have objects we encounter later in the file
|
|
supersede those we found earlier. This improves the chances of
|
|
being able to recover appended files with damaged cross-reference
|
|
tables.
|
|
|
|
2009-02-19 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* libqpdf/Pl_LZWDecoder.cc: correct logic error for previously
|
|
untested case of running the LZW decoder without the "early code
|
|
change" flag. Thanks to a bug report from "Atom Smasher", I
|
|
finally was able to obtain an input stream compressed in this way.
|
|
|
|
2009-02-15 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.0.3: release
|
|
|
|
2008-12-11 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* qpdf/qpdf.cc (main): Accept -help and -version as well as --help
|
|
and --version
|
|
|
|
2008-11-23 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* Include stdio.h in a few files for proper compilation with (yet
|
|
to be released) gcc 4.4
|
|
|
|
* updated embedded qtest to version 1.3
|
|
|
|
* libqpdf/QPDF_String.cc (QPDF_String::getUTF8Val): handle
|
|
UTF-16BE properly rather than just treating the string as a string
|
|
of 16-bit characters.
|
|
|
|
2008-06-30 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.0.2: release
|
|
|
|
* updated embedded qtest to version 1.2 (includes previous
|
|
changes)
|
|
|
|
2008-06-07 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* qpdf/qtest/qpdf/diff-encrypted: change == to = so that the test
|
|
suite passes when /bin/sh is not bash
|
|
|
|
2008-05-07 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* qtest/bin/qtest-driver (run_test): increase timeout for qtest to
|
|
be more tolerant of slow machines
|
|
|
|
2008-05-06 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.0.1: release
|
|
|
|
* make/rules.mk: fix logic with .dep generation for .lo files so
|
|
that dependencies work properly with libtool
|
|
|
|
2008-05-05 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* libqpdf/qpdf/MD5.hh: fix header to be 64-bit clean
|
|
|
|
* configure.ac: add tests for sized integer types
|
|
|
|
2008-05-04 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* libqpdf/QPDF_encryption.cc: do not assume size_t is unsigned int
|
|
|
|
* qpdf/qtest/qpdf.test: removed locale-specific tests. These were
|
|
really to check bugs in perl 5.8.0 and are obsolete now. They
|
|
also make the test suite fail in some environments that don't have
|
|
all the locales fully configured.
|
|
|
|
* various: updated several files for gcc 4.3 by adding missing
|
|
includes (string.h, stdlib.h)
|
|
|
|
2008-04-26 Jay Berkenbilt <ejb@ql.org>
|
|
|
|
* 2.0: initial public release
|