2022-02-01 07:18:23 -05:00
|
|
|
|
|
|
|
|
|
.. _qpdf-job:
|
|
|
|
|
|
|
|
|
|
QPDFJob: a Job-Based Interface
|
|
|
|
|
==============================
|
|
|
|
|
|
|
|
|
|
All of the functionality from the :command:`qpdf` command-line
|
|
|
|
|
executable is available from inside the C++ library using the
|
|
|
|
|
``QPDFJob`` class. There are several ways to access this functionality:
|
|
|
|
|
|
|
|
|
|
- Command-line options
|
|
|
|
|
|
|
|
|
|
- Run the :command:`qpdf` command line
|
|
|
|
|
|
|
|
|
|
- Use from the C++ API with ``QPDFJob::initializeFromArgv``
|
|
|
|
|
|
2022-02-05 15:32:53 -05:00
|
|
|
|
- Use from the C API with ``qpdfjob_run_from_argv`` from
|
|
|
|
|
:file:`qpdfjob-c.h`. If you are calling from a Windows-style main
|
|
|
|
|
and have an argv array of ``wchar_t``, you can use
|
|
|
|
|
``qpdfjob_run_from_wide_argv``.
|
2022-02-01 07:18:23 -05:00
|
|
|
|
|
|
|
|
|
- The job JSON file format
|
|
|
|
|
|
|
|
|
|
- Use from the CLI with the :qpdf:ref:`--job-json-file` parameter
|
|
|
|
|
|
|
|
|
|
- Use from the C++ API with ``QPDFJob::initializeFromJson``
|
|
|
|
|
|
2022-02-01 08:55:18 -05:00
|
|
|
|
- Use from the C API with ``qpdfjob_run_from_json`` from :file:`qpdfjob-c.h`
|
2022-02-01 07:18:23 -05:00
|
|
|
|
|
|
|
|
|
- The ``QPDFJob`` C++ API
|
|
|
|
|
|
|
|
|
|
If you can understand how to use the :command:`qpdf` CLI, you can
|
2022-02-05 12:37:15 -05:00
|
|
|
|
understand the ``QPDFJob`` class and the JSON file. qpdf guarantees
|
2022-02-01 07:18:23 -05:00
|
|
|
|
that all of the above methods are in sync. Here's how it works:
|
|
|
|
|
|
|
|
|
|
.. list-table:: QPDFJob Interfaces
|
|
|
|
|
:widths: 30 30 30
|
|
|
|
|
:header-rows: 1
|
|
|
|
|
|
|
|
|
|
- - CLI
|
|
|
|
|
- JSON
|
|
|
|
|
- C++
|
|
|
|
|
|
|
|
|
|
- - ``--some-option``
|
|
|
|
|
- ``"someOption": ""``
|
|
|
|
|
- ``config()->someOption()``
|
|
|
|
|
|
|
|
|
|
- - ``--some-option=value``
|
|
|
|
|
- ``"someOption": "value"``
|
|
|
|
|
- ``config()->someOption("value")``
|
|
|
|
|
|
|
|
|
|
- - positional argument
|
|
|
|
|
- ``"otherOption": "value"``
|
|
|
|
|
- ``config()->otherOption("value")``
|
|
|
|
|
|
|
|
|
|
In the JSON file, the JSON structure is an object (dictionary) whose
|
|
|
|
|
keys are command-line flags converted to camelCase. Positional
|
|
|
|
|
arguments have some corresponding key, which you can find by running
|
|
|
|
|
``qpdf`` with the :qpdf:ref:`--job-json-help` flag. For example, input
|
|
|
|
|
and output files are named by positional arguments on the CLI. In the
|
2022-02-05 12:37:15 -05:00
|
|
|
|
JSON, they appear in the ``"inputFile"`` and ``"outputFile"`` keys.
|
|
|
|
|
The following are equivalent:
|
2022-02-01 07:18:23 -05:00
|
|
|
|
|
|
|
|
|
.. It would be nice to have an automated test that these are all the
|
|
|
|
|
same, but we have so few live examples that it's not worth it for
|
|
|
|
|
now.
|
|
|
|
|
|
|
|
|
|
CLI:
|
|
|
|
|
::
|
|
|
|
|
|
|
|
|
|
qpdf infile.pdf outfile.pdf \
|
|
|
|
|
--pages . other.pdf --password=x 1-5 -- \
|
|
|
|
|
--encrypt user owner 256 --print=low -- \
|
|
|
|
|
--object-streams=generate
|
|
|
|
|
|
|
|
|
|
Job JSON:
|
|
|
|
|
.. code-block:: json
|
|
|
|
|
|
|
|
|
|
{
|
|
|
|
|
"inputFile": "infile.pdf",
|
|
|
|
|
"outputFile": "outfile.pdf",
|
|
|
|
|
"pages": [
|
|
|
|
|
{
|
|
|
|
|
"file": "."
|
|
|
|
|
},
|
|
|
|
|
{
|
|
|
|
|
"file": "other.pdf",
|
|
|
|
|
"password": "x",
|
|
|
|
|
"range": "1-5"
|
|
|
|
|
}
|
|
|
|
|
],
|
|
|
|
|
"encrypt": {
|
|
|
|
|
"userPassword": "user",
|
|
|
|
|
"ownerPassword": "owner",
|
|
|
|
|
"256bit": {
|
|
|
|
|
"print": "low"
|
|
|
|
|
}
|
|
|
|
|
},
|
|
|
|
|
"objectStreams": "generate"
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
C++ code:
|
|
|
|
|
.. code-block:: c++
|
|
|
|
|
|
|
|
|
|
#include <qpdf/QPDFJob.hh>
|
|
|
|
|
#include <qpdf/QPDFUsage.hh>
|
|
|
|
|
#include <iostream>
|
|
|
|
|
|
|
|
|
|
int main(int argc, char* argv[])
|
|
|
|
|
{
|
|
|
|
|
try
|
|
|
|
|
{
|
|
|
|
|
QPDFJob j;
|
|
|
|
|
j.config()
|
|
|
|
|
->inputFile("infile.pdf")
|
|
|
|
|
->outputFile("outfile.pdf")
|
|
|
|
|
->pages()
|
|
|
|
|
->pageSpec(".", "1-z")
|
|
|
|
|
->pageSpec("other.pdf", "1-5", "x")
|
|
|
|
|
->endPages()
|
|
|
|
|
->encrypt(256, "user", "owner")
|
|
|
|
|
->print("low")
|
|
|
|
|
->endEncrypt()
|
|
|
|
|
->objectStreams("generate")
|
|
|
|
|
->checkConfiguration();
|
|
|
|
|
j.run();
|
|
|
|
|
}
|
|
|
|
|
catch (QPDFUsage& e)
|
|
|
|
|
{
|
|
|
|
|
std::cerr << "configuration error: " << e.what() << std::endl;
|
|
|
|
|
return 2;
|
|
|
|
|
}
|
|
|
|
|
catch (std::exception& e)
|
|
|
|
|
{
|
|
|
|
|
std::cerr << "other error: " << e.what() << std::endl;
|
|
|
|
|
return 2;
|
|
|
|
|
}
|
|
|
|
|
return 0;
|
|
|
|
|
}
|
|
|
|
|
|
2022-02-05 15:32:53 -05:00
|
|
|
|
Note the ``QPDFUsage`` exception above. This is thrown whenever a
|
|
|
|
|
configuration error occurs. These exactly correspond to usage messages
|
|
|
|
|
issued by the :command:`qpdf` CLI for things like omitting an output
|
|
|
|
|
file, specifying `--pages` multiple times, or other invalid
|
|
|
|
|
combinations of options. ``QPDFUsage`` is thrown by the argv and JSON
|
|
|
|
|
interfaces as well as the native ``QPDFJob`` interface.
|
|
|
|
|
|
2022-02-05 12:37:15 -05:00
|
|
|
|
It is also possible to mix and match command-line options and JSON
|
2022-02-01 07:18:23 -05:00
|
|
|
|
from the CLI. For example, you could create a file called
|
|
|
|
|
:file:`my-options.json` containing the following:
|
|
|
|
|
|
|
|
|
|
.. code-block:: json
|
|
|
|
|
|
|
|
|
|
{
|
|
|
|
|
"encrypt": {
|
|
|
|
|
"userPassword": "",
|
|
|
|
|
"ownerPassword": "owner",
|
|
|
|
|
"256bit": {
|
|
|
|
|
}
|
|
|
|
|
},
|
|
|
|
|
"objectStreams": "generate"
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
and use it with other options to create 256-bit encrypted (but
|
|
|
|
|
unrestricted) files with object streams while specifying other
|
|
|
|
|
parameters on the command line, such as
|
|
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
|
|
qpdf infile.pdf outfile.pdf --job-json-file=my-options.json
|
|
|
|
|
|
|
|
|
|
.. _qpdfjob-design:
|
|
|
|
|
|
|
|
|
|
See also :file:`examples/qpdf-job.cc` in the source distribution as
|
|
|
|
|
well as comments in ``QPDFJob.hh``.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
QPDFJob Design
|
|
|
|
|
--------------
|
|
|
|
|
|
|
|
|
|
This section describes some of the design rationale and history behind
|
|
|
|
|
``QPDFJob``.
|
|
|
|
|
|
|
|
|
|
Documentation of ``QPDFJob`` is divided among three places:
|
|
|
|
|
|
|
|
|
|
- "HOW TO ADD A COMMAND-LINE ARGUMENT" in :file:`README-maintainer`
|
2022-02-08 16:19:08 +00:00
|
|
|
|
provides a quick reminder of how to add a command-line argument.
|
2022-02-01 07:18:23 -05:00
|
|
|
|
|
|
|
|
|
- The source file :file:`generate_auto_job` has a detailed explanation
|
2022-02-08 16:19:08 +00:00
|
|
|
|
about how ``QPDFJob`` and ``generate_auto_job`` work together.
|
2022-02-01 07:18:23 -05:00
|
|
|
|
|
|
|
|
|
- This chapter of the manual has other details.
|
|
|
|
|
|
|
|
|
|
Prior to qpdf version 10.6.0, the qpdf CLI executable had a lot of
|
2022-02-05 12:37:15 -05:00
|
|
|
|
functionality built into it that was not callable from the library as
|
|
|
|
|
such. This created a number of problems:
|
2022-02-01 07:18:23 -05:00
|
|
|
|
|
|
|
|
|
- Some of the logic in :file:`qpdf.cc` was pretty complex, such as
|
2022-02-05 12:37:15 -05:00
|
|
|
|
image optimization, generating JSON output, and many of the page
|
2022-02-01 07:18:23 -05:00
|
|
|
|
manipulations. While those things could all be coded using the C++
|
|
|
|
|
API, there would be a lot of duplicated code.
|
|
|
|
|
|
|
|
|
|
- Page splitting and merging will get more complicated over time as
|
|
|
|
|
qpdf supports a wider range of document-level options. It would be
|
|
|
|
|
nice to be able to expose this to library users instead of baking it
|
|
|
|
|
all into the CLI.
|
|
|
|
|
|
|
|
|
|
- Users of other languages who just wanted an interface to do things
|
|
|
|
|
that the CLI could do didn't have a good way to do it, such as just
|
2022-02-08 16:19:08 +00:00
|
|
|
|
handing a library call a set of command-line options or an
|
2022-02-01 07:18:23 -05:00
|
|
|
|
equivalent JSON object that could be passed in as a string.
|
|
|
|
|
|
|
|
|
|
- The qpdf CLI itself was almost 8,000 lines of code. It needed to be
|
|
|
|
|
refactored, cleaned up, and split.
|
|
|
|
|
|
|
|
|
|
- Exposing a new feature via the command-line required making lots of
|
|
|
|
|
small edits to lots of small bits of code, and it was easy to forget
|
|
|
|
|
something. Adding a code generator, while complex in some ways,
|
|
|
|
|
greatly reduces the chances of error when extending qpdf.
|
|
|
|
|
|
|
|
|
|
Here are a few notes on some design decisions about QPDFJob and its
|
|
|
|
|
various interfaces.
|
|
|
|
|
|
|
|
|
|
- Bare command-line options (flags with no parameter) map to config
|
2022-02-05 12:37:15 -05:00
|
|
|
|
functions that take no options and to JSON keys whose values are
|
2022-02-01 07:18:23 -05:00
|
|
|
|
required to be the empty string. The rationale is that we can later
|
|
|
|
|
change these bare options to options that take an optional parameter
|
|
|
|
|
without breaking backward compatibility in the CLI or the JSON.
|
|
|
|
|
Options that take optional parameters generate two config functions:
|
|
|
|
|
one has no arguments, and one that has a ``char const*`` argument.
|
|
|
|
|
This means that adding an optional parameter to a previously bare
|
|
|
|
|
option also doesn't break binary compatibility.
|
|
|
|
|
|
|
|
|
|
- Adding a new argument to :file:`job.yml` automatically triggers
|
|
|
|
|
almost everything by declaring and referencing things that you have
|
|
|
|
|
to implement. This way, once you get the code to compile and link,
|
|
|
|
|
you know you haven't forgotten anything. There are two tricky cases:
|
|
|
|
|
|
|
|
|
|
- If an argument handler has to do something special, like call a
|
|
|
|
|
nested config method or select an option table, you have to
|
|
|
|
|
implement it manually. This is discussed in
|
|
|
|
|
:file:`generate_auto_job`.
|
|
|
|
|
|
|
|
|
|
- When you add an option that has optional parameters or choices,
|
|
|
|
|
both of the handlers described above are declared, but only the
|
|
|
|
|
one that takes an argument is referenced. You have to remember to
|
|
|
|
|
implement the one that doesn't take an argument or else people
|
|
|
|
|
will get a linker error if they try to call it. The assumption is
|
|
|
|
|
that things with optional parameters started out as bare, so the
|
|
|
|
|
argument-less version is already there.
|
|
|
|
|
|
|
|
|
|
- If you have to add a new option that requires its own option table,
|
|
|
|
|
you will have to do some extra work including adding a new nested
|
|
|
|
|
Config class, adding a config member variable to ``ArgParser`` in
|
|
|
|
|
:file:`QPDFJob_argv.cc` and ``Handlers`` in :file:`QPDFJob_json.cc`,
|
|
|
|
|
and make sure that manually implemented handlers are consistent with
|
2022-02-05 12:37:15 -05:00
|
|
|
|
each other. It is best to add explicit test cases for all the
|
|
|
|
|
various ways to get to the option.
|