octoleo/qpdf - qpdf - Vast Development Method

mirror of https://github.com/qpdf/qpdf.git synced 2024-06-05 11:50:53 +00:00

Author	SHA1	Message	Date
Jay Berkenbilt	b3e6d445cb	Tweak "AndGet" mutator functions again Remove any ambiguity around whether old or new value is being returned.	2022-07-24 15:42:23 -04:00
m-holger	afd35f9a30	Overload StreamDataProvider::provideStreamData Use 'QPDFObjGen const&' instead of 'int, int' in signature.	2022-07-24 16:02:35 +01:00
m-holger	f0a8178091	Refactor QPDFObject creation and cloning Move responsibility for creating shared pointers to objects and cloning from QPDFObjectHandle to QPDFObject.	2022-06-27 12:47:02 -04:00
Jay Berkenbilt	0c7c7e4ba4	Track whether certain page modifying methods have been called We need to know whether pushInheritedAttributesToPage or getAllPages have been called when generating JSON output. When reading the JSON back in, we have to call the same methods so that object numbers will line up properly.	2022-06-25 13:55:45 -04:00
Jay Berkenbilt	8a32515a62	Add warnings for some additional page tree repair	2022-06-25 13:25:35 -04:00
Jay Berkenbilt	eae75dbe44	Add Pl_Function -- a generic function pipeline	2022-06-19 09:12:29 -04:00
Jay Berkenbilt	bb0ea2f8e7	Add qpdfjob_register_progress_reporter	2022-06-19 08:46:58 -04:00
Jay Berkenbilt	87412eb05b	Add QPDFJob::registerProgressReporter	2022-06-19 08:46:58 -04:00
Jay Berkenbilt	3a7ee7e938	Move C-based ProgressReporter helper into QPDFWriter	2022-06-19 08:46:58 -04:00
Jay Berkenbilt	daef4e8fb8	Add more flexible funtions to qpdfjob C API	2022-06-19 08:46:58 -04:00
Jay Berkenbilt	e0720eaa78	Use the default logger for other writes to stdout/stderr When there is no context for writing output or error messages, use the default logger.	2022-06-18 10:38:50 -04:00
Jay Berkenbilt	83be2191b4	Use "save" logger when saving data to standard output This includes the output PDF, streams from --show-object and attachments from --save-attachment. This also enables --verbose and --progress to work with saving to stdout.	2022-06-18 09:54:40 -04:00
Jay Berkenbilt	641e92c6a7	QPDF, QPDFJob: use QPDFLogger instead of custom output streams	2022-06-18 09:02:55 -04:00
Jay Berkenbilt	f1f711963b	Add and test QPDFLogger class	2022-06-18 09:02:55 -04:00
Jay Berkenbilt	b7bbf12e85	In json mode, reveal recovered user password when otherwise unavailable	2022-05-30 20:03:08 -04:00
Jay Berkenbilt	f049a77c59	Add additional information when listing attachments	2022-05-30 20:03:08 -04:00
Jay Berkenbilt	27a42c16c7	Change default decode level to "none" with --json-output	2022-05-21 17:51:34 -04:00
Jay Berkenbilt	b0f1564376	Add another binary utf8 to JSON test	2022-05-21 17:39:35 -04:00
Jay Berkenbilt	752f43d4e4	Allow empty b: binary JSON strings	2022-05-21 17:36:32 -04:00
m-holger	6c69a747b9	Code clean up: use range-style for loops wherever possible Remove variables obsoleted by commit `4f24617`.	2022-05-21 16:06:29 -04:00
Jay Berkenbilt	905f47a55f	Add json to large file test	2022-05-21 09:43:45 -04:00
Jay Berkenbilt	9b2eb01e25	Exercise object description in tests	2022-05-20 14:23:32 -04:00
Jay Berkenbilt	6c2fb5b8f0	Add test for bad data and bad datafile	2022-05-20 13:33:30 -04:00
Jay Berkenbilt	d065098089	Test --update-from-json	2022-05-20 11:10:12 -04:00
Jay Berkenbilt	6d4e3ba8a4	Test (and fix) handling of dangling references	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	35b1e1c493	Explicitly test ignoring unknown keys in JSON input	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	dc8df962d8	Make version default to latest for --json-output (like --json)	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	907df2c823	Round-trip tests with --json-stream-data=file	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	a83b7b0611	Tests with manually constructed qpdf json	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	7f8c4b183d	Add tests for --json-input	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	1ec561daa4	Add more names and strings in good13 * native UTF-8 strings * names whose PDF and canonical syntax differ in both dictionary key positions and other positions For json, names are converted both as names and directly when used as dictionary keys.	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	6c5e590673	Rename all test files: _ to -	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	6f43bf8de3	Major rework -- see long comments * Replace --create-from-json=file with --json-input, which causes the regular input to be treated as json. * Eliminate --to-json * In --json=2, bring back "objects" and eliminate "objectinfo". Stream data is never present. * In --json-output=2, write "qpdf-v2" with "objects" and include stream data.	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	56f1b411fe	Back out fluent QPDFObjectHandle methods. Keep the andGet methods. I decided these were confusing and inconsistent with how JSON works. They muddle the API rather than improving it.	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	7e7a9c4379	Parse objects; stream data is not yet handled	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	7fa5d1773b	Implement top-level qpdf json parsing	2022-05-16 13:41:40 -04:00
Jay Berkenbilt	9a0e9a1a9e	Remove offset from missing /Root error The last offset is irrelevant to not being able to find /Root.	2022-05-16 13:39:26 -04:00
Jay Berkenbilt	173b944ef8	Split qpdf.test into multiple test suites This makes it a lot easier to run parts of the test suite.	2022-05-14 17:35:06 -04:00
Jay Berkenbilt	2a2f7f1bba	Add maxobjectid to JSON	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	e9390aeaaa	Add --to-json option	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	2e87d593eb	Test inline stream data with different decode levels	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	f08f398920	Test json v2 with invalid stream data	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	c76536dd9a	Implement JSON v2 output	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	bdfc4da510	Apply script across future v2 test files There is one unexpected pass in this commit. This script was applied to the files changed in this commit: ---------- #!/usr/bin/env python3 import json import sys def json_dumps(data): return json.dumps(data, ensure_ascii=False, indent=2, separators=(',', ': ')) for filename in sys.argv[1:]: with open(filename, 'r') as f: data = json.loads(f.read()) data['version'] = 2 objectinfo = {} if 'objectinfo' in data: objectinfo = data['objectinfo'] del data['objectinfo'] if 'objects' not in data: continue qpdf = {'jsonversion': 2, 'pdfversion': '1.3', 'objects': {}} for k, v in data['objects'].items(): is_stream = objectinfo.get(k, {}).get('stream', {}).get('is', False) if k.endswith(' R'): k = 'obj:' + k if is_stream: v = {'stream': {'dict': v}} else: v = {'value': v} qpdf['objects'][k] = v data['qpdf'] = qpdf del data['objects'] print(json_dumps(data)) ----------	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	8d348974aa	Prepare test suite for json v2	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	15272662f6	Fix typo in json output key name moddify -> modify. Also carefully spell checked all remaining keys by splitting them into words and running a spell checker, not just relying on visual proofreading. That was the only one.	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	1bc8abfdd3	Implement JSON v2 for Stream Not fully exercised in this commit	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	3246923cf2	Implement JSON v2 for String Also refine the herustic for deciding whether to use hexadecimal notation for a string.	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	16f4f94cd9	Prepare code for JSON v2 Update getJSON() methods and calls to them	2022-05-07 11:12:01 -04:00
Jay Berkenbilt	a9fbbd5dca	Objectinfo json: write incrementally and in numeric order This script was used on test data: ---------- #!/usr/bin/env python3 import json import sys import re def json_dumps(data): return json.dumps(data, ensure_ascii=False, indent=2, separators=(',', ': ')) for filename in sys.argv[1:]: with open(filename, 'r') as f: data = json.loads(f.read()) if 'objectinfo' not in data: continue trailer = None to_sort = [] for k, v in data['objectinfo'].items(): if k == 'trailer': trailer = v else: m = re.match(r'^(\d+) \d+ R', k) if m: to_sort.append([int(m.group(1)), k, v]) newobjectinfo = {x[1]: x[2] for x in sorted(to_sort)} if trailer is not None: newobjectinfo['trailer'] = trailer data['objectinfo'] = newobjectinfo print(json_dumps(data)) ----------	2022-05-07 08:26:31 -04:00

1 2 3 4 5 ...

814 Commits