octoleo/qpdf - qpdf - Vast Development Method

mirror of https://github.com/qpdf/qpdf.git synced 2024-11-02 11:46:35 +00:00

Author	SHA1	Message	Date
m-holger	057bd659bc	Code tidy: remove redundant variable in QPDF::writeJSON	2022-06-05 18:46:21 -04:00
Jay Berkenbilt	0bd908b550	Update documentation for qpdf JSON v2	2022-05-30 20:03:08 -04:00
Jay Berkenbilt	b7bbf12e85	In json mode, reveal recovered user password when otherwise unavailable	2022-05-30 20:03:08 -04:00
Jay Berkenbilt	f049a77c59	Add additional information when listing attachments	2022-05-30 20:03:08 -04:00
Jay Berkenbilt	04fc7c4bea	Add conversions to ISO-8601 date format	2022-05-30 20:03:08 -04:00
Jay Berkenbilt	27a42c16c7	Change default decode level to "none" with --json-output	2022-05-21 17:51:34 -04:00
Jay Berkenbilt	752f43d4e4	Allow empty b: binary JSON strings	2022-05-21 17:36:32 -04:00
Jay Berkenbilt	05460d405c	Format code	2022-05-21 16:11:42 -04:00
m-holger	6c69a747b9	Code clean up: use range-style for loops wherever possible Remove variables obsoleted by commit `4f24617`.	2022-05-21 16:06:29 -04:00
Jay Berkenbilt	c56a9ca7f6	JSON: Fix large file support	2022-05-21 09:43:45 -04:00
Jay Berkenbilt	47c093c48b	Replace std::regex with validators for better performance	2022-05-21 08:43:21 -04:00
Jay Berkenbilt	9b2eb01e25	Exercise object description in tests	2022-05-20 14:23:32 -04:00
Jay Berkenbilt	6c2fb5b8f0	Add test for bad data and bad datafile	2022-05-20 13:33:30 -04:00
Jay Berkenbilt	d065098089	Test --update-from-json	2022-05-20 11:10:12 -04:00
Jay Berkenbilt	ef955b04b5	Bug fix: don't clobber stream length with replaceDict	2022-05-20 11:09:45 -04:00
Jay Berkenbilt	3eb77a7004	JSON: detect duplicate dictionary keys while parsing	2022-05-20 10:13:15 -04:00
Jay Berkenbilt	6d4e3ba8a4	Test (and fix) handling of dangling references	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	5a2aa59479	Bug fix: isReserved() true for indirect reference to reserved object	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	35b1e1c493	Explicitly test ignoring unknown keys in JSON input	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	dc8df962d8	Make version default to latest for --json-output (like --json)	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	6c7326b290	JSON fix: correctly parse UTF-16 surrogate pairs	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	6f43bf8de3	Major rework -- see long comments * Replace --create-from-json=file with --json-input, which causes the regular input to be treated as json. * Eliminate --to-json * In --json=2, bring back "objects" and eliminate "objectinfo". Stream data is never present. * In --json-output=2, write "qpdf-v2" with "objects" and include stream data.	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	23fc6756f1	Add QUtil::FileCloser to the public API	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	0fe8d44762	Support stream data -- not tested There are no automated tests yet, but committing work so far in preparation for some refactoring.	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	63c7eefe9d	replaceStreamData: accept uninitialized filter/decode_parms These mean to leave the original values alone. This is needed for reconstructing streams from JSON given that the stream data and stream dictionary may appear in any order in the JSON.	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	56f1b411fe	Back out fluent QPDFObjectHandle methods. Keep the andGet methods. I decided these were confusing and inconsistent with how JSON works. They muddle the API rather than improving it.	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	7e7a9c4379	Parse objects; stream data is not yet handled	2022-05-20 09:16:25 -04:00
Jay Berkenbilt	9064542b5f	Add private methods for reserving specific objects	2022-05-20 07:54:09 -04:00
Jay Berkenbilt	7fa5d1773b	Implement top-level qpdf json parsing	2022-05-16 13:41:40 -04:00
Jay Berkenbilt	8d42eb2632	Add scaffolding for QPDF JSON reactor	2022-05-16 13:41:40 -04:00
Jay Berkenbilt	4fe2e06b47	Add --create-from-json and --update-from-json arguments Also add stubs for top-level QPDF methods (createFromJSON, updateFromJSON)	2022-05-16 13:41:40 -04:00
Jay Berkenbilt	9a0e9a1a9e	Remove offset from missing /Root error The last offset is irrelevant to not being able to find /Root.	2022-05-16 13:39:26 -04:00
Jay Berkenbilt	051ae7c282	Improve handling of replacing stream data with empty strings When an empty string was passed to replaceStreamData, the code was passing a null pointer to memcpy. Since a 0 size was also passed, this was harmless, but it triggers sanitizer errors. The code properly handles a null pointer as the buffer in other places.	2022-05-16 13:39:26 -04:00
Jay Berkenbilt	60ec94a7c3	Add QUtil::is_long_long	2022-05-16 13:39:26 -04:00
Jay Berkenbilt	4c7cfd5cbc	JSON reactor: improve handling of nested containers Call the parent container's item method before calling the child item's start method so we can easily know the current nesting level when nested items are added.	2022-05-14 17:35:06 -04:00
Jay Berkenbilt	2a2f7f1bba	Add maxobjectid to JSON	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	e9390aeaaa	Add --to-json option	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	c76536dd9a	Implement JSON v2 output	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	15272662f6	Fix typo in json output key name moddify -> modify. Also carefully spell checked all remaining keys by splitting them into words and running a spell checker, not just relying on visual proofreading. That was the only one.	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	1bc8abfdd3	Implement JSON v2 for Stream Not fully exercised in this commit	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	3246923cf2	Implement JSON v2 for String Also refine the herustic for deciding whether to use hexadecimal notation for a string.	2022-05-08 13:45:20 -04:00
Jay Berkenbilt	16f4f94cd9	Prepare code for JSON v2 Update getJSON() methods and calls to them	2022-05-07 11:12:01 -04:00
Jay Berkenbilt	a9fbbd5dca	Objectinfo json: write incrementally and in numeric order This script was used on test data: ---------- #!/usr/bin/env python3 import json import sys import re def json_dumps(data): return json.dumps(data, ensure_ascii=False, indent=2, separators=(',', ': ')) for filename in sys.argv[1:]: with open(filename, 'r') as f: data = json.loads(f.read()) if 'objectinfo' not in data: continue trailer = None to_sort = [] for k, v in data['objectinfo'].items(): if k == 'trailer': trailer = v else: m = re.match(r'^(\d+) \d+ R', k) if m: to_sort.append([int(m.group(1)), k, v]) newobjectinfo = {x[1]: x[2] for x in sorted(to_sort)} if trailer is not None: newobjectinfo['trailer'] = trailer data['objectinfo'] = newobjectinfo print(json_dumps(data)) ----------	2022-05-07 08:26:31 -04:00
Jay Berkenbilt	948de60990	Objects json: write incrementally and in numeric order The following script was used to adjust test data: ---------- #!/usr/bin/env python3 import json import sys import re def json_dumps(data): return json.dumps(data, ensure_ascii=False, indent=2, separators=(',', ': ')) for filename in sys.argv[1:]: with open(filename, 'r') as f: data = json.loads(f.read()) if 'objects' not in data: continue trailer = None to_sort = [] for k, v in data['objects'].items(): if k == 'trailer': trailer = v else: m = re.match(r'^(\d+) \d+ R', k) if m: to_sort.append([int(m.group(1)), k, v]) newobjects = {x[1]: x[2] for x in sorted(to_sort)} if trailer is not None: newobjects['trailer'] = trailer data['objects'] = newobjects print(json_dumps(data)) ----------	2022-05-07 08:26:31 -04:00
Jay Berkenbilt	f50274ef46	Pages json: write each page incrementally	2022-05-07 08:26:31 -04:00
Jay Berkenbilt	dc9b7287cd	Top-level json: write incrementally This commit just changes the order in which fields are written to the json without changing their content. All the json files in the test suite were modified with this script to ensure that we didn't get any changes other than ordering. ---------- #!/usr/bin/env python3 import json import sys def json_dumps(data): return json.dumps(data, ensure_ascii=False, indent=2, separators=(',', ': ')) for filename in sys.argv[1:]: with open(filename, 'r') as f: data = json.loads(f.read()) newdata = {} for i in ('version', 'parameters', 'pages', 'pagelabels', 'acroform', 'attachments', 'encrypt', 'outlines', 'objects', 'objectinfo'): if i in data: newdata[i] = data[i] print(json_dumps(newdata)) ----------	2022-05-07 08:26:31 -04:00
Jay Berkenbilt	7f65a5c21f	Test json against schema only on demand Testing json against schema requires an in-memory copy, so do it only when requested by the test suite.	2022-05-07 08:26:31 -04:00
Jay Berkenbilt	a3c9980395	Add next to Pl_String and fix comments	2022-05-07 08:26:31 -04:00
Jay Berkenbilt	b361c5ce19	Add --test-json-schema command-line option	2022-05-07 08:26:31 -04:00
Jay Berkenbilt	7604ac5cb2	QPDFJob: have doJSON write to a pipeline	2022-05-07 08:26:31 -04:00

1 2 3 4 5 ...

1168 Commits