mirror of
https://github.com/qpdf/qpdf.git
synced 2024-06-26 07:12:45 +00:00
TODO note about sanitizer
This commit is contained in:
parent
8ed3e8c79b
commit
4f103c6182
24
TODO
24
TODO
|
@ -491,17 +491,19 @@ I find it useful to make reference to them in this list.
|
||||||
by making it possible to run the lexer (tokenizer) over a whole
|
by making it possible to run the lexer (tokenizer) over a whole
|
||||||
file. Make it possible to replace all strings in a file lexically
|
file. Make it possible to replace all strings in a file lexically
|
||||||
even on badly broken files. Ideally this should work files that are
|
even on badly broken files. Ideally this should work files that are
|
||||||
lacking xref, have broken links, etc., and ideally it should work
|
lacking xref, have broken links, duplicated dictionary keys, syntax
|
||||||
with encrypted files if possible. This should go through the
|
errors, etc., and ideally it should work with encrypted files if
|
||||||
streams and strings and replace them with fixed or random
|
possible. This should go through the streams and strings and
|
||||||
characters, preferably, but not necessarily, in a manner that works
|
replace them with fixed or random characters, preferably, but not
|
||||||
with fonts. One possibility would be to detect whether a string
|
necessarily, in a manner that works with fonts. One possibility
|
||||||
contains characters with normal encoding, and if so, use 0x41. If
|
would be to detect whether a string contains characters with normal
|
||||||
the string uses character maps, use 0x01. The output should
|
encoding, and if so, use 0x41. If the string uses character maps,
|
||||||
otherwise be unrelated to the input. This could be built after the
|
use 0x01. The output should otherwise be unrelated to the input.
|
||||||
filtering and tokenizer rewrite and should be done in a manner that
|
This could be built after the filtering and tokenizer rewrite and
|
||||||
takes advantage of the other lexical features. This sanitizer
|
should be done in a manner that takes advantage of the other
|
||||||
should also clear metadata and replace images.
|
lexical features. This sanitizer should also clear metadata and
|
||||||
|
replace images. If I ever do this, the file from issue #494 would
|
||||||
|
be a great one to look at.
|
||||||
|
|
||||||
* Here are some notes about having stream data providers modify
|
* Here are some notes about having stream data providers modify
|
||||||
stream dictionaries. I had wanted to add this functionality to make
|
stream dictionaries. I had wanted to add this functionality to make
|
||||||
|
|
Loading…
Reference in New Issue
Block a user