Is there a way to digitally markup a pdf so its not OCR-readable?
from cheese_greater@lemmy.world to nostupidquestions@lemmy.ca on 06 Nov 23:16
https://lemmy.world/post/21713633

Want to ensure financial documents cant be parsed by automated systems

#nostupidquestions

threaded - newest

OsrsNeedsF2P@lemmy.ml on 06 Nov 23:29 collapse

PDF scanning is done by both OCR and PDF analysis so no. If you, a human can read it, a bot can read it too.

Your best bet is the classic inserting BS in a 1-hex-off-white font

umt@lemmynsfw.com on 06 Nov 23:55 collapse

This is correct. There are also typefaces that are designed to be difficult for OCR eg www.librarystack.org/zxx/

These are, however, difficult for humans to read as well.