Is there a way to digitally markup a pdf so its not OCR-readable?
from cheese_greater@lemmy.world to nostupidquestions@lemmy.ca on 06 Nov 23:16
https://lemmy.world/post/21713633
from cheese_greater@lemmy.world to nostupidquestions@lemmy.ca on 06 Nov 23:16
https://lemmy.world/post/21713633
Want to ensure financial documents cant be parsed by automated systems
#nostupidquestions
threaded - newest
PDF scanning is done by both OCR and PDF analysis so no. If you, a human can read it, a bot can read it too.
Your best bet is the classic inserting BS in a 1-hex-off-white font
This is correct. There are also typefaces that are designed to be difficult for OCR eg www.librarystack.org/zxx/
These are, however, difficult for humans to read as well.