-
Notifications
You must be signed in to change notification settings - Fork 118
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Linearize PDFs for better metadata removal #111
Comments
Thanks for bringing this to my attention! I have to look into this more but it sounds promising. I suppose we'd just have to include the latest 64-bit qpdf binary for each platform with the distribution, then for PDFs run While I do want to keep the number of settings and buttons to a minimum, I also want the main feature of the app, removing metadata, to be comprehensive. For this reason I'm also exploring removing extended filesystem attributes. So better PDF handling is something I'd like to add if it can be done well. |
It would be amazing because currently, the only other options for secure PDF cleanup are:
There actually is one other option that's super easy and straightforward, that most people's operating systems support natively. And that's simply to print as PDF, which apparently flattens the document and removes all the metadata. But, I'm not confident it's 100% fool-proof. It would be nicer to actually see the before and after (what your app does) to verify it's been cleaned. |
Why does QPDF produce different files everytime? I used the same source file to generate files through QPDF at two instances and the binary file comparison shows differences in the two PDFs produced, despite the input file being the same. |
I think that until you'll update ExifCleaner to make it permanently remove PDF metadata, it would be best to remove all claims of PDF support altogether from github.com/szTheory/exifcleaner and exifcleaner.com In the former it starts by saying "Desktop app to clean metadata from images, videos, PDFs, and other files." without any warning. In the "Benefits" section it warns that support is "partial" and links to this discussion, but that's not good enough either as NOTHING is truly removed. It's only appears to be removed, on the surface. That's a serious problem as users are expecting this tool to protect their privacy by permanently removing harmful metadata. The "Supported File Types" also mentions PDF without any warning. It's only in the "File writer limitations" that it's said properly that "The original metadata is never actually removed." It's important to update these pages so that people will not get the wrong impression. |
I read that you wish to keep this app minimalistic, and as someone who shares the same philosophy, I can appreciate that.
I'm wondering if adding linearization of PDF files (so that meta data is actually removed), would be within that scope, or overkill?
Here, someone is using QPDF to compliment ExifTool to accomplish that:
https://blog.joshlemon.com.au/protecting-your-pdf-files-and-metadata/
The text was updated successfully, but these errors were encountered: