I see what you mean - but that's not a particularly good example. The file was uploaded more than 13 years ago. The Archive workflow has got so much better in that time. As a test, I've re-uploaded the images to see what the new workflow can do: Home Computing Weekly issue 40.That's not been my experience -- several PDFs I've downloaded from the Internet Archive have been shockingly bad (to the point of unreadable). Downloading the raw images and recreating the PDFs produces much better results.
This is a sample from the 2011 upload: And this from my recent upload: While I couldn't get them to exactly the same zoom factor, I think you'll agree the new upload is much more crisp. It looks like the 2011 compression setting were set way too high. The new PDF file size is just under 40% larger than the old one
The tools available to the user are also much better than 2011. I don't think either img2pdf or ocrmypdf were available back then
The guide for uploading scanned page images is here: How to upload scanned images to make a book – Internet Archive Help Center. I have to refer to it far more often than I should.
Statistics: Posted by scruss — Sun Feb 23, 2025 1:27 am