Date: Wed, 10 Jun 2009 09:08:27 +0700 (ICT) From: Olivier Nicole <on@cs.ait.ac.th> To: djuatdelta@gmail.com Cc: freebsd-questions@freebsd.org Subject: Re: PDF inventory software Message-ID: <200906100208.n5A28RYg062023@banyan.cs.ait.ac.th> In-Reply-To: <b6c05a470906091018k3a2f7527o640a526949596141@mail.gmail.com> (message from Daniel Underwood on Tue, 9 Jun 2009 13:18:56 -0400) References: <b6c05a470906081417x370edb66yb86fac71b462eab8@mail.gmail.com> <3D527043-AF88-4A26-8029-FD51159E6ABB@yahoo.fr> <b6c05a470906091018k3a2f7527o640a526949596141@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
Daniel, > I'm trying to convert all PDF files in a directory to text using > "pdftotext". I tried the following command: Aside from the syntax of the command find(1) and some article that may be in corrupted PDF, you may consider hacking pdftotext to skip the "do not print" flag in some of the PDF articles. I don't think that many scientific articles would set the flag that prevent from printing them. But some PDF filess have that flag set, and pdftotext would not work on them, unless you patch it (which is easy, could even be a compile option, I don't remember). Best regards, Olivier
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200906100208.n5A28RYg062023>