Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 10 Jun 2009 09:08:27 +0700 (ICT)
From:      Olivier Nicole <on@cs.ait.ac.th>
To:        djuatdelta@gmail.com
Cc:        freebsd-questions@freebsd.org
Subject:   Re: PDF inventory software
Message-ID:  <200906100208.n5A28RYg062023@banyan.cs.ait.ac.th>
In-Reply-To: <b6c05a470906091018k3a2f7527o640a526949596141@mail.gmail.com> (message from Daniel Underwood on Tue, 9 Jun 2009 13:18:56 -0400)
References:  <b6c05a470906081417x370edb66yb86fac71b462eab8@mail.gmail.com> <3D527043-AF88-4A26-8029-FD51159E6ABB@yahoo.fr> <b6c05a470906091018k3a2f7527o640a526949596141@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
Daniel,

> I'm trying to convert all PDF files in a directory to text using
> "pdftotext".  I tried the following command:

Aside from the syntax of the command find(1) and some article that may
be in corrupted PDF, you may consider hacking pdftotext to skip the
"do not print" flag in some of the PDF articles.

I don't think that many scientific articles would set the flag that
prevent from printing them. But some PDF filess have that flag set,
and pdftotext would not work on them, unless you patch it (which is
easy, could even be a compile option, I don't remember).

Best regards,

Olivier



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200906100208.n5A28RYg062023>