Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 26 Jan 2009 13:36:48 -0800
From:      Charlie Kester <corky1951@comcast.net>
To:        FreeBSD Mailing List <freebsd-questions@freebsd.org>
Subject:   Re: can i split a pdf file?
Message-ID:  <20090126213648.GL66858@comcast.net>
In-Reply-To: <20090126091623.a0b50f64.freebsd@edvax.de>
References:  <20090126001822.GA38314@thought.org> <20090126005156.GJ66858@comcast.net> <497D0FF3.6090402@telenix.org> <20090126080618.GA51983@thought.org> <20090126091623.a0b50f64.freebsd@edvax.de>

next in thread | previous in thread | raw e-mail | index | archive | help
On Mon 26 Jan 2009 at 00:16:23 PST Polytropon wrote:
>On Mon, 26 Jan 2009 00:06:18 -0800, Gary Kline <kline@thought.org> wrote:
>> 	Thanks, Gents,
>> 
>> 	But according to one smallish pdf file that I send to a web based
>> 	tool, it was not a real pdf.  Or, more accurately, it (the pdf to 
>> 	speech program) couldn't decode it.
>
>This is a typical problem with "poorly engineered" PDFs where the
>author puts in the text as images (you'll see this stupidity across
>the Web, too).

In most cases where I've seen this, it's because they had scanned an
actual printed document.  Many old, out-of-print books are being made
newly available this way, so I'm not inclined to complain.

Unfortunately, OCR software still isn't reliable enough (or, if
reliable, cheap enough) to convert these scanned images to actual text.



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20090126213648.GL66858>