Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 6 Sep 2010 16:33:17 -0700
From:      Charlie Kester <corky1951@comcast.net>
To:        freebsd-questions@freebsd.org
Subject:   Re: PDF to HTML translations
Message-ID:  <20100906233317.GB6385@comcast.net>
In-Reply-To: <20100906230941.GA6385@comcast.net>
References:  <20100904230920.GA20735@guilt.hydra> <20100905065711.GA34993@slackbox.erewhon.net> <20100905083154.GA89704@owl.midgard.homeip.net> <20100906184802.GC28608@guilt.hydra> <alpine.BSF.2.00.1009061258460.15237@wonkity.com> <20100906230941.GA6385@comcast.net>

next in thread | previous in thread | raw e-mail | index | archive | help
On Mon 06 Sep 2010 at 16:09:41 PDT Charlie Kester wrote:
>On Mon 06 Sep 2010 at 12:02:07 PDT Warren Block wrote:
>>On Mon, 6 Sep 2010, Chad Perrin wrote:
>>>I've started looking at the Xpdf tools as well as pdftohtml.  Other
>>>suggestions from within ports would be appreciated.  Additional options
>>>other than what can be found in ports might also be useful, understanding
>>>the needs I sketched out above.  The script itself is Perl, in case that
>>>matters.
>>
>>An alternative might be to render the PDF to a relatively low-res 
>>bitmap.  Then the HTML becomes just an IMG.  You can do that 
>>directly
>>with Ghostscript, or use ImageMagick/GraphicsMagick.
>
>Which, if I correctly understand the description on freshmeat, is almost
>exactly what pdf2html does.
>
>http://freshmeat.net/projects/pdf2html/
>
>I downloaded the latest version just now and tried building it.  The
>build failed with some syntax errors in pbm2png.c, so if anyone wants
>to add this to ports, they'll have some cleanup work to do.

FWIW, the syntax errors are all due to some misplaced line breaks.
Re-joining all the lines that generate a warning about a missing
terminating " character fixes this. 

(This was using gcc 4.2.1.)




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20100906233317.GB6385>