Date: Mon, 6 Sep 2010 16:33:17 -0700 From: Charlie Kester <corky1951@comcast.net> To: freebsd-questions@freebsd.org Subject: Re: PDF to HTML translations Message-ID: <20100906233317.GB6385@comcast.net> In-Reply-To: <20100906230941.GA6385@comcast.net> References: <20100904230920.GA20735@guilt.hydra> <20100905065711.GA34993@slackbox.erewhon.net> <20100905083154.GA89704@owl.midgard.homeip.net> <20100906184802.GC28608@guilt.hydra> <alpine.BSF.2.00.1009061258460.15237@wonkity.com> <20100906230941.GA6385@comcast.net>
next in thread | previous in thread | raw e-mail | index | archive | help
On Mon 06 Sep 2010 at 16:09:41 PDT Charlie Kester wrote: >On Mon 06 Sep 2010 at 12:02:07 PDT Warren Block wrote: >>On Mon, 6 Sep 2010, Chad Perrin wrote: >>>I've started looking at the Xpdf tools as well as pdftohtml. Other >>>suggestions from within ports would be appreciated. Additional options >>>other than what can be found in ports might also be useful, understanding >>>the needs I sketched out above. The script itself is Perl, in case that >>>matters. >> >>An alternative might be to render the PDF to a relatively low-res >>bitmap. Then the HTML becomes just an IMG. You can do that >>directly >>with Ghostscript, or use ImageMagick/GraphicsMagick. > >Which, if I correctly understand the description on freshmeat, is almost >exactly what pdf2html does. > >http://freshmeat.net/projects/pdf2html/ > >I downloaded the latest version just now and tried building it. The >build failed with some syntax errors in pbm2png.c, so if anyone wants >to add this to ports, they'll have some cleanup work to do. FWIW, the syntax errors are all due to some misplaced line breaks. Re-joining all the lines that generate a warning about a missing terminating " character fixes this. (This was using gcc 4.2.1.)
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20100906233317.GB6385>