From owner-freebsd-questions@FreeBSD.ORG Mon Sep 6 23:33:21 2010 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 32CF210656B7 for ; Mon, 6 Sep 2010 23:33:21 +0000 (UTC) (envelope-from corky1951@comcast.net) Received: from qmta03.westchester.pa.mail.comcast.net (qmta03.westchester.pa.mail.comcast.net [76.96.62.32]) by mx1.freebsd.org (Postfix) with ESMTP id D13B08FC0A for ; Mon, 6 Sep 2010 23:33:20 +0000 (UTC) Received: from omta18.westchester.pa.mail.comcast.net ([76.96.62.90]) by qmta03.westchester.pa.mail.comcast.net with comcast id 3WJB1f0031wpRvQ53bZMe1; Mon, 06 Sep 2010 23:33:21 +0000 Received: from comcast.net ([98.203.142.76]) by omta18.westchester.pa.mail.comcast.net with comcast id 3bZK1f0031f6R9u3ebZKxQ; Mon, 06 Sep 2010 23:33:20 +0000 Received: by comcast.net (sSMTP sendmail emulation); Mon, 06 Sep 2010 16:33:17 -0700 Date: Mon, 6 Sep 2010 16:33:17 -0700 From: Charlie Kester To: freebsd-questions@freebsd.org Message-ID: <20100906233317.GB6385@comcast.net> Mail-Followup-To: freebsd-questions@freebsd.org References: <20100904230920.GA20735@guilt.hydra> <20100905065711.GA34993@slackbox.erewhon.net> <20100905083154.GA89704@owl.midgard.homeip.net> <20100906184802.GC28608@guilt.hydra> <20100906230941.GA6385@comcast.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Disposition: inline In-Reply-To: <20100906230941.GA6385@comcast.net> X-Mailer: Mutt 1.5.20 X-Composer: Vim 7.2 User-Agent: Mutt/1.5.20 (2009-06-14) Subject: Re: PDF to HTML translations X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 06 Sep 2010 23:33:21 -0000 On Mon 06 Sep 2010 at 16:09:41 PDT Charlie Kester wrote: >On Mon 06 Sep 2010 at 12:02:07 PDT Warren Block wrote: >>On Mon, 6 Sep 2010, Chad Perrin wrote: >>>I've started looking at the Xpdf tools as well as pdftohtml. Other >>>suggestions from within ports would be appreciated. Additional options >>>other than what can be found in ports might also be useful, understanding >>>the needs I sketched out above. The script itself is Perl, in case that >>>matters. >> >>An alternative might be to render the PDF to a relatively low-res >>bitmap. Then the HTML becomes just an IMG. You can do that >>directly >>with Ghostscript, or use ImageMagick/GraphicsMagick. > >Which, if I correctly understand the description on freshmeat, is almost >exactly what pdf2html does. > >http://freshmeat.net/projects/pdf2html/ > >I downloaded the latest version just now and tried building it. The >build failed with some syntax errors in pbm2png.c, so if anyone wants >to add this to ports, they'll have some cleanup work to do. FWIW, the syntax errors are all due to some misplaced line breaks. Re-joining all the lines that generate a warning about a missing terminating " character fixes this. (This was using gcc 4.2.1.)