From owner-freebsd-questions@FreeBSD.ORG Mon Sep 6 23:09:46 2010 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 622EE10656E4 for ; Mon, 6 Sep 2010 23:09:46 +0000 (UTC) (envelope-from corky1951@comcast.net) Received: from qmta14.westchester.pa.mail.comcast.net (qmta14.westchester.pa.mail.comcast.net [76.96.59.212]) by mx1.freebsd.org (Postfix) with ESMTP id 0CE1E8FC1D for ; Mon, 6 Sep 2010 23:09:44 +0000 (UTC) Received: from omta08.westchester.pa.mail.comcast.net ([76.96.62.12]) by qmta14.westchester.pa.mail.comcast.net with comcast id 3auj1f0010Fqzac5Eb9lRa; Mon, 06 Sep 2010 23:09:45 +0000 Received: from comcast.net ([98.203.142.76]) by omta08.westchester.pa.mail.comcast.net with comcast id 3b9j1f0071f6R9u3Ub9k98; Mon, 06 Sep 2010 23:09:45 +0000 Received: by comcast.net (sSMTP sendmail emulation); Mon, 06 Sep 2010 16:09:41 -0700 Date: Mon, 6 Sep 2010 16:09:41 -0700 From: Charlie Kester To: freebsd-questions@freebsd.org Message-ID: <20100906230941.GA6385@comcast.net> Mail-Followup-To: freebsd-questions@freebsd.org References: <20100904230920.GA20735@guilt.hydra> <20100905065711.GA34993@slackbox.erewhon.net> <20100905083154.GA89704@owl.midgard.homeip.net> <20100906184802.GC28608@guilt.hydra> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Disposition: inline In-Reply-To: X-Mailer: Mutt 1.5.20 X-Composer: Vim 7.2 User-Agent: Mutt/1.5.20 (2009-06-14) Subject: Re: PDF to HTML translations X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 06 Sep 2010 23:09:46 -0000 On Mon 06 Sep 2010 at 12:02:07 PDT Warren Block wrote: >On Mon, 6 Sep 2010, Chad Perrin wrote: >>I've started looking at the Xpdf tools as well as pdftohtml. Other >>suggestions from within ports would be appreciated. Additional options >>other than what can be found in ports might also be useful, understanding >>the needs I sketched out above. The script itself is Perl, in case that >>matters. > >An alternative might be to render the PDF to a relatively low-res >bitmap. Then the HTML becomes just an IMG. You can do that directly >with Ghostscript, or use ImageMagick/GraphicsMagick. Which, if I correctly understand the description on freshmeat, is almost exactly what pdf2html does. http://freshmeat.net/projects/pdf2html/ I downloaded the latest version just now and tried building it. The build failed with some syntax errors in pbm2png.c, so if anyone wants to add this to ports, they'll have some cleanup work to do.