From owner-freebsd-questions@FreeBSD.ORG Mon Sep 6 19:02:10 2010 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id F105410656E1 for ; Mon, 6 Sep 2010 19:02:10 +0000 (UTC) (envelope-from wblock@wonkity.com) Received: from wonkity.com (wonkity.com [67.158.26.137]) by mx1.freebsd.org (Postfix) with ESMTP id AC0068FC15 for ; Mon, 6 Sep 2010 19:02:10 +0000 (UTC) Received: from wonkity.com (localhost [127.0.0.1]) by wonkity.com (8.14.4/8.14.4) with ESMTP id o86J27ne015262; Mon, 6 Sep 2010 13:02:07 -0600 (MDT) (envelope-from wblock@wonkity.com) Received: from localhost (wblock@localhost) by wonkity.com (8.14.4/8.14.4/Submit) with ESMTP id o86J27pT015259; Mon, 6 Sep 2010 13:02:07 -0600 (MDT) (envelope-from wblock@wonkity.com) Date: Mon, 6 Sep 2010 13:02:07 -0600 (MDT) From: Warren Block To: Chad Perrin In-Reply-To: <20100906184802.GC28608@guilt.hydra> Message-ID: References: <20100904230920.GA20735@guilt.hydra> <20100905065711.GA34993@slackbox.erewhon.net> <20100905083154.GA89704@owl.midgard.homeip.net> <20100906184802.GC28608@guilt.hydra> User-Agent: Alpine 2.00 (BSF 1167 2008-08-23) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.2.6 (wonkity.com [127.0.0.1]); Mon, 06 Sep 2010 13:02:07 -0600 (MDT) Cc: FreeBSD Questions Subject: Re: PDF to HTML translations X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 06 Sep 2010 19:02:11 -0000 On Mon, 6 Sep 2010, Chad Perrin wrote: > I've started looking at the Xpdf tools as well as pdftohtml. Other > suggestions from within ports would be appreciated. Additional options > other than what can be found in ports might also be useful, understanding > the needs I sketched out above. The script itself is Perl, in case that > matters. An alternative might be to render the PDF to a relatively low-res bitmap. Then the HTML becomes just an IMG. You can do that directly with Ghostscript, or use ImageMagick/GraphicsMagick.