From owner-freebsd-questions@FreeBSD.ORG Wed Jan 28 23:09:22 2009 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 7B8301065679 for ; Wed, 28 Jan 2009 23:09:22 +0000 (UTC) (envelope-from kline@thought.org) Received: from aristotle.thought.org (aristotle.thought.org [209.180.213.210]) by mx1.freebsd.org (Postfix) with ESMTP id 2CC3F8FC35 for ; Wed, 28 Jan 2009 23:09:22 +0000 (UTC) (envelope-from kline@thought.org) Received: from thought.org (tao.thought.org [10.47.0.250]) (authenticated bits=0) by aristotle.thought.org (8.14.2/8.14.2) with ESMTP id n0SNA3pW005217; Wed, 28 Jan 2009 15:10:03 -0800 (PST) (envelope-from kline@thought.org) Received: by thought.org (nbSMTP-1.00) for uid 1002 kline@thought.org; Wed, 28 Jan 2009 15:09:17 -0800 (PST) Date: Wed, 28 Jan 2009 15:09:17 -0800 From: Gary Kline To: Andrew Gould Message-ID: <20090128230916.GA29328@thought.org> References: <20090128040802.GA94236@thought.org> <319D789FD18042DBB7A19571DA26E5AE@rivendell> <20090128192211.GB22208@thought.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.2.3i X-Organization: Thought Unlimited. Public service Unix since 1986. X-Of_Interest: With 22 years of service to the Unix community. X-Spam-Status: No, score=-4.4 required=3.6 tests=ALL_TRUSTED,BAYES_00 autolearn=ham version=3.2.3 X-Spam-Checker-Version: SpamAssassin 3.2.3 (2007-08-08) on aristotle.thought.org Cc: Reko Turja , FreeBSD Mailing List Subject: Re: OCR... X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 28 Jan 2009 23:09:23 -0000 On Wed, Jan 28, 2009 at 01:32:57PM -0600, Andrew Gould wrote: > On Wed, Jan 28, 2009 at 1:22 PM, Gary Kline wrote: > > > On Wed, Jan 28, 2009 at 12:08:55PM +0200, Reko Turja wrote: > > > >so what is the best commercial/shareware that can read a 10pt-font > > > >file? (( also, when i have time to get back into actually hacking, > > > >this [[turning imaged pdf into OCR'able ascii or 8859-1]] is giong > > > >to > > > >be a first target. any idea which team i should go with. gOCR > > > >looks > > > >best so far to me. > > > > > > AABBYY Finereader - Omnipage haven't been able to catch it in several > > > years either feature or qualitywise. No idea if Finereader runs under > > > emulator though. If the file is already a PDF and 72 DPI with text as > > > graphics most of the damage has already been done, and it will be > > > extremely hard to OCR. > > > > > > > well, damage is probably done. how can i check the resolution? > > i tried to increase it by creating huge ppm and tif files, but > > then that's really absurd since there can only be just so much > > data per image. i _could_ try xv and jpeg and smoothing image to > > refine, but too much hassle. > > > > (i used gocr -m 130 and "saw" the glyphs it (presumably) saw. > > seemed pretty much okay to my eyes. but then i'm not a computer > > program. [MAYBE :)] > > > > gary > > > > > > > > > -Reko > > > > > > > -- > > Gary Kline kline@thought.org http://www.thought.org Public Service > > Unix > > http://jottings.thought.org http://transfinite.thought.org > > The 2.23a release of Jottings: http://jottings.thought.org/index.php > > > > At one point in time, the Abby folks were offering a back-end that ran on > FreeBSD. I tried to get the free download; but it never happened. (They > misplaced my signed, faxed license agreement and I finally got tired of the > back-and-forth prerequisite communication.) > > Abby also no longer supports Mac OS X. I use an old version and like it a > lot. > OK, now i know what to expect. I found theit site and signed up to get the linux version; trial. not likrly to go any further.... gary > Andrew -- Gary Kline kline@thought.org http://www.thought.org Public Service Unix http://jottings.thought.org http://transfinite.thought.org The 2.23a release of Jottings: http://jottings.thought.org/index.php