From owner-freebsd-questions@FreeBSD.ORG Thu Jan 29 12:43:19 2009 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 21124106566B for ; Thu, 29 Jan 2009 12:43:19 +0000 (UTC) (envelope-from andrewlylegould@gmail.com) Received: from mail-ew0-f21.google.com (mail-ew0-f21.google.com [209.85.219.21]) by mx1.freebsd.org (Postfix) with ESMTP id 5AF668FC0A for ; Thu, 29 Jan 2009 12:43:17 +0000 (UTC) (envelope-from andrewlylegould@gmail.com) Received: by ewy14 with SMTP id 14so5406044ewy.19 for ; Thu, 29 Jan 2009 04:43:17 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type; bh=3Zn1gxrjfoXylKeY4wMKJ2oMZ+9BYytE9iRYMBErOBY=; b=nKx8LwMMxEqP9JO1HKt/bvVQntme7QoypIp21c0TJob1s+7AKHkFnAzEr87XErExfK 592r1VQZoTAkDZqLLLuuvhLhx+lvrmdtuVSpTtRAUO54mbRwknBa7CzCBRrJgwoMHrzz pVQfEs02rqLRtjkuhyVvIYdBdwjcR5PqSGV0E= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=RGlBqtz2okkBSzovNrrsMciOdP74jcte4rJFJYL2OwH6xBh7CG2LrVncnbqKx4cKPH Rg0A7dwGLHzF+hGmgTOo+bp/GFJAqxpLT63oyIYij5AV1JpJNqfCpAWS+qy32tOxqRQS FYPKy5DFKNh0ZPb1flaoCM69kMsrr81HiiorI= MIME-Version: 1.0 Received: by 10.103.229.12 with SMTP id g12mr25754mur.16.1233232997079; Thu, 29 Jan 2009 04:43:17 -0800 (PST) In-Reply-To: <20090129022349.GB34877@thought.org> References: <20090128040802.GA94236@thought.org> <319D789FD18042DBB7A19571DA26E5AE@rivendell> <20090128192211.GB22208@thought.org> <20090128230916.GA29328@thought.org> <20090129022349.GB34877@thought.org> Date: Thu, 29 Jan 2009 06:43:16 -0600 Message-ID: From: Andrew Gould To: Gary Kline Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: Reko Turja , FreeBSD Mailing List Subject: Re: OCR... X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 29 Jan 2009 12:43:19 -0000 On Wed, Jan 28, 2009 at 8:23 PM, Gary Kline wrote: > On Wed, Jan 28, 2009 at 07:33:41PM -0600, Andrew Gould wrote: > > On Wed, Jan 28, 2009 at 5:09 PM, Gary Kline wrote: > > > > > On Wed, Jan 28, 2009 at 01:32:57PM -0600, Andrew Gould wrote: > > > > On Wed, Jan 28, 2009 at 1:22 PM, Gary Kline > wrote: > > > > > > > > > On Wed, Jan 28, 2009 at 12:08:55PM +0200, Reko Turja wrote: > > > > > > >so what is the best commercial/shareware that can read a > 10pt-font > > > > > > >file? (( also, when i have time to get back into actually > hacking, > > > > > > >this [[turning imaged pdf into OCR'able ascii or 8859-1]] is > giong > > > > > > >to > > > > > > >be a first target. any idea which team i should go with. gOCR > > > > > > >looks > > > > > > >best so far to me. > > > > > > > > > > > > AABBYY Finereader - Omnipage haven't been able to catch it in > several > > > > > > years either feature or qualitywise. No idea if Finereader runs > under > > > > > > emulator though. If the file is already a PDF and 72 DPI with > text > > > as > > > > > > graphics most of the damage has already been done, and it will be > > > > > > extremely hard to OCR. > > > > > > > > > > > > > > > > well, damage is probably done. how can i check the > resolution? > > > > > i tried to increase it by creating huge ppm and tif files, > but > > > > > then that's really absurd since there can only be just so > much > > > > > data per image. i _could_ try xv and jpeg and smoothing > image > > > to > > > > > refine, but too much hassle. > > > > > > > > > > (i used gocr -m 130 and "saw" the glyphs it (presumably) > saw. > > > > > seemed pretty much okay to my eyes. but then i'm not a > computer > > > > > program. [MAYBE :)] > > > > > > > > > > gary > > > > > > > > > > > > > > > > > > > > > -Reko > > > > > > > > > > > > > > > > -- > > > > > Gary Kline kline@thought.org http://www.thought.org Public > Service > > > > > Unix > > > > > http://jottings.thought.org > http://transfinite.thought.org > > > > > The 2.23a release of Jottings: > > > http://jottings.thought.org/index.php > > > > > > > > > > > > > At one point in time, the Abby folks were offering a back-end that > ran on > > > > FreeBSD. I tried to get the free download; but it never happened. > (They > > > > misplaced my signed, faxed license agreement and I finally got tired > of > > > the > > > > back-and-forth prerequisite communication.) > > > > > > > > Abby also no longer supports Mac OS X. I use an old version and like > it > > > a > > > > lot. > > > > > > > > > > > > > OK, now i know what to expect. I found theit site and signed > up > > > to get the linux version; trial. not likrly to go any > > > further.... > > > > > > gary > > > > > > > > > > Andrew > > > > > > -- > > > Gary Kline kline@thought.org http://www.thought.org Public Service > > > Unix > > > http://jottings.thought.org http://transfinite.thought.org > > > The 2.23a release of Jottings: > http://jottings.thought.org/index.php > > > > > > > > I'm rooting for you! :-) > > > well, i just got an email from a david hazard who said to look on > their website; i replied that i had and couldn't find their test > suite.... if/when this guy replies, i'll share. > > gary > > Start here: http://www.abbyy.com/sdk/?param=59956 I will try again, as well. Andrew