From owner-freebsd-questions@FreeBSD.ORG Wed Jan 28 04:08:13 2009 Return-Path: Delivered-To: freebsd-questions@FreeBSD.ORG Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 51553106564A for ; Wed, 28 Jan 2009 04:08:13 +0000 (UTC) (envelope-from kline@thought.org) Received: from aristotle.thought.org (ns1.thought.org [209.180.213.210]) by mx1.freebsd.org (Postfix) with ESMTP id 123DE8FC25 for ; Wed, 28 Jan 2009 04:08:12 +0000 (UTC) (envelope-from kline@thought.org) Received: from thought.org (tao.thought.org [10.47.0.250]) (authenticated bits=0) by aristotle.thought.org (8.14.2/8.14.2) with ESMTP id n0S48pge095510 for ; Tue, 27 Jan 2009 20:08:51 -0800 (PST) (envelope-from kline@thought.org) Received: by thought.org (nbSMTP-1.00) for uid 1002 kline@thought.org; Tue, 27 Jan 2009 20:08:05 -0800 (PST) Date: Tue, 27 Jan 2009 20:08:05 -0800 From: Gary Kline To: FreeBSD Mailing List Message-ID: <20090128040802.GA94236@thought.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.4.2.3i X-Organization: Thought Unlimited. Public service Unix since 1986. X-Of_Interest: With 22 years of service to the Unix community. X-Spam-Status: No, score=-4.4 required=3.6 tests=ALL_TRUSTED,BAYES_00 autolearn=ham version=3.2.3 X-Spam-Checker-Version: SpamAssassin 3.2.3 (2007-08-08) on aristotle.thought.org Cc: Subject: OCR... X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 28 Jan 2009 04:08:13 -0000 guys, well, i'm ashamed to admit that i've put at least a dozen hours in trying, then re-re-retrying to OCR a imaged pdf file with as many open source ocr packages as i can find. before i quit for supper tonight, i finally threw in the towel. realized than i would have been THROUGH with all 181 pages of the text on Aristotle if i had just read the bloody thing. but anyway, i'm done. there simply is no freeware that runs on a 'nix computer//real computer. so what is the best commercial/shareware that can read a 10pt-font file? (( also, when i have time to get back into actually hacking, this [[turning imaged pdf into OCR'able ascii or 8859-1]] is giong to be a first target. any idea which team i should go with. gOCR looks best so far to me. gary -- Gary Kline kline@thought.org http://www.thought.org Public Service Unix http://jottings.thought.org http://transfinite.thought.org The 2.23a release of Jottings: http://jottings.thought.org/index.php