From owner-freebsd-questions@FreeBSD.ORG Fri Jan 6 05:14:51 2006 Return-Path: X-Original-To: freebsd-questions@FreeBSD.ORG Delivered-To: freebsd-questions@FreeBSD.ORG Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 30E3616A41F for ; Fri, 6 Jan 2006 05:14:51 +0000 (GMT) (envelope-from kline@tao.thought.org) Received: from tao.thought.org (dsl231-043-140.sea1.dsl.speakeasy.net [216.231.43.140]) by mx1.FreeBSD.org (Postfix) with ESMTP id 1624643D46 for ; Fri, 6 Jan 2006 05:14:47 +0000 (GMT) (envelope-from kline@tao.thought.org) Received: from tao.thought.org (localhost [127.0.0.1]) by tao.thought.org (8.13.4/8.13.1) with ESMTP id k065EjBY080182 for ; Thu, 5 Jan 2006 21:14:45 -0800 (PST) (envelope-from kline@tao.thought.org) Received: (from kline@localhost) by tao.thought.org (8.13.4/8.13.1/Submit) id k065Ee7q080181 for freebsd-questions@FreeBSD.ORG; Thu, 5 Jan 2006 21:14:40 -0800 (PST) (envelope-from kline) Date: Thu, 5 Jan 2006 21:14:39 -0800 From: Gary Kline To: FreeBSD Mailing List Message-ID: <20060106051439.GA80045@thought.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.4.2.1i X-Organization: Thought Unlimited. Public service Unix since 1986. X-Of_Interest: Observing 19 years of service to the Unix community Cc: Subject: how to tell aspell -c to ignore "_", ">", "<", and other bytes X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 06 Jan 2006 05:14:51 -0000 People, You may remember that I'm trying to scan > 400 pages from a text. Things work much better using he latest gocr and a greatly enlarged JPEG image, tweaked with xv. I'm almmost to the point where I can use aspell -c to correct misinterpreted text. The gotcha is that the sample jpg file I have are filled with improper non-characters, including "_", '<", ">", along with punctuation, and random integers. Is there any way to tell aspell to look at (say) S_wiss and guess Swiss, an6yle and guess angle, n:otio:1 and guess motion, and di.5tnnce and guess distance? Because the delimiters are spaces, it is impossible to have aspelll recgnize things like "i f" for if, and so on. But gotta say that version 0.40is vastly bettr than 0.37. thanks much, gary -- Gary Kline kline@thought.org www.thought.org Public service Unix