From owner-freebsd-questions@FreeBSD.ORG Thu May 27 23:36:16 2010 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 3D889106566B for ; Thu, 27 May 2010 23:36:16 +0000 (UTC) (envelope-from kline@thought.org) Received: from ethic.thought.org (plato.thought.org [209.180.213.209]) by mx1.freebsd.org (Postfix) with ESMTP id D33CB8FC0C for ; Thu, 27 May 2010 23:36:15 +0000 (UTC) Received: from thought.org (tao.thought.org [10.47.0.250]) (authenticated bits=0) by ethic.thought.org (8.14.3/8.14.3) with ESMTP id o4RNa8SA021460; Thu, 27 May 2010 16:36:09 -0700 (PDT) (envelope-from kline@thought.org) Received: by thought.org (nbSMTP-1.00) for uid 1002 kline@thought.org; Thu, 27 May 2010 16:36:08 -0700 (PDT) Date: Thu, 27 May 2010 16:36:08 -0700 From: Gary Kline To: Polytropon Message-ID: <20100527233607.GD19297@thought.org> References: <20100527013843.GA40751@thought.org> <20100527050302.da39c258.freebsd@edvax.de> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20100527050302.da39c258.freebsd@edvax.de> User-Agent: Mutt/1.4.2.3i X-Organization: Thought Unlimited. Public service Unix since 1986. X-Of_Interest: With 23 years of service to the Unix community. X-Spam-Status: No, score=-0.2 required=3.6 tests=ALL_TRUSTED,BAYES_00, GUARANTEED_100_PERCENT,T_RP_MATCHES_RCVD autolearn=no version=3.3.0 X-Spam-Checker-Version: SpamAssassin 3.3.0 (2010-01-18) on ethic.thought.org Cc: FreeBSD Mailing List Subject: Re: any shortcuts to doc to ascii? X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 27 May 2010 23:36:16 -0000 On Thu, May 27, 2010 at 05:03:02AM +0200, Polytropon wrote: > On Wed, 26 May 2010 18:38:47 -0700, Gary Kline wrote: > > > > > > guys, > > > > is there anything that can take these hex triplets such as > > > > We Don\xe2\x80\x99t > > > > and render them back to the ascii or keyboard equivalents? > > in this case, the \x99 would be an apostrophe. > > thus: > > > > > > We Don't > > > > tia, > > > > gsry > > > > ps: even lynx -dump messes up, i believe. i'm trying to go from > > DOC back to typewriter.... > > > Yes, even a typewriter is better than DOC. :-) > man, you got that right!! > To process DOC files into ASCII, there are several ways, with > different complexity: > > Most complex ones: Use OpenOffice or Abiword, open the file and > save it as ASCII. Included "special characters" should be in > regular ASCII representation now. > > Better: Use (from ports) catdoc or antiword. > i don't see any ascii suffix [for OOo]. i saved as .txt. same krap. the \x94, x9d, \x9c... same with catdoc. i'll try antiword. [forgot about that. ] > I'm not sure in how far conflicting codepages may be involved. > It is known that "Windows" does have problems supporting standards, > and this applies to character sets and language variations, too. > your words could be emblazoned in 24k gold on some Monument of Truth. i've been fighting going for mac to OOo and back... (******) thanks. gary ps: antiword same as catdoc. back to my per substitutions. that works, along with vi's Builtin subs. pps::: .... > > > -- > Polytropon > Magdeburg, Germany > Happy FreeBSD user since 4.0 > Andra moi ennepe, Mousa, ... -- Gary Kline kline@thought.org http://www.thought.org Public Service Unix The 7.83a release of Jottings: http://jottings.thought.org/index.php http://journey.thought.org 99 44/100% Guaranteed Novel