From owner-freebsd-questions Mon Feb 15 15:38:02 1999 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id PAA23396 for freebsd-questions-outgoing; Mon, 15 Feb 1999 15:38:02 -0800 (PST) (envelope-from owner-freebsd-questions@FreeBSD.ORG) Received: from phoenix.welearn.com.au (phoenix.welearn.com.au [139.130.44.81] (may be forged)) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id PAA23349 for ; Mon, 15 Feb 1999 15:37:57 -0800 (PST) (envelope-from sue@phoenix.welearn.com.au) Received: (from sue@localhost) by phoenix.welearn.com.au (8.9.1/8.9.0) id KAA19506; Tue, 16 Feb 1999 10:37:45 +1100 (EST) Message-ID: <19990216103740.60271@welearn.com.au> Date: Tue, 16 Feb 1999 10:37:40 +1100 From: Sue Blake To: Greg Lehey Cc: rick hamell , freebsd-questions@FreeBSD.ORG Subject: Re: cleaning a text file References: <19990215201056.19929@welearn.com.au> <19990216095232.J2207@lemis.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Mailer: Mutt 0.88e In-Reply-To: <19990216095232.J2207@lemis.com>; from Greg Lehey on Tue, Feb 16, 1999 at 09:52:32AM +1030 Sender: owner-freebsd-questions@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG On Tue, Feb 16, 1999 at 09:52:32AM +1030, Greg Lehey wrote: > On Monday, 15 February 1999 at 1:10:36 -0800, rick hamell wrote: > > > >> Also, this file has some very long lines which would get truncated > >> or unexpectedly wrapped when sent as email. And if there is something > >> strange, I have to read it and guess what it should have been. > >> > >> Maybe someone will come up with something for this particular case. > >> I can't believe there's not some little untility for this that's been > >> hanging around unloved for years. > > > > Oy! Ok... how does Greg reformat all those emails? > > With Emacs. I have a collection of macros which I'm constantly > changing to catch up with new tricks that mailers discover. > > To Sue's original question: it depends on what your text looks like. > tr(1) will remove characters if you ask it to. If I knew which characters were there (so I could ask tr to remove them) I would have already removed them with my text editor. > fmt(1) might be useful for wrapping lines. I don't see the long line lengths as a big problem at this stage, but fmt might be useful later. The problem is that I don't know which funny characters exist in the file, if any. I want to find out what they are, so I can search for them and eyeball them before killing them. Just knowing which characters they are would give me many solutions immediately. There still doesn't seem to be a way to find this out :-( Maybe there's a long way... somehow put a linefeed after each character in the file (with sed?) and then sort it and look at the top and bottom of the sorted file. -- Regards, -*Sue*- To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-questions" in the body of the message