Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 17 Sep 2017 22:32:11 +0200
From:      Polytropon <freebsd@edvax.de>
To:        mfv@bway.net
Cc:        freebsd-questions@freebsd.org, Polytropon <freebsd@edvax.de>
Subject:   Re: case command
Message-ID:  <20170917223211.bd017503.freebsd@edvax.de>
In-Reply-To: <20170917160758.18b6000e@gecko4>
References:  <59BE89E1.3050209@gmail.com> <20170917193722.7d2ecbe3.freebsd@edvax.de> <20170917160758.18b6000e@gecko4>

next in thread | previous in thread | raw e-mail | index | archive | help
On Sun, 17 Sep 2017 16:07:58 -0400, mfv wrote:
> > On Sun, 2017-09-17 at 19:37 Polytropon <freebsd@edvax.de> wrote:
> >
> >On Sun, 17 Sep 2017 10:42:41 -0400, Ernie Luzar wrote:
> >> Looking for a system command that a I can pip a file through to
> >> change all uppercase content to lower case.
> >>=20
> >> Is there such a command line command? =20
> >
> >Several ones. One is to use tr:
> >
> >	... | tr '[A-Z]' '[a-z]' | ...
> >
> >Or with character classes:
> >
> >	... | tr '[:upper:]' '[:lower:] | ...
> >
> >You can also use awk for this task:
> >
> >	... | awk '{ print tolower($0) }' | ...
> >
> >You can use this within the awk portion of your script, too.
> >
> >Or shortened:
> >
> >	... | awk '{ print tolower }' | ...
> >
> >But keep in mind: Things like german Umlauts usually won't
> >be processed correctly.
> >
> >Those are a few possible solutions. There are more. ;-)
> >
> >
> >
>=20
> Hello,
>=20
> Yes, Indeed. Here is an alternative using gsed:
>=20
>  gsed -e 's/(.*)/\L\1/' < input | ...
>=20
> To convert from lower case to upper case, change '\L' to '\U'.

This only works with GNU sed (gsed), to be installed from ports.
FreeBSD's native sed implementation does not support \L and \U,
so you'd have to install GNU sed additionally.



> As gsed operates on one line at a time it will not be as fast as other
> solutions but has the merit of working on very large files when memory
> is an issue.

If awk is already part of the pipe chain, it's not a problem
to use it for this task.



> It is also able to convert some Unicode, at least some of the Latin-1
> Supplements and Latin Extended-A.  If conversions are needed for a
> particular language not already covered by gsed then the y-command
> could be added. For example:
>=20
>  y/=C2=C3=C4=C5=C1/=E2=E3=E4=E5=E1/

For localized 1-byte codes (like german Umlauts), dd can be used.
All methods mentioned so far seem to work correctly:

	% echo "M=C4RCHENB=DCGELR=D6STER" | dd conv=3Dlcase
	m=E4rchenb=FCgelr=F6ster
	0+1 records in
	0+1 records out
	19 bytes transferred in 0.000030 secs (632474 bytes/sec)

	% echo "M=C4RCHENB=DCGELR=D6STER" | tr '[A-Z]' '[a-z]'
	m=E4rchenb=FCgelr=F6ster

	% echo "M=C4RCHENB=DCGELR=D6STER" | tr '[:upper:]' '[:lower:]'
	m=E4rchenb=FCgelr=F6ster

	% echo "M=C4RCHENB=DCGELR=D6STER" | awk '{ print tolower }'
	m=E4rchenb=FCgelr=F6ster

This is on a ISO-8859-1 localized system. Even though it is technically
possible, I don't think those "edge cases" will appear in a domain
name list. :-)




--=20
Polytropon
Magdeburg, Germany
Happy FreeBSD user since 4.0
Andra moi ennepe, Mousa, ...



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20170917223211.bd017503.freebsd>