Date: Sun, 7 Apr 2002 20:40:13 +1000 From: "Tim J. Robbins" <tim@robbins.dropbear.id.au> To: freebsd-current@FreeBSD.ORG Cc: "Andrey A. Chernov" <ache@nagual.pp.ru> Subject: Re: NetBSD sort l10n: I give up! Message-ID: <20020407204013.A20639@treetop.robbins.dropbear.id.au>
next in thread | raw e-mail | index | archive | help
Here is a patch to make NetBSD's sort(1) sort by the locale's collating order. The table should not be called ascii[] anymore, but I can't think of a better one, and supplying a patch to change the name would be pointless. It works. It assumes the string strxfrm() outputs is the same length as its input, which is always possible, and true on FreeBSD. $ env LC_COLLATE=fr_FR.ISO8859-1 sort <test.fr | rs Čte elle $ env LC_COLLATE=fr_FR.ISO8859-1 ./sort <test.fr | rs Čte elle $ rs <test.fr elle Čte Enjoy (?) Tim Index: init.c =================================================================== RCS file: /home/ncvs/src/contrib/sort/init.c,v retrieving revision 1.2 diff -u -r1.2 init.c --- init.c 2002/04/07 00:49:00 1.2 +++ init.c 2002/04/07 10:29:59 @@ -46,6 +46,7 @@ #endif /* not lint */ #include <ctype.h> +#include <err.h> #include <string.h> static void insertcol __P((struct field *)); @@ -291,8 +292,7 @@ * Note: when sorting in forward order, to encode character zero in a key, * use \001\001; character 1 becomes \001\002. In this case, character 0 * is reserved for the field delimiter. Analagously for -r (fld_d = 255). - * Note: this is only good for ASCII sorting. For different LC 's, - * all bets are off. See also num_init in number.c + * See also num_init in number.c */ void settables(gflags) @@ -300,8 +300,20 @@ { u_char *wts; int i, incr; + static int warned; + char abuf[2], xbuf[8]; + + abuf[1] = '\0'; for (i=0; i < 256; i++) { - ascii[i] = i; + if (i != 0) { + *abuf = i; + if (strxfrm(xbuf, abuf, sizeof(xbuf)) > 1 && !warned) { + warnx("collating order too complicated"); + warned = 1; + } + ascii[i] = *xbuf; + } else + ascii[i] = 0; if (i > REC_D && i < 255 - REC_D+1) Rascii[i] = 255 - i + 1; else To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20020407204013.A20639>