Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 3 Feb 2023 12:36:47 -0500
From:      George Mitchell <george+freebsd@m5p.com>
To:        stable@freebsd.org
Subject:   Re: Grep with non-ascii
Message-ID:  <c77ad75b-b8c4-9014-0bc7-f1a0ec78272c@m5p.com>
In-Reply-To: <20230204010605.4874609f80eed28543407807@dec.sakura.ne.jp>
References:  <20230203110642.70e4a076@elg.hjerdalen.lokalnett> <819a4336-9689-bdbe-a90d-8f1d7b842662@grosbein.net> <20230203151853.02732bd6@elg.hjerdalen.lokalnett> <20230204010605.4874609f80eed28543407807@dec.sakura.ne.jp>

next in thread | previous in thread | raw e-mail | index | archive | help
On 2/3/23 11:06, Tomoaki AOKI wrote:
> [...]
> If this is the case like above, the only solution is to move to
> character set containing ALL characters all over the world.
> 
> AFAIK, the only candidates are only two, TRON code [1] and Unicode (UCS,
> ISO/IEC 10646) [2]. And TRON code is very rarely used, actual candidate
> would be Unicode only.
> Note that Unicode is usually encoded to any of UTF-8, UTF-16 or UTF-32
> for data transfer (sometimes raw UCS-2?).
> [...]

The one positive development in the world of computing that I would
credit to Java is the earliest big push toward the adoption of UTF-8.
I strongly hope UTF-8 becomes universally used sooner rather than
later.                                                     -- George




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?c77ad75b-b8c4-9014-0bc7-f1a0ec78272c>