Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 2 Jul 2009 08:37:26 +1000
From:      Edwin Groothuis <edwin@mavetju.org>
To:        freebsd-i18n@freebsd.org
Subject:   CLDR import for src/share/*def definitions
Message-ID:  <20090701223726.GA36335@mavetju.org>

next in thread | raw e-mail | index | archive | help
I have been playing with the CLDR database to see if I can get the
monetary, time, messages and numerical definitions right. The CLDR
is in UTF-8, I use iconv to translate to other charactersets.

So far most of it is fine, except (subset of issues):

- A couple of languages are not known (es_FR, es_IT)

- A couple of languages have a different abbrevation
        no_NO   -> nb_NO nn_NO
        *_YU    -> *_RS

- A couple of charactersets are not known to iconv:
        (CP1131 ISCII-DECV)

- A couple of translations went wrong:
        Writing to fi_FI in ISO8859-1
        Could not convert currency_symbol from UTF-8 to ISO8859-1

- It is not clear what the difference between "Long month names (as
  in a date)" and "Long month names (without case ending)" is. (could
  be my language problem :-)

The biggest problem so far is not a technical: WHich data is more 
authoritative - The one in the CLDR database or the one we have 
collected over the years from various sources and people?

Another problem I'm facing is that there is little documentation 
on what the format of the *def/ files is, it is mostly a UTSL 
approach in lib/libc/locale, but that doesn't show me neither if I 
can safely replace (for example in uk_UA)
     # yesstr
    -<E2><D0><DA>
    +<E2><D0><DA>:<E2>:<C2><B0><BA>:<C2>:yes:y:YES:Y

So euhm... Is there anybody who wants to give their opinion or
wisdom about things, please speak up, I need it :-)

Edwin

-- 
Edwin Groothuis		Website: http://www.mavetju.org/
edwin@mavetju.org	Weblog:  http://www.mavetju.org/weblog/



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20090701223726.GA36335>