Date: Thu, 2 Jul 2009 08:37:26 +1000 From: Edwin Groothuis <edwin@mavetju.org> To: freebsd-i18n@freebsd.org Subject: CLDR import for src/share/*def definitions Message-ID: <20090701223726.GA36335@mavetju.org>
next in thread | raw e-mail | index | archive | help
I have been playing with the CLDR database to see if I can get the monetary, time, messages and numerical definitions right. The CLDR is in UTF-8, I use iconv to translate to other charactersets. So far most of it is fine, except (subset of issues): - A couple of languages are not known (es_FR, es_IT) - A couple of languages have a different abbrevation no_NO -> nb_NO nn_NO *_YU -> *_RS - A couple of charactersets are not known to iconv: (CP1131 ISCII-DECV) - A couple of translations went wrong: Writing to fi_FI in ISO8859-1 Could not convert currency_symbol from UTF-8 to ISO8859-1 - It is not clear what the difference between "Long month names (as in a date)" and "Long month names (without case ending)" is. (could be my language problem :-) The biggest problem so far is not a technical: WHich data is more authoritative - The one in the CLDR database or the one we have collected over the years from various sources and people? Another problem I'm facing is that there is little documentation on what the format of the *def/ files is, it is mostly a UTSL approach in lib/libc/locale, but that doesn't show me neither if I can safely replace (for example in uk_UA) # yesstr -<E2><D0><DA> +<E2><D0><DA>:<E2>:<C2><B0><BA>:<C2>:yes:y:YES:Y So euhm... Is there anybody who wants to give their opinion or wisdom about things, please speak up, I need it :-) Edwin -- Edwin Groothuis Website: http://www.mavetju.org/ edwin@mavetju.org Weblog: http://www.mavetju.org/weblog/
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20090701223726.GA36335>