Date: Mon, 6 Jul 2009 20:00:55 +0200 From: Wolfgang Zenker <wolfgang@lyxys.ka.sub.org> To: freebsd-i18n@freebsd.org Subject: Re: CLDR import for src/share/*def definitions Message-ID: <20090706180055.GA18458@lyxys.ka.sub.org> In-Reply-To: <20090701223726.GA36335@mavetju.org> References: <20090701223726.GA36335@mavetju.org>
next in thread | previous in thread | raw e-mail | index | archive | help
Hi, * Edwin Groothuis <edwin@mavetju.org> [090702 00:37]: > I have been playing with the CLDR database to see if I can get the > monetary, time, messages and numerical definitions right. The CLDR > is in UTF-8, I use iconv to translate to other charactersets. > So far most of it is fine, except (subset of issues): > - A couple of languages are not known (es_FR, es_IT) what do you mean by "not known"? Both locales specify a spanish language locale, one for use in France and one in Italy. There might even be no different language contructs from es_ES, just different ways to format dates ore something like that. > - A couple of languages have a different abbrevation > no_NO -> nb_NO nn_NO > *_YU -> *_RS You best ask a norwegian about the locales to use for Norway; for the second line I'ld go with *_RS, as the former Yugoslavia hac split into aming others the Republic of Serbia (which i assume the code RS is supposed to mean). > - A couple of charactersets are not known to iconv: > (CP1131 ISCII-DECV) Never heard of them. > - A couple of translations went wrong: > Writing to fi_FI in ISO8859-1 > Could not convert currency_symbol from UTF-8 to ISO8859-1 Thats because Finland uses the Euro and the Euro-sign does not exist in ISO8859-1; you have to use ISO8859-15 if you want the € currency symbol in an ISO8859-* charset. > - It is not clear what the difference between "Long month names (as > in a date)" and "Long month names (without case ending)" is. (could > be my language problem :-) I don't know either; could you give an example where the two are different? Preferably in a language where we find some speaekers here :-) I do speak english, german, latin and some arabic, if that is of any help. > The biggest problem so far is not a technical: WHich data is more > authoritative - The one in the CLDR database or the one we have > collected over the years from various sources and people? I _think_ the CLDR has been relying on people coming forward with information the same way that we have, so I consider neither _the_ authoritative source. Best to just list conflicting entries and ask around for locals on the lit that could help resolve conflicts. > Another problem I'm facing is that there is little documentation > on what the format of the *def/ files is, it is mostly a UTSL > approach in lib/libc/locale, but that doesn't show me neither if I > can safely replace (for example in uk_UA) > # yesstr > -<E2><D0><DA> > +<E2><D0><DA>:<E2>:<C2><B0><BA>:<C2>:yes:y:YES:Y Sorry, no clue, can't help you here. > So euhm... Is there anybody who wants to give their opinion or > wisdom about things, please speak up, I need it :-) I don't know if it was of any help, but here you got my 2¢ Wolfgang
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20090706180055.GA18458>