From owner-soc-status@FreeBSD.ORG Mon Aug 18 18:33:24 2014 Return-Path: Delivered-To: soc-status@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 8C72F920; Mon, 18 Aug 2014 18:33:24 +0000 (UTC) Received: from mail-wi0-x233.google.com (mail-wi0-x233.google.com [IPv6:2a00:1450:400c:c05::233]) (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id AE6443625; Mon, 18 Aug 2014 18:33:23 +0000 (UTC) Received: by mail-wi0-f179.google.com with SMTP id f8so4138769wiw.0 for ; Mon, 18 Aug 2014 11:33:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:reply-to:from:date:message-id:subject:to:content-type; bh=ItBzq8ubAHmjacpabv9sDtgH614eJJ2YgnQ3ZXvOlME=; b=zzRrfq4Nf4Ew2La672wsE7RAFrAWh7qW4D7qA5dWvuEfMAO5JR1w8pn6nMlIcR+l/A 9KMUMVC63oe7DcHRDZV+ptMttVLZCnEdGcsoJdybofUxauQKoPDheSd8FkX5+0uWgpXj f7LH63URMJj0DjoYcSmzsvVFZaF6i+dEkAYy5a8XVqjnziRBM52oLIjjUrs24xLFTv5f odH/CcdsgpX6Vzhl2rZhwI+eoISoL4SWVU4avNFRyXntiirmHWSsjnAJSD6l37jr0vBI qV1XEgCIOs6qkwl+R1ldP/Xfs08Bx7dvrz6HNtVEk/YvraEry6djgaArj+8dNv1FGg4K GefQ== X-Received: by 10.194.58.244 with SMTP id u20mr33226046wjq.36.1408386801748; Mon, 18 Aug 2014 11:33:21 -0700 (PDT) MIME-Version: 1.0 Received: by 10.194.40.33 with HTTP; Mon, 18 Aug 2014 11:33:01 -0700 (PDT) Reply-To: ghostmansd@gmail.com From: Dmitry Selyutin Date: Mon, 18 Aug 2014 22:33:01 +0400 Message-ID: Subject: Report #8: Unicode support To: soc-status@freebsd.org, Pedro Giffuni , David Chisnall , Konrad Jankowski Content-Type: text/plain; charset=UTF-8 X-Content-Filtered-By: Mailman/MimeDel 2.1.18-1 X-BeenThere: soc-status@freebsd.org X-Mailman-Version: 2.1.18-1 Precedence: list List-Id: Summer of Code Status Reports and Discussion List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 18 Aug 2014 18:33:24 -0000 Hello everyone! Here are the last news about the Unicode support project[0]. You can always check my repository[1]. During these days I've been testing and fixing a ton of bugs and memory leaks using valgrind and gdb. I've finished libcolldb and its libc internal clone. Each collation database is now a part of reference-counted locale system. If locale is either "C" or "POSIX" or has no collation rules, we use either strcmp() or wcscmp() instead of strcoll() and wcscoll(). If LC_COLLATE file for the given locale not found or is not a collation database, then try to use /usr/share/locale/UTF-8/LC_COLLATE file. If the latter is not found or is not a collation database, we use original algorithm to maintain backward compatibility. In order to generate LC_COLLATE as collation database, one must use share/examples/collation/colldb.py script. usage: colldb.py share/colldb/root.src /usr/share/locale/UTF-8/LC_COLLATE. The C version of this tool is on the way. [0] https://wiki.freebsd.org/SummerOfCode2014/Unicode [1] https://socsvn.freebsd.org/socsvn/soc2014/ghostmansd -- With best regards, Dmitry Selyutin