From owner-soc-status@FreeBSD.ORG Sun Jun 28 20:10:03 2009 Return-Path: Delivered-To: soc-status@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1420F1065670 for ; Sun, 28 Jun 2009 20:10:03 +0000 (UTC) (envelope-from gabor@FreeBSD.org) Received: from server.mypc.hu (server.mypc.hu [87.229.73.95]) by mx1.freebsd.org (Postfix) with ESMTP id C55A38FC17 for ; Sun, 28 Jun 2009 20:10:02 +0000 (UTC) (envelope-from gabor@FreeBSD.org) Received: from localhost (localhost [127.0.0.1]) by server.mypc.hu (Postfix) with ESMTP id 4BF8414D8B60 for ; Sun, 28 Jun 2009 22:10:01 +0200 (CEST) X-Virus-Scanned: amavisd-new at t-hosting.hu Received: from server.mypc.hu ([127.0.0.1]) by localhost (server.mypc.hu [127.0.0.1]) (amavisd-new, port 10024) with LMTP id ZENfp05A9hVK for ; Sun, 28 Jun 2009 22:10:00 +0200 (CEST) Received: from [192.168.1.105] (catv-80-98-231-64.catv.broadband.hu [80.98.231.64]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by server.mypc.hu (Postfix) with ESMTPSA id A332214D8A5D for ; Sun, 28 Jun 2009 22:10:00 +0200 (CEST) Message-ID: <4A47CE16.6080907@FreeBSD.org> Date: Sun, 28 Jun 2009 22:09:58 +0200 From: Gabor Kovesdan User-Agent: Thunderbird 2.0.0.22 (Windows/20090605) MIME-Version: 1.0 To: soc-status@freebsd.org Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Subject: libiconv status report #4 X-BeenThere: soc-status@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Summer of Code Status Reports and Discussion List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 28 Jun 2009 20:10:03 -0000 Hi, this week's progress: - Investigated about UTF-8 and found out that the differences between GNU iconv and Citrus are bugs in GNU because it allows unpaired surrogates in the output, which shouldn't be allowed according to unicode.org. - Also checked UTF-7 and the behaviour seems to be completely the same apart from that Citrus gives some warnings on invalid sequences, which is fine I think. - Added mapper_parallel modul, which wasn't built before - Little optimization to UTF-8: static tail length lookup tables instead of filling the table with for loops. - Fix Kazakh conversion table paths, so Kazakh conversions work now. I have some more changes in progress but they aren't ready yet, so I haven't merged them. I consulted with my mentor and I'll organize my work around GNU iconv compatibility so that we can start to use iconv as soon as possible. Optimization can be done when I find something like this UTF-8 nit and I'll review the whole code at the end of the program. My mentor also pointed out that glibc iconv seems to be even better than GNU iconv, so I'll take a look at it, as well. From the next week I'll have limited connectivity for some time but as I do weekly merges, you might not notice it at all. Cheers, -- Gabor Kovesdan FreeBSD Volunteer EMAIL: gabor@FreeBSD.org .:|:. gabor@kovesdan.org WEB: http://people.FreeBSD.org/~gabor .:|:. http://kovesdan.org