From owner-freebsd-performance@FreeBSD.ORG Sun Nov 4 05:22:11 2007 Return-Path: Delivered-To: freebsd-performance@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C7CDB16A469 for ; Sun, 4 Nov 2007 05:22:11 +0000 (UTC) (envelope-from josh.carroll@gmail.com) Received: from py-out-1112.google.com (py-out-1112.google.com [64.233.166.179]) by mx1.freebsd.org (Postfix) with ESMTP id 76FAE13C48E for ; Sun, 4 Nov 2007 05:22:11 +0000 (UTC) (envelope-from josh.carroll@gmail.com) Received: by py-out-1112.google.com with SMTP id u77so2459942pyb for ; Sat, 03 Nov 2007 22:21:54 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:reply-to:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; bh=5ep1M1cMHd+Lxp36P+KS3gtdEx9UL3aVXh3BKJtEYWg=; b=j+gw1xsD8nBxuJfImcNHvEsDbRVQmHG/O2LZxVTGzYgGxHlhZt6mwYmCivkgyAMiNU/rg+us13NDRiR5+nz18ckfrPMwop0TIDkmDpU+UzZktJX9llfepBZJYEGL3KI7ENtb4Bkrqug5Qwpe3AU7JAcEM0oOdjXeW2r65Vj81K8= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:reply-to:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=mNUBGqdY+hOY9JQIqDLszWqmuuRlzs9P1wMOkGxqVjRJ3ipl/dLMPy52g0IgMqu+uLH6jqALDaCoWJ/hNwdQAJJEIycDyV7BpIiJEDWvFyurnEZ0h2KqA87nyagh9wTsUJS9Apj0VugpQ8dZHlfV7A6brQEIEnI9NFqgeFV/wTI= Received: by 10.35.116.12 with SMTP id t12mr4298032pym.1194153714064; Sat, 03 Nov 2007 22:21:54 -0700 (PDT) Received: by 10.35.110.17 with HTTP; Sat, 3 Nov 2007 22:21:53 -0700 (PDT) Message-ID: <8cb6106e0711032221i2ecf1d59ge82f368d4162a827@mail.gmail.com> Date: Sun, 4 Nov 2007 01:21:53 -0400 From: "Josh Carroll" To: "Jeff Roberson" In-Reply-To: <20071103133711.Q544@10.0.0.1> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <8cb6106e0710230902x4edf2c8eu2d912d5de1f5d4a2@mail.gmail.com> <8cb6106e0710251925s2db0117cvcb67321b08d7b2a1@mail.gmail.com> <20071102102331.G544@10.0.0.1> <8cb6106e0711021306w10c48a15s99eab526064ac814@mail.gmail.com> <20071102150028.R544@10.0.0.1> <8cb6106e0711021839i4714bdfbh74d412b3e14e808@mail.gmail.com> <8cb6106e0711031227t1e829969s2e6062e1db5167af@mail.gmail.com> <20071103121921.M544@10.0.0.1> <8cb6106e0711031416l4183e2beueb87fade0d56f504@mail.gmail.com> <20071103133711.Q544@10.0.0.1> Cc: freebsd-performance@freebsd.org Subject: Re: ULE vs. 4BSD in RELENG_7 X-BeenThere: freebsd-performance@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: josh.carroll@gmail.com List-Id: Performance/tuning List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 04 Nov 2007 05:22:11 -0000 > Josh, thanks for your help so far. This has been very useful. You're welcome, glad to help! Thanks for the effort and the patch. > Any testing you can run this through is appreciated. Anyone else lurking > in this thread who would like to is also welcome to report back findings. Here are a few benchmarks comparing ULE and the patched ULE. I experimented in changing the slice_min value from 2 to 4, in case that might be useful info for you. Hopefully that helps a bit, but if not it's just a few minutes of CPU time wasted :) Sysbench results: # threads slice=7 slice=13 slice_min=4 slice_min=2 4 2265.67 2250.36 2261.71 2297.08 8 2300.25 2310.02 2306.79 2313.61 12 2269.54 2304.04 2296.54 2279.73 16 2249.26 2252.04 2260.53 2245.76 It looks like with the default minimum (2), the performance for systat is better with 4 and 8 threads (on a 4 core system), but worse for 12 and 16 threads. Here are the results for ffmpeg (-threads 8): slice=7 slice=13 slice_min=4 slice_min=2 1:37.00 1:39.09 1:38.12 1:38.06 The patch definitely improves things there, though not quite as good as using a slice value of 7. But it does improve things. So it slightly improves things for ffmpeg and also slightly increases the performance of sysbench/MySQL (with 8 threads). I also ran through buildworld for both slice_min of 2 and 4, and here are the results, again with ULE as a base line: slice=7 slice=13 slice_min=4 slice_min=2 13:40.56 13:44.28 13:46.64 13:45.80 So buildworld performance is about the same as with the default ULE and default slice value. Thanks, Josh