From owner-freebsd-hackers@FreeBSD.ORG Sat Oct 20 13:00:11 2012 Return-Path: Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 377D7937; Sat, 20 Oct 2012 13:00:11 +0000 (UTC) (envelope-from ndenev@gmail.com) Received: from mail-wg0-f42.google.com (mail-wg0-f42.google.com [74.125.82.42]) by mx1.freebsd.org (Postfix) with ESMTP id 8D14D8FC17; Sat, 20 Oct 2012 13:00:10 +0000 (UTC) Received: by mail-wg0-f42.google.com with SMTP id fm10so652080wgb.1 for ; Sat, 20 Oct 2012 06:00:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=subject:mime-version:content-type:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to:x-mailer; bh=iCf7zOwb41trqzi1oKnj/9ho1Rc051rOTPPSVQNGpsA=; b=SvfoV3DishYCLiATdfSC6iMxwLWV2YTRdQibwtVpKlkYIKSJGrM2P3sXPIZZogoO+n Rzab3c4axJAyeHH+EWVmB8aIPQCaJpNEEAuoNfZRnK/7gAzxHcTrXgnyr/1DjsgpkZxt iRSvorX6nINTW4kZRTC2jH0zE9sPAAqcBXEAuQUV4v387UHnNG01a6xIUfEe6Ev6IhTy VF/9yWjXuCLn1bJSbk3NNb+L15vLjRg1386rHXRiBxVlYHilRg84zqjDm9h57NPQilmJ zSzcXr3jzitgbULLmGZ+ROE0SC0B2eDCPho6NVWIxKQhcTdebLWX6aoY8H//Zh2qAJ6x nDEA== Received: by 10.216.203.1 with SMTP id e1mr2538238weo.103.1350738004180; Sat, 20 Oct 2012 06:00:04 -0700 (PDT) Received: from [10.0.0.86] ([93.152.184.10]) by mx.google.com with ESMTPS id ay10sm9461879wib.2.2012.10.20.06.00.02 (version=TLSv1/SSLv3 cipher=OTHER); Sat, 20 Oct 2012 06:00:03 -0700 (PDT) Subject: Re: NFS server bottlenecks Mime-Version: 1.0 (Mac OS X Mail 6.1 \(1498\)) Content-Type: text/plain; charset=us-ascii From: Nikolay Denev In-Reply-To: Date: Sat, 20 Oct 2012 16:00:01 +0300 Content-Transfer-Encoding: quoted-printable Message-Id: References: <937460294.2185822.1350093954059.JavaMail.root@erie.cs.uoguelph.ca> <302BF685-4B9D-49C8-8000-8D0F6540C8F7@gmail.com> <0857D79A-6276-433F-9603-D52125CF190F@gmail.com> <6DAAB1E6-4AC7-4B08-8CAD-0D8584D039DE@gmail.com> <23D7CB3A-BD66-427E-A7F5-6C9D3890EE1B@gmail.com> To: Ivan Voras X-Mailer: Apple Mail (2.1498) Cc: "freebsd-hackers@freebsd.org Hackers" , Rick Macklem X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 20 Oct 2012 13:00:11 -0000 On Oct 20, 2012, at 3:11 PM, Ivan Voras wrote: > On 20 October 2012 13:42, Nikolay Denev wrote: >=20 >> Here are the results from testing both patches : = http://home.totalterror.net/freebsd/nfstest/results.html >> Both tests ran for about 14 hours ( a bit too much, but I wanted to = compare different zfs recordsize settings ), >> and were done first after a fresh reboot. >> The only noticeable difference seems to be much more context switches = with Ivan's patch. >=20 > Thank you very much for your extensive testing! >=20 > I don't know how to interpret the rise in context switches; as this is > kernel code, I'd expect no context switches. I hope someone else can > explain. >=20 > But, you have also shown that my patch doesn't do any better than > Rick's even on a fairly large configuration, so I don't think there's > value in adding the extra complexity, and Rick knows NFS much better > than I do. >=20 > But there are a few things other than that I'm interested in: like why > does your load average spike almost to 20-ties, and how come that with > 24 drives in RAID-10 you only push through 600 MBit/s through the 10 > GBit/s Ethernet. Have you tested your drive setup locally (AESNI > shouldn't be a bottleneck, you should be able to encrypt well into > Gbyte/s range) and the network? >=20 > If you have the time, could you repeat the tests but with a recent > Samba server and a CIFS mount on the client side? This is probably not > important, but I'm just curious of how would it perform on your > machine. The first iozone local run finished, I'll paste just the result here, = and also the same test over NFS for comparison: (This is iozone doing 8k sized IO ops, on ZFS dataset with = recordsize=3D8k) NFS: random = random bkwd record stride =20 KB reclen write rewrite read reread read = write read rewrite read =20 33554432 8 4973 5522 2930 2906 2908 = 3886 =20 Local: random = random bkwd record stride =20 KB reclen write rewrite read reread read = write read rewrite read =20 33554432 8 34740 41390 135442 142534 24992 = 12493 =20 P.S.: I forgot to mention that the network is with 9K mtu.=