From owner-svn-src-all@FreeBSD.ORG Sun Sep 7 16:01:51 2014 Return-Path: Delivered-To: svn-src-all@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 6B609FA4; Sun, 7 Sep 2014 16:01:51 +0000 (UTC) Received: from zxy.spb.ru (zxy.spb.ru [195.70.199.98]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 2310815D2; Sun, 7 Sep 2014 16:01:51 +0000 (UTC) Received: from slw by zxy.spb.ru with local (Exim 4.82 (FreeBSD)) (envelope-from ) id 1XQeuT-000LaH-9L; Sun, 07 Sep 2014 20:01:49 +0400 Date: Sun, 7 Sep 2014 20:01:49 +0400 From: Slawa Olhovchenkov To: Steven Hartland Subject: Re: svn commit: r256956 - in head/sys: cam/ata cam/scsi cddl/contrib/opensolaris/uts/common/fs/zfs cddl/contrib/opensolaris/uts/common/fs/zfs/sys geom sys Message-ID: <20140907160149.GC72244@zxy.spb.ru> References: <201310230954.r9N9sxAZ011750@svn.freebsd.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <201310230954.r9N9sxAZ011750@svn.freebsd.org> User-Agent: Mutt/1.5.23 (2014-03-12) X-SA-Exim-Connect-IP: X-SA-Exim-Mail-From: slw@zxy.spb.ru X-SA-Exim-Scanned: No (on zxy.spb.ru); SAEximRunCond expanded to false Cc: svn-src-head@freebsd.org, svn-src-all@freebsd.org, src-committers@freebsd.org X-BeenThere: svn-src-all@freebsd.org X-Mailman-Version: 2.1.18-1 Precedence: list List-Id: "SVN commit messages for the entire src tree \(except for " user" and " projects" \)" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 07 Sep 2014 16:01:51 -0000 On Wed, Oct 23, 2013 at 09:54:59AM +0000, Steven Hartland wrote: > Author: smh > Date: Wed Oct 23 09:54:58 2013 > New Revision: 256956 > URL: http://svnweb.freebsd.org/changeset/base/256956 > > Log: > Improve ZFS N-way mirror read performance by using load and locality > information. > > The existing algorithm selects a preferred leaf vdev based on offset of the zio > request modulo the number of members in the mirror. It assumes the devices are > of equal performance and that spreading the requests randomly over both drives > will be sufficient to saturate them. In practice this results in the leaf vdevs > being under utilized. > > The new algorithm takes into the following additional factors: > * Load of the vdevs (number outstanding I/O requests) > * The locality of last queued I/O vs the new I/O request. > > Within the locality calculation additional knowledge about the underlying vdev > is considered such as; is the device backing the vdev a rotating media device. > > This results in performance increases across the board as well as significant > increases for predominantly streaming loads and for configurations which don't > have evenly performing devices. > > The following are results from a setup with 3 Way Mirror with 2 x HD's and > 1 x SSD from a basic test running multiple parrallel dd's. > > With pre-fetch disabled (vfs.zfs.prefetch_disable=1): > > == Stripe Balanced (default) == > Read 15360MB using bs: 1048576, readers: 3, took 161 seconds @ 95 MB/s > == Load Balanced (zfslinux) == > Read 15360MB using bs: 1048576, readers: 3, took 297 seconds @ 51 MB/s > == Load Balanced (locality freebsd) == > Read 15360MB using bs: 1048576, readers: 3, took 54 seconds @ 284 MB/s > > With pre-fetch enabled (vfs.zfs.prefetch_disable=0): > > == Stripe Balanced (default) == > Read 15360MB using bs: 1048576, readers: 3, took 91 seconds @ 168 MB/s > == Load Balanced (zfslinux) == > Read 15360MB using bs: 1048576, readers: 3, took 108 seconds @ 142 MB/s > == Load Balanced (locality freebsd) == > Read 15360MB using bs: 1048576, readers: 3, took 48 seconds @ 320 MB/s > > In addition to the performance changes the code was also restructured, with > the help of Justin Gibbs, to provide a more logical flow which also ensures > vdevs loads are only calculated from the set of valid candidates. > > The following additional sysctls where added to allow the administrator > to tune the behaviour of the load algorithm: > * vfs.zfs.vdev.mirror.rotating_inc > * vfs.zfs.vdev.mirror.rotating_seek_inc > * vfs.zfs.vdev.mirror.rotating_seek_offset > * vfs.zfs.vdev.mirror.non_rotating_inc > * vfs.zfs.vdev.mirror.non_rotating_seek_inc > > These changes where based on work started by the zfsonlinux developers: > https://github.com/zfsonlinux/zfs/pull/1487 > > Reviewed by: gibbs, mav, will > MFC after: 2 weeks > Sponsored by: Multiplay No MFC yet.