From owner-freebsd-current@FreeBSD.ORG Sun Aug 22 15:44:53 2010 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B8F4410656A5 for ; Sun, 22 Aug 2010 15:44:53 +0000 (UTC) (envelope-from olivier@gid0.org) Received: from mail-iw0-f182.google.com (mail-iw0-f182.google.com [209.85.214.182]) by mx1.freebsd.org (Postfix) with ESMTP id 812C98FC20 for ; Sun, 22 Aug 2010 15:44:53 +0000 (UTC) Received: by iwn36 with SMTP id 36so5560600iwn.13 for ; Sun, 22 Aug 2010 08:44:52 -0700 (PDT) MIME-Version: 1.0 Received: by 10.231.31.71 with SMTP id x7mr4871366ibc.33.1282491892704; Sun, 22 Aug 2010 08:44:52 -0700 (PDT) Received: by 10.231.176.140 with HTTP; Sun, 22 Aug 2010 08:44:52 -0700 (PDT) In-Reply-To: <4C713EF5.8080402@FreeBSD.org> References: <4C713EF5.8080402@FreeBSD.org> Date: Sun, 22 Aug 2010 17:44:52 +0200 Message-ID: From: Olivier Smedts To: Martin Matuska Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Cc: freebsd-current@freebsd.org Subject: Re: [CFT] Improved ZFS metaslab code (faster write speed) X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 22 Aug 2010 15:44:53 -0000 2010/8/22 Martin Matuska : > Dear FreeBSD community, > > many of our [2] (and Solaris [3]) users today are complaining about slow > ZFS writes. One of the causes for these writes is the selection of the > proper allocation method for allocation of new blocks [3] [4]. Another > issue a write slowdown during TXG sync times. > > Solaris 10 (and OpenSolaris up to november 2009) have the > following scenario: > > - pool has more than 30% free space: use first fit method [1] > - pool has less than 30% free space: use best fit method [1] > > This causes a major slowdown of the writes if we go below 30% of free > space. On large pools, 30% may be terabytes of free space. > > OpenSolaris has changed this in November 2009 and the Oracle Storage > Appliances also included the new code in Q1/2010 [1]. > > The source [1] states, that with this change they archieved a speedup > of: "50% Improved OLTP Performance, 70% Reduced Variability, 200% > Improvement on MS Exchange" > > I would like to issue a Call For Testing for the following 9-CURRENT patc= h: > http://people.freebsd.org/~mm/patches/zfs/zfs_metaslab.patch > > To apply the patch against 8-STABLE, you need to apply the v15 update fir= st: > http://people.freebsd.org/~mm/patches/zfs/v15/stable-8-v15.patch This one doesn't apply cleanly since few minutes : # svn log -l 1 sys/cddl/contrib/opensolaris/uts/common/fs/zfs/arc.c ------------------------------------------------------------------------ r211599 | avg | 2010-08-22 10:18:32 +0200 (Dim 22 ao=FB 2010) | 7 lignes Fix a mismerge in r211581, MFC of r210427 This is a direct commit. Reported by: many Pointyhat to: avg ------------------------------------------------------------------------ But it does not seem hard to correct. Do you want me to submit an updated patch for 8-stable ? > The patch includes the following OpenSolaris onnv revisions: > 10921 (partial), 11146, 11728, 12047 > > And covers the following Bug IDs: > 6826241 Sync write IOPS drops dramatically during TXG sync > 6869229 zfs should switch to shiny new metaslabs more frequently > 6917066 zfs block picking can be improved > 6918420 zdb -m has issues printing metaslab statistics > > References: > [1] http://blogs.sun.com/roch/entry/doubling_exchange_performance > [2] http://forums.freebsd.org/showthread.php?t=3D8270 > [3] > http://blogs.everycity.co.uk/alasdair/2010/07/zfs-runs-really-slowly-when= -free-disk-usage-goes-above-80/ > [4] http://blogs.sun.com/bonwick/entry/zfs_block_allocation > [5] http://blogs.sun.com/bonwick/entry/space_maps > _______________________________________________ > freebsd-current@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-current > To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org= " > --=20 Olivier Smedts=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0=A0 _ =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 ASCII ribbon campaign ( ) e-mail: olivier@gid0.org=A0 =A0 =A0 =A0 - against HTML email & vCards=A0 X www: http://www.gid0.org=A0 =A0 - against proprietary attachments / \ =A0 "Il y a seulement 10 sortes de gens dans le monde : =A0 ceux qui comprennent le binaire, =A0 et ceux qui ne le comprennent pas."