From owner-freebsd-fs@freebsd.org Fri Feb 19 13:51:24 2016 Return-Path: Delivered-To: freebsd-fs@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id E30EDAAE0F5 for ; Fri, 19 Feb 2016 13:51:23 +0000 (UTC) (envelope-from killing@multiplay.co.uk) Received: from mail-wm0-x22c.google.com (mail-wm0-x22c.google.com [IPv6:2a00:1450:400c:c09::22c]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 8387718BE for ; Fri, 19 Feb 2016 13:51:23 +0000 (UTC) (envelope-from killing@multiplay.co.uk) Received: by mail-wm0-x22c.google.com with SMTP id c200so76628383wme.0 for ; Fri, 19 Feb 2016 05:51:22 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=multiplay-co-uk.20150623.gappssmtp.com; s=20150623; h=subject:to:references:cc:from:message-id:date:user-agent :mime-version:in-reply-to:content-type:content-transfer-encoding; bh=i5WCWXq8CjP1Gn47s++4VTJJPjct0tkUZThdIMRKcpw=; b=bw/dEFMZXTXcwVOhE5VLEzc9v0jSIIeBraWW6Sq0LZK6GgZ/Ft/vCp1T/jvrVDm+bk Qun5zojzL0FEhRcmMlwXIShiydzBL/3VvAfgKsgjd3Npl9zw5q+Y/3ZnXf60FOl/gjeW XAL592xe0UMe3OqGK7KuRq0eoXl94hSMXvCurFfgCGBTP5ow/WprAwihq107j7UNR8BD 2Ag3dV+g3xMVoFDvXtWq5AD7eKtXaPBKi/cdVNmNrrihPj9BU1FoGNm+YDqzPpyXZ0wQ yUITTr4OzXurTIpZgAMza3LWgXbJRGUkskH/4vmo/MhcUSaNReAuX0LnVD3++kZibAuk iV7Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:subject:to:references:cc:from:message-id:date :user-agent:mime-version:in-reply-to:content-type :content-transfer-encoding; bh=i5WCWXq8CjP1Gn47s++4VTJJPjct0tkUZThdIMRKcpw=; b=li+YuLz6F9UPHI8P+I5yl90UkyQO03I3wVXm4B73ukUEIMSYrFwwIGL9Y8Ihu7sXrH MGnR5g6Plo/iRdvKx2mfPBB9kM7gR1r6VROqG0gAW7fQUyrH5a/TIXcVe4hu4N4Ad/4C uI5x1RL1MAJZ/oVsseZ72L1T9Y+h4vasyb27+NUjd5Bp0U18Sj5pL0bOpy+oPzjYpZTQ /2NRw9MjH45mmMTdOIgRHWkTnniMNZUp9iVvGE6IJevV3HgOiL/aenojcWKxjw5ywWys L4HeHwpmsIHi+hKtr1sil1fl/V/w5W3goE3cDsZSuDhJDQ1HughiMZH66gFoHeV6ruHs hxFw== X-Gm-Message-State: AG10YORMR93nfuky2tzGh7WEyK8E4OltCJw2vlRDzLMZ2HgTOkZK9MXO8mm0gbL/UAJcK5b4 X-Received: by 10.194.133.201 with SMTP id pe9mr13213100wjb.101.1455889881514; Fri, 19 Feb 2016 05:51:21 -0800 (PST) Received: from [10.10.1.58] (liv3d.labs.multiplay.co.uk. [82.69.141.171]) by smtp.gmail.com with ESMTPSA id i5sm11328646wja.23.2016.02.19.05.51.19 (version=TLSv1/SSLv3 cipher=OTHER); Fri, 19 Feb 2016 05:51:19 -0800 (PST) Subject: Re: Zfs heavy io writing | zfskern txg_thread_enter To: Michelle Sullivan References: <56C70365.1050800@sorbs.net> <56C70AB0.6050400@multiplay.co.uk> <56C71350.3020602@sorbs.net> Cc: freebsd-fs@freebsd.org From: Steven Hartland Message-ID: <56C71DD8.3040505@multiplay.co.uk> Date: Fri, 19 Feb 2016 13:51:20 +0000 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:38.0) Gecko/20100101 Thunderbird/38.6.0 MIME-Version: 1.0 In-Reply-To: <56C71350.3020602@sorbs.net> Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 8bit X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 19 Feb 2016 13:51:24 -0000 On 19/02/2016 13:06, Michelle Sullivan wrote: > Steven Hartland wrote: >> >> On 19/02/2016 11:58, Michelle Sullivan wrote: >>> Niccolò Corvini wrote: >>>> Hi, first time here! >>>> We are having a problem with a server running FreeBsd 9.1 with ZFS on a >>>> >>> You should upgrade to a supported version first... 9.3 would probably >>> be the best (rather than 10.x) as it's still supported and uses the same >>> ABI (ie you should need to reinstall all your ports/packages - though >>> you should because it sometimes breaks things - at least check for >>> broken things :) .) >>> >>> If you're not familiar "freebsd-update -r 9.3-RELEASE upgrade" will help >>> you do it without too many problems. >> 9.3 is still ancient, and while "supported" its not in active >> development, and to be blunt no one will be interested in helping to >> diagnose any actual issue on something so old. > So supported is not really supported... Is that an official position? Supported for 9.x, which is a "Legacy Release", I would say is supported for security and other critical issues only, which is the same for pretty much every project / product out there. > >> 10.x has a totally different ZFS IO scheduler for example, so its >> differently for most workloads. > But the user is on 9.x not 10.x and 10.x changes a lot more than just > the ZFS IO scheduler. If this is a production machine, then an upgrade > to 9.3 may be easier as it would require less regression testing.... Or > is this another case of people don't run FreeBSD in production > environments so it doesn't matter...? Yes but 9.x is already legacy and becomes unsupported in December of this year, so the process of migration to 10.x should be well on the way by now tbh. > >>>> single sata drive. Since a few days ago, in the morning the system >>>> becomes >>>> really slow due of a really heavy io writing. We investigated and we >>>> think >>>> it might start at night, maybe correlated to to crondaily (standard) >>>> but we >>>> are not sure. After a few hours the situation returns to normal. >>>> >>> Yeah this sounds like something I am quite familiar with... It's the >>> security check cronjob that runs every day... its looking for any >>> setuid/setgid files, new/modified files...etc... across all file systems >> This is quite likely, so while updating to 10 may not fix the issue >> running on 9.x. > Which means you just told the user to do something that is not likely to > fix the issue but will give them more problems to deal with so they > might forget the original problem in the mean time... You know this was > the reason I was the Technical Lead for the support teams first in > Europe then in AsiaPAC for Netscape back in the day, and why I never > worked for Microsoft... You diagnose the problem as best as possible > with as minimal changes to the system at first, then if all else fails > or you come across evidence that points to a known bug that you know is > fixed you tell them to upgrade to the latest *supported* version > >> Be aware that 10.3-BETA2 has a known issue related to vnode memory >> usage which can be triggered by such workloads so trying BETA3 when >> released, which should address this would be a good idea. >> > ...supported version... i.e. *NOT* a BETA release - especially if that > beta release has other known issues that might well trigger on the very > problem they indicated...! > > Seriously, sorry to pick on you, but "Upgrade to 10.x Beta as it might > help" is not the *first* answer anyone should give... you might as well > have told him to upgrade to 11.... because that has as much chance of > fixing the problem... Beta and soon to become RC, so no its nothing like 11. If no one bothered to update and test until release then you can't expect a good result at the end of the day now can you. There's a reason why people like Netflix run code close to head. Gleb Smirnoff iirc has Youtube vid talking about it, you should watch it. Obviously when updating to any new version, be that on mission critical system or dev test box, then the implications must be considered. At the end of the day it doesn't change the fact that if any investigation results in the issue requiring code changes, you're not going to get that running 9.x, and it may already be fixed, so you'll have wasted your time. Yes you could spend lots of time investigating the problem to come to that conclusion and be at a dead end or you could get on a version which is being actively developed and hence help that process along, which is something that's going to need to be done anyway, so why not take a step in the right direction. At the end of the day its a balancing act and something that only the user can decide given the relevant information. Regards Steve