From nobody Mon Jul 5 14:30:43 2021 X-Original-To: freebsd-stable@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 72D8C11D059D for ; Mon, 5 Jul 2021 14:30:48 +0000 (UTC) (envelope-from petefrench@ingresso.co.uk) Received: from constantine.ingresso.co.uk (constantine.ingresso.co.uk [IPv6:2001:470:6a18:411::3]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 4GJSlr2xHXz3qQK; Mon, 5 Jul 2021 14:30:48 +0000 (UTC) (envelope-from petefrench@ingresso.co.uk) Received: from [2001:470:6cc4:1:cd6:5836:ddba:7b54] (helo=balta.drayhouse.twisted.org.uk) by constantine.ingresso.co.uk with esmtpsa (TLS1.3) tls TLS_AES_128_GCM_SHA256 (Exim 4.94.2 (FreeBSD)) (envelope-from ) id 1m0PcU-000Nm1-2P; Mon, 05 Jul 2021 14:30:46 +0000 Subject: Re: ZFS + mysql appears to be killing my SSD's To: Stefan Esser Cc: FreeBSD Stable Mailing List References: <89c37c3e-22e8-006e-5826-33bd7db7739e@ingresso.co.uk> <2fd9b7e4-dc75-fedc-28d7-b98191167e6b@freebsd.org> From: Pete French Message-ID: <9c71d627-55b8-2464-6cc9-489e4ce98049@ingresso.co.uk> Date: Mon, 5 Jul 2021 15:30:43 +0100 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:78.0) Gecko/20100101 Thunderbird/78.11.0 List-Id: Production branch of FreeBSD source code List-Archive: https://lists.freebsd.org/archives/freebsd-stable List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-stable@freebsd.org X-BeenThere: freebsd-stable@freebsd.org MIME-Version: 1.0 In-Reply-To: <2fd9b7e4-dc75-fedc-28d7-b98191167e6b@freebsd.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-GB Content-Transfer-Encoding: 7bit X-Rspamd-Queue-Id: 4GJSlr2xHXz3qQK X-Spamd-Bar: ---- Authentication-Results: mx1.freebsd.org; none X-Spamd-Result: default: False [-4.00 / 15.00]; REPLY(-4.00)[] X-Spam: Yes X-ThisMailContainsUnwantedMimeParts: N On 05/07/2021 14:37, Stefan Esser wrote: > Hi Pete, > > have you checked the drive state and statistics with smartctl? Hi, thanks for the reply - yes, I did check the statistics, and they dont make a lot of sense. I was just looking at them again in fact. So, one of the machines that we chnaged a drive on when this first started, which was 4 weeks ago. root@telehouse04:/home/webadmin # smartctl -a /dev/ada0 | grep Perc 169 Remaining_Lifetime_Perc 0x0000 082 082 000 Old_age Offline - 82 root@telehouse04:/home/webadmin # smartctl -a /dev/ada1 | grep Perc 202 Percent_Lifetime_Remain 0x0030 100 100 001 Old_age Offline - 0 Now, from that you might think the 2nd drive was the one changes, but no. Its the first one, which is now at 82% lifetime remaining! The other druve, still at 100%, has been in there a year. The drives are different manufacturers, which makes comparing most of the numbers tricky unfortunately. Am now even more worried than when I sent the first email - if that 18% is accurate then I am going to be doing this again in another 4 months, and thats not sustainable. It also looks as if this problem has got a lot worse recently. Though I wasnt looking at the numbers before, only noticing tyhe failurses. If I look at 'Percentage Used Endurance Indicator' isntead of the 'Percent_Lifetime_Remain' value then I see some of those well over 200%. That value is, on the newer drives, 100 minus the 'Percent_Lifetime_Remain' value, so I guess they ahve the same underlying metric. I didnt mention in my original email, but I am encrypting these with geli. Does geli do any write amplification at all ? That might explain the high write volumes... -pete.