From nobody Mon Jul 5 13:37:09 2021 X-Original-To: freebsd-stable@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 928F48D62F2 for ; Mon, 5 Jul 2021 13:37:13 +0000 (UTC) (envelope-from se@freebsd.org) Received: from smtp.freebsd.org (smtp.freebsd.org [96.47.72.83]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (4096 bits) client-digest SHA256) (Client CN "smtp.freebsd.org", Issuer "R3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4GJRZ13fqBz3QSJ; Mon, 5 Jul 2021 13:37:13 +0000 (UTC) (envelope-from se@freebsd.org) Received: from Stefans-MBP-449.fritz.box (p200300cd5f18b2006c175e4403fd7f12.dip0.t-ipconnect.de [IPv6:2003:cd:5f18:b200:6c17:5e44:3fd:7f12]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client did not present a certificate) (Authenticated sender: se/mail) by smtp.freebsd.org (Postfix) with ESMTPSA id F18A72BFA4; Mon, 5 Jul 2021 13:37:12 +0000 (UTC) (envelope-from se@freebsd.org) To: Pete French References: <89c37c3e-22e8-006e-5826-33bd7db7739e@ingresso.co.uk> From: Stefan Esser Cc: FreeBSD Stable Mailing List Subject: Re: ZFS + mysql appears to be killing my SSD's Message-ID: <2fd9b7e4-dc75-fedc-28d7-b98191167e6b@freebsd.org> Date: Mon, 5 Jul 2021 15:37:09 +0200 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:78.0) Gecko/20100101 Thunderbird/78.11.0 List-Id: Production branch of FreeBSD source code List-Archive: https://lists.freebsd.org/archives/freebsd-stable List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-stable@freebsd.org X-BeenThere: freebsd-stable@freebsd.org MIME-Version: 1.0 In-Reply-To: <89c37c3e-22e8-006e-5826-33bd7db7739e@ingresso.co.uk> Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="lVKaSy8oz6R3Z7xTaC4N8EB1I8WYkYlV5" X-ThisMailContainsUnwantedMimeParts: N This is an OpenPGP/MIME signed message (RFC 4880 and 3156) --lVKaSy8oz6R3Z7xTaC4N8EB1I8WYkYlV5 Content-Type: multipart/mixed; boundary="EA0fJvzZWDXOGTOUZNr1x5kbXiRTj9EJC"; protected-headers="v1" From: Stefan Esser To: Pete French Cc: FreeBSD Stable Mailing List Message-ID: <2fd9b7e4-dc75-fedc-28d7-b98191167e6b@freebsd.org> Subject: Re: ZFS + mysql appears to be killing my SSD's References: <89c37c3e-22e8-006e-5826-33bd7db7739e@ingresso.co.uk> In-Reply-To: <89c37c3e-22e8-006e-5826-33bd7db7739e@ingresso.co.uk> --EA0fJvzZWDXOGTOUZNr1x5kbXiRTj9EJC Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: quoted-printable Am 05.07.21 um 15:15 schrieb Pete French: > I hve a netwkr of FreeBSD machines which are running mysql on top of zf= s. I > have been doing this for a while, but a couple of years ago we switched= to > using SSD. After less than a year (I dont remember the exact timings), = they all > strated to fail. We assumed a bad batch, and had them replaced, and did= nt think > anything more of it. >=20 > A week or so, all the replacements started to fail. This was shortly af= ter I > upgraded to FreeBSD 13 and OpenZFS, but I think this is unrelated, howe= ver its > one major chnage which happened before the most recent round of failure= s. >=20 > The thing is though, that I am not seieng any heavy activity on the dri= ves. The > load is sustained, but well below the lifetime write thresh-hold for th= e drive. > I also do not see the drives a being heavily in use when I run gstat. S= o its > perplexing. I am assuming its related to the mysql load, as this is ide= ntical > across all machines, and they are all dying within a few days of each o= ther. >=20 > Any insights would be appreciated... :-) Hi Pete, have you checked the drive state and statistics with smartctl? This is the output that I get from my SSD after use as a L2ARC for 1 year= : $ smartctl -d nvme /dev/nvme0 -a =2E.. =3D=3D=3D START OF SMART DATA SECTION =3D=3D=3D SMART overall-health self-assessment test result: PASSED SMART/Health Information (NVMe Log 0x02) Critical Warning: 0x00 Temperature: 27 Celsius Available Spare: 100% Available Spare Threshold: 5% Percentage Used: 1% Data Units Read: 11,745,658 [6.01 TB] Data Units Written: 14,767,823 [7.56 TB] Host Read Commands: 522,309,835 Host Write Commands: 69,368,834 Controller Busy Time: 1,198 Power Cycles: 40 Power On Hours: 8,514 Unsafe Shutdowns: 28 Media and Data Integrity Errors: 0 Error Information Log Entries: 120 Warning Comp. Temperature Time: 0 Critical Comp. Temperature Time: 0 Error Information (NVMe Log 0x01, 16 of 63 entries) No Errors Logged That drive has a spec of 600 TB TBW and I seem to have used 1% of that wi= thin that year of use. Regards, STefan --EA0fJvzZWDXOGTOUZNr1x5kbXiRTj9EJC-- --lVKaSy8oz6R3Z7xTaC4N8EB1I8WYkYlV5 Content-Type: application/pgp-signature; name="OpenPGP_signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="OpenPGP_signature" -----BEGIN PGP SIGNATURE----- wsB5BAABCAAjFiEEo3HqZZwL7MgrcVMTR+u171r99UQFAmDjCwUFAwAAAAAACgkQR+u171r99UQ3 kwf/QsSKxLxPo0A4yX6NzXft8K8uMfFE3vofIv3mBJz9v7SHAoDsdHuIpRzlzQf47cLSWLdlSm5S nbXYoyCxuFWJWik1gAbimWGFK5ei9/z/QurYYuHIxN11KH168ZFf+Lp+D8XjXVrFVU5I87n2ljQe f2Mifd+/BCuq3MLVVt7F/iuImZHrl3ap9pZ7dVFqLb+/wEA0GwKv2ai8yYjTEMDUsPoMYwJRkGt1 K08DvqdqeolBwc+SR1DI56YHadZkHoCjEmifOM4Uwcd4sScu183EHgG914VcPku3OfSNpeMEN4YB wT0GVqzhGZGI99JCoDnsgNKoKZj833ffjSGFP8txVw== =vUnr -----END PGP SIGNATURE----- --lVKaSy8oz6R3Z7xTaC4N8EB1I8WYkYlV5--