From owner-freebsd-stable@FreeBSD.ORG Tue Jul 1 10:10:59 2008 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 5A2C21065677; Tue, 1 Jul 2008 10:10:59 +0000 (UTC) (envelope-from daniel_k_eriksson@telia.com) Received: from pne-smtpout1-sn2.hy.skanova.net (pne-smtpout1-sn2.hy.skanova.net [81.228.8.83]) by mx1.freebsd.org (Postfix) with ESMTP id 161E18FC1C; Tue, 1 Jul 2008 10:10:59 +0000 (UTC) (envelope-from daniel_k_eriksson@telia.com) Received: from royal64.emp.zapto.org (195.198.193.168) by pne-smtpout1-sn2.hy.skanova.net (7.3.129) id 483EBD680068BF85; Tue, 1 Jul 2008 11:01:19 +0200 MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Content-class: urn:content-classes:message Date: Tue, 1 Jul 2008 11:01:17 +0200 Message-ID: <4F9C9299A10AE74E89EA580D14AA10A61A1968@royal64.emp.zapto.org> X-MimeOLE: Produced By Microsoft Exchange V6.5.7235.2 X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: MCP55 SATA data corruption in FreeBSD 7 Thread-Index: AcjbWSPaqAfW4rtJQwiRClpfJFg2GA== From: "Daniel Eriksson" To: Cc: legioner.r@gmail.com, morten@lightworkings.dk, sos@FreeBSD.org Subject: MCP55 SATA data corruption in FreeBSD 7 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 01 Jul 2008 10:10:59 -0000 I am having problems with silent data corruption on (some) drives connected to an MCP55 SATA controller. I have two servers, both running RELENG_7_0/amd64. One has the 570 Ultra chipset, the other has 570 SLI. Both chipsets have the MCP55 SATA controller. The server with 570 Ultra chipset has a bunch of older 250GB SATA-150 drives hooked up to the MCP55 controller and it is working just fine. The server with 570 SLI chipset has a bunch of new SATA-300 drives hooked up to the MCP55 controller and it is giving me silent data corruption (easily detectable by running ZFS scrub, every time I run it new checksum errors show up). I know the drives are good because when they are hooked up to another controller they work just fine. Unfortunately the drives does not have a jumper for setting SATA-150 speed (they are Samsung 1 TB drives), and trying to force the drives to SATA-150 speed with the "patch" provided by the manufacturer does not seem to work (the drives still negotiate SATA-300 speed). I will try to get my hands on another older SATA-150 drive (or a new that can be jumpered) to verify if the culprit is the MCP55 revision (see below) or the interface speed. NOT working (570 SLI) --------------------- atapci1@pci0:0:5:0: class=3D0x010185 card=3D0x72501462 = chip=3D0x037f10de rev=3D0xa2 hdr=3D0x00 vendor =3D 'Nvidia Corp' device =3D 'MCP55 SATA Controller' class =3D mass storage subclass =3D ATA Working (570 Ultra) ------------------- atapci1@pci0:0:5:0: class=3D0x010185 card=3D0xcb8410de = chip=3D0x037f10de rev=3D0xa3 hdr=3D0x00 vendor =3D 'Nvidia Corp' device =3D 'MCP55 SATA Controller' class =3D mass storage subclass =3D ATA This is most likely related to kern/120296 (http://www.freebsd.org/cgi/query-pr.cgi?pr=3Dkern/120296) and = kern/121396 (http://www.freebsd.org/cgi/query-pr.cgi?pr=3Dkern/121396). If someone else is having data corruption problems with drives connected to an MCP55 controller it might be worth testing if limiting the drives to SATA-150 makes a difference. It will most likely take me a while before I can verify this. --- Daniel Eriksson (http://www.toomuchdata.com/)