From owner-freebsd-i386@FreeBSD.ORG Tue Sep 20 08:30:19 2005 Return-Path: X-Original-To: freebsd-i386@hub.freebsd.org Delivered-To: freebsd-i386@hub.freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id B314916A41F for ; Tue, 20 Sep 2005 08:30:19 +0000 (GMT) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [216.136.204.21]) by mx1.FreeBSD.org (Postfix) with ESMTP id 3CA4C43D45 for ; Tue, 20 Sep 2005 08:30:19 +0000 (GMT) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (gnats@localhost [127.0.0.1]) by freefall.freebsd.org (8.13.3/8.13.3) with ESMTP id j8K8UIJG096697 for ; Tue, 20 Sep 2005 08:30:18 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.13.3/8.13.1/Submit) id j8K8UI0v096696; Tue, 20 Sep 2005 08:30:18 GMT (envelope-from gnats) Resent-Date: Tue, 20 Sep 2005 08:30:18 GMT Resent-Message-Id: <200509200830.j8K8UI0v096696@freefall.freebsd.org> Resent-From: FreeBSD-gnats-submit@FreeBSD.org (GNATS Filer) Resent-To: freebsd-i386@FreeBSD.org Resent-Reply-To: FreeBSD-gnats-submit@FreeBSD.org, Thede Loder Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id A4AB816A41F for ; Tue, 20 Sep 2005 08:26:08 +0000 (GMT) (envelope-from nobody@FreeBSD.org) Received: from www.freebsd.org (www.freebsd.org [216.136.204.117]) by mx1.FreeBSD.org (Postfix) with ESMTP id 3401043D45 for ; Tue, 20 Sep 2005 08:26:08 +0000 (GMT) (envelope-from nobody@FreeBSD.org) Received: from www.freebsd.org (localhost [127.0.0.1]) by www.freebsd.org (8.13.1/8.13.1) with ESMTP id j8K8Q7g7014359 for ; Tue, 20 Sep 2005 08:26:07 GMT (envelope-from nobody@www.freebsd.org) Received: (from nobody@localhost) by www.freebsd.org (8.13.1/8.13.1/Submit) id j8K8Q7jO014354; Tue, 20 Sep 2005 08:26:07 GMT (envelope-from nobody) Message-Id: <200509200826.j8K8Q7jO014354@www.freebsd.org> Date: Tue, 20 Sep 2005 08:26:07 GMT From: Thede Loder To: freebsd-gnats-submit@FreeBSD.org X-Send-Pr-Version: www-2.3 Cc: Subject: i386/86364: ATA woes, SATA controller: failed writes, FS corruption, and system hang under heavy loads X-BeenThere: freebsd-i386@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: I386-specific issues for FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 20 Sep 2005 08:30:19 -0000 >Number: 86364 >Category: i386 >Synopsis: ATA woes, SATA controller: failed writes, FS corruption, and system hang under heavy loads >Confidential: no >Severity: serious >Priority: medium >Responsible: freebsd-i386 >State: open >Quarter: >Keywords: >Date-Required: >Class: sw-bug >Submitter-Id: current-users >Arrival-Date: Tue Sep 20 08:30:18 GMT 2005 >Closed-Date: >Last-Modified: >Originator: Thede Loder >Release: FreeBSD-6.0BETA5 >Organization: Paritive, Inc. >Environment: FreeBSD davros.loder.com 6.0-BETA5 FreeBSD 6.0-BETA5 #21: Sun Sep 18 22:34:20 PDT 2005 root@davros.loder.com:/usr/src/sys/i386/compile/DAVROS i386 >Description: Hi all. A little ATA trouble. I've been running an NFS client-driven stress test on a NFS exported file system. It seems just fine unless the FS is hosted on a drive attached to a PCI SATA controller, which is a Promise SATAII150 TX2plus. After a short period of time with the stress test (as little as a few seconds, as long as a minute or two), the exported drive simply hangs, eventually causing writes to timing out on the NFS client The drive device path /dev/ad4 remains visible in /dev, but calls to access the drive do not seem to return. 'umount'ing the filesystem on the hung drive freezes all ATA devices and hangs the system (I am not overclocked). A hard reboot is required to bring things back to normal. Not sure if data is being lost or not, but fsck always finds FS errors, and self reboot is not possible with the console reporting failed buffer writes. I have repeated the stress test using a filesystem on ATA100 drives hosted by the mainboard's VIA 8235 without any problems, so it seems to be specific to the PCI Promise Controller and it's drives. The drive itself is a Western Digital (WDC WD2500JD-50GBB0 02.05D02). Motherboard is a KT3 Ultra 2 with an AMD 1800+ on it. I'm happy to dig into it further and provide more specifics, but need some experienced advice as to where to instrument. >How-To-Repeat: Export, via NFS, a filesystem that is on a drive hosted by the SATA controller. Stress the filesystem (I used an import of mp3 files using iTunes). After a minute or two, (repeatable) the the kernel outputs "ad4: FAILURE - SETFEATURES SET TRANSFER MODE timed out", and the drive becomes unresponsive, halting the NFS activity. A subsequent 'umount' of the file system hangs all ATA devices on the system, preventing login or logout. If "reboot" is issued before the 'umount', the reboot process starts but hangs while flushing buffers. >Fix: >Release-Note: >Audit-Trail: >Unformatted: