From owner-freebsd-i386@FreeBSD.ORG Wed Jul 2 10:20:02 2003 Return-Path: Delivered-To: freebsd-i386@hub.freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 075CD37B401 for ; Wed, 2 Jul 2003 10:20:02 -0700 (PDT) Received: from freefall.freebsd.org (freefall.freebsd.org [216.136.204.21]) by mx1.FreeBSD.org (Postfix) with ESMTP id EAA1C43FBD for ; Wed, 2 Jul 2003 10:20:00 -0700 (PDT) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (gnats@localhost [127.0.0.1]) by freefall.freebsd.org (8.12.9/8.12.9) with ESMTP id h62HK0Up069267 for ; Wed, 2 Jul 2003 10:20:00 -0700 (PDT) (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.12.9/8.12.9/Submit) id h62HK00E069266; Wed, 2 Jul 2003 10:20:00 -0700 (PDT) Resent-Date: Wed, 2 Jul 2003 10:20:00 -0700 (PDT) Resent-Message-Id: <200307021720.h62HK00E069266@freefall.freebsd.org> Resent-From: FreeBSD-gnats-submit@FreeBSD.org (GNATS Filer) Resent-To: freebsd-i386@FreeBSD.org Resent-Reply-To: FreeBSD-gnats-submit@FreeBSD.org, Nathan Gardner Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 8078137B401 for ; Wed, 2 Jul 2003 10:16:42 -0700 (PDT) Received: from eclipse.inwa.net (eclipse.inwa.net [66.228.198.133]) by mx1.FreeBSD.org (Postfix) with SMTP id B984343F93 for ; Wed, 2 Jul 2003 10:16:39 -0700 (PDT) (envelope-from nathan@eclipse.inwa.net) Received: (qmail 90048 invoked by uid 1000); 26 Jun 2003 19:49:52 -0000 Message-Id: <20030626194952.90047.qmail@eclipse.inwa.net> Date: 26 Jun 2003 19:49:52 -0000 From: Nathan Gardner To: FreeBSD-gnats-submit@FreeBSD.org X-Send-Pr-Version: 3.113 cc: support@inwa.net Subject: i386/54033: Disk lockup. X-BeenThere: freebsd-i386@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list Reply-To: Nathan Gardner List-Id: I386-specific issues for FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 02 Jul 2003 17:20:02 -0000 >Number: 54033 >Category: i386 >Synopsis: Disk lockup. >Confidential: no >Severity: serious >Priority: medium >Responsible: freebsd-i386 >State: open >Quarter: >Keywords: >Date-Required: >Class: sw-bug >Submitter-Id: current-users >Arrival-Date: Wed Jul 02 10:20:00 PDT 2003 >Closed-Date: >Last-Modified: >Originator: Nathan Gardner >Release: FreeBSD 4.7-RELEASE i386 >Organization: InWa.net >Environment: System: FreeBSD eclipse 4.7-RELEASE FreeBSD 4.7-RELEASE #1: Tue Jun 24 16:57:46 PDT 2003 nathan@host.inwa.net:/usr/src/sys/compile/eclipse.new i386 host# atacontrol list ATA channel 0: Master: ad0 ATA/ATAPI rev 7 Slave: no device present ATA channel 1: Master: acd0 ATA/ATAPI rev 0 Slave: no device present ATA channel 2: Master: ad4 ATA/ATAPI rev 5 Slave: ad5 ATA/ATAPI rev 5 ATA channel 3: Master: no device present Slave: no device present host# >Description: After about two weeks processes accessing the single hard disk (/dev/ad0) hang. ps reports that the process status as D for disk write. The processes can not be killed, and do not finish. If the process is system critical this means that the system crashes, and if it is not, then the system needs to be rebooted before the processes will go away and the disk will become accessible again. When the system comes back up, the drive seems to work fine, and I have not found anything in my logs to show what the cause of the problem might be. I had been using the drive for swap and backup (via tar), but after several crashes, I moved swap off the drive. Now it just does backups, which do not cause the whole system to crash. The current drive is the second hard drive I have tried. The first one was also a Maxtor drive (although it was an 80GB drive whereas the new one is 120GB). I assumed that this error was hardware related, so I swapped it out, and brought it home for testing. The manufacturers tests show that there is nothing wrong with the drive. I have not, as yet, been able to try the system with a drive by a different manufacturer. Because the processes hang in disk writes, I haven't been able to figure out what is causing the crash with any more certainty. Once one process hangs, any other processes that try to access the drive do the same. >How-To-Repeat: Use Maxtor drive regularly for a couple weeks for backups. Watch hang, reboot. Contents of /var/run/dmesg.boot --begin-- Copyright (c) 1992-2002 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.7-RELEASE #1: Tue Jun 24 16:57:46 PDT 2003 nathan@host.inwa.net:/usr/src/sys/compile/eclipse.new Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (1399.33-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x6b1 Stepping = 1 Features=0x383fbff real memory = 1073676288 (1048512K bytes) avail memory = 1040371712 (1015988K bytes) Programming 24 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00178011, at 0xfec00000 Preloaded elf kernel "kernel" at 0xc0395000. ccd0-3: Concatenated disk drivers Pentium Pro MTRR support enabled md0: Malloc disk Using $PIR table, 8 entries at 0xc00fdc60 npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard IOAPIC #0 intpin 11 -> irq 2 pci0: on pcib0 pcib1: at device 1.0 on pci0 pci1: on pcib1 pci1: at 0.0 atapci0: port 0xb000-0xb03f,0xac00-0xac03,0xa800-0xa807,0xa400-0xa403,0xa000-0xa007 mem 0xf8100000-0xf811ffff irq 2 at device 12.0 on pci0 ata2: at 0xa000 on atapci0 ata3: at 0xa800 on atapci0 fxp0: port 0xb400-0xb43f mem 0xf8000000-0xf80fffff,0xf8120000-0xf8120fff irq 5 at device 13.0 on pci0 fxp0: Ethernet address 00:30:48:41:53:be inphy0: on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto isab0: at device 17.0 on pci0 isa0: on isab0 atapci1: port 0xb800-0xb80f at device 17.1 on pci0 ata0: at 0x1f0 irq 14 on atapci1 ata1: at 0x170 irq 15 on atapci1 pci0: (vendor=0x1106, dev=0x3065) at 18.0 irq 10 orm0: