From owner-freebsd-stable@FreeBSD.ORG Wed Feb 16 12:50:02 2005 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 54D4216A4CE for ; Wed, 16 Feb 2005 12:50:02 +0000 (GMT) Received: from mail48-s.fg.online.no (mail48-s.fg.online.no [148.122.161.48]) by mx1.FreeBSD.org (Postfix) with ESMTP id 5321743D54 for ; Wed, 16 Feb 2005 12:50:01 +0000 (GMT) (envelope-from charles-erik@mcdonald.no) Received: from mcdonald.no (ytrefw-blaatt.mobil.telenor.no [212.17.134.41]) by mail48.fg.online.no (8.12.11/8.12.11) with ESMTP id j1GCnxiD005725 for ; Wed, 16 Feb 2005 13:49:59 +0100 (CET) Message-ID: <42134454.7030306@mcdonald.no> Date: Wed, 16 Feb 2005 14:02:12 +0100 From: Charles Erik McDonald User-Agent: Mozilla Thunderbird 1.0 (X11/20040309) X-Accept-Language: en-us, en MIME-Version: 1.0 To: freebsd-stable@freebsd.org X-Enigmail-Version: 0.83.2.0 X-Enigmail-Supports: pgp-inline, pgp-mime Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Subject: Strange disk problems make the system lock up X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 16 Feb 2005 12:50:02 -0000 Hi, I have a problem that arises once in a while with my IDE drives under FreeBSD 5.3 uname: FreeBSD bastion 5.3-RELEASE-p5 FreeBSD 5.3-RELEASE-p5 #23: Mon Feb 7 13:28:32 UTC 2005 I have two large disks: /dev/ad2 144G 124G 9.2G 93% /mnt/sp1604n /dev/ad3 226G 202G 5.2G 98% /mnt/wd2500pb Each with only one partition, where fdisk on /dev/ad3 gives the following information: The data for partition 1 is: sysid 165 (0xa5),(FreeBSD/NetBSD/386BSD) start 63, size 488392002 (238472 Meg), flag 80 (active) beg: cyl 0/ head 1/ sector 1; end: cyl 1023/ head 254/ sector 63 Both disks are in UDMA33 mode, checked with atacontrol. Fsck does not report any errors on any of the drives, nor does smartctl. The problem is that sometimes, one of the drives will "disappear". /mnt/wd2500pb (or the other) will simply become inaccessible. bastion(root):/var/run#cd /mnt/wd2500pb -su: cd: /mnt/wd2500pb: Not a directory The strange thing is, 'df' still reports the drive, and I can fsck it. The root fs that holds /mnt/ doesn't report any errors with fsck, but bastion(root):~#ls -l /mnt/ ls: wd2500pb: Bad file descriptor total 8 drwxrwxrwx 28 root operator 1536 Feb 14 22:40 sp1604n drwxr-xr-x 2 root wheel 512 Jan 17 2004 usb_thumbdrive .. trying to list /mnt, the mountpoint for the drive is listed as having a bad filedescriptor. Yesterday I unmounted both drives, removed the directories/mountpoints and recreated them both, but this morning the same thing happened - only now on the other drive (sp1604n). When I try to unmount the drive, the whole system freezes up and I am forced to do a cold boot. It does not freeze/lockup if I try to check the fs, only when I try to unmount. Hope this is valid for this group .. So long, C.Erik dmesg.boot: Copyright (c) 1992-2004 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.3-RELEASE-p5 #23: Mon Feb 7 13:28:32 UTC 2005 root@bastion:/usr/obj/usr/src/sys/BASTION Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Pentium III/Pentium III Xeon/Celeron (548.54-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x673 Stepping = 3 Features=0x383fbff real memory = 805306368 (768 MB) avail memory = 778305536 (742 MB) ACPI APIC Table: FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 1 cpu1 (AP): APIC ID: 0 MADT: Forcing active-low polarity and level trigger for SCI ioapic0 irqs 0-23 on motherboard npx0: [FAST] npx0: on motherboard npx0: INT 16 interface acpi0: on motherboard Timecounter "ACPI-safe" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0xf808-0xf80b on acpi0 cpu0: on acpi0 cpu1: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 agp0: mem 0x54000000-0x57ffffff at device 0.0 on pci0 pcib1: at device 1.0 on pci0 pci1: on pcib1 pci1: at device 0.0 (no driver attached) pci0: at device 11.0 (no driver attached) fxp0: port 0x5480-0x549f mem 0x52000000-0x520fffff,0x52400000-0x52400fff irq 20 at device 12.0 on pci 0 miibus0: on fxp0 inphy0: on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:50:8b:72:a0:f4 xl0: <3Com 3c905C-TX Fast Etherlink XL> port 0x5400-0x547f mem 0x52300000-0x5230007f irq 17 at device 13.0 on pci0 miibus1: on xl0 ukphy0: on miibus1 ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto xl0: Ethernet address: 00:04:75:7c:dd:d2 isab0: at device 20.0 on pci0 isa0: on isab0 atapci0: port 0x54c0-0x54cf,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 20.1 on pci0 ata0: channel #0 on atapci0 ata1: channel #1 on atapci0 uhci0: port 0x54a0-0x54bf irq 19 at device 20.2 on pci0 uhci0: [GIANT-LOCKED] usb0: on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered pci0: at device 20.3 (no driver attached) acpi_button0: on acpi0 atkbdc0: port 0x64,0x60 irq 1 on acpi0 atkbd0: flags 0x1 irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] ppc0: port 0x778-0x77d,0x378-0x37f irq 7 drq 3 on acpi0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/8 bytes threshold ppbus0: on ppc0 plip0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 sio0: port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio1: port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A fdc0: port 0x3f0-0x3f5 irq 6 drq 2 on acpi0 fdc0: [FAST] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 orm0: at iomem 0xec000-0xeffff,0xcc000-0xcc7ff,0xc8000-0xcbfff,0xc0000-0xc7fff on isa0 pmtimer0 on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounters tick every 10.000 msec ad0: 114473MB [232581/16/63] at ata0-master UDMA33 ad2: 152627MB [310101/16/63] at ata1-master UDMA33 ad3: 238475MB [484521/16/63] at ata1-slave UDMA33 SMP: AP CPU #1 Launched! Mounting root from ufs:/dev/ad0s1a