From owner-freebsd-stable Sat Aug 5 1:11:27 2000 Delivered-To: freebsd-stable@freebsd.org Received: from campbell.cwx.net (Campbell.cwx.net [216.17.176.12]) by hub.freebsd.org (Postfix) with ESMTP id B4DE137B607 for ; Sat, 5 Aug 2000 01:11:17 -0700 (PDT) (envelope-from allenc@verinet.com) Received: from verinet.com (pragma. [192.168.1.2]) by campbell.cwx.net (8.9.3/8.9.3) with ESMTP id CAA00667 for ; Sat, 5 Aug 2000 02:11:16 -0600 (MDT) (envelope-from allenc@verinet.com) Message-ID: <398BCC24.3DE06DE8@verinet.com> Date: Sat, 05 Aug 2000 02:11:16 -0600 From: Allen Campbell X-Mailer: Mozilla 4.73 [en] (X11; I; Linux 2.2.13 i686) X-Accept-Language: en MIME-Version: 1.0 To: stable@FreeBSD.ORG Subject: Kernel bug or bad hardware? Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG I am experiencing infrequent, spontaneous panics of a 4.0-STABLE machine. The reboots occur while the system is under light load. When the panic occurs the following output is generated by a kernel trap. I have witnessed this on three occasions. On the most recent I bolted to the server and hit PAUSE to preserve the output long enough to copy it verbatim. ad0: WRITE command timeout - resetting ata0: resetting devices .. done ad0: timeout waiting for DRQad-106100774: soft error ECC corrected ad-106100774: timeout waiting for DRQdevstat_end_transaction: HELP!! busy count for (?CA1062206520 is < 0 (-1061721857)! Fatal trap 12: page fault while in kernel mode fault virtual address = 0x60 fault code = supervisor read, page not present instruction pointer = 0x8:0xc0173e8c stack pointer = 0x10:0xc0290aec frame pointer = 0x10:0xc0290b1c code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = Idle interrupt mask = bio trap number = 12 panic: page fault syncing disks... At this point the machine boots, fsck runs to clean up the disks and the machine runs normally. On each occasion the output began with: `ad0: WRITE command timeout - resetting'. This makes me think the problem is not related to faulty memory, but possibly a bad IDE disk or a kernel bug. Other than this problem the machine runs perfectly. uname -a: FreeBSD const. 4.0-STABLE FreeBSD 4.0-STABLE #0: Sat Jul 1 21:46:49 MDT 2000 allenc@pragma.:/usr/src/sys/compile/CONST i386 My most recent STABLE update was performed to obtain fixes for the TCP/IP ICMP bugs reported recently. dmesg output: Copyright (c) 1992-2000 The FreeBSD Project. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 4.0-STABLE #0: Sat Jul 1 21:46:49 MDT 2000 allenc@pragma.:/usr/src/sys/compile/CONST Timecounter "i8254" frequency 1193182 Hz CPU: AMD Am5x86 Write-Back (486-class CPU) Origin = "AuthenticAMD" Id = 0x4f4 Stepping = 4 Features=0x1 real memory = 67108864 (65536K bytes) config> di psm0 config> en ed0 config> po ed0 0x300 config> ir ed0 11 config> iom ed0 0xd8000 config> f ed0 0 config> q avail memory = 61964288 (60512K bytes) Preloaded elf kernel "kernel" at 0xc0326000. Preloaded userconfig_script "/boot/kernel.conf" at 0xc032609c. md0: Malloc disk npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard pci0: on pcib0 pci0: at 11.0 irq 10 dc0: <82c169 PNIC 10/100BaseTX> port 0x6000-0x60ff mem 0xf4000000-0xf40000ff irq 12 at device 13.0 on pci0 dc0: Ethernet address: 00:a0:cc:54:e6:b9 miibus0: on dc0 ukphy0: on miibus0 ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto ncr0: port 0x6100-0x61ff mem 0xf4001000-0xf40010ff irq 12 at device 15.0 on pci0 ncr0: driver is using old-style compatability shims isa0: on motherboard fdc0: at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fd0: <1440-KB 3.5" drive> on fdc0 drive 0 ata0 at port 0x1f0-0x1f7,0x3f6 irq 14 on isa0 ata1 at port 0x170-0x177,0x376 irq 15 on isa0 atkbdc0: at port 0x60,0x64 on isa0 atkbd0: irq 1 on atkbdc0 vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: on isa0 sc0: VGA <16 virtual consoles, flags=0x200> sio0 at port 0x3f8-0x3ff irq 4 on isa0 sio0: type 16550A sio1 at port 0x2f8-0x2ff irq 3 on isa0 sio1: type 16550A ppc0: at port 0x378-0x37f irq 7 on isa0 ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode ppbus0: IEEE1284 device found /NIBBLE/ECP Probing for PnP devices on ppbus0: ppbus0: PJL,MLC,PCL,PCLXL ppi0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port plip0: on ppbus0 ed0 at port 0x300-0x31f iomem 0xd8000 irq 11 drq 0 on isa0 ed0: address 00:80:29:67:e0:a3, type NE2000 (16 bit) IP packet filtering initialized, divert enabled, rule-based forwarding disabled, default to deny, logging disabled DUMMYNET initialized (000608) ncp_load: [210-213] IP Filter: initialized. Default = pass all, Logging = disabled IP Filter: v3.3.8 ad0: 6149MB [13328/15/63] at ata0-master using BIOSPIO (null): MODE_SENSE_BIG command timeout - resetting ata1: resetting devices .. done ata1-master: no status, reselecting device (null): read data overrun 29/1 (null): MODE_SENSE_BIG command timeout - resetting ata1: resetting devices .. done ata1-master: no status, reselecting device (null): read data overrun 29/0 acd0: CDROM at ata1-master using BIOSPIO Waiting 15 seconds for SCSI devices to settle sa0 at ncr0 bus 0 target 2 lun 0 sa0: Removable Sequential Access SCSI-2 device sa0: 3.300MB/s transfers cd0 at ncr0 bus 0 target 6 lun 0 cd0: Removable CD-ROM SCSI-2 device cd0: 10.000MB/s transfers (10.000MHz, offset 8) cd0: cd present [320584 x 2048 byte records] Mounting root from ufs:/dev/ad0s2a -- Allen Campbell | Lurking at the bottom of the allenc@verinet.com | gravity well, getting old. To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message