From owner-freebsd-stable Tue Mar 21 20:50:45 2000 Delivered-To: freebsd-stable@freebsd.org Received: from BSDpc.geek4food.org (adsl-216-103-105-71.dsl.snfc21.pacbell.net [216.103.105.71]) by hub.freebsd.org (Postfix) with ESMTP id 0DA3937C120 for ; Tue, 21 Mar 2000 20:50:36 -0800 (PST) (envelope-from andy@geek4food.org) Received: from mega.geek4food.org (mega.geek4food.org [192.168.1.57]) by BSDpc.geek4food.org (Postfix) with ESMTP id 3959B420 for ; Wed, 22 Mar 2000 04:50:35 +0000 (GMT) Received: from mega.geek4food.org (localhost.geek4food.org [127.0.0.1]) by mega.geek4food.org (8.9.3/8.9.3) with ESMTP id UAA31802 for ; Tue, 21 Mar 2000 20:50:34 -0800 (PST) (envelope-from andy@geek4food.org) Message-Id: <200003220450.UAA31802@mega.geek4food.org> To: stable@FreeBSD.ORG Subject: ncr problems on 3.4-Stable Date: Tue, 21 Mar 2000 20:50:34 -0800 From: Andy Sparrow Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG Hi all. I've been running 3.x-stable on a firewall box for months and months, and it's been very, well, stable :=) Recently, it's started intermittantly (every week or so) giving problems like the following (somewhat trimmed) output: Mar 17 18:34:32 BSDpc syslogd: /var/log/messages: Input/output error Mar 17 18:34:32 BSDpc /kernel: ncr0:0: ERROR (0:c1) (9-ab-2) (e0/5) @ (script 6bc:190001ff). Mar 17 18:34:32 BSDpc /kernel: ncr0: script cmd = 89030000 Mar 17 18:34:32 BSDpc /kernel: ncr0: regdump: da 10 80 05 47 e0 00 0f fe 09 80 ab 80 00 0b 00. Mar 17 18:34:32 BSDpc /kernel: ncr0: have to clear fifos. Mar 17 18:34:32 BSDpc /kernel: ncr0: restart (fatal error). Mar 17 18:34:32 BSDpc /kernel: (da0:ncr0:0:0:0): COMMAND FAILED (9 ff) @0xc078f200. Mar 17 18:34:32 BSDpc /kernel: swap_pager: indefinite wait buffer: device: 0x30401, blkno: 832, size: 4096 Mar 17 18:34:32 BSDpc last message repeated 2 times Mar 17 18:34:32 BSDpc /kernel: ncr0: timeout nccb=0xc078f200 (skip) Mar 17 18:34:32 BSDpc last message repeated 2 times Mar 17 18:34:32 BSDpc /kernel: ncr0: timeout nccb=0xc078f200 (skip) Mar 17 18:34:32 BSDpc /kernel: ncr0: timeout nccb=0xc0791e00 (skip) Mar 17 18:34:32 BSDpc /kernel: ncr0: timeout nccb=0xc0791c00 (skip) Mar 17 18:34:32 BSDpc /kernel: swap_pager: indefinite wait buffer: device: 0x30401, blkno: 832, size: 4096 Mar 17 18:34:32 BSDpc /kernel: swap_pager: indefinite wait buffer: device: 0x30401, blkno: 28272, size: 8192 Mar 17 18:34:32 BSDpc /kernel: ncr0: timeout nccb=0xc078f200 (skip) Mar 17 18:34:32 BSDpc /kernel: ncr0: timeout nccb=0xc0791e00 (skip) Mar 17 18:34:32 BSDpc /kernel: ncr0: timeout nccb=0xc0791c00 (skip) Mar 17 18:34:32 BSDpc /kernel: swap_pager: indefinite wait buffer: device: 0x30401, blkno: 832, size: 4096 Mar 17 18:34:32 BSDpc /kernel: swap_pager: indefinite wait buffer: device: 0x30401, blkno: 28272, size: 8192 Mar 17 18:34:32 BSDpc /kernel: ncr0: timeout nccb=0xc078f200 (skip) Mar 17 18:34:32 BSDpc /kernel: swap_pager: I/O error - pagein failed; blkno 28272, size 8192, error 5 Mar 17 18:34:32 BSDpc /kernel: vm_fault: pager read error, pid 447 (getty) Mar 17 18:34:32 BSDpc /kernel: swap_pager: I/O error - pagein failed; blkno 832, size 4096, error 5 Mar 17 18:34:32 BSDpc /kernel: vm_fault: pager read error, pid 30582 (master) Mar 17 18:34:33 BSDpc /kernel: ncr0: timeout nccb=0xc0791e00 (skip) Mar 17 18:34:36 BSDpc /kernel: ncr0: timeout nccb=0xc0791c00 (skip) Mar 17 18:34:52 BSDpc /kernel: swap_pager: indefinite wait buffer: device: 0x30401, blkno: 10720, size: 4096 At this point, rebooting is really the only option. Hardware is an Intel 430VX chipset with a 233MMX CPU fitted, 32Mb true parity memory, and two PCI NIC cards, along with a TekRam DC-390 F (I think, AMD 53c875 chipset, anyway) SCSI card. Here's the dmesg output: Copyright (c) 1992-1999 FreeBSD Inc. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 3.4-STABLE #19: Sun Mar 12 13:07:22 PST 2000 root@BSDpc.geek4food.org:/usr/src/sys/compile/BSDpc Timecounter "i8254" frequency 1193182 Hz Timecounter "TSC" frequency 233864380 Hz CPU: Pentium/P55C (233.86-MHz 586-class CPU) Origin = "GenuineIntel" Id = 0x543 Stepping = 3 Features=0x8001bf real memory = 33554432 (32768K bytes) avail memory = 29917184 (29216K bytes) Preloaded elf kernel "kernel" at 0xc02d1000. Probing for devices on PCI bus 0: chip0: rev 0x02 on pci0.0.0 chip1: rev 0x01 on pci0.7.0 ide_pci0: rev 0x00 on pci0.7.1 fxp0: rev 0x02 int a irq 11 on pci0.8.0 fxp0: Ethernet address 00:a0:c9:6f:ac:13 de0: rev 0x23 int a irq 10 on pci0.9.0 de0: SMC 8432BTA 21040 [10Mb/s] pass 2.3 de0: address 00:00:c0:80:04:be ncr0: rev 0x03 int a irq 9 on pci0.10.0 Probing for PnP devices: CSN 1 Vendor ID: BRI9400 [0x0094490a] Serial 0xfd15c320 Comp ID: @@@0000 [0x00000000] Probing for devices on the ISA bus: sc0 on isa sc0: VGA color <16 virtual consoles, flags=0x0> ed0 not found at 0x280 atkbdc0 at 0x60-0x6f on motherboard atkbd0 irq 1 on isa sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa sio0: type 16550A sio1 at 0x2f8-0x2ff irq 3 on isa sio1: type 16550A ppc0 at 0x378 irq 7 on isa ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/16 bytes threshold ppb0: IEEE1284 device found /NIBBLE Probing for PnP devices on ppbus0: ppbus0: PRINTER BJL,BJRaster3,BSCC lpt0: on ppbus 0 lpt0: Interrupt-driven port fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa fd0: 1.44MB 3.5in wdc0 at 0x1f0-0x1f7 irq 14 flags 0xa0ffa0ff on isa wdc0: unit 0 (wd0): , 32-bit, multi-block-32 wd0: 516MB (1058400 sectors), 1050 cyls, 16 heads, 63 S/T, 512 B/S wdc1 at 0x170-0x177 irq 15 flags 0xa0ffa0ff on isa wdc1: unit 0 (wd2): , DMA, 32-bit, multi-block-16 wd2: 3020MB (6185088 sectors), 6136 cyls, 16 heads, 63 S/T, 512 B/S wdc1: unit 1 (atapi): , removable, accel, dma, iordy acd0: drive speed 1377 - 2755KB/sec, 128KB cache acd0: supported read types: CD-R, CD-RW, CD-DA, packet track acd0: Audio: play, 255 volume levels acd0: Mechanism: ejectable tray acd0: Medium: CD-ROM 120mm data disc loaded, unlocked vga0 at 0x3b0-0x3df maddr 0xa0000 msize 131072 on isa npx0 on motherboard npx0: INT 16 interface Intel Pentium detected, installing workaround for F00F bug Errr, what's going on? Is this a hardware problem (the boot/swap drive is a slow old Fujitsu 500Mb drive, which always seemed to work fine before), or something wacky with either CAM or the NCR support? I can happily replace the hardware, but if there's anything that should be done to identify some weird problem, I'd be happy to help out as best I can (like providing access, additional information etc). Cheers, AS To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message