Date: Sat, 20 Dec 2003 19:07:41 +0100 From: "Oivind H. Danielsen" <oivind.danielsen@kopek.net> To: <freebsd-stable@freebsd.org> Subject: WRITE command timeout Message-ID: <NMEPLAHDNAPMGKOIJMLLIEFNCAAA.oivind.danielsen@kopek.net>
next in thread | raw e-mail | index | archive | help
This is a multi-part message in MIME format. ------=_NextPart_000_0004_01C3C72C.8AD09600 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Hello. We have been running FreeBSD 4.6-5.1 systems for 1.5 years and are being plagued by these: Dec 18 15:15:39 <> /kernel: ad0: WRITE command timeout tag=0 serv=0 - resetting Dec 19 15:03:23 <> /kernel: ad0: READ command timeout tag=0 serv=0 - resetting In our rack we have 34 identical drives (IBM IC35L080AVVA07). 24 drives on Windows 2000 : no problems. 4 drives on Linux 2.4.x : no problems. 2 drives on RELENG_4_8 (VIA 82C686, VIA C3) : no problems 4 drives on RELENG_4_8 (nVIDIA nForce, XP 2000+) : r/w timeouts, fs corruption. (1 drive/system, 6 FreeBSD boxes) The good systems have been running the 1.5 years without a hitch. The four identical RELENG_4_8 systems have all had corrupted filesystems (at least once every two months). We have tried the following: - Changed ATA100 cables (3 diff. types, all 80-wire) - Disabled DMA (use PIO4) (hw.ata.ata_dma="0" in loader.conf) - Disabled DMA in BIOS setup - Changed motherboard (MSI MS6734, VIA KM400, vt8235 ATA) - Changed power supply (added 100W) - RELENG_5_1. None of these changes has helped. The only change seen when disabling DMA is additional messages: "timeout waiting for DRQ - resetting". I have searched the net for more information on this topic for over a year, and all I find is replies like: - "Just change the cable, dude.." (did that, still timeouts) - "IBM drives are bad for you." (seen this with other drives too) (drives work well on Linux/W2k) - "Disabling DMA fixes it." (tried that, it didn't) - "ATA is for wimps. SCSI rulezz." (different discussion) # sysctl hw.ata hw.ata.ata_dma: 0 hw.ata.wc: 1 hw.ata.tags: 0 hw.ata.atapi_dma: 0 # atacontrol mode 0 Master = PIO4 Slave = ??? # atacontrol info 0 Master: ad0 <IC35L080AVVA07-0/VA4OA52A> ATA/ATAPI rev 5 Slave: no device present dmesg, pciconf and kernel config are attached. No special compilation options (except -DIPFW2) are used. I can provide more information on request. We're now running FreeBSD 4.8-RELEASE-p14 and FreeBSD 5.1-RELEASE-p8, but the problem has been around since we started out with 4.6 I believe. The "good" and "bad" FreeBSD systems all use the same kernel/world. The reason why we have used such low-end hardware in these boxes is that they are part of a highly redundant cluster solution for crypto processing (no storage is used for application purposes). This means the system can cope with the occasional fs corruption, but we would still prefer to get rid of it. I know this problem has been discussed before, but wanted to add more data to the discussion. I don't think all of the reports should be attributed to bad HW. Nevertheless, even if the hardware is broken, the system should preferably function equally well/bad as with Linux/W2k. Any help is greatly appreciated. Best Regards, Oivind H. Danielsen ------=_NextPart_000_0004_01C3C72C.8AD09600 Content-Type: text/plain; name="dmesg.txt" Content-Transfer-Encoding: quoted-printable Content-Disposition: attachment; filename="dmesg.txt" Copyright (c) 1992-2003 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.8-RELEASE-p14 #0: Sat Dec 6 00:30:50 CET 2003 Timecounter "i8254" frequency 1193182 Hz CPU: AMD Athlon(TM) XP 2000+ (1662.38-MHz 686-class CPU) Origin =3D "AuthenticAMD" Id =3D 0x662 Stepping =3D 2 = Features=3D0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE= ,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE> AMD Features=3D0xc0400000<AMIE,DSP,3DNow!> real memory =3D 503234560 (491440K bytes) avail memory =3D 485507072 (474128K bytes) Preloaded elf kernel "kernel" at 0xc03e3000. Pentium Pro MTRR support enabled md0: Malloc disk Using $PIR table, 10 entries at 0xc00f20b0 npx0: <math processor> on motherboard npx0: INT 16 interface pcib0: <Host to PCI bridge> on motherboard pci0: <PCI bus> on pcib0 pci0: <unknown card> (vendor=3D0x10de, dev=3D0x01ac) at 0.1 pci0: <unknown card> (vendor=3D0x10de, dev=3D0x01ad) at 0.2 pci0: <unknown card> (vendor=3D0x10de, dev=3D0x01aa) at 0.3 isab0: <PCI to ISA bridge (vendor=3D10de device=3D01b2)> at device 1.0 = on pci0 isa0: <ISA bus> on isab0 pci0: <unknown card> (vendor=3D0x10de, dev=3D0x01b4) at 1.1 irq 5 ohci0: <OHCI (generic) USB controller> mem 0xe7000000-0xe7000fff irq 10 = at device 2.0 on pci0 usb0: OHCI version 1.0 usb0: SMM does not respond, resetting usb0: <OHCI (generic) USB controller> on ohci0 usb0: USB revision 1.0 uhub0: (0x10de) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 3 ports with 3 removable, self powered ohci1: <OHCI (generic) USB controller> mem 0xe6800000-0xe6800fff irq 10 = at device 3.0 on pci0 usb1: OHCI version 1.0 usb1: <OHCI (generic) USB controller> on ohci1 usb1: USB revision 1.0 uhub1: (0x10de) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 3 ports with 3 removable, self powered pci0: <unknown card> (vendor=3D0x10de, dev=3D0x01c3) at 4.0 irq 10 pci0: <unknown card> (vendor=3D0x10de, dev=3D0x01b0) at 5.0 irq 5 pci0: <unknown card> (vendor=3D0x10de, dev=3D0x01b1) at 6.0 irq 11 pcib1: <PCI to PCI bridge (vendor=3D10de device=3D01b8)> at device 8.0 = on pci0 pci1: <PCI bus> on pcib1 pcib2: <DEC 21150 PCI-PCI bridge> at device 6.0 on pci1 pci2: <PCI bus> on pcib2 fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xb800-0xb83f mem = 0xe4000000-0xe40fffff,0xe4800000-0xe4800fff irq 5 at device 4.0 on pci2 fxp0: Ethernet address 00:08:9b:14:72:0f inphy0: <i82555 10/100 media interface> on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp1: <Intel Pro 10/100B/100+ Ethernet> port 0xb400-0xb43f mem = 0xe3000000-0xe30fffff,0xe3800000-0xe3800fff irq 5 at device 5.0 on pci2 fxp1: Ethernet address 00:08:9b:14:72:10 inphy1: <i82555 10/100 media interface> on miibus1 inphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp2: <Intel Pro 10/100B/100+ Ethernet> port 0xb000-0xb03f mem = 0xe2000000-0xe20fffff,0xe2800000-0xe2800fff irq 5 at device 6.0 on pci2 fxp2: Ethernet address 00:08:9b:14:71:9b inphy2: <i82555 10/100 media interface> on miibus2 inphy2: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp3: <Intel Pro 10/100B/100+ Ethernet> port 0xa800-0xa83f mem = 0xe1000000-0xe10fffff,0xe1800000-0xe1800fff irq 5 at device 7.0 on pci2 fxp3: Ethernet address 00:08:9b:14:71:9c inphy3: <i82555 10/100 media interface> on miibus3 inphy3: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto atapci0: <nVIDIA nForce ATA100 controller> port 0x9800-0x980f at device = 9.0 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 pcib3: <PCI to PCI bridge (vendor=3D10de device=3D01b7)> at device 30.0 = on pci0 pci3: <PCI bus> on pcib3 pci3: <NVidia model 01a0 graphics accelerator> at 0.0 irq 11 orm0: <Option ROM> at iomem 0xd0000-0xd3fff on isa0 fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on = isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on = isa0 sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=3D0x100> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A, console sio1 at port 0x2f8-0x2ff irq 3 on isa0 sio1: type 16550A ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/9 bytes threshold ipfw2 initialized, divert enabled, rule-based forwarding enabled, = default to deny, logging limited to 100 packets/entry by default ad0: 78533MB <IC35L080AVVA07-0> [159560/16/63] at ata0-master PIO4 Mounting root from ufs:/dev/ad0s1a ------=_NextPart_000_0004_01C3C72C.8AD09600 Content-Type: text/plain; name="kernel_config.txt" Content-Transfer-Encoding: quoted-printable Content-Disposition: attachment; filename="kernel_config.txt" machine i386 cpu I386_CPU cpu I486_CPU cpu I586_CPU cpu I686_CPU ident GENERIC maxusers 0 makeoptions DEBUG=3D-g #Build kernel with gdb(1) debug symbols options MATH_EMULATE #Support for x87 emulation options INET #InterNETworking options INET6 #IPv6 communications protocols options FFS #Berkeley Fast Filesystem options FFS_ROOT #FFS usable as root device [keep this!] options SOFTUPDATES #Enable FFS soft updates support options UFS_DIRHASH #Improve performance on big directories options MFS #Memory Filesystem options MD_ROOT #MD is a potential root device options NFS #Network Filesystem options NFS_ROOT #NFS usable as root device, NFS required options MSDOSFS #MSDOS Filesystem options CD9660 #ISO 9660 Filesystem options CD9660_ROOT #CD-ROM usable as root, CD9660 required options PROCFS #Process filesystem options COMPAT_43 #Compatible with BSD 4.3 [KEEP THIS!] options SCSI_DELAY=3D15000 #Delay (in ms) before probing SCSI options UCONSOLE #Allow users to grab the console options USERCONFIG #boot -c editor options VISUAL_USERCONFIG #visual boot -c editor options KTRACE #ktrace(1) support options SYSVSHM #SYSV-style shared memory options SYSVMSG #SYSV-style message queues options SYSVSEM #SYSV-style semaphores options P1003_1B #Posix P1003_1B real-time extensions options _KPOSIX_PRIORITY_SCHEDULING options ICMP_BANDLIM #Rate limit bad replies options KBD_INSTALL_CDEV # install a CDEV entry in /dev options COMPAT_LINUX # To make an SMP kernel, the next two are needed #options SMP # Symmetric MultiProcessor Kernel #options APIC_IO # Symmetric (APIC) I/O device isa device pci # Floppy drives device fdc0 at isa? port IO_FD1 irq 6 drq 2 device fd0 at fdc0 drive 0 # ATA and ATAPI devices device ata0 at isa? port IO_WD1 irq 14 device ata1 at isa? port IO_WD2 irq 15 device ata device atadisk # ATA disk drives device atapicd # ATAPI CDROM drives options ATA_STATIC_ID #Static device numbering # SCSI peripherals device scbus # SCSI bus (required) device da # Direct Access (disks) device sa # Sequential Access (tape etc) device cd # CD device pass # Passthrough device (direct SCSI access) device twe # 3ware Escalade # atkbdc0 controls both the keyboard and the PS/2 mouse device atkbdc0 at isa? port IO_KBD device atkbd0 at atkbdc? irq 1 flags 0x1 device psm0 at atkbdc? irq 12 device vga0 at isa? # splash screen/screen saver pseudo-device splash # syscons is the default console driver, resembling an SCO console device sc0 at isa? flags 0x100 # Floating point support - do not disable. device npx0 at nexus? port IO_NPX irq 13 # Power management support (see LINT for more options) device apm0 at nexus? disable flags 0x20 # Advanced Power Management # Serial (COM) ports device sio0 at isa? port IO_COM1 flags 0x10 irq 4 device sio1 at isa? port IO_COM2 irq 3 # Parallel port device ppc0 at isa? irq 7 device ppbus # Parallel port bus (required) # PCI Ethernet NICs. device de # DEC/Intel DC21x4x (``Tulip'') device em # Intel PRO/1000 adapter Gigabit Ethernet Card (``Wiseman'') device txp # 3Com 3cR990 (``Typhoon'') device vx # 3Com 3c590, 3c595 (``Vortex'') # PCI Ethernet NICs that use the common MII bus controller code. # NOTE: Be sure to keep the 'device miibus' line in order to use these = NICs! device miibus # MII bus support device dc # DEC/Intel 21143 and various workalikes device fxp # Intel EtherExpress PRO/100B (82557, 82558) device pcn # AMD Am79C97x PCI 10/100 NICs device rl # RealTek 8129/8139 device sf # Adaptec AIC-6915 (``Starfire'') device sis # Silicon Integrated Systems SiS 900/SiS 7016 device ste # Sundance ST201 (D-Link DFE-550TX) device tl # Texas Instruments ThunderLAN device tx # SMC EtherPower II (83c170 ``EPIC'') device vr # VIA Rhine, Rhine II device wb # Winbond W89C840F device xl # 3Com 3c90x (``Boomerang'', ``Cyclone'') device bge # Broadcom BCM570x (``Tigon III'') # Pseudo devices - the number indicates how many units to allocate. pseudo-device loop # Network loopback pseudo-device ether # Ethernet support pseudo-device vlan 4 #VLAN support pseudo-device tun # Packet tunnel. pseudo-device pty # Pseudo-ttys (telnet etc) pseudo-device md # Memory "disks" pseudo-device gif # IPv6 and IPv4 tunneling pseudo-device faith 1 # IPv6-to-IPv4 relaying (translation) # The `bpf' pseudo-device enables the Berkeley Packet Filter. # Be aware of the administrative consequences of enabling this! pseudo-device bpf #Berkeley packet filter # USB support device uhci # UHCI PCI->USB interface device ohci # OHCI PCI->USB interface device usb # USB Bus (required) device ugen # Generic device uhid # "Human Interface Devices" device ukbd # Keyboard device ulpt # Printer device ums # Mouse options IPFIREWALL #firewall options IPFW2 options IPDIVERT #divert sockets options IPFIREWALL_VERBOSE #enable logging to syslogd(8) options IPFIREWALL_VERBOSE_LIMIT=3D100 #limit verbosity ------=_NextPart_000_0004_01C3C72C.8AD09600 Content-Type: text/plain; name="pciconf.txt" Content-Transfer-Encoding: quoted-printable Content-Disposition: attachment; filename="pciconf.txt" chip0@pci0:0:0: class=3D0x060000 card=3D0x00000000 chip=3D0x01a410de = rev=3D0xb2 hdr=3D0x00 vendor =3D 'NVIDIA Corporation' device =3D 'nForce AGP Controller' class =3D bridge subclass =3D HOST-PCI none0@pci0:0:1: class=3D0x050000 card=3D0x0c1110de chip=3D0x01ac10de = rev=3D0xb2 hdr=3D0x00 vendor =3D 'NVIDIA Corporation' device =3D 'nForce 2x0/415/4x0 Memory Controller' class =3D memory subclass =3D RAM none1@pci0:0:2: class=3D0x050000 card=3D0x0c1110de chip=3D0x01ad10de = rev=3D0xb2 hdr=3D0x00 vendor =3D 'NVIDIA Corporation' device =3D 'nForce 2x0/415/4x0 Memory Controller' class =3D memory subclass =3D RAM none2@pci0:0:3: class=3D0x050000 card=3D0x0c1110de chip=3D0x01aa10de = rev=3D0xb2 hdr=3D0x00 vendor =3D 'NVIDIA Corporation' device =3D 'nForce 220/230 Memory Controller (DDR)' class =3D memory subclass =3D RAM isab0@pci0:1:0: class=3D0x060100 card=3D0x0c1110de chip=3D0x01b210de = rev=3D0xc3 hdr=3D0x00 vendor =3D 'NVIDIA Corporation' device =3D 'nForce HUB Interface' class =3D bridge subclass =3D PCI-ISA none3@pci0:1:1: class=3D0x0c0500 card=3D0x0c1110de chip=3D0x01b410de = rev=3D0xc1 hdr=3D0x00 vendor =3D 'NVIDIA Corporation' device =3D 'nForce MCP SMBus Controller' class =3D serial bus subclass =3D SMBus ohci0@pci0:2:0: class=3D0x0c0310 card=3D0x0c1110de chip=3D0x01c210de = rev=3D0xc3 hdr=3D0x00 vendor =3D 'NVIDIA Corporation' device =3D 'nForce MCP OHCI USB Controller' class =3D serial bus subclass =3D USB ohci1@pci0:3:0: class=3D0x0c0310 card=3D0x0c1110de chip=3D0x01c210de = rev=3D0xc3 hdr=3D0x00 vendor =3D 'NVIDIA Corporation' device =3D 'nForce MCP OHCI USB Controller' class =3D serial bus subclass =3D USB none4@pci0:4:0: class=3D0x020000 card=3D0x0c111043 chip=3D0x01c310de = rev=3D0xc2 hdr=3D0x00 vendor =3D 'NVIDIA Corporation' device =3D 'nForce MCP Networking Adapter' class =3D network subclass =3D ethernet none5@pci0:5:0: class=3D0x040100 card=3D0x0c111043 chip=3D0x01b010de = rev=3D0xc2 hdr=3D0x00 vendor =3D 'NVIDIA Corporation' device =3D 'nForce MCP Audio Processing Unit (Dolby Digital)' class =3D multimedia subclass =3D audio none6@pci0:6:0: class=3D0x040100 card=3D0x83841043 chip=3D0x01b110de = rev=3D0xc2 hdr=3D0x00 vendor =3D 'NVIDIA Corporation' device =3D 'nForce MCP Audio Codec Interface' class =3D multimedia subclass =3D audio pcib1@pci0:8:0: class=3D0x060400 card=3D0x00000044 chip=3D0x01b810de = rev=3D0xc2 hdr=3D0x01 vendor =3D 'NVIDIA Corporation' device =3D 'nForce PCI Bridge' class =3D bridge subclass =3D PCI-PCI atapci0@pci0:9:0: class=3D0x01018a card=3D0x0c1110de chip=3D0x01bc10de = rev=3D0xc3 hdr=3D0x00 vendor =3D 'NVIDIA Corporation' device =3D 'nForce MCP ATA Controller' class =3D mass storage subclass =3D ATA pcib3@pci0:30:0: class=3D0x060400 card=3D0x00000000 chip=3D0x01b710de = rev=3D0xb2 hdr=3D0x01 vendor =3D 'NVIDIA Corporation' device =3D 'nForce AGP Host to PCI Bridge' class =3D bridge subclass =3D PCI-PCI pcib2@pci1:6:0: class=3D0x060400 card=3D0x000000dc chip=3D0x00221011 = rev=3D0x06 hdr=3D0x01 vendor =3D 'Digital Equipment Corporation' device =3D '21150-AA PCI to PCI Bridge' class =3D bridge subclass =3D PCI-PCI fxp0@pci2:4:0: class=3D0x020000 card=3D0x100c8086 chip=3D0x12298086 = rev=3D0x08 hdr=3D0x00 vendor =3D 'Intel Corporation' device =3D '82557/8/9 EtherExpress PRO/100(B) Ethernet Adapter' class =3D network subclass =3D ethernet fxp1@pci2:5:0: class=3D0x020000 card=3D0x100c8086 chip=3D0x12298086 = rev=3D0x08 hdr=3D0x00 vendor =3D 'Intel Corporation' device =3D '82557/8/9 EtherExpress PRO/100(B) Ethernet Adapter' class =3D network subclass =3D ethernet fxp2@pci2:6:0: class=3D0x020000 card=3D0x100c8086 chip=3D0x12298086 = rev=3D0x08 hdr=3D0x00 vendor =3D 'Intel Corporation' device =3D '82557/8/9 EtherExpress PRO/100(B) Ethernet Adapter' class =3D network subclass =3D ethernet fxp3@pci2:7:0: class=3D0x020000 card=3D0x100c8086 chip=3D0x12298086 = rev=3D0x08 hdr=3D0x00 vendor =3D 'Intel Corporation' device =3D '82557/8/9 EtherExpress PRO/100(B) Ethernet Adapter' class =3D network subclass =3D ethernet none7@pci3:0:0: class=3D0x030000 card=3D0x00000000 chip=3D0x01a010de = rev=3D0xb1 hdr=3D0x00 vendor =3D 'NVIDIA Corporation' device =3D 'GeForce2 [Crush11] Integrated Graphics' class =3D display subclass =3D VGA ------=_NextPart_000_0004_01C3C72C.8AD09600--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?NMEPLAHDNAPMGKOIJMLLIEFNCAAA.oivind.danielsen>