From owner-freebsd-bugs@FreeBSD.ORG Wed Jul 7 19:30:04 2010 Return-Path: Delivered-To: freebsd-bugs@hub.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 93EE7106564A for ; Wed, 7 Jul 2010 19:30:04 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by mx1.freebsd.org (Postfix) with ESMTP id 690778FC0C for ; Wed, 7 Jul 2010 19:30:04 +0000 (UTC) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.4/8.14.4) with ESMTP id o67JU4dn076801 for ; Wed, 7 Jul 2010 19:30:04 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.4/8.14.4/Submit) id o67JU40O076798; Wed, 7 Jul 2010 19:30:04 GMT (envelope-from gnats) Date: Wed, 7 Jul 2010 19:30:04 GMT Message-Id: <201007071930.o67JU40O076798@freefall.freebsd.org> To: freebsd-bugs@FreeBSD.org From: Ted Mittelstaedt Cc: Subject: Re: kern/115152: [ata] Sil 3512 SATA controller panics on 6.2 X-BeenThere: freebsd-bugs@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: Ted Mittelstaedt List-Id: Bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 07 Jul 2010 19:30:04 -0000 The following reply was made to PR kern/115152; it has been noted by GNATS. From: Ted Mittelstaedt To: bug-followup@FreeBSD.org Cc: Subject: Re: kern/115152: [ata] Sil 3512 SATA controller panics on 6.2 Date: Wed, 07 Jul 2010 12:20:22 -0700 We have the same general problem with this controller under FreeBSD 8.0. We get numerous TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=XXXXXXX on the system with moderate to heavy load on the controller, and eventually the system locks up. Interestingly, the problem didn't happen with OLDer SATA150 disks plugged into the controller, but 150GB hard drives aren't that useable nowadays. There's been numerous postings on the mailing list about these controllers. Linux does not appear to have difficulty with them so people seem to move these systems to Linux servers. Here's the dmesg from our system, the system has since been rehomed to Linux: nas1# dmesg Copyright (c) 1992-2009 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 8.0-RELEASE #0: Sat Nov 21 15:48:17 UTC 2009 root@almeida.cse.buffalo.edu:/usr/obj/usr/src/sys/GENERIC Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel Pentium III (937.55-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x68a Stepping = 10 Features=0x387fbff real memory = 1073741824 (1024 MB) avail memory = 1036361728 (988 MB) ACPI APIC Table: FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs FreeBSD/SMP: 2 package(s) x 1 core(s) cpu0 (BSP): APIC ID: 3 cpu1 (AP): APIC ID: 0 ioapic0 irqs 0-23 on motherboard kbd1 at kbdmux0 acpi0: on motherboard acpi0: [ITHREAD] acpi0: Power Button (fixed) acpi0: reservation of 0, a0000 (3) failed acpi0: reservation of 100000, 3ff00000 (3) failed Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0xe408-0xe40b on acpi0 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 agp0: on hostb0 agp0: aperture size is 256M pcib1: at device 1.0 on pci0 pci1: on pcib1 vgapci0: port 0xd800-0xd8ff mem 0xf8000000-0xfbffffff,0xf6000000-0xf6003fff irq 16 at device 0.0 on pci1 isab0: at device 4.0 on pci0 isa0: on isab0 atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xb800-0xb80f at device 4.1 on pci0 ata0: on atapci0 ata0: [ITHREAD] ata1: on atapci0 ata1: [ITHREAD] uhci0: port 0xb400-0xb41f irq 5 at device 4.2 on pci0 uhci0: [ITHREAD] usbus0: on uhci0 uhci1: port 0xb000-0xb01f irq 5 at device 4.3 on pci0 uhci1: [ITHREAD] usbus1: on uhci1 fxp0: port 0xa800-0xa81f mem 0xf7000000-0xf7000fff,0xf5800000-0xf58fffff at device 10.0 on pci0 fxp0: Enabling Rx lock-up workaround miibus0: on fxp0 nsphy0: PHY 1 on miibus0 nsphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:a0:c9:39:0f:cc fxp0: [ITHREAD] atapci1: port 0xa400-0xa407,0xa000-0xa003,0x9800-0x9807,0x9400-0x9403,0x9000-0x900f mem 0xf5000000-0xf50001ff irq 17 at device 11.0 on pci0 atapci1: [ITHREAD] ata2: on atapci1 ata2: [ITHREAD] ata3: on atapci1 ata3: [ITHREAD] sym0: <810a> port 0x8800-0x88ff mem 0xf4800000-0xf48000ff irq 16 at device 12.0 on pci0 sym0: No NVRAM, ID 7, Fast-10, SE, parity checking sym0: [ITHREAD] atrtc0: port 0x70-0x73 irq 8 on acpi0 fdc0: port 0x3f2-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FILTER] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 ppc0: port 0x378-0x37f,0x778-0x77b irq 7 drq 3 on acpi0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/8 bytes threshold ppc0: [ITHREAD] ppbus0: on ppc0 plip0: on ppbus0 plip0: [ITHREAD] lpt0: on ppbus0 lpt0: [ITHREAD] lpt0: Interrupt-driven port ppi0: on ppbus0 uart0: <16550 or compatible> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 uart0: [FILTER] uart1: <16550 or compatible> port 0x2f8-0x2ff irq 3 on acpi0 uart1: [FILTER] atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] atkbd0: [ITHREAD] psm0: irq 12 on atkbdc0 psm0: [GIANT-LOCKED] psm0: [ITHREAD] psm0: model Generic PS/2 mouse, device ID 0 cpu0: on acpi0 cpu1: on acpi0 pmtimer0 on isa0 orm0: at iomem 0xc0000-0xc7fff,0xcc000-0xd0fff pnpid ORM0000 on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounters tick every 1.000 msec usbus0: 12Mbps Full Speed USB v1.0 usbus1: 12Mbps Full Speed USB v1.0 acd0: CDROM at ata1-master PIO4 ad4: 1907729MB at ata2-master SATA150 ugen0.1: at usbus0 uhub0: on usbus0 ugen1.1: at usbus1 uhub1: on usbus1 ad6: 1907729MB at ata3-master SATA150 Waiting 5 seconds for SCSI devices to settle uhub0: 2 ports with 2 removable, self powered uhub1: 2 ports with 2 removable, self powered GEOM: ad4s1: geometry does not match label (255h,63s != 16h,63s). (probe6:sym0:0:6:0): TEST UNIT READY. CDB: 0 0 0 0 0 0 (probe6:sym0:0:6:0): CAM Status: SCSI Status Error (probe6:sym0:0:6:0): SCSI Status: Check Condition (probe6:sym0:0:6:0): UNIT ATTENTION asc:29,0 (probe6:sym0:0:6:0): Power on, reset, or bus device reset occurred (probe6:sym0:0:6:0): Retrying Command (per Sense Data) (probe6:sym0:0:6:0): TEST UNIT READY. CDB: 0 0 0 0 0 0 (probe6:sym0:0:6:0): CAM Status: SCSI Status Error (probe6:sym0:0:6:0): SCSI Status: Check Condition (probe6:sym0:0:6:0): NOT READY asc:3a,0 (probe6:sym0:0:6:0): Medium not present (probe6:sym0:0:6:0): Unretryable error sa0 at sym0 bus 0 target 6 lun 0 sa0: Removable Sequential Access SCSI-2 device sa0: 10.000MB/s transfers (10.000MHz, offset 8) sa1 at sym0 bus 0 target 6 lun 1 sa1: Removable Sequential Access SCSI-2 device sa1: 10.000MB/s transfers (10.000MHz, offset 8) ar0: 1907729MB status: DEGRADED ar0: disk0 DOWN no device found for this subdisk ar0: disk1 READY (mirror) using ad4 at ata2-master SMP: AP CPU #1 Launched! Trying to mount root from ufs:/dev/ar0s1a ad6: inserted into ar0 disk0 as spare ad4: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=753183 ad4: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=753183 ad4: TIMEOUT - WRITE_DMA retrying (0 retries left) LBA=753183 ad4: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=773599 ad4: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=753183 ad4: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=753183 ad4: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=752839 ad4: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=752839 ad4: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=287 ad4: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=10953023 ad4: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=752839