From owner-freebsd-stable@FreeBSD.ORG Mon Aug 7 13:53:26 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 4A15616A4DA for ; Mon, 7 Aug 2006 13:53:26 +0000 (UTC) (envelope-from dom@goodforbusiness.co.uk) Received: from mailhost.graphdata.co.uk (mailhost.graphdata.co.uk [195.12.22.194]) by mx1.FreeBSD.org (Postfix) with ESMTP id 3AB2443D53 for ; Mon, 7 Aug 2006 13:53:10 +0000 (GMT) (envelope-from dom@goodforbusiness.co.uk) Received: from localhost (localhost [127.0.0.1]) by mailhost.graphdata.co.uk (Postfix) with ESMTP id 40F9C114026; Mon, 7 Aug 2006 14:53:09 +0100 (BST) X-Virus-Scanned: amavisd-new at graphdata.co.uk Received: from mailhost.graphdata.co.uk ([127.0.0.1]) by localhost (mailhost.graphdata.co.uk [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id J+ow1+ReHsOi; Mon, 7 Aug 2006 14:53:04 +0100 (BST) Received: from [192.168.0.86] (gdc083.internal.graphdata.co.uk [192.168.0.86]) by mailhost.graphdata.co.uk (Postfix) with ESMTP id 9163F114023; Mon, 7 Aug 2006 14:53:04 +0100 (BST) Message-ID: <44D745C0.2010803@goodforbusiness.co.uk> Date: Mon, 07 Aug 2006 14:53:04 +0100 From: Dominic Marks User-Agent: Thunderbird 1.5.0.4 (X11/20060718) MIME-Version: 1.0 To: Jerome Sobecki References: <20060807101946.GE33821@pasteur.fr> In-Reply-To: <20060807101946.GE33821@pasteur.fr> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: Stability of ICH7 sata on FreeBSD 6.1 ? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 07 Aug 2006 13:53:26 -0000 Jerome Sobecki wrote: > Hi all, > > We have here some Supermicro Superserver 5015P-TR > (http://www.supermicro.com/products/system/1U/5015/SYS-5015P-TR.cfm) > > Those servers, with a ICH7 controler, are currently working with FreeBSD > 6.1 and everything seems ok, except that it's the third time, on two > different machines, that the system crash because it lost is hard drive. > We have a Subversion sever on a Dell box with an ICH7 chipset. No problems so far (with Western Digital drives). Dominic > The logs we get on console during the last crash (ad4s1g is /var, so we > don't have any other logs): > g_vsf_done() :ad4s1g[WRITE(offset=35657547776, length=16384)]error = 6 > [...] > g_vsf_done() :ad4s1g[WRITE(offset=35662495744, length=16384)]error = 6 > g_vsf_done() :ad4s1g[READ(offset=23900815360, , length=16384)]error = 6 > handle_workitem_freeblocks: block count > > The logs we have in /var/log/message during another crash : > Jul 26 19:34:07 munster2 kernel: ad6: FAILURE - device detached > Jul 26 19:34:07 munster2 kernel: subdisk6: detached > Jul 26 19:34:07 munster2 kernel: ad6: detached > > When the machine crash, the led of the lost DD is fixed on, and a soft > reboot doesn't allow to get the disk back : an electric shutdown is > necessary. > > Before the crash, servers had more than 1 mounth of uptime in > production, and others are still ok... > > information about the machine : > > vieux-lille2% uname -v > FreeBSD 6.1-STABLE #3: Thu Jun 8 12:47:45 CEST 2006 > root@vieux-lille2.sis.pasteur.fr:/usr/obj/usr/src/sys/GENERIC > > It's right it was not up to date, but I didn't see cvs commit about that > problem (but maybe I simply miss it) > > Does anyone had that problem ? Do you think updating the system from 6.1 > sources will be enought ? > > I let you dmesg, if it could help... : > vieux-lille2% dmesg > Copyright (c) 1992-2006 The FreeBSD Project. > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > The Regents of the University of California. All rights > reserved. > FreeBSD 6.1-STABLE #3: Thu Jun 8 12:47:45 CEST 2006 > root@vieux-lille2.sis.pasteur.fr:/usr/obj/usr/src/sys/GENERIC > Timecounter "i8254" frequency 1193182 Hz quality 0 > CPU: Intel(R) Pentium(R) 4 CPU 3.40GHz (3400.15-MHz 686-class CPU) > Origin = "GenuineIntel" Id = 0xf43 Stepping = 3 > Features=0xbfebfbff > Features2=0x649d> > AMD Features=0x20100000 > Logical CPUs per core: 2 > real memory = 1072562176 (1022 MB) > avail memory = 1040637952 (992 MB) > ACPI APIC Table: > ioapic0 irqs 0-23 on motherboard > ioapic1 irqs 24-47 on motherboard > ioapic2 irqs 48-71 on motherboard > kbd1 at kbdmux0 > acpi0: on motherboard > acpi0: Power Button (fixed) > Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 > acpi_timer0: <24-bit timer at 3.579545MHz> port 0x1008-0x100b on acpi0 > cpu0: on acpi0 > acpi_throttle0: on cpu0 > pcib0: port 0xcf8-0xcff on acpi0 > pci0: on pcib0 > pcib1: irq 16 at device 1.0 on pci0 > pci1: on pcib1 > pcib2: at device 0.0 on pci1 > pci2: on pcib2 > pci1: at device 0.1 (no driver attached) > pcib3: at device 0.2 on pci1 > pci3: on pcib3 > pci1: at device 0.3 (no driver attached) > pcib4: irq 17 at device 28.0 on pci0 > pci4: on pcib4 > pcib5: irq 17 at device 28.4 on pci0 > pci5: on pcib5 > em0: port 0x4000-0x401f mem 0xed200000-0xed21ffff irq 16 at device 0.0 on pci5 > em0: Ethernet address: 00:30:48:84:89:58 > pcib6: irq 16 at device 28.5 on pci0 > pci6: on pcib6 > em1: port 0x5000-0x501f mem 0xed300000-0xed31ffff irq 17 at device 0.0 on pci6 > em1: Ethernet address: 00:30:48:84:89:59 > uhci0: port 0x3000-0x301f irq 23 at > device 29.0 on pci0 > uhci0: [GIANT-LOCKED] > usb0: on uhci0 > usb0: USB revision 1.0 > uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub0: 2 ports with 2 removable, self powered > uhci1: port 0x3020-0x303f irq 19 at > device 29.1 on pci0 > uhci1: [GIANT-LOCKED] > usb1: on uhci1 > usb1: USB revision 1.0 > uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub1: 2 ports with 2 removable, self powered > uhci2: port 0x3040-0x305f irq 18 at > device 29.2 on pci0 > uhci2: [GIANT-LOCKED] > usb2: on uhci2 > usb2: USB revision 1.0 > uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub2: 2 ports with 2 removable, self powered > uhci3: port 0x3060-0x307f irq 16 at device 29.3 on pci0 > uhci3: [GIANT-LOCKED] > usb3: on uhci3 > usb3: USB revision 1.0 > uhub3: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub3: 2 ports with 2 removable, self powered > ehci0: mem 0xed000000-0xed0003ff irq 23 at device 29.7 on pci0 > ehci0: [GIANT-LOCKED] > usb4: EHCI version 1.0 > usb4: companion controllers, 2 ports each: usb0 usb1 usb2 usb3 > usb4: on ehci0 > usb4: USB revision 2.0 > uhub4: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 > uhub4: 8 ports with 8 removable, self powered > pcib7: at device 30.0 on pci0 > pci10: on pcib7 > pci10: at device 1.0 (no driver attached) > isab0: at device 31.0 on pci0 > isa0: on isab0 > atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x30a0-0x30af at device 31.1 on pci0 > ata0: on atapci0 > ata1: on atapci0 > atapci1: port 0x30e8-0x30ef,0x30dc-0x30df,0x30e0-0x30e7,0x30d8-0x30db,0x30b0-0x30bf mem 0xed000400-0xed0007ff irq 19 at device 31.2 on pci0 > ata2: on atapci1 > ata3: on atapci1 > pci0: at device 31.3 (no driver attached) > acpi_button0: on acpi0 > atkbdc0: port 0x60,0x64 irq 1 on acpi0 > atkbd0: irq 1 on atkbdc0 > kbd0 at atkbd0 > atkbd0: [GIANT-LOCKED] > sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 > sio0: type 16550A > sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 > sio1: type 16550A > fdc0: port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 > fdc0: [FAST] > fd0: <1440-KB 3.5" drive> on fdc0 drive 0 > pmtimer0 on isa0 > orm0: at iomem 0xc0000-0xc7fff on isa0 > ppc0: parallel port not found. > sc0: at flags 0x100 on isa0 > sc0: VGA <16 virtual consoles, flags=0x300> > vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 > Timecounter "TSC" frequency 3400145333 Hz quality 800 > Timecounters tick every 1.000 msec > acd0: DMA limited to UDMA33, controller found non-ATA66 cable > acd0: DVDROM at ata0-slave UDMA33 > ad4: 78167MB at ata2-master SATA150 > ad6: 78167MB at ata3-master SATA150 > Trying to mount root from ufs:/dev/ad4s1a > WARNING: / was not properly dismounted > em0: link state changed to UP > > >