From owner-freebsd-stable@FreeBSD.ORG Sun Feb 12 13:22:17 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 3FCA116A420 for ; Sun, 12 Feb 2006 13:22:17 +0000 (GMT) (envelope-from johan@stromnet.org) Received: from pne-smtpout2-sn1.fre.skanova.net (pne-smtpout2-sn1.fre.skanova.net [81.228.11.159]) by mx1.FreeBSD.org (Postfix) with ESMTP id 6D7FE43D45 for ; Sun, 12 Feb 2006 13:22:15 +0000 (GMT) (envelope-from johan@stromnet.org) Received: from elfi.stromnet.org (213.67.207.70) by pne-smtpout2-sn1.fre.skanova.net (7.2.070) id 43EC2A6A0006D771 for freebsd-stable@freebsd.org; Sun, 12 Feb 2006 14:22:14 +0100 Received: from localhost (localhost [127.0.0.1]) by elfi.stromnet.org (Postfix) with ESMTP id 7C52661CB1 for ; Sun, 12 Feb 2006 14:22:13 +0100 (CET) Received: from elfi.stromnet.org ([127.0.0.1]) by localhost (elfi.stromnet.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 00682-01 for ; Sun, 12 Feb 2006 14:22:09 +0100 (CET) Received: from [10.10.0.6] (vpn1-c1.stromnet.org [10.10.0.6]) by elfi.stromnet.org (Postfix) with ESMTP id DFCFF61CB0 for ; Sun, 12 Feb 2006 14:22:08 +0100 (CET) Mime-Version: 1.0 (Apple Message framework v746.2) In-Reply-To: References: <24ECF01E-9881-41F3-A1D9-4C258489D41F@stromnet.org> Content-Type: multipart/signed; micalg=sha1; boundary=Apple-Mail-3-381948905; protocol="application/pkcs7-signature" Message-Id: <287751E3-AFFC-4ECA-B887-1E8F85943FA6@stromnet.org> From: =?ISO-8859-1?Q?Johan_Str=F6m?= Date: Sun, 12 Feb 2006 14:20:22 +0100 To: freebsd-stable@freebsd.org X-Mailer: Apple Mail (2.746.2) X-Virus-Scanned: amavisd-new at stromnet.org X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: Re: gmirror/disk problems! X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 12 Feb 2006 13:22:17 -0000 --Apple-Mail-3-381948905 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=ISO-8859-1; delsp=yes; format=flowed On 10 feb 2006, at 07.43, Johan Str=F6m wrote: > > On 10 feb 2006, at 07.15, Johan Str=F6m wrote: > >> Hi list! >> >> I've been experiencing problems earlier with gmirror (thread "Page =20= >> fault, GEOM problem??"). My gmirror crashed, and the box =20 >> compleatly froze. >> Now I got a new mobo, and it has been working great since (no =20 >> crashes, and i get decent 40-50mb/s read/write instead of ~10-20). >> This morning i woke up to this: >> >> >> subdisk4: detached >> ad4: detached >> unknown: TIMEOUT - READ_DMA retrying (1 retry left) LBA=3D187595536 >> unknown: timeout waiting to issue command >> unknown: error issueing READ_DMA command >> GEOM_MIRROR: Device gm0s1: provider ad4s1 disconnected. >> GEOM_MIRROR: Request failed (error=3D6). ad4s1[WRITE=20 >> (offset=3D134373376, length=3D16384)] >> GEOM_MIRROR: Request failed (error=3D6). ad4s1[WRITE=20 >> (offset=3D134438912, length=3D16384)] >> GEOM_MIRROR: Request failed (error=3D6). ad4s1[WRITE=20 >> (offset=3D268591104, length=3D16384)] >> GEOM_MIRROR: Request failed (error=3D6). ad4s1[WRITE=20 >> (offset=3D268607488, length=3D16384)] >> GEOM_MIRROR: Request failed (error=3D6). ad4s1[WRITE=20 >> (offset=3D268656640, length=3D16384)] >> GEOM_MIRROR: Request failed (error=3D6). ad4s1[WRITE=20 >> (offset=3D5966399488, length=3D2048)] >> GEOM_MIRROR: Request failed (error=3D5). ad4s1[READ=20 >> (offset=3D96048882176, length=3D32768)] >> >> Just like "old times"... However, no page faults! Yay.. But.. what =20= >> is going on here?? Why does the atacontroler or whatever think they >> need to detach my disk?? And how do i reattach it? I have tried =20 >> some stuff with atacontrol: >> >> $ atacontrol list >> ATA channel 0: >> Master: acd0 ATA/ATAPI revision 0 >> Slave: no device present >> ATA channel 1: >> Master: no device present >> Slave: no device present >> ATA channel 2: >> Master: no device present >> Slave: no device present >> ATA channel 3: >> Master: ad6 Serial ATA v1.0 >> Slave: no device present >> $ atacontrol attach ata2 >> atacontrol: ioctl(IOCATAATTACH): File exists >> $ atacontrol reinit ata2 >> < here i get a long system wide block> >> Master: no device present >> Slave: no device present >> $ >> >> Okay so no luck reiniting it.. I dont realy wanna reboot the box =20 >> (each time this might happen).. But im happy that it doesnt crash =20 >> totally anymore heh... >> >> dmesg of current system: > > Feb 2 19:39:09 elfi syslogd: kernel boot file is /boot/kernel/kernel > Feb 2 19:39:09 elfi kernel: Copyright (c) 1992-2005 The FreeBSD =20 > Project. > Feb 2 19:39:09 elfi kernel: Copyright (c) 1979, 1980, 1983, 1986, =20 > 1988, 1989, 1991, 1992, 1993, 1994 > Feb 2 19:39:09 elfi kernel: The Regents of the University of =20 > California. All rights reserved. > Feb 2 19:39:09 elfi kernel: FreeBSD 6.0-RELEASE #2: Thu Dec 1 =20 > 20:18:30 CET 2005 > Feb 2 19:39:09 elfi kernel: johan@elfi.stromnet.org:/usr/obj/usr/=20 > src/sys/GENERIC > Feb 2 19:39:09 elfi kernel: ACPI APIC Table: > Feb 2 19:39:09 elfi kernel: Timecounter "i8254" frequency 1193182 =20 > Hz quality 0 > Feb 2 19:39:09 elfi kernel: CPU: AMD Athlon(tm) XP (1200.01-MHz =20 > 686-class CPU) > Feb 2 19:39:09 elfi kernel: Origin =3D "AuthenticAMD" Id =3D 0x662 =20= > Stepping =3D 2 > Feb 2 19:39:09 elfi kernel: =20 > Features=3D0x383fbff E,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE> > Feb 2 19:39:09 elfi kernel: AMD Features=3D0xc0480800 +,3DNow+,3DNow> > Feb 2 19:39:09 elfi kernel: real memory =3D 536674304 (511 MB) > Feb 2 19:39:09 elfi kernel: avail memory =3D 515833856 (491 MB) > Feb 2 19:39:09 elfi kernel: ioapic0 irqs 0-23 on =20 > motherboard > Feb 2 19:39:09 elfi kernel: npx0: [FAST] > Feb 2 19:39:09 elfi kernel: npx0: on motherboard > Feb 2 19:39:09 elfi kernel: npx0: INT 16 interface > Feb 2 19:39:09 elfi kernel: acpi0: on motherboard > Feb 2 19:39:09 elfi kernel: acpi0: Power Button (fixed) > Feb 2 19:39:09 elfi kernel: pci_link0: irq 0 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link1: irq 5 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link2: irq 0 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link3: irq 0 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link4: irq 11 =20= > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link5: irq 5 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link6: irq 5 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link7: irq 3 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link8: irq 5 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link9: irq 0 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link10: irq =20 > 11 on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link11: irq 0 =20= > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link12: irq 5 =20= > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link13: irq 0 =20= > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link14: irq 0 =20= > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link15: irq =20 > 10 on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link16: irq 0 =20= > on acpi0 > Feb 2 19:39:09 elfi kernel: Timecounter "ACPI-fast" frequency =20 > 3579545 Hz quality 1000 > Feb 2 19:39:09 elfi kernel: acpi_timer0: <24-bit timer at =20 > 3.579545MHz> port 0x4008-0x400b on acpi0 > Feb 2 19:39:09 elfi kernel: cpu0: on acpi0 > Feb 2 19:39:09 elfi kernel: acpi_throttle0: =20 > on cpu0 > Feb 2 19:39:09 elfi kernel: pcib0: port =20 > 0xcf8-0xcff on acpi0 > Feb 2 19:39:09 elfi kernel: pci0: on pcib0 > Feb 2 19:39:09 elfi kernel: agp0: =20 > mem 0xf8000000-0xfbffffff at device 0.0 on pci0 > Feb 2 19:39:09 elfi kernel: pci0: at device 0.1 (no =20 > driver attached) > Feb 2 19:39:09 elfi kernel: pci0: at device 0.2 (no =20 > driver attached) > Feb 2 19:39:09 elfi kernel: pci0: at device 0.3 (no =20 > driver attached) > Feb 2 19:39:09 elfi kernel: pci0: at device 0.4 (no =20 > driver attached) > Feb 2 19:39:09 elfi kernel: pci0: at device 0.5 (no =20 > driver attached) > Feb 2 19:39:09 elfi kernel: isab0: at device 1.0 =20 > on pci0 > Feb 2 19:39:09 elfi kernel: isa0: on isab0 > Feb 2 19:39:09 elfi kernel: pci0: at device =20 > 1.1 (no driver attached) > Feb 2 19:39:09 elfi kernel: ohci0: =20= > mem 0xfebfb000-0xfebfbfff irq 20 at device 2.0 on pci0 > Feb 2 19:39:09 elfi kernel: ohci0: [GIANT-LOCKED] > Feb 2 19:39:09 elfi kernel: usb0: OHCI version 1.0, legacy support > Feb 2 19:39:09 elfi kernel: usb0: =20 > on ohci0 > Feb 2 19:39:09 elfi kernel: usb0: USB revision 1.0 > Feb 2 19:39:09 elfi kernel: uhub0: nVidia OHCI root hub, class =20 > 9/0, rev 1.00/1.00, addr 1 > Feb 2 19:39:09 elfi kernel: uhub0: 4 ports with 4 removable, self =20 > powered > Feb 2 19:39:09 elfi kernel: ohci1: =20= > mem 0xfebfc000-0xfebfcfff irq 21 at device 2.1 on pci0 > Feb 2 19:39:09 elfi kernel: ohci1: [GIANT-LOCKED] > Feb 2 19:39:09 elfi kernel: usb1: OHCI version 1.0, legacy support > Feb 2 19:39:09 elfi kernel: usb1: =20 > on ohci1 > Feb 2 19:39:09 elfi kernel: usb1: USB revision 1.0 > Feb 2 19:39:09 elfi kernel: uhub1: nVidia OHCI root hub, class =20 > 9/0, rev 1.00/1.00, addr 1 > Feb 2 19:39:09 elfi kernel: uhub1: 4 ports with 4 removable, self =20 > powered > Feb 2 19:39:09 elfi kernel: ehci0: controller> mem 0xfebfdc00-0xfebfdcff irq 22 at device 2.2 on pci0 > Feb 2 19:39:09 elfi kernel: ehci0: [GIANT-LOCKED] > Feb 2 19:39:09 elfi kernel: usb2: EHCI version 1.0 > Feb 2 19:39:09 elfi kernel: usb2: companion controllers, 4 ports =20 > each: usb0 usb1 > Feb 2 19:39:09 elfi kernel: usb2: controller> on ehci0 > Feb 2 19:39:09 elfi kernel: usb2: USB revision 2.0 > Feb 2 19:39:09 elfi kernel: uhub2: nVidia EHCI root hub, class =20 > 9/0, rev 2.00/1.00, addr 1 > Feb 2 19:39:09 elfi kernel: uhub2: 8 ports with 8 removable, self =20 > powered > Feb 2 19:39:09 elfi kernel: nve0: Adapter> port 0xdc00-0xdc07 mem 0xfebfe000-0xfebfefff irq 20 at =20 > device 4.0 on pci0 > Feb 2 19:39:09 elfi kernel: nve0: Ethernet address 00:13:d4:bf:5b:79 > Feb 2 19:39:09 elfi kernel: miibus0: on nve0 > Feb 2 19:39:09 elfi kernel: rlphy0: interface> on miibus0 > Feb 2 19:39:09 elfi kernel: rlphy0: 10baseT, 10baseT-FDX, =20 > 100baseTX, 100baseTX-FDX, auto > Feb 2 19:39:09 elfi kernel: nve0: Ethernet address: 00:13:d4:bf:5b:79 > Feb 2 19:39:09 elfi kernel: nve0: [GIANT-LOCKED] > Feb 2 19:39:09 elfi kernel: pci0: at device =20 > 6.0 (no driver attached) > Feb 2 19:39:09 elfi kernel: pcib1: at device =20= > 8.0 on pci0 > Feb 2 19:39:09 elfi kernel: pci_link0: BIOS IRQ 22 for 0.11.INTA =20 > is invalid > Feb 2 19:39:09 elfi kernel: pci_link2: BIOS IRQ 21 for 0.6.INTA is =20= > invalid > Feb 2 19:39:09 elfi kernel: pci2: on pcib1 > Feb 2 19:39:09 elfi kernel: pci2: at device 6.0 (no =20= > driver attached) > Feb 2 19:39:09 elfi kernel: xl0: <3Com 3c905C-TX Fast Etherlink =20 > XL> port 0xcc00-0xcc7f mem 0xfeafec00-0xfeafec7f irq 17 at device =20 > 9.0 on pci2 > Feb 2 19:39:09 elfi kernel: miibus1: on xl0 > Feb 2 19:39:09 elfi kernel: xlphy0: <3c905C 10/100 internal PHY> =20 > on miibus1 > Feb 2 19:39:09 elfi kernel: xlphy0: 10baseT, 10baseT-FDX, =20 > 100baseTX, 100baseTX-FDX, auto > Feb 2 19:39:09 elfi kernel: xl0: Ethernet address: 00:04:76:ef:c6:36 > Feb 2 19:39:09 elfi kernel: atapci0: controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xffa0-0xffaf =20 > at device 9.0 on pci0 > Feb 2 19:39:09 elfi kernel: ata0: on atapci0 > Feb 2 19:39:09 elfi kernel: ata1: on atapci0 > Feb 2 19:39:09 elfi kernel: atapci1: controller> port =20 > 0xec00-0xec07,0xe880-0xe883,0xe800-0xe807,0xe480-0xe483,0x7f00-0x7f0f,=20= > 0x7c00- > 0x7c7f irq 22 at device 11.0 on pci0 > Feb 2 19:39:09 elfi kernel: ata2: on atapci1 > Feb 2 19:39:09 elfi kernel: ata3: on atapci1 > Feb 2 19:39:09 elfi kernel: pcib2: at device =20= > 30.0 on pci0 > Feb 2 19:39:09 elfi kernel: pci1: on pcib2 > Feb 2 19:39:09 elfi kernel: acpi_button0: on acpi0 > Feb 2 19:39:09 elfi kernel: fdc0: =20 > port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 > Feb 2 19:39:09 elfi kernel: fdc0: [FAST] > Feb 2 19:39:09 elfi kernel: ppc0: port =20= > 0x378-0x37f,0x778-0x77f irq 7 drq 3 on acpi0 > Feb 2 19:39:09 elfi kernel: ppc0: SMC-like chipset (ECP/EPP/PS2/=20 > NIBBLE) in COMPATIBLE mode > Feb 2 19:39:09 elfi kernel: ppc0: FIFO with 16/16/9 bytes threshold > Feb 2 19:39:09 elfi kernel: ppbus0: on ppc0 > Feb 2 19:39:09 elfi kernel: plip0: on ppbus0 > Feb 2 19:39:09 elfi kernel: lpt0: on ppbus0 > Feb 2 19:39:09 elfi kernel: lpt0: Interrupt-driven port > Feb 2 19:39:09 elfi kernel: ppi0: on ppbus0 > Feb 2 19:39:09 elfi kernel: atkbdc0: =20= > port 0x60,0x64 irq 1 on acpi0 > Feb 2 19:39:09 elfi kernel: atkbd0: irq 1 on atkbdc0 > Feb 2 19:39:09 elfi kernel: kbd0 at atkbd0 > Feb 2 19:39:09 elfi kernel: atkbd0: [GIANT-LOCKED] > Feb 2 19:39:09 elfi kernel: sio0: <16550A-compatible COM port> =20 > port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 > Feb 2 19:39:09 elfi kernel: sio0: type 16550A > Feb 2 19:39:09 elfi kernel: pmtimer0 on isa0 > Feb 2 19:39:09 elfi kernel: orm0: at iomem =20 > 0xc0000-0xc7fff,0xc8000-0xc87ff on isa0 > Feb 2 19:39:09 elfi kernel: sc0: at flags 0x100 =20 > on isa0 > Feb 2 19:39:09 elfi kernel: sc0: VGA <16 virtual consoles, =20 > flags=3D0x300> > Feb 2 19:39:09 elfi kernel: sio1: configured irq 3 not in bitmap =20 > of probed irqs 0 > Feb 2 19:39:09 elfi kernel: sio1: port may not be enabled > Feb 2 19:39:09 elfi kernel: vga0: at port =20 > 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 > Feb 2 19:39:09 elfi kernel: Timecounter "TSC" frequency 1200006671 =20= > Hz quality 800 > Feb 2 19:39:09 elfi kernel: Timecounters tick every 1.000 msec > Feb 2 19:39:09 elfi kernel: acd0: CDROM at =20 > ata0-master PIO4 > Feb 2 19:39:09 elfi kernel: ad4: 286188MB BANC1G10> at ata2-master SATA150 > Feb 2 19:39:09 elfi kernel: ad6: 286188MB BANC1G10> at ata3-master SATA150 > Feb 2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1 created =20 > (id=3D4118114647). > Feb 2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider =20 > ad4s1 detected. > Feb 2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider =20 > ad6s1 detected. > Feb 2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider =20 > ad6s1 activated. > Feb 2 19:39:09 elfi kernel: Root mount waiting for: GMIRROR > Feb 2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider =20 > ad4s1 activated. > Feb 2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider =20 > mirror/gm0s1 launched. > > > There we go..: ) The last was from a previous boot before i pulled =20 > the promise card out... Has worked fine since (7 days uptime). > >> >> I could try to move the disks to my promise sata2 tx4 card i =20 >> bought for the old mobo (which didnt have sata)... But i'd rather =20 >> find the problem ;) >> >> Hope someone can help. >> Thanks >> Johan >> > I tried to do some more revival of the disconnected disk, no success. =20= pulled it out and plugged it back in again, still not detected.. =20 tried all sorts of combinations of reinit attach detach etc with =20 atacontrol... Finnaly I gave up and rebooted the box and now it's =20 rebuilding again... Does anyone have any clue why this is happening? Okay its better than =20= before, no crashing.. but loosing one drive in a gmirror and having =20 to reboot to fix it is not good. Thanks= --Apple-Mail-3-381948905--