Date: Sun, 12 Feb 2006 14:20:22 +0100 From: =?ISO-8859-1?Q?Johan_Str=F6m?= <johan@stromnet.org> To: freebsd-stable@freebsd.org Subject: Re: gmirror/disk problems! Message-ID: <287751E3-AFFC-4ECA-B887-1E8F85943FA6@stromnet.org> In-Reply-To: <B6B1F7EE-83A3-4CD4-8343-3DCEFA0F95AA@stromnet.org> References: <24ECF01E-9881-41F3-A1D9-4C258489D41F@stromnet.org> <B6B1F7EE-83A3-4CD4-8343-3DCEFA0F95AA@stromnet.org>
next in thread | previous in thread | raw e-mail | index | archive | help
--Apple-Mail-3-381948905 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=ISO-8859-1; delsp=yes; format=flowed On 10 feb 2006, at 07.43, Johan Str=F6m wrote: > > On 10 feb 2006, at 07.15, Johan Str=F6m wrote: > >> Hi list! >> >> I've been experiencing problems earlier with gmirror (thread "Page =20= >> fault, GEOM problem??"). My gmirror crashed, and the box =20 >> compleatly froze. >> Now I got a new mobo, and it has been working great since (no =20 >> crashes, and i get decent 40-50mb/s read/write instead of ~10-20). >> This morning i woke up to this: >> >> >> subdisk4: detached >> ad4: detached >> unknown: TIMEOUT - READ_DMA retrying (1 retry left) LBA=3D187595536 >> unknown: timeout waiting to issue command >> unknown: error issueing READ_DMA command >> GEOM_MIRROR: Device gm0s1: provider ad4s1 disconnected. >> GEOM_MIRROR: Request failed (error=3D6). ad4s1[WRITE=20 >> (offset=3D134373376, length=3D16384)] >> GEOM_MIRROR: Request failed (error=3D6). ad4s1[WRITE=20 >> (offset=3D134438912, length=3D16384)] >> GEOM_MIRROR: Request failed (error=3D6). ad4s1[WRITE=20 >> (offset=3D268591104, length=3D16384)] >> GEOM_MIRROR: Request failed (error=3D6). ad4s1[WRITE=20 >> (offset=3D268607488, length=3D16384)] >> GEOM_MIRROR: Request failed (error=3D6). ad4s1[WRITE=20 >> (offset=3D268656640, length=3D16384)] >> GEOM_MIRROR: Request failed (error=3D6). ad4s1[WRITE=20 >> (offset=3D5966399488, length=3D2048)] >> GEOM_MIRROR: Request failed (error=3D5). ad4s1[READ=20 >> (offset=3D96048882176, length=3D32768)] >> >> Just like "old times"... However, no page faults! Yay.. But.. what =20= >> is going on here?? Why does the atacontroler or whatever think they >> need to detach my disk?? And how do i reattach it? I have tried =20 >> some stuff with atacontrol: >> >> $ atacontrol list >> ATA channel 0: >> Master: acd0 <CD-ROM CDU701-F/1.0q> ATA/ATAPI revision 0 >> Slave: no device present >> ATA channel 1: >> Master: no device present >> Slave: no device present >> ATA channel 2: >> Master: no device present >> Slave: no device present >> ATA channel 3: >> Master: ad6 <Maxtor 7L300S0/BANC1G10> Serial ATA v1.0 >> Slave: no device present >> $ atacontrol attach ata2 >> atacontrol: ioctl(IOCATAATTACH): File exists >> $ atacontrol reinit ata2 >> < here i get a long system wide block> >> Master: no device present >> Slave: no device present >> $ >> >> Okay so no luck reiniting it.. I dont realy wanna reboot the box =20 >> (each time this might happen).. But im happy that it doesnt crash =20 >> totally anymore heh... >> >> dmesg of current system: > > Feb 2 19:39:09 elfi syslogd: kernel boot file is /boot/kernel/kernel > Feb 2 19:39:09 elfi kernel: Copyright (c) 1992-2005 The FreeBSD =20 > Project. > Feb 2 19:39:09 elfi kernel: Copyright (c) 1979, 1980, 1983, 1986, =20 > 1988, 1989, 1991, 1992, 1993, 1994 > Feb 2 19:39:09 elfi kernel: The Regents of the University of =20 > California. All rights reserved. > Feb 2 19:39:09 elfi kernel: FreeBSD 6.0-RELEASE #2: Thu Dec 1 =20 > 20:18:30 CET 2005 > Feb 2 19:39:09 elfi kernel: johan@elfi.stromnet.org:/usr/obj/usr/=20 > src/sys/GENERIC > Feb 2 19:39:09 elfi kernel: ACPI APIC Table: <A M I OEMAPIC > > Feb 2 19:39:09 elfi kernel: Timecounter "i8254" frequency 1193182 =20 > Hz quality 0 > Feb 2 19:39:09 elfi kernel: CPU: AMD Athlon(tm) XP (1200.01-MHz =20 > 686-class CPU) > Feb 2 19:39:09 elfi kernel: Origin =3D "AuthenticAMD" Id =3D 0x662 =20= > Stepping =3D 2 > Feb 2 19:39:09 elfi kernel: =20 > Features=3D0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PG= =20 > E,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE> > Feb 2 19:39:09 elfi kernel: AMD Features=3D0xc0480800<SYSCALL,MP,MMX=20= > +,3DNow+,3DNow> > Feb 2 19:39:09 elfi kernel: real memory =3D 536674304 (511 MB) > Feb 2 19:39:09 elfi kernel: avail memory =3D 515833856 (491 MB) > Feb 2 19:39:09 elfi kernel: ioapic0 <Version 1.1> irqs 0-23 on =20 > motherboard > Feb 2 19:39:09 elfi kernel: npx0: [FAST] > Feb 2 19:39:09 elfi kernel: npx0: <math processor> on motherboard > Feb 2 19:39:09 elfi kernel: npx0: INT 16 interface > Feb 2 19:39:09 elfi kernel: acpi0: <A M I OEMRSDT> on motherboard > Feb 2 19:39:09 elfi kernel: acpi0: Power Button (fixed) > Feb 2 19:39:09 elfi kernel: pci_link0: <ACPI PCI Link LNKA> irq 0 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link1: <ACPI PCI Link LNKB> irq 5 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link2: <ACPI PCI Link LNKC> irq 0 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link3: <ACPI PCI Link LNKD> irq 0 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link4: <ACPI PCI Link LNKE> irq 11 =20= > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link5: <ACPI PCI Link LUS0> irq 5 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link6: <ACPI PCI Link LUS1> irq 5 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link7: <ACPI PCI Link LUS2> irq 3 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link8: <ACPI PCI Link LKLN> irq 5 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link9: <ACPI PCI Link LAPU> irq 0 =20 > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link10: <ACPI PCI Link LAUI> irq =20 > 11 on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link11: <ACPI PCI Link LKMO> irq 0 =20= > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link12: <ACPI PCI Link LKSM> irq 5 =20= > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link13: <ACPI PCI Link LFWR> irq 0 =20= > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link14: <ACPI PCI Link LETH> irq 0 =20= > on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link15: <ACPI PCI Link LATA> irq =20 > 10 on acpi0 > Feb 2 19:39:09 elfi kernel: pci_link16: <ACPI PCI Link LSHD> irq 0 =20= > on acpi0 > Feb 2 19:39:09 elfi kernel: Timecounter "ACPI-fast" frequency =20 > 3579545 Hz quality 1000 > Feb 2 19:39:09 elfi kernel: acpi_timer0: <24-bit timer at =20 > 3.579545MHz> port 0x4008-0x400b on acpi0 > Feb 2 19:39:09 elfi kernel: cpu0: <ACPI CPU> on acpi0 > Feb 2 19:39:09 elfi kernel: acpi_throttle0: <ACPI CPU Throttling> =20 > on cpu0 > Feb 2 19:39:09 elfi kernel: pcib0: <ACPI Host-PCI bridge> port =20 > 0xcf8-0xcff on acpi0 > Feb 2 19:39:09 elfi kernel: pci0: <ACPI PCI bus> on pcib0 > Feb 2 19:39:09 elfi kernel: agp0: <NVIDIA nForce2 AGP Controller> =20 > mem 0xf8000000-0xfbffffff at device 0.0 on pci0 > Feb 2 19:39:09 elfi kernel: pci0: <memory, RAM> at device 0.1 (no =20 > driver attached) > Feb 2 19:39:09 elfi kernel: pci0: <memory, RAM> at device 0.2 (no =20 > driver attached) > Feb 2 19:39:09 elfi kernel: pci0: <memory, RAM> at device 0.3 (no =20 > driver attached) > Feb 2 19:39:09 elfi kernel: pci0: <memory, RAM> at device 0.4 (no =20 > driver attached) > Feb 2 19:39:09 elfi kernel: pci0: <memory, RAM> at device 0.5 (no =20 > driver attached) > Feb 2 19:39:09 elfi kernel: isab0: <PCI-ISA bridge> at device 1.0 =20 > on pci0 > Feb 2 19:39:09 elfi kernel: isa0: <ISA bus> on isab0 > Feb 2 19:39:09 elfi kernel: pci0: <serial bus, SMBus> at device =20 > 1.1 (no driver attached) > Feb 2 19:39:09 elfi kernel: ohci0: <OHCI (generic) USB controller> =20= > mem 0xfebfb000-0xfebfbfff irq 20 at device 2.0 on pci0 > Feb 2 19:39:09 elfi kernel: ohci0: [GIANT-LOCKED] > Feb 2 19:39:09 elfi kernel: usb0: OHCI version 1.0, legacy support > Feb 2 19:39:09 elfi kernel: usb0: <OHCI (generic) USB controller> =20 > on ohci0 > Feb 2 19:39:09 elfi kernel: usb0: USB revision 1.0 > Feb 2 19:39:09 elfi kernel: uhub0: nVidia OHCI root hub, class =20 > 9/0, rev 1.00/1.00, addr 1 > Feb 2 19:39:09 elfi kernel: uhub0: 4 ports with 4 removable, self =20 > powered > Feb 2 19:39:09 elfi kernel: ohci1: <OHCI (generic) USB controller> =20= > mem 0xfebfc000-0xfebfcfff irq 21 at device 2.1 on pci0 > Feb 2 19:39:09 elfi kernel: ohci1: [GIANT-LOCKED] > Feb 2 19:39:09 elfi kernel: usb1: OHCI version 1.0, legacy support > Feb 2 19:39:09 elfi kernel: usb1: <OHCI (generic) USB controller> =20 > on ohci1 > Feb 2 19:39:09 elfi kernel: usb1: USB revision 1.0 > Feb 2 19:39:09 elfi kernel: uhub1: nVidia OHCI root hub, class =20 > 9/0, rev 1.00/1.00, addr 1 > Feb 2 19:39:09 elfi kernel: uhub1: 4 ports with 4 removable, self =20 > powered > Feb 2 19:39:09 elfi kernel: ehci0: <EHCI (generic) USB 2.0 =20 > controller> mem 0xfebfdc00-0xfebfdcff irq 22 at device 2.2 on pci0 > Feb 2 19:39:09 elfi kernel: ehci0: [GIANT-LOCKED] > Feb 2 19:39:09 elfi kernel: usb2: EHCI version 1.0 > Feb 2 19:39:09 elfi kernel: usb2: companion controllers, 4 ports =20 > each: usb0 usb1 > Feb 2 19:39:09 elfi kernel: usb2: <EHCI (generic) USB 2.0 =20 > controller> on ehci0 > Feb 2 19:39:09 elfi kernel: usb2: USB revision 2.0 > Feb 2 19:39:09 elfi kernel: uhub2: nVidia EHCI root hub, class =20 > 9/0, rev 2.00/1.00, addr 1 > Feb 2 19:39:09 elfi kernel: uhub2: 8 ports with 8 removable, self =20 > powered > Feb 2 19:39:09 elfi kernel: nve0: <NVIDIA nForce MCP5 Networking =20 > Adapter> port 0xdc00-0xdc07 mem 0xfebfe000-0xfebfefff irq 20 at =20 > device 4.0 on pci0 > Feb 2 19:39:09 elfi kernel: nve0: Ethernet address 00:13:d4:bf:5b:79 > Feb 2 19:39:09 elfi kernel: miibus0: <MII bus> on nve0 > Feb 2 19:39:09 elfi kernel: rlphy0: <RTL8201L 10/100 media =20 > interface> on miibus0 > Feb 2 19:39:09 elfi kernel: rlphy0: 10baseT, 10baseT-FDX, =20 > 100baseTX, 100baseTX-FDX, auto > Feb 2 19:39:09 elfi kernel: nve0: Ethernet address: 00:13:d4:bf:5b:79 > Feb 2 19:39:09 elfi kernel: nve0: [GIANT-LOCKED] > Feb 2 19:39:09 elfi kernel: pci0: <multimedia, audio> at device =20 > 6.0 (no driver attached) > Feb 2 19:39:09 elfi kernel: pcib1: <ACPI PCI-PCI bridge> at device =20= > 8.0 on pci0 > Feb 2 19:39:09 elfi kernel: pci_link0: BIOS IRQ 22 for 0.11.INTA =20 > is invalid > Feb 2 19:39:09 elfi kernel: pci_link2: BIOS IRQ 21 for 0.6.INTA is =20= > invalid > Feb 2 19:39:09 elfi kernel: pci2: <ACPI PCI bus> on pcib1 > Feb 2 19:39:09 elfi kernel: pci2: <display, VGA> at device 6.0 (no =20= > driver attached) > Feb 2 19:39:09 elfi kernel: xl0: <3Com 3c905C-TX Fast Etherlink =20 > XL> port 0xcc00-0xcc7f mem 0xfeafec00-0xfeafec7f irq 17 at device =20 > 9.0 on pci2 > Feb 2 19:39:09 elfi kernel: miibus1: <MII bus> on xl0 > Feb 2 19:39:09 elfi kernel: xlphy0: <3c905C 10/100 internal PHY> =20 > on miibus1 > Feb 2 19:39:09 elfi kernel: xlphy0: 10baseT, 10baseT-FDX, =20 > 100baseTX, 100baseTX-FDX, auto > Feb 2 19:39:09 elfi kernel: xl0: Ethernet address: 00:04:76:ef:c6:36 > Feb 2 19:39:09 elfi kernel: atapci0: <nVidia nForce2 MCP UDMA133 =20 > controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xffa0-0xffaf =20 > at device 9.0 on pci0 > Feb 2 19:39:09 elfi kernel: ata0: <ATA channel 0> on atapci0 > Feb 2 19:39:09 elfi kernel: ata1: <ATA channel 1> on atapci0 > Feb 2 19:39:09 elfi kernel: atapci1: <nVidia nForce2 MCP SATA150 =20 > controller> port =20 > 0xec00-0xec07,0xe880-0xe883,0xe800-0xe807,0xe480-0xe483,0x7f00-0x7f0f,=20= > 0x7c00- > 0x7c7f irq 22 at device 11.0 on pci0 > Feb 2 19:39:09 elfi kernel: ata2: <ATA channel 0> on atapci1 > Feb 2 19:39:09 elfi kernel: ata3: <ATA channel 1> on atapci1 > Feb 2 19:39:09 elfi kernel: pcib2: <ACPI PCI-PCI bridge> at device =20= > 30.0 on pci0 > Feb 2 19:39:09 elfi kernel: pci1: <ACPI PCI bus> on pcib2 > Feb 2 19:39:09 elfi kernel: acpi_button0: <Power Button> on acpi0 > Feb 2 19:39:09 elfi kernel: fdc0: <floppy drive controller (FDE)> =20 > port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 > Feb 2 19:39:09 elfi kernel: fdc0: [FAST] > Feb 2 19:39:09 elfi kernel: ppc0: <ECP parallel printer port> port =20= > 0x378-0x37f,0x778-0x77f irq 7 drq 3 on acpi0 > Feb 2 19:39:09 elfi kernel: ppc0: SMC-like chipset (ECP/EPP/PS2/=20 > NIBBLE) in COMPATIBLE mode > Feb 2 19:39:09 elfi kernel: ppc0: FIFO with 16/16/9 bytes threshold > Feb 2 19:39:09 elfi kernel: ppbus0: <Parallel port bus> on ppc0 > Feb 2 19:39:09 elfi kernel: plip0: <PLIP network interface> on ppbus0 > Feb 2 19:39:09 elfi kernel: lpt0: <Printer> on ppbus0 > Feb 2 19:39:09 elfi kernel: lpt0: Interrupt-driven port > Feb 2 19:39:09 elfi kernel: ppi0: <Parallel I/O> on ppbus0 > Feb 2 19:39:09 elfi kernel: atkbdc0: <Keyboard controller (i8042)> =20= > port 0x60,0x64 irq 1 on acpi0 > Feb 2 19:39:09 elfi kernel: atkbd0: <AT Keyboard> irq 1 on atkbdc0 > Feb 2 19:39:09 elfi kernel: kbd0 at atkbd0 > Feb 2 19:39:09 elfi kernel: atkbd0: [GIANT-LOCKED] > Feb 2 19:39:09 elfi kernel: sio0: <16550A-compatible COM port> =20 > port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 > Feb 2 19:39:09 elfi kernel: sio0: type 16550A > Feb 2 19:39:09 elfi kernel: pmtimer0 on isa0 > Feb 2 19:39:09 elfi kernel: orm0: <ISA Option ROMs> at iomem =20 > 0xc0000-0xc7fff,0xc8000-0xc87ff on isa0 > Feb 2 19:39:09 elfi kernel: sc0: <System console> at flags 0x100 =20 > on isa0 > Feb 2 19:39:09 elfi kernel: sc0: VGA <16 virtual consoles, =20 > flags=3D0x300> > Feb 2 19:39:09 elfi kernel: sio1: configured irq 3 not in bitmap =20 > of probed irqs 0 > Feb 2 19:39:09 elfi kernel: sio1: port may not be enabled > Feb 2 19:39:09 elfi kernel: vga0: <Generic ISA VGA> at port =20 > 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 > Feb 2 19:39:09 elfi kernel: Timecounter "TSC" frequency 1200006671 =20= > Hz quality 800 > Feb 2 19:39:09 elfi kernel: Timecounters tick every 1.000 msec > Feb 2 19:39:09 elfi kernel: acd0: CDROM <CD-ROM CDU701-F/1.0q> at =20 > ata0-master PIO4 > Feb 2 19:39:09 elfi kernel: ad4: 286188MB <Maxtor 7L300S0 =20 > BANC1G10> at ata2-master SATA150 > Feb 2 19:39:09 elfi kernel: ad6: 286188MB <Maxtor 7L300S0 =20 > BANC1G10> at ata3-master SATA150 > Feb 2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1 created =20 > (id=3D4118114647). > Feb 2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider =20 > ad4s1 detected. > Feb 2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider =20 > ad6s1 detected. > Feb 2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider =20 > ad6s1 activated. > Feb 2 19:39:09 elfi kernel: Root mount waiting for: GMIRROR > Feb 2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider =20 > ad4s1 activated. > Feb 2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider =20 > mirror/gm0s1 launched. > > > There we go..: ) The last was from a previous boot before i pulled =20 > the promise card out... Has worked fine since (7 days uptime). > >> >> I could try to move the disks to my promise sata2 tx4 card i =20 >> bought for the old mobo (which didnt have sata)... But i'd rather =20 >> find the problem ;) >> >> Hope someone can help. >> Thanks >> Johan >> > I tried to do some more revival of the disconnected disk, no success. =20= pulled it out and plugged it back in again, still not detected.. =20 tried all sorts of combinations of reinit attach detach etc with =20 atacontrol... Finnaly I gave up and rebooted the box and now it's =20 rebuilding again... Does anyone have any clue why this is happening? Okay its better than =20= before, no crashing.. but loosing one drive in a gmirror and having =20 to reboot to fix it is not good. Thanks= --Apple-Mail-3-381948905--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?287751E3-AFFC-4ECA-B887-1E8F85943FA6>