From owner-freebsd-stable@FreeBSD.ORG Sat Aug 5 13:35:49 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 193A816A4DD for ; Sat, 5 Aug 2006 13:35:49 +0000 (UTC) (envelope-from sumirati@yahoo.de) Received: from smtp005.mail.ukl.yahoo.com (smtp005.mail.ukl.yahoo.com [217.12.11.36]) by mx1.FreeBSD.org (Postfix) with SMTP id 4099F43D49 for ; Sat, 5 Aug 2006 13:35:48 +0000 (GMT) (envelope-from sumirati@yahoo.de) Received: (qmail 6302 invoked from network); 5 Aug 2006 13:35:47 -0000 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.de; h=Received:Mime-Version:To:Message-Id:Content-Type:From:Subject:Date:X-Mailer; b=g2IXF4eWcC0vLqp2VxupCcczDSmhzaQPNiYeiWxQr84+ooIyvxiP2fKOa0MuaAh0mvVyYXRcUE0CvWAYB1Pjs8XIAofLaK2FBdGEOJZJcXb+1yWL6KP6AejEk86gs01wBsJOD7OseVXd701cUvSPq8vN0SEOPHcWRikJvj7fV7k= ; Received: from unknown (HELO ?192.168.66.66?) (sumirati@84.176.112.70 with plain) by smtp005.mail.ukl.yahoo.com with SMTP; 5 Aug 2006 13:35:46 -0000 Mime-Version: 1.0 (Apple Message framework v752.2) To: freebsd-stable@freebsd.org Message-Id: <25D490BF-F24D-4A3E-8A31-B352DC42DAD3@yahoo.de> Content-Type: multipart/mixed; boundary=Apple-Mail-5-384085040 From: Marc Perisa Date: Sat, 5 Aug 2006 15:35:44 +0200 X-Mailer: Apple Mail (2.752.2) Subject: Panic with gvinum rebuildparity X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 05 Aug 2006 13:35:49 -0000 --Apple-Mail-5-384085040 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Hello there, I have some problems with my RAID-5 under gvinum since a power outage because of a lightning. If I mount the volume and the automatic background fsck run, check or rebuild the parity I will get a panic. Shorthand before the lightning I began to upgrade from 5.4-stable to 6.1-stable. Because of that I ignored the flaws on my data disk and continued the upgrade. The error originated with a 5.4-stable without debugging enabled. Now the machine has the following running with a dump device configured: FreeBSD titan.home.sumirati.net 6.1-STABLE FreeBSD 6.1-STABLE #0: Sun Jul 30 21:19:10 CEST 2006 root@titan.home.sumirati.net:/usr/obj/ usr/src/sys/GENERIC i386 The stable is from Saturday, 29.07.2006 as of around 22:00 MET from cvsup2.de.freebsd.org. The message in /var/crash/info.* looks everytime like this: Panic String: kmem_malloc(262144): kmem_map too small: 172404736 total allocated The dmesg and the vinum configuration is attached. I also attached a backtrace but I think it is not that useful. I found nothing in the archives about a problem like this. Some PRs are showing kmem_map erros but not with (g)vinum. My questions are: Is there any know bug or problem with gvinum with rebuilding the parity? and How can I get back my data? (of course :)) I'm in the process of making sure the hardware is ok while dd'ing them to /dev/null. Thanks a lot for your time Marc --Apple-Mail-5-384085040 Content-Transfer-Encoding: 7bit Content-Type: application/octet-stream; x-unix-mode=0644; name=dmesg Content-Disposition: attachment; filename=dmesg Copyright (c) 1992-2006 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 6.1-STABLE #0: Sun Jul 30 21:19:10 CEST 2006 root@titan.home.sumirati.net:/usr/obj/usr/src/sys/GENERIC Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Athlon(tm) processor (902.05-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0x642 Stepping = 2 Features=0x183fbff AMD Features=0xc0440800,MMX+,3DNow+,3DNow> real memory = 536805376 (511 MB) avail memory = 515915776 (492 MB) mptable_probe: MP Config Table has bad signature: POWE ACPI APIC Table: ioapic0 irqs 0-23 on motherboard kbd1 at kbdmux0 ath_hal: 0.9.17.2 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413) acpi0: on motherboard acpi0: Power Button (fixed) Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x4008-0x400b on acpi0 cpu0: on acpi0 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 Correcting nForce2 C1 CPU disconnect hangs agp0: mem 0xd0000000-0xd7ffffff at device 0.0 on pci0 pci0: at device 0.1 (no driver attached) pci0: at device 0.2 (no driver attached) pci0: at device 0.3 (no driver attached) pci0: at device 0.4 (no driver attached) pci0: at device 0.5 (no driver attached) isab0: at device 1.0 on pci0 isa0: on isab0 pci0: at device 1.1 (no driver attached) ohci0: mem 0xe2086000-0xe2086fff irq 20 at device 2.0 on pci0 ohci0: [GIANT-LOCKED] usb0: OHCI version 1.0, legacy support usb0: SMM does not respond, resetting usb0: on ohci0 usb0: USB revision 1.0 uhub0: nVidia OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 3 ports with 3 removable, self powered ohci1: mem 0xe2081000-0xe2081fff irq 21 at device 2.1 on pci0 ohci1: [GIANT-LOCKED] usb1: OHCI version 1.0, legacy support usb1: SMM does not respond, resetting usb1: on ohci1 usb1: USB revision 1.0 uhub1: nVidia OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 3 ports with 3 removable, self powered ehci0: mem 0xe2082000-0xe20820ff irq 22 at device 2.2 on pci0 ehci0: [GIANT-LOCKED] usb2: EHCI version 1.0 usb2: companion controllers, 4 ports each: usb0 usb1 usb2: on ehci0 usb2: USB revision 2.0 uhub2: nVidia EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 uhub2: 6 ports with 6 removable, self powered pci0: at device 4.0 (no driver attached) pci0: at device 5.0 (no driver attached) pci0: at device 6.0 (no driver attached) pcib1: at device 8.0 on pci0 pci1: on pcib1 fxp0: port 0x9000-0x901f mem 0xda000000-0xda000fff,0xd8000000-0xd80fffff irq 18 at device 6.0 on pci1 miibus0: on fxp0 nsphy0: on miibus0 nsphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:a0:c9:00:a8:1f ahc0: port 0x9400-0x94ff mem 0xd8200000-0xd8200fff irq 19 at device 7.0 on pci1 ahc0: [GIANT-LOCKED] aic7892: Ultra160 Wide Channel A, SCSI Id=7, 32/253 SCBs atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xf000-0xf00f at device 9.0 on pci0 ata0: on atapci0 ata1: on atapci0 fwohci0: <1394 Open Host Controller Interface> mem 0xe2084000-0xe20847ff,0xe2085000-0xe208503f irq 20 at device 13.0 on pci0 fwohci0: OHCI version 1.10 (ROM=0) fwohci0: No. of Isochronous channels is 4. fwohci0: EUI64 00:01:00:30:04:04:91:3f fwohci0: Phy 1394a available S400, 3 ports. fwohci0: Link S400, max_rec 2048 bytes. firewire0: on fwohci0 fwe0: on firewire0 if_fwe0: Fake Ethernet address: 02:01:00:04:91:3f fwe0: Ethernet address: 02:01:00:04:91:3f fwe0: if_start running deferred for Giant sbp0: on firewire0 fwohci0: Initiate bus reset fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode firewire0: 1 nodes, maxhop <= 0, cable IRM = 0 (me) firewire0: bus manager 0 (me) pcib2: at device 30.0 on pci0 pci3: on pcib2 pci3: at device 0.0 (no driver attached) acpi_tz0: on acpi0 fdc0: port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FAST] sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A ppc0: port 0x378-0x37f,0x778-0x77b irq 7 on acpi0 ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode ppbus0: on ppc0 plip0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] pmtimer0 on isa0 orm0: at iomem 0xc0000-0xc9fff,0xcc000-0xcffff,0xd0000-0xd17ff on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounter "TSC" frequency 902050500 Hz quality 800 Timecounters tick every 1.000 msec Waiting 5 seconds for SCSI devices to settle ad0: 114473MB at ata0-master UDMA100 ad1: 114473MB at ata0-slave UDMA100 ad2: 114440MB at ata1-master UDMA100 ad3: 114473MB at ata1-slave UDMA100 da0 at ahc0 bus 0 target 8 lun 0 da0: Fixed Direct Access SCSI-3 device da0: 160.000MB/s transfers (80.000MHz, offset 127, 16bit), Tagged Queueing Enabled da0: 34732MB (71132000 512 byte sectors: 255H 63S/T 4427C) cd0 at ahc0 bus 0 target 0 lun 0 cd0: Removable CD-ROM SCSI-2 device cd0: 20.000MB/s transfers (20.000MHz, offset 16) cd0: Attempt to query device size failed: NOT READY, Medium not present cd1 at ahc0 bus 0 target 4 lun 0 cd1: Removable CD-ROM SCSI-2 device cd1: 20.000MB/s transfers (20.000MHz, offset 8) cd1: Attempt to query device size failed: NOT READY, Medium not present - tray closed Trying to mount root from ufs:/dev/da0s1a WARNING: / was not properly dismounted WARNING: /usr was not properly dismounted WARNING: /var was not properly dismounted fxp0: link state changed to UP --Apple-Mail-5-384085040 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed titan# gvinum list 4 drives: D wd120a State: up /dev/ad0s1 A: 33/114470 MB (0%) D wd120b State: up /dev/ad1s1 A: 33/114470 MB (0%) D wd120c State: up /dev/ad2s1 A: 2/114439 MB (0%) D wd120d State: up /dev/ad3s1 A: 33/114470 MB (0%) 1 volume: V daten State: up Plexes: 1 Size: 335 GB 1 plex: P daten.p0 R5 State: up Subdisks: 4 Size: 335 GB 4 subdisks: S daten.p0.s3 State: up D: wd120d Size: 111 GB S daten.p0.s2 State: up D: wd120c Size: 111 GB S daten.p0.s1 State: up D: wd120b Size: 111 GB S daten.p0.s0 State: up D: wd120a Size: 111 GB This GDB was configured as "i386-marcel-freebsd". Unread portion of the kernel message buffer: panic: kmem_malloc(262144): kmem_map too small: 172400640 total allocated Uptime: 18h21m45s Dumping 511 MB (2 chunks) chunk 0: 1MB (159 pages) ... ok chunk 1: 511MB (130800 pages) 495 479 463 447 431 415 399 383 367 351 335 319 303 287 271 255 239 223 207 191 175 159 143 127 111 95 79 63 47 31 15 #0 doadump () at pcpu.h:165 165 __asm __volatile("movl %%fs:0,%0" : "=r" (td)); (kgdb) bt #0 doadump () at pcpu.h:165 #1 0xc0665e0e in boot (howto=260) at /usr/src/sys/kern/ kern_shutdown.c:409 #2 0xc06660a4 in panic (fmt=0xc08f0533 "kmem_malloc(%ld): kmem_map too small: %ld total allocated") at /usr/src/sys/kern/kern_shutdown.c:565 #3 0xc07d2f1d in kmem_malloc (map=0xc10430c0, size=262144, flags=258) at /usr/src/sys/vm/vm_kern.c:299 #4 0xc07ca852 in page_alloc (zone=0x0, bytes=262144, pflag=0x0, wait=258) at /usr/src/sys/vm/uma_core.c:958 #5 0xc07ccbab in uma_large_malloc (size=262144, wait=258) at /usr/ src/sys/vm/uma_core.c:2701 #6 0xc065b605 in malloc (size=262144, mtp=0xc0948ce0, flags=258) at / usr/src/sys/kern/kern_malloc.c:329 #7 0xc57907ea in ?? () #8 0x00040000 in ?? () #9 0xc0948ce0 in g_gptmodule_sys_init () #10 0x00000102 in ?? () #11 0xc389f200 in ?? () #12 0x4b2c0000 in ?? () #13 0x00000001 in ?? () #14 0x00040000 in ?? () #15 0x00000000 in ?? () #16 0x00000000 in ?? () #17 0xe1880000 in ?? () #18 0xc5783380 in ?? () #19 0xc4b79e70 in ?? () #20 0xdeb71d10 in ?? () #21 0xc578d985 in ?? () #22 0xc3620600 in ?? () #23 0xc5783380 in ?? () #24 0xc4b8f108 in ?? () #25 0xcd5f6000 in ?? () #26 0xe1880000 in ?? () #27 0x00000003 in ?? () #28 0x00040000 in ?? () #29 0x00000000 in ?? () #30 0x00040000 in ?? () #31 0x00000000 in ?? () #32 0x00000000 in ?? () #33 0x00040000 in ?? () #34 0x00000000 in ?? () #35 0x00000000 in ?? () #36 0xcd6f4000 in ?? () #37 0xcd5f4000 in ?? () #38 0xc0667124 in sigprocmask (td=0xc3620600, uap=0xc4b96c60) at /usr/ src/sys/kern/kern_sig.c:700 Previous frame inner to this frame (corrupt stack?) (kgdb) --Apple-Mail-5-384085040-- ___________________________________________________________ Der frühe Vogel fängt den Wurm. Hier gelangen Sie zum neuen Yahoo! Mail: http://mail.yahoo.de