From owner-freebsd-stable@FreeBSD.ORG Mon Apr 7 14:08:00 2003 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 02E8A37B407 for ; Mon, 7 Apr 2003 14:08:00 -0700 (PDT) Received: from clavin.cluepon.com (clavin.cluepon.com [64.154.215.6]) by mx1.FreeBSD.org (Postfix) with ESMTP id 4F5D843FCB for ; Mon, 7 Apr 2003 14:07:58 -0700 (PDT) (envelope-from lamont@cluepon.com) Received: from lamont by clavin.cluepon.com with local (Exim 3.03 #1) id 192dqP-0009Ls-00 for stable@freebsd.org; Mon, 07 Apr 2003 14:07:57 -0700 Date: Mon, 7 Apr 2003 14:07:57 -0700 From: Lamont Lucas To: stable@freebsd.org Message-ID: <20030407210757.GE70647@clavin.cluepon.com> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="NzB8fVQJ5HfG6fxh" Content-Disposition: inline Organization: Cluepon Consulting, Inc. User-Agent: Mutt/1.5.4i Subject: kmem_malloc crash with 4.8 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 07 Apr 2003 21:08:00 -0000 --NzB8fVQJ5HfG6fxh Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Hi. I've recently borrowed a SuperMicro 6013P-8 from a vendor for the purposes of testing it to see if FreeBSD and these machines will make a good replacements for our current flock of 11 Netra T1s running solaris. Things are going very well with the tests, with the glaring exception of the following crash: While doing some significant disk operations: du, deleting a different part of the tree and finally running "sync" the machine locked hard with the following errors: panic: kmem_malloc(4096): kmem_map too small: 230162432 total allocated mp_lock = 03000001; cpuid = 3; lapic.id = 07000000 boot() called on cpu#3 Then "Synching disks" but no futher output is printed, nor is any further progress made. This machine was installed off of 4.8-release and was rebuilt under -stable as of april 4th, late in the afternoon. The kernel conf file is attached, but biggest changes I made were to enable SMP as well as hyperthreading. It has 2 gigs of ram and 2 2.4 ghz xeon processors capable of hyperthreading. The drive is a single controlled by a AIC7902 Ultra 320 scsi adapter. I saw the previous discussion about the adaptec U320 controllers and scott long's advice about "lower[ing] the tag depth to 32" using camcontrol tags. I'm currently planning on removing hyperthread support and rerunning some of my tests to see if I can reproduce the error. I'm also going to try some bonnie tests. If I can consistantly reproduce the error I'll report back, but I'd appreciate knowing what additional debug info I can give back. I'm not clear what would cause this type of error. Attached are dmesg as well as the kernel config file. Can anybody recommend any other steps to try and eliminate or fix this problem? -- - Lamont "I am not an atomic playboy." --NzB8fVQJ5HfG6fxh Content-Type: text/plain; charset=us-ascii Content-Disposition: attachment; filename="dmesg.20030407" Copyright (c) 1992-2003 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.8-RELEASE #1: Sat Apr 5 03:51:24 PST 2003 root@sm0test.shockwave.com:/usr/obj/usr/src/sys/SM6013P-8 Timecounter "i8254" frequency 1193182 Hz CPU: Intel(R) Xeon(TM) CPU 2.40GHz (2399.33-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0xf27 Stepping = 7 Features=0xbfebfbff Hyperthreading: 2 logical CPUs real memory = 2146959360 (2096640K bytes) avail memory = 2085826560 (2036940K bytes) Programming 24 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 Programming 24 pins in IOAPIC #1 Programming 24 pins in IOAPIC #2 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 0, version: 0x00050014, at 0xfee00000 cpu1 (AP): apic id: 6, version: 0x00050014, at 0xfee00000 cpu2 (AP): apic id: 1, version: 0x00050014, at 0xfee00000 cpu3 (AP): apic id: 7, version: 0x00050014, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00178020, at 0xfec00000 io1 (APIC): apic id: 3, version: 0x00178020, at 0xfec80000 io2 (APIC): apic id: 4, version: 0x00178020, at 0xfec80400 Preloaded elf kernel "kernel" at 0xc052c000. Pentium Pro MTRR support enabled md0: Malloc disk Using $PIR table, 24 entries at 0xc00fde40 npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard IOAPIC #0 intpin 16 -> irq 2 IOAPIC #0 intpin 19 -> irq 10 IOAPIC #0 intpin 18 -> irq 11 pci0: on pcib0 pcib1: at device 2.0 on pci0 pci1: on pcib1 pci1: (vendor=0x8086, dev=0x1461) at 28.0 pcib2: at device 29.0 on pci1 IOAPIC #2 intpin 6 -> irq 16 IOAPIC #2 intpin 7 -> irq 17 pci2: on pcib2 em0: port 0x3000-0x303f mem 0xfc200000-0xfc21ffff irq 16 at device 3.0 on pci2 em0: Speed:100 Mbps Duplex:Full em1: port 0x3040-0x307f mem 0xfc220000-0xfc23ffff irq 17 at device 3.1 on pci2 em1: Speed:N/A Duplex:N/A pci1: (vendor=0x8086, dev=0x1461) at 30.0 pcib3: at device 31.0 on pci1 IOAPIC #1 intpin 4 -> irq 18 IOAPIC #1 intpin 5 -> irq 19 pci3: on pcib3 ahd0: port 0x4000-0x40ff,0x4400-0x44ff mem 0xfc300000-0xfc301fff irq 18 at device 2.0 on pci3 aic7902: Ultra320 Wide Channel A, SCSI Id=7, PCI-X 101-133Mhz, 512 SCBs ahd1: port 0x4800-0x48ff,0x4c00-0x4cff mem 0xfc302000-0xfc303fff irq 19 at device 2.1 on pci3 aic7902: Ultra320 Wide Channel B, SCSI Id=7, PCI-X 101-133Mhz, 512 SCBs uhci0: port 0x2000-0x201f irq 2 at device 29.0 on pci0 usb0: on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhci1: port 0x2020-0x203f irq 10 at device 29.1 on pci0 usb1: on uhci1 usb1: USB revision 1.0 uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 2 ports with 2 removable, self powered uhci2: port 0x2040-0x205f irq 11 at device 29.2 on pci0 usb2: on uhci2 usb2: USB revision 1.0 uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub2: 2 ports with 2 removable, self powered pcib4: at device 30.0 on pci0 pci4: on pcib4 pci4: at 1.0 irq 2 isab0: at device 31.0 on pci0 isa0: on isab0 atapci0: port 0x2060-0x206f,0x374-0x377,0x170-0x177,0x3f4-0x3f7,0x1f0-0x1f7 mem 0xfc000000-0xfc0003ff irq 0 at device 31.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 pci0: (vendor=0x8086, dev=0x2483) at 31.3 irq 0 eisa0: on motherboard eisa0: unknown card @@@0000 (0x00000000) at slot 4 orm0: