Date: Sun, 02 Oct 2005 19:54:25 +0200 From: Jurij Kovacic <jurij.kovacic@gmail.com> To: freebsd-stable@freebsd.org Subject: Stability problems vith FreeBSD 5.2.1-RELEASE-p14 Message-ID: <43401ED1.6060404@gmail.com>
next in thread | raw e-mail | index | archive | help
Hello! We are running FreeBSD 5.2.1-RELEASE-p14 with SMP kernell on one of our servers and are experiencing stability problems; the server has the tendency to reboot itself for no apparent reason at least once per month. Dmesg: Copyright (c) 1992-2004 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.2.1-RELEASE-p14 #0: Tue May 10 18:06:08 CEST 2005 root@host:/usr/obj/usr/src/sys/CUSTOM_521 Preloaded elf kernel "/boot/kernel/kernel" at 0xc0a3f000. Preloaded elf module "/boot/kernel/acpi.ko" at 0xc0a3f294. ACPI APIC Table: <PTLTD APIC > Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Opteron(tm) Processor 246 (1993.80-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0xf58 Stepping = 8 Features=0x78bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2> AMD Features=0xe0500000<NX,AMIE,LM,DSP,3DNow!> real memory = 4227268608 (4031 MB) avail memory = 4109025280 (3918 MB) FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 ioapic0 <Version 1.1> irqs 0-23 on motherboard ioapic1 <Version 1.1> irqs 24-27 on motherboard ioapic2 <Version 1.1> irqs 28-31 on motherboard Pentium Pro MTRR support enabled npx0: [FAST] npx0: <math processor> on motherboard npx0: INT 16 interface acpi0: <PTLTD XSDT > on motherboard pcibios: BIOS version 2.10 Using $PIR table, 12 entries at 0xc00fdf00 acpi0: Power Button (fixed) Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 unknown: I/O range not supported unknown: I/O range not supported acpi_timer0: <24-bit timer at 3.579545MHz> port 0x8008-0x800b on acpi0 acpi_cpu0: <CPU> on acpi0 acpi_cpu1: <CPU> on acpi0 acpi_button0: <Power Button> on acpi0 pcib0: <ACPI Host-PCI bridge> port 0x8080-0x80ff,0x8000-0x807f,0xcf8-0xcff iomem 0xd8000-0xdbfff on acpi0 pcib0: could not get PCI interrupt routing table for \\_SB_.PCI0 - AE_NOT_FOUND pci0: <ACPI PCI bus> on pcib0 pcib1: <ACPI PCI-PCI bridge> at device 6.0 on pci0 pci1: <ACPI PCI bus> on pcib1 ohci0: <OHCI (generic) USB controller> mem 0xfc100000-0xfc100fff irq 19 at device 0.0 on pci1 usb0: OHCI version 1.0, legacy support usb0: SMM does not respond, resetting usb0: <OHCI (generic) USB controller> on ohci0 usb0: USB revision 1.0 uhub0: AMD OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 3 ports with 3 removable, self powered ukbd0: Belkin Components KVM USB Keyboard/Mouse, rev 1.00/2.01, addr 2, iclass 3/1 kbd1 at ukbd0 ums0: Belkin Components KVM USB Keyboard/Mouse, rev 1.00/2.01, addr 2, iclass 3/1 ums0: 5 buttons and Z dir. ohci1: <OHCI (generic) USB controller> mem 0xfc101000-0xfc101fff irq 19 at device 0.1 on pci1 usb1: OHCI version 1.0, legacy support usb1: SMM does not respond, resetting usb1: <OHCI (generic) USB controller> on ohci1 usb1: USB revision 1.0 uhub1: AMD OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 3 ports with 3 removable, self powered pci1: <display, VGA> at device 5.0 (no driver attached) isab0: <PCI-ISA bridge> at device 7.0 on pci0 isa0: <ISA bus> on isab0 atapci0: <AMD 8111 UDMA133 controller> port 0x1020-0x102f at device 7.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata0: [MPSAFE] ata1: at 0x170 irq 15 on atapci0 ata1: [MPSAFE] pci0: <bridge, PCI-unknown> at device 7.3 (no driver attached) pcib2: <ACPI PCI-PCI bridge> at device 10.0 on pci0 pci2: <ACPI PCI bus> on pcib2 bge0: <Broadcom BCM5704C Dual Gigabit Ethernet, ASIC rev. 0x2003> mem 0xfe000000-0xfe00ffff,0xfe010000-0xfe01ffff irq 24 at device 1.0 on pci2 bge0: Ethernet address: 00:0d:60:14:86:16 miibus0: <MII bus> on bge0 brgphy0: <BCM5704 10/100/1000baseTX PHY> on miibus0 brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto bge1: <Broadcom BCM5704C Dual Gigabit Ethernet, ASIC rev. 0x2003> mem 0xfe020000-0xfe02ffff,0xfe030000-0xfe03ffff irq 25 at device 1.1 on pci2 bge1: Ethernet address: 00:0d:60:14:86:17 miibus1: <MII bus> on bge1 brgphy1: <BCM5704 10/100/1000baseTX PHY> on miibus1 brgphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto pci0: <base peripheral, interrupt controller> at device 10.1 (no driver attached) pcib3: <ACPI PCI-PCI bridge> at device 11.0 on pci0 pci3: <ACPI PCI bus> on pcib3 pcib4: <PCI-PCI bridge> at device 3.0 on pci3 pci4: <PCI bus> on pcib4 pcib4: slot 8 INTA is routed to irq 28 ips0: <IBM ServeRAID Adapter> mem 0xfe100000-0xfe100fff irq 28 at device 8.0 on pci4 pci0: <base peripheral, interrupt controller> at device 11.1 (no driver attached) sio0 port 0x3f8-0x3ff irq 4 on acpi0 sio0: type 16550A orm0: <Option ROMs> at iomem 0xc9800-0xcafff,0xc8000-0xc97ff on isa0 pmtimer0 on isa0 fdc0: ready for input in output fdc0: cmd 3 failed at out byte 1 of 3 ppc0: parallel port not found. sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounters tick every 10.000 msec acd0: CDROM <CD-224E> at ata1-master PIO4 ips0: adapter type: ServeRAID 6M (marco) ips0: logical drives: 1 ips0: Logical Drive 0: RAID1 sectors: 143372288, state OK ipsd0: <Logical Drive> on ips0 GEOM: create disk ipsd0 dp=0xc7418790 ipsd0: Logical Drive (70006MB) SMP: AP CPU #1 Launched! Mounting root from ufs:/dev/ipsd0s1a bge1: gigabit link up netsmb_dev: loaded Memtest reveales no problems with RAM. One sure way to crash the server is to delete a large number of files - /usr/ports for example. The panic message is ussually somewhere along these lines: panic: kmem_malloc(4096) kmem map too small: 48496066400 total allocated cpuid =0 boot() called on cpu#0 ... Has anyone experienced similar problems? Would upgrading to FreeBSD 5.4 fix the problem? Any and all will be deeply appreciated! With regards, Jurij Kovacic
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?43401ED1.6060404>