Date: Mon, 2 Jul 2007 17:59:18 -0700 From: "Yong Rao" <yrao@force10networks.com> To: <questions@FreeBSD.org> Subject: SMP options and core dump failure Message-ID: <9E2742C54E161041A53F36F9A8DC31BE070236@EXCH-CLUSTER-04.force10networks.com>
next in thread | raw e-mail | index | archive | help
Hello, =20 We have a problem with SMP kernel. It could not dump out core when the crash happens. =20 I am able to isolate the problem to kernel configurations which have SMP enabled when used with 2 cpus.=20 With ONE cpu the core dump works ok. =20 I built the kernel with GENERIC, and deliberately crash the kernel (for testing purpose). The core dump works fine. Only added the "options SMP" and crashed the kernel, then prior to any pages being dumped out, it hangs there. =20 Has someone successfully core dumped on a system using SMP kernel with multiple CPUs? =20 I tried on two different boxes (different motherboards, CPUs and hard disks). Both got failed. =20 I tried to enable the DDB, but don't know what to look for when it goes into ddb. Appreciate any pointers. =20 a) The CPU information is=20 =20 CPU: Dual Core AMD Opteron(tm) Processor 280 (2405.47-MHz 686-class CPU) Origin =3D "AuthenticAMD" Id =3D 0x20f12 Stepping =3D 2 =20 Features=3D0x178bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PG= E ,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,S SE,SSE2,HTT> Features2=3D0x1<SSE3> AMD Features=3D0xe2500800<SYSCALL,NX,MMX+,FFXSR,LM,3DNow+,3DNow> AMD Features2=3D0x3<LAHF,CMP> Cores per package: 2 =20 =20 b) We also tried on another mother board, which has 2 CPUs. The CPU information is below. =20 CPU: Intel(R) Xeon(TM) CPU 2.80GHz (2800.11-MHz 686-class CPU) Origin =3D "GenuineIntel" Id =3D 0xf29 Stepping =3D 9 =20 Features=3D0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PG= E ,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE> Features2=3D0x4400<CNTX-ID,<b14>> real memory =3D 2147418112 (2047 MB) avail memory =3D 2096300032 (1999 MB) ACPI APIC Table: <A M I OEMAPIC > FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 6 =20 =20 c) The following are the prints when the dump hung. =20 mem dump: start address =3D 0x4352, len=3D0x30 =20 =20 Fatal trap 12: page fault while in kernel mode cpuid =3D 1; apic id =3D 01 fault virtual address =3D 0x4352 fault code =3D supervisor read, page not present instruction pointer =3D 0x20:0xc9e9fc92 stack pointer =3D 0x28:0xebdbdbdc frame pointer =3D 0x28:0xebdbdbf8 code segment =3D base 0x0, limit 0xfffff, type 0x1b =3D DPL 0, pres 1, def32 1, gran 1 processor eflags =3D interrupt enabled, resume, IOPL =3D 0 current process =3D 74231 (pnicdbg) trap number =3D 12 panic: page fault cpuid =3D 1 Uptime: 1d18h27m42s Dumping 4030 MB (2 chunks) chunk 0: 1MB (154 pages) ... ok chunk 1: 4031MB (1031776 pages) (stopped and hung here) =20 =20 Thanks, =20 Yong Rao Force10 Networks Inc. 350 Holger Way San Jose, CA 95132 408 571 6317 =20
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?9E2742C54E161041A53F36F9A8DC31BE070236>