From owner-freebsd-current@FreeBSD.ORG Mon Nov 22 07:29:06 2004 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 64B3616A4CE; Mon, 22 Nov 2004 07:29:06 +0000 (GMT) Received: from publicd.ub.mng.net (publicd.ub.mng.net [202.179.0.88]) by mx1.FreeBSD.org (Postfix) with ESMTP id ABC8743D58; Mon, 22 Nov 2004 07:28:19 +0000 (GMT) (envelope-from ganbold@micom.mng.net) Received: from [202.179.0.164] (helo=ganbold.micom.mng.net) by publicd.ub.mng.net with esmtpa (Exim 4.43 (FreeBSD)) id 1CW8cc-000G7I-Up; Mon, 22 Nov 2004 15:28:27 +0800 Message-Id: <6.2.0.14.2.20041122151958.0303be20@202.179.0.80> X-Mailer: QUALCOMM Windows Eudora Version 6.2.0.14 Date: Mon, 22 Nov 2004 15:27:56 +0800 To: Scott Long From: Ganbold Mime-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1"; format=flowed Content-Transfer-Encoding: quoted-printable cc: tomaz.borstnar@over.net cc: cguttesen@yahoo.dk cc: freebsd-current@freebsd.org cc: mhunter@ack.Berkeley.EDU cc: rwatson@freebsd.org Subject: Re: Page fault in FreeBSD 5.3 on IBM e325, Dual AMD64 2.2GHz, 4GB RAM, ServeRAID 6M - debug logs X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 22 Nov 2004 07:29:06 -0000 Scott, I didn't put debug.mpsafenet=3D0 to /boot/loader.conf. I compiled kernel= with: options DDB options KDB Here is crash time debug output and dmesg: ----------------------------------------------------------------------------= ---------------------------- Console: serial port BIOS drive C: is disk0 BIOS 614kB/4127168kB available memory FreeBSD/i386 bootstrap loader, Revision 1.1 (root@publica.ub.mng.net, Sun Nov 21 18:24:58 ULAT 2004) Loading /boot/defaults/loader.conf /boot/kernel/kernel text=3D0x2fa370 data=3D0x7a920+0x3b9d0=20 -syms=3D[0x8+0x58278-+0x8+0x4b956| KDB: debugger backends: ddb KDB: current backend: ddb Copyright (c) 1992-2004 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.3-STABLE #12: Mon Nov 22 12:04:57 ULAT 2004 tsgan@publicc.ub.mng.net:/usr/obj/usr/src/sys/AMD Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Opteron(tm) Processor 248 (2193.17-MHz K8-class CPU) Origin =3D "AuthenticAMD" Id =3D 0xf58 Stepping =3D 8 = Features=3D0x78bfbff AMD Features=3D0xe0500800 real memory =3D 4227268608 (4031 MB) avail memory =3D 4083208192 (3894 MB) ACPI APIC Table: FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 MADT: Forcing active-low polarity and level trigger for SCI ioapic0 irqs 0-23 on motherboard ioapic1 irqs 24-27 on motherboard ioapic2 irqs 28-31 on motherboard acpi0: on motherboard acpi0: Power Button (fixed) unknown: I/O range not supported unknown: I/O range not supported Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x8008-0x800b on acpi0 cpu0: on acpi0 cpu1: on acpi0 acpi_button0: on acpi0 pcib0: port 0x8080-0x80ff,0x8000-0x807f,0xcf8-0xcff= =20 iomem 0xd8000-0xdbfff on acpi0 pci0: on pcib0 pcib1: at device 6.0 on pci0 pci1: on pcib1 ohci0: mem 0xfc100000-0xfc100fff irq 19 at= =20 device 0.0 on pci1 ohci0: [GIANT-LOCKED] usb0: OHCI version 1.0, legacy support usb0: SMM does not respond, resetting usb0: on ohci0 usb0: USB revision 1.0 uhub0: AMD OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 3 ports with 3 removable, self powered ohci1: mem 0xfc101000-0xfc101fff irq 19 at= =20 device 0.1 on pci1 ohci1: [GIANT-LOCKED] usb1: OHCI version 1.0, legacy support usb1: SMM does not respond, resetting usb1: on ohci1 usb1: USB revision 1.0 uhub1: AMD OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 3 ports with 3 removable, self powered ukbd0: Silitek IBM USB Keyboard, rev 1.10/1.00, addr 2, iclass 3/1 kbd1 at ukbd0 uhid0: Silitek IBM USB Keyboard, rev 1.10/1.00, addr 2, iclass 3/1 pci1: at device 5.0 (no driver attached) isab0: at device 7.0 on pci0 isa0: on isab0 pci0: at device 7.1 (no driver attached) pci0: at device 7.3 (no driver attached) pcib2: at device 10.0 on pci0 pci2: on pcib2 bge0: mem=20 0xfe000000-0xfe00ffff,0xfe010000-0xfe01ffff irq 24 at device 1.0 on pci2 miibus0: on bge0 brgphy0: on miibus0 brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX,=20 1000baseTX-FDX, auto bge0: Ethernet address: 00:0d:60:14:cc:9e bge1: mem=20 0xfe020000-0xfe02ffff,0xfe030000-0xfe03ffff irq 25 at device 1.1 on pci2 miibus1: on bge1 brgphy1: on miibus1 brgphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX,=20 1000baseTX-FDX, auto bge1: Ethernet address: 00:0d:60:14:cc:9f pci2: at device 2.0 (no driver attached) pci0: at device 10.1 (no driver=20 attached) pcib3: at device 11.0 on pci0 pci3: on pcib3 pcib4: at device 3.0 on pci3 pci4: on pcib4 ips0: mem 0xfe100000-0xfe100fff irq 28 at=20 device 8.0 on pci4 ips0: [GIANT-LOCKED] pci0: at device 11.1 (no driver=20 attached) sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on= acpi0 sio0: type 16550A, console orm0: at iomem=20 0xcb000-0xcb7ff,0xc9800-0xcafff,0xc8000-0xc97ff,0xc0000-0xc7fff on isa0 atkbdc0: at port 0x64,0x60 on isa0 atkbd0: flags 0x1 irq 1 on atkbdc0 device_attach: atkbd0 attach returned 6 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=3D0x100> sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounters tick every 0.976 msec ips0: adapter type: ServeRAID 6M (marco) ips0: logical drives: 1 ips0: Logical Drive 0: RAID5 sectors: 355481600, state OK ipsd0: on ips0 ipsd0: Logical Drive (173575MB) SMP: AP CPU #1 Launched! Mounting root from ufs:/dev/ipsd0s1a Pre-seeding PRNG: kickstart. Loading configuration files. Entropy harvesting: interrupts ethernet point_to_point kickstart. swapon: adding /dev/ipsd0s1b as swap device Starting file system checks: /dev/ipsd0s1a: FILE SYSTEM CLEAN; SKIPPING CHECKS /dev/ipsd0s1a: clean, 507157 free (7565 frags, 62449 blocks, 0.7%=20 fragmentation) /dev/ipsd0s1f: FILE SYSTEM CLEAN; SKIPPING CHECKS /dev/ipsd0s1f: clean, 1012200 free (40 frags, 126520 blocks, 0.0%=20 fragmentation) /dev/ipsd0s1d: FILE SYSTEM CLEAN; SKIPPING CHECKS /dev/ipsd0s1d: clean, 4876181 free (45669 frags, 603814 blocks, 0.4%=20 fragmentation) /dev/ipsd0s1e: FILE SYSTEM CLEAN; SKIPPING CHECKS /dev/ipsd0s1e: clean, 4965575 free (1087 frags, 620561 blocks, 0.0%=20 fragmentation) /dev/ipsd0s1g: FILE SYSTEM CLEAN; SKIPPING CHECKS /dev/ipsd0s1g: clean, 53622407 free (671 frags, 6702717 blocks, 0.0%=20 fragmentation) Setting hostname: publica.ub.mng.net. bge0: flags=3D8843 mtu 1500 options=3D1a inet 202.179.0.80 netmask 0xffffffe0 broadcast 202.179.0.95 ether 00:0d:60:14:cc:9e media: Ethernet 100baseTX status: active lo0: flags=3D8049 mtu 16384 inet 127.0.0.1 netmask 0xff000000 add net default: gateway 202.179.0.65 Additional routing options:. Starting devd. kbdcontrol: cannot open /dev/kbd1: Device busy Mounting NFS file systems:. Starting syslogd. Nov 22 13:49:36 publica syslogd: kernel boot file is /boot/kernel/kernel ELF ldconfig path: /lib /usr/lib /usr/lib/compat /usr/X11R6/lib= /usr/local/lib Starting usbd. Starting local daemons:. Updating motd. Configuring syscons: blanktime. Starting sshd. Initial amd64 initialization:. Additional ABI support:. Starting cron. Local package initialization:Starting apache. Starting clamav_clamd. Starting clamav_freshclam. Starting exim. Starting mysql. Starting spamd. . Additional TCP options:. Starting inetd. Starting background file system checks in 60 seconds. Mon Nov 22 13:49:41 ULAT 2004 =E0=E0=E0=E0=E0=E0N=E0=E0=E0=E0=E0=E0N=E0Nov =E0=E0N=E0=E0=E0=E0No=E0N=E0Nov= 22 14:08:=E0N=E0=E0=E0=E0=E0=E0=E0=E0=E0=E0=E0=E0=E0=E0=E0=E0=E0=E0=E0=E0= =E0=E0=E0=E0=E0=E0=E0=E0 Fatal trap 12: page fault while in kernel mode cpuid =3D 1; apic id =3D 01 fault virtual address =3D 0x18 fault code =3D supervisor read, page not present instruction pointer =3D 0x8:0xffffffff80277fc0 stack pointer =3D 0x10:0xffffffffb36ab830 frame pointer =3D 0x10:0xffffffffb36ab890 code segment =3D base 0x0, limit 0xfffff, type 0x1b =3D DPL 0, pres 1, long 1, def32 0, gran 1 processor eflags =3D interrupt enabled, resume, IOPL =3D 0 current process =3D 44 (swi1: net) [thread 100044] Stopped at m_copym+0x190: incl %ecx db> bt No such command db> help print p examine x search set = write w delete d break dwatch watch = dhwatch hwatch step s continue c until next match trace where call show ps gdb reset kill watchdog thread panic db> trace m_copym() at m_copym+0x190 tcp_output() at tcp_output+0xe91 tcp_input() at tcp_input+0x2f80 ip_input() at ip_input+0xff netisr_processqueue() at netisr_processqueue+0x79 swi_net() at swi_net+0x14d ithread_loop() at ithread_loop+0xde fork_exit() at fork_exit+0x8f fork_trampoline() at fork_trampoline+0xe --- trap 0, rip =3D 0, rsp =3D 0xffffffffb36abd00, rbp =3D 0 --- db> trace m_copym() at m_copym+0x190 tcp_output() at tcp_output+0xe91 tcp_input() at tcp_input+0x2f80 ip_input() at ip_input+0xff netisr_processqueue() at netisr_processqueue+0x79 swi_net() at swi_net+0x14d ithread_loop() at ithread_loop+0xde fork_exit() at fork_exit+0x8f fork_trampoline() at fork_trampoline+0xe --- trap 0, rip =3D 0, rsp =3D 0xffffffffb36abd00, rbp =3D 0 --- db> ----------------------------------------------------------------------------= ---------------------------- It seems to me the problem is related to network stack and threading. Am I right? How to solve this problem? thanks in advance, Ganbold