From owner-freebsd-stable@FreeBSD.ORG Mon Jun 12 20:27:30 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 53F6416A477 for ; Mon, 12 Jun 2006 20:27:30 +0000 (UTC) (envelope-from anthonyv@brainlink.com) Received: from mta10.srv.hcvlny.cv.net (mta10.srv.hcvlny.cv.net [167.206.4.205]) by mx1.FreeBSD.org (Postfix) with ESMTP id A59A143D6A for ; Mon, 12 Jun 2006 20:27:24 +0000 (GMT) (envelope-from anthonyv@brainlink.com) Received: from [192.168.1.30] (ool-18b917b1.dyn.optonline.net [24.185.23.177]) by mta10.srv.hcvlny.cv.net (Sun Java System Messaging Server 6.2-6.01 (built Apr 3 2006)) with ESMTP id <0J0R00HAIKSUK4A0@mta10.srv.hcvlny.cv.net> for freebsd-stable@freebsd.org; Mon, 12 Jun 2006 16:26:55 -0400 (EDT) Date: Mon, 12 Jun 2006 16:27:10 -0400 From: Anthony Volodkin To: freebsd-stable@freebsd.org Message-id: <448DCE1E.2090000@brainlink.com> MIME-version: 1.0 Content-type: text/plain; charset=ISO-8859-1; format=flowed Content-transfer-encoding: 7BIT User-Agent: Thunderbird 1.4 (Windows/20050908) Subject: Intermittent Kernel Panics on Disk Activity/cvsup/make on 6.1-RELEASE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 12 Jun 2006 20:27:30 -0000 Hi, On my Athlon XP 1800 / Abit KX7-333R system, I've been encountering intermittent kernel panics during periods of high disk activity or when running cvsup or make buildworld. What's notable is that at this point the motherboard, CPU, and power supply have been replaced and the system has no difficulties running Memtest86 for several hours. Additionally, this problem does not happen EACH time I generate a lot of disk load or run cvsup, but occasionally. Per the handbook, I've built a debug kernel and captured crash data. Below are my dmesg, and the output of several kgdb, list *instruction pointer and backtrace. What are some of the next steps I can take to help resolve this or find the cause of these crashes? Any help is highly appreciated. Please CC me when responding as I am not subscribed to this list. Thank you, Anthony Volodkin ---------------------------------------- kgdb kernel.debug /var/crash/vmcore.1 ---------------------------------------- [GDB will not be able to debug user-mode threads: /usr/lib/libthread_db.so: Undefined symbol "ps_pglobal_lookup"] GNU gdb 6.1.1 [FreeBSD] ... [snip] ... This GDB was configured as "i386-marcel-freebsd". Unread portion of the kernel message buffer: Fatal trap 9: general protection fault while in kernel mode instruction pointer = 0x20:0xc06aacfd stack pointer = 0x28:0xe981ac10 frame pointer = 0x28:0xe981acdc code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 18257 (cvsup) trap number = 9 panic: general protection fault Uptime: 3d15h50m31s Dumping 2047 MB (2 chunks) chunk 0: 1MB (159 pages) ... ok chunk 1: 2047MB (524016 pages) 2031 2015 1999 1983 1967 1951 1935 1919 1903 1887 1871 1855 1839 1823 1807 1791 1775 1759 1743 1727 1711 1695 1679 1663 1647 1631 1615 1599 1583 1567 1551 1535 1519 1503 1487 1471 1455 1439 1423 1407 1391 1375 1359 1343 1327 1311 1295 1279 1263 1247 1231 1215 1199 1183 1167 1151 1135 1119 1103 1087 1071 1055 1039 1023 1007 991 975 959 943 927 911 895 879 863 847 831 815 799 783 767 751 735 719 703 687 671 655 639 623 607 591 575 559 543 527 511 495 479 463 447 431 415 399 383 367 351 335 319 303 287 271 255 239 223 207 191 175 159 143 127 111 95 79 63 47 31 15 #0 doadump () at pcpu.h:165 165 __asm __volatile("movl %%fs:0,%0" : "=r" (td)); ---------------------------------------- (kgdb) list *0xc06aacfd ---------------------------------------- 0xc06aacfd is in lseek (/usr/src/sys/kern/vfs_syscalls.c:1788). 1783 goto drop; 1784 fp->f_offset = offset; 1785 *(off_t *)(td->td_retval) = fp->f_offset; 1786 drop: 1787 fdrop(fp, td); 1788 VFS_UNLOCK_GIANT(vfslocked); 1789 return (error); 1790 } 1791 1792 #if defined(COMPAT_43) ---------------------------------------- (kgdb) bt ---------------------------------------- #0 doadump () at pcpu.h:165 #1 0xc064dee1 in boot (howto=260) at /usr/src/sys/kern/kern_shutdown.c:402 #2 0xc064e178 in panic (fmt=0xc088cb0e "%s") at /usr/src/sys/kern/kern_shutdown.c:558 #3 0xc0841d94 in trap_fatal (frame=0xe981abd0, eva=0) at /usr/src/sys/i386/i386/trap.c:836 #4 0xc08418bc in trap (frame= {tf_fs = 8, tf_es = 40, tf_ds = 40, tf_edi = -377377532, tf_esi = -948068912, tf_ebp = -377377572, tf_isp = -377377796, tf_ebx = 0, tf_edx = -969420352, tf_ecx = -911557632, tf_eax = 0, tf_trapno = 9, tf_err = 48128, tf_eip = -1066750723, tf_cs = 32, tf_eflags = 66050, tf_esp = -960093848, tf_ss = -911557632}) at /usr/src/sys/i386/i386/trap.c:631 #5 0xc0830c9a in calltrap () at /usr/src/sys/i386/i386/exception.s:139 #6 0xc06aacfd in lseek (td=0xc9aabc00, uap=0xe981ad04) at /usr/src/sys/kern/vfs_syscalls.c:1787 #7 0xc08420ab in syscall (frame= {tf_fs = 59, tf_es = 59, tf_ds = 59, tf_edi = 672019104, tf_esi = -1077940896, tf_ebp = 136321580, tf_isp = -377377436, tf_ebx = 673236232, tf_edx = 0, tf_ecx = 118, tf_eax = 198, tf_trapno = 0, tf_err = 2, tf_eip = 673179699, tf_cs = 51, tf_eflags = 514, tf_esp = 136321536, tf_ss = 59}) at /usr/src/sys/i386/i386/trap.c:981 #8 0xc0830cef in Xint0x80_syscall () at /usr/src/sys/i386/i386/exception.s:200 #9 0x00000033 in ?? () Previous frame inner to this frame (corrupt stack?) ---------------------------------------- dmesg ---------------------------------------- Copyright (c) 1992-2006 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 6.1-RELEASE #0: Fri May 26 03:01:47 EDT 2006 root@superior.local.non-standard.net:/usr/obj/usr/src/sys/SUPERIOR mptable_probe: MP Config Table has bad signature: \M^D\^A\^A Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Athlon(tm) XP 1800+ (1533.99-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0x662 Stepping = 2 Features=0x383fbff AMD Features=0xc0480800 real memory = 2147418112 (2047 MB) avail memory = 2096558080 (1999 MB) kbd1 at kbdmux0 acpi0: on motherboard acpi0: Power Button (fixed) Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x4008-0x400b on acpi0 cpu0: on acpi0 acpi_throttle0: on cpu0 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff,0x4000-0x407f,0x4080-0x40ff,0x5000-0x500f on acpi0 pci0: on pcib0 agp0: mem 0xe4000000-0xe47fffff at device 0.0 on pci0 pcib1: at device 1.0 on pci0 pci1: on pcib1 pci1: at device 0.0 (no driver attached) fxp0: port 0xb000-0xb03f mem 0xe4904000-0xe4904fff,0xe4800000-0xe48fffff irq 10 at device 9.0 on pci0 miibus0: on fxp0 inphy0: on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:90:27:a7:39:6e atapci0: port 0xb400-0xb407,0xb800-0xb803,0xbc00-0xbc07,0xc000-0xc003,0xc400-0xc40f mem 0xe4900000-0xe4903fff irq 5 at device 15.0 on pci0 ata2: on atapci0 ata3: on atapci0 isab0: at device 17.0 on pci0 isa0: on isab0 atapci1: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xc800-0xc80f at device 17.1 on pci0 ata0: on atapci1 ata1: on atapci1 uhci0: port 0xcc00-0xcc1f irq 11 at device 17.2 on pci0 uhci0: [GIANT-LOCKED] usb0: on uhci0 usb0: USB revision 1.0 uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhci1: port 0xd000-0xd01f irq 11 at device 17.3 on pci0 uhci1: [GIANT-LOCKED] usb1: on uhci1 usb1: USB revision 1.0 uhub1: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 2 ports with 2 removable, self powered atapci2: port 0xd400-0xd407,0xd800-0xd803,0xdc00-0xdc07,0xe000-0xe003,0xe400-0xe4ff irq 10 at device 19.0 on pci0 ata4: on atapci2 ata5: on atapci2 acpi_tz0: on acpi0 fdc0: port 0x3f2-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FAST] sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A ppc0: port 0x378-0x37f,0x778-0x77b irq 7 on acpi0 ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode ppbus0: on ppc0 plip0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] pmtimer0 on isa0 orm0: at iomem 0xc0000-0xc7fff,0xc8000-0xca7ff,0xcb000-0xcc7ff on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounter "TSC" frequency 1533986228 Hz quality 800 Timecounters tick every 1.000 msec ad0: 57259MB at ata0-master UDMA133 ad2: 190782MB at ata1-master UDMA100 ad3: 78167MB at ata1-slave UDMA133 ad4: 239372MB at ata2-master UDMA133 ad5: 238475MB at ata2-slave UDMA100 ad6: 194481MB at ata3-master UDMA133 ad7: 76345MB at ata3-slave UDMA133 Trying to mount root from ufs:/dev/ad0s1a