From owner-freebsd-bugs Tue May 2 19:41:13 2000 Delivered-To: freebsd-bugs@freebsd.org Received: from ns.itga.com.au (ns.itga.com.au [202.53.40.210]) by hub.freebsd.org (Postfix) with ESMTP id 9868037BCDA for ; Tue, 2 May 2000 19:41:04 -0700 (PDT) (envelope-from gnb@itga.com.au) Received: from lightning.itga.com.au (lightning.itga.com.au [192.168.71.20]) by ns.itga.com.au (8.9.3/8.9.3) with ESMTP id MAA56595; Tue, 2 May 2000 12:34:37 +1000 (EST) (envelope-from gnb@itga.com.au) Received: from itga.com.au (lightning.itga.com.au [192.168.71.20]) by lightning.itga.com.au (8.9.3/8.9.3) with ESMTP id MAA02873; Tue, 2 May 2000 12:34:36 +1000 (EST) Message-Id: <200005020234.MAA02873@lightning.itga.com.au> X-Mailer: exmh version 2.0.1 12/23/97 From: Gregory Bond To: Rahul Dhesi Cc: freebsd-bugs@FreeBSD.ORG Subject: Re: kern/17122: crash due to: softdep_disk_write_com In-reply-to: Your message of Mon, 01 May 2000 16:20:15 -0700. Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Date: Tue, 02 May 2000 12:34:36 +1000 Sender: owner-freebsd-bugs@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org I have caught 2 vmcore's from these otherwise-unexplained crashes. No obvious similarities between them, nor any link in my crash dumps to softdep_disk_write_com as originally reported by Rahul. (I thought my crashes and Rahul's crashes might be related, because we are both seeing them on boxes doing nothing but Squid caching on Softupdates-enabled partitions.) Currently seems to be happening once every week or two. I have vmcores (100Mb unzipped/37Mb gzipped each) and debug kernels if anyone is interested in having a look. Or I can run up the debugger and chase info if anyone has hints on what to look for. My machine is fairly vanilla - HP Vectra, PPro with 96Mb & 2xIDE drives, running 3.4-Release off the CDs. We have a number of other FreeBSD boxes, same and different hardware types, running various versions from 3.2-R to late 3.X-Stable, most with Softupdates, running various things (but not Squid); we have never seen panic/traps with any of them. Of course, being 3.x the answer might be "forget it, upgrade to 4.x and see if it still happens." I might just try that next.... dmesg and kernel backtraces appended below. Copyright (c) 1992-1999 FreeBSD Inc. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 3.4-RELEASE #0: Thu Feb 17 17:35:48 EST 2000 toor@proxy.itga.com.au:/usr/src/sys/compile/Squid Timecounter "i8254" frequency 1193182 Hz CPU: Pentium Pro (199.74-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x619 Stepping = 9 Features=0xf9ff real memory = 100663296 (98304K bytes) avail memory = 95264768 (93032K bytes) Preloaded elf kernel "kernel" at 0xc025b000. Pentium Pro MTRR support enabled Probing for devices on PCI bus 0: Correcting Natoma config for non-SMP chip0: rev 0x02 on pci0.0.0 chip1: rev 0x01 on pci0.4.0 ide_pci0: rev 0x00 on pci0.4.1 vga0: rev 0x43 int a irq 9 on pci0.7.0 fxp0: rev 0x02 int a irq 11 on pci0.10.0 fxp0: Ethernet address 00:60:b0:68:4d:04 lnc1: rev 0x16 int a irq 10 on pci0.11.0 lnc1: PCnet-PCI II address 00:60:b0:7b:5b:43 Probing for devices on the ISA bus: sc0 on isa sc0: VGA color <16 virtual consoles, flags=0x0> atkbdc0 at 0x60-0x6f on motherboard atkbd0 irq 1 on isa psm0 not found sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa sio0: type 16550A sio1 at 0x2f8-0x2ff irq 3 on isa sio1: type 16550A ppc0 at 0x378 irq 7 flags 0x40 on isa ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode lpt0: on ppbus 0 lpt0: Interrupt-driven port fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa fdc0: FIFO enabled, 8 bytes threshold fd0: 1.44MB 3.5in wdc0 at 0x1f0-0x1f7 irq 14 flags 0xa0ffa0ff on isa wdc0: unit 0 (wd0): , DMA, 32-bit, multi-block-16 wd0: 2445MB (5008752 sectors), 4969 cyls, 16 heads, 63 S/T, 512 B/S wdc0: unit 1 (atapi): , removable, accel, dma, iordis acd0: drive speed 1378KB/sec, 128KB cache acd0: supported read types: CD-R, CD-DA acd0: Audio: play, 256 volume levels acd0: Mechanism: ejectable tray acd0: Medium: no/blank disc inside, unlocked wdc1 at 0x170-0x177 irq 15 flags 0xa0ffa0ff on isa wdc1: unit 0 (wd2): , DMA, 32-bit, multi-block-16 wd2: 19574MB (40088160 sectors), 39770 cyls, 16 heads, 63 S/T, 512 B/S vga0 at 0x3b0-0x3df maddr 0xa0000 msize 131072 on isa npx0 on motherboard npx0: INT 16 interface IP packet filtering initialized, divert disabled, rule-based forwarding disabled, unlimited logging changing root device to wd0s1a WARNING: / was not properly dismounted Script started on Tue May 2 11:59:36 2000 proxy## gdb -k kernel.debug.0 vmcore.0 GNU gdb 4.18 Copyright 1998 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "i386-unknown-freebsd"... IdlePTD 2543616 initial pcb at 203ce4 panicstr: page fault panic messages: --- Fatal trap 12: page fault while in kernel mode fault virtual address = 0xbfc21a60 fault code = supervisor read, page not present instruction pointer = 0x8:0xc01b6b0f stack pointer = 0x10:0xc5948f08 frame pointer = 0x10:0xc5948f18 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 255 (squid) interrupt mask = net bio cam trap number = 12 panic: page fault syncing disks... 19 19 19 19 19 19 19 19 19 19 19 19 19 19 19 19 19 19 19 19 giving up dumping to dev 20001, offset 65536 dump 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 --- #0 boot (howto=256) at ../../kern/kern_shutdown.c:285 285 dumppcb.pcb_cr3 = rcr3(); (kgdb) where #0 boot (howto=256) at ../../kern/kern_shutdown.c:285 #1 0xc0130591 in panic (fmt=0xc01e6892 "page fault") at ../../kern/kern_shutdown.c:446 #2 0xc01b852e in trap_fatal (frame=0xc5948ecc, eva=3217168992) at ../../i386/i386/trap.c:942 #3 0xc01b81e7 in trap_pfault (frame=0xc5948ecc, usermode=0, eva=3217168992) at ../../i386/i386/trap.c:835 #4 0xc01b7e46 in trap (frame={tf_es = -980156400, tf_ds = -1071972336, tf_edi = 0, tf_esi = -994412912, tf_ebp = -980119784, tf_isp = -980119820, tf_ebx = -994412912, tf_edx = 1073741824, tf_ecx = -1077798304, tf_eax = 137824, tf_trapno = 12, tf_err = 0, tf_eip = -1071944945, tf_cs = 8, tf_eflags = 66054, tf_esp = -980189760, tf_ss = 0}) at ../../i386/i386/trap.c:437 #5 0xc01b6b0f in pmap_remove_pages (pmap=0xc5937e24, sva=0, eva=3217022976) at ../../i386/i386/pmap.c:2913 #6 0xc012a61a in exit1 (p=0xc5934780, rv=0) at ../../kern/kern_exit.c:216 #7 0xc012a42c in exit1 (p=0xc5934780, rv=-980119660) at ../../kern/kern_exit.c:104 #8 0xc01b8777 in syscall (frame={tf_es = 39, tf_ds = -1078001625, tf_edi = 0, tf_esi = -1, tf_ebp = -1077944944, tf_isp = -980119580, tf_ebx = 672448200, tf_edx = 0, tf_ecx = 136851456, tf_eax = 1, tf_trapno = 12, tf_err = 2, tf_eip = 672192924, tf_cs = 31, tf_eflags = 598, tf_esp = -1077944964, tf_ss = 39}) at ../../i386/i386/trap.c:1100 #9 0xc01ae0dc in Xint0x80_syscall () Cannot access memory at address 0xbfbfdd90. (kgdb) quit proxy## proxy## gdb -k kernel.debug.1 vmcore.1 GNU gdb 4.18 Copyright 1998 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "i386-unknown-freebsd"... IdlePTD 2543616 initial pcb at 203ce4 panicstr: page fault panic messages: --- Fatal trap 12: page fault while in kernel mode fault virtual address = 0xc078c5e4 fault code = supervisor read, page not present instruction pointer = 0x8:0xc012d2a3 stack pointer = 0x10:0xc5a2ebf8 frame pointer = 0x10:0xc5a2ec10 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 16796 (find) interrupt mask = bio trap number = 12 panic: page fault syncing disks... Fatal trap 12: page fault while in kernel mode fault virtual address = 0x30 fault code = supervisor read, page not present instruction pointer = 0x8:0xc0186523 stack pointer = 0x10:0xc5a2eaac frame pointer = 0x10:0xc5a2eadc code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 16796 (find) interrupt mask = bio trap number = 12 panic: page fault dumping to dev 20001, offset 65536 dump 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 --- #0 boot (howto=260) at ../../kern/kern_shutdown.c:285 285 dumppcb.pcb_cr3 = rcr3(); (kgdb) where #0 boot (howto=260) at ../../kern/kern_shutdown.c:285 #1 0xc0130591 in panic (fmt=0xc01e6892 "page fault") at ../../kern/kern_shutdown.c:446 #2 0xc01b852e in trap_fatal (frame=0xc5a2ea70, eva=48) at ../../i386/i386/trap.c:942 #3 0xc01b81e7 in trap_pfault (frame=0xc5a2ea70, usermode=0, eva=48) at ../../i386/i386/trap.c:835 #4 0xc01b7e46 in trap (frame={tf_es = -978190320, tf_ds = 16842768, tf_edi = -1071544632, tf_esi = -978999616, tf_ebp = -979178788, tf_isp = -979178856, tf_ebx = -978186112, tf_edx = 16842770, tf_ecx = -1060093696, tf_eax = 0, tf_trapno = 12, tf_err = 0, tf_eip = -1072143069, tf_cs = 8, tf_eflags = 66050, tf_esp = -1062670336, tf_ss = 0}) at ../../i386/i386/trap.c:437 #5 0xc0186523 in ffs_sync (mp=0xc0a8f000, waitfor=2, cred=0xc0697600, p=0xc02186c8) at ../../ufs/ffs/ffs_vfsops.c:977 #6 0xc0156f4b in sync (p=0xc02186c8, uap=0x0) at ../../kern/vfs_syscalls.c:549 #7 0xc0130139 in boot (howto=256) at ../../kern/kern_shutdown.c:203 #8 0xc0130591 in panic (fmt=0xc01e6892 "page fault") at ../../kern/kern_shutdown.c:446 #9 0xc01b852e in trap_fatal (frame=0xc5a2ebbc, eva=3229140452) at ../../i386/i386/trap.c:942 #10 0xc01b81e7 in trap_pfault (frame=0xc5a2ebbc, usermode=0, eva=3229140452) at ../../i386/i386/trap.c:835 #11 0xc01b7e46 in trap (frame={tf_es = -1072562160, tf_ds = -1060503536, tf_edi = -980214208, tf_esi = -1065826844, tf_ebp = -979178480, tf_isp = -979178524, tf_ebx = -1071692168, tf_edx = 0, tf_ecx = 1418496, tf_eax = 1041892, tf_trapno = 12, tf_err = 0, tf_eip = -1072508253, tf_cs = 8, tf_eflags = 66182, tf_esp = -978186112, tf_ss = 0}) at ../../i386/i386/trap.c:437 #12 0xc012d2a3 in free (addr=0x10000, type=0xc01f4678) at ../../kern/kern_malloc.c:271 #13 0xc01552b7 in vclean (vp=0xc5b21080, flags=8, p=0xc5931e40) at ../../kern/vfs_subr.c:1608 #14 0xc0155447 in vgonel (vp=0xc5b21080, p=0xc5931e40) at ../../kern/vfs_subr.c:1757 #15 0xc0154089 in getnewvnode (tag=VT_UFS, mp=0xc0a8f000, vops=0xc0a67200, vpp=0xc5a2ed04) at ../../kern/vfs_subr.c:464 #16 0xc0186759 in ffs_vget (mp=0xc0a8f000, ino=86273, vpp=0xc5a2ed84) at ../../ufs/ffs/ffs_vfsops.c:1083 #17 0xc018a2de in ufs_lookup (ap=0xc5a2eddc) at ../../ufs/ufs/ufs_lookup.c:546 #18 0xc018ea35 in ufs_vnoperate (ap=0xc5a2eddc) at ../../ufs/ufs/ufs_vnops.c:2300 #19 0xc01510c8 in vfs_cache_lookup (ap=0xc5a2ee38) at vnode_if.h:55 #20 0xc018ea35 in ufs_vnoperate (ap=0xc5a2ee38) at ../../ufs/ufs/ufs_vnops.c:2300 #21 0xc0153599 in lookup (ndp=0xc5a2eeb8) at vnode_if.h:31 #22 0xc015306c in namei (ndp=0xc5a2eeb8) at ../../kern/vfs_lookup.c:152 #23 0xc01587d4 in lstat (p=0xc5931e40, uap=0xc5a2ef94) at ../../kern/vfs_syscalls.c:1649 #24 0xc01b8777 in syscall (frame={tf_es = 134545447, tf_ds = -1078001625, tf_edi = 134583040, tf_esi = 134583120, tf_ebp = -1077945476, tf_isp = -979177500, tf_ebx = 671964872, tf_edx = 134583104, tf_ecx = 134541312, tf_eax = 190, tf_trapno = 7, tf_err = 2, tf_eip = 671711496, tf_cs = 31, tf_eflags = 582, tf_esp = -1077945604, tf_ss = 39}) at ../../i386/i386/trap.c:1100 #25 0xc01ae0dc in Xint0x80_syscall () #26 0x28094eb5 in ?? () #27 0x280947a6 in ?? () #28 0x804919f in ?? () #29 0x804af96 in ?? () #30 0x804904d in ?? () (kgdb) quit proxy## exit Script done on Tue May 2 12:00:35 2000 To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-bugs" in the body of the message