From owner-freebsd-hackers Mon Sep 16 20:27:36 2002 Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.FreeBSD.org (mx1.FreeBSD.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 1212337B400 for ; Mon, 16 Sep 2002 20:27:32 -0700 (PDT) Received: from anu.edu.au (anumail4.anu.edu.au [150.203.2.44]) by mx1.FreeBSD.org (Postfix) with ESMTP id 0027E43E77 for ; Mon, 16 Sep 2002 20:27:30 -0700 (PDT) (envelope-from Gregory.Lane@anu.edu.au) Received: from nucl03.anu.edu.au (nucl03.anu.edu.au [150.203.19.120]) by anu.edu.au (8.12.3/8.12.3) with ESMTP id g8H3RTI8011834 for ; Tue, 17 Sep 2002 13:27:29 +1000 (EST) Received: from nucl03.anu.edu.au (localhost [127.0.0.1]) by nucl03.anu.edu.au (8.12.6/8.12.6) with ESMTP id g8H3RTsj000559 for ; Tue, 17 Sep 2002 13:27:29 +1000 (EST) (envelope-from gjl103@nucl03.anu.edu.au) Received: (from gjl103@localhost) by nucl03.anu.edu.au (8.12.6/8.12.6/Submit) id g8H3RSAL000558 for freebsd-hackers@FreeBSD.org; Tue, 17 Sep 2002 13:27:28 +1000 (EST) Date: Tue, 17 Sep 2002 13:27:28 +1000 From: Greg Lane To: freebsd-hackers@FreeBSD.org Subject: Kernel panic under heavy disk load - trace enclosed Message-ID: <20020917032728.GA458@nucl03.anu.edu.au> Reply-To: gregory.lane@anu.edu.au Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.4i X-Virus-Scanned: by amavisd-milter (http://amavis.org/) Sender: owner-freebsd-hackers@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG G'day all, At some point a few months ago my workstation began to periodically crash. Eventually I realised it was when it was put under heavy disk load. Something like "make clean" in the ports tree, "rm -rf *" in a directory with many files, or (sometimes) a "make buildworld", would bring it down. Recently, I finally got around to compiling a debug kernel and saving a crash dump. In trying "rm -rf /usr/ports" I brought it down three times. Each trace is identical. One is shown below. A dmesg is attached also. I have been tracking stable on this machine since the beginning of 2001. The troubles started (I am guessing) April'ish this year (2002). Can anyone tell me if this hardware or a real bug? Is there anything else I could supply which would help in understanding the problem? Cheers, Greg (kgdb) where #0 dumpsys () at /usr/src/sys/kern/kern_shutdown.c:487 #1 0xc017caab in boot (howto=260) at /usr/src/sys/kern/kern_shutdown.c:316 #2 0xc017cee9 in panic (fmt=0xc02c8300 "lockmgr: locking against myself") at /usr/src/sys/kern/kern_shutdown.c:595 #3 0xc017739b in lockmgr (lkp=0xcbccdeb8, flags=33620002, interlkp=0xc034e094, p=0xd2de45e0) at /usr/src/sys/kern/kern_lock.c:337 #4 0xc01a47e8 in getblk (vp=0xd391aa80, blkno=8519728, size=6144, slpflag=0, slptimeo=0) at /usr/src/sys/sys/buf.h:305 #5 0xc01a29a6 in bread (vp=0xd391aa80, blkno=8519728, size=6144, cred=0x0, bpp=0xd392ab30) at /usr/src/sys/kern/vfs_bio.c:508 #6 0xc0226c79 in ffs_freefile (pvp=0xd392ac54, ino=1032215, mode=33188) at /usr/src/sys/ufs/ffs/ffs_alloc.c:1592 #7 0xc022be24 in handle_workitem_freefile (freefile=0xc2143860) at /usr/src/sys/ufs/ffs/ffs_softdep.c:2913 #8 0xc02293c3 in process_worklist_item (matchmnt=0x0, flags=0) at /usr/src/sys/ufs/ffs/ffs_softdep.c:737 #9 0xc022922e in softdep_process_worklist (matchmnt=0x0) at /usr/src/sys/ufs/ffs/ffs_softdep.c:622 #10 0xc017c909 in boot (howto=256) at /usr/src/sys/kern/kern_shutdown.c:261 #11 0xc017cee9 in panic (fmt=0xc02d8dc0 "ffs_blkfree: freeing free frag") at /usr/src/sys/kern/kern_shutdown.c:595 #12 0xc02269bf in ffs_blkfree (ip=0xd392ae0c, bno=8103, size=1024) at /usr/src/sys/ufs/ffs/ffs_alloc.c:1470 #13 0xc022af1e in handle_workitem_freeblocks (freeblks=0xc202df00) at /usr/src/sys/ufs/ffs/ffs_softdep.c:2146 #14 0xc022939b in process_worklist_item (matchmnt=0x0, flags=0) at /usr/src/sys/ufs/ffs/ffs_softdep.c:723 #15 0xc022922e in softdep_process_worklist (matchmnt=0x0) at /usr/src/sys/ufs/ffs/ffs_softdep.c:622 #16 0xc01aae87 in sched_sync () at /usr/src/sys/kern/vfs_subr.c:1177 (kgdb) up 3 #3 0xc017739b in lockmgr (lkp=0xcbccdeb8, flags=33620002, interlkp=0xc034e094, p=0xd2de45e0) at /usr/src/sys/kern/kern_lock.c:337 337 panic("lockmgr: locking against myself"); (kgdb) up 1 #4 0xc01a47e8 in getblk (vp=0xd391aa80, blkno=8519728, size=6144, slpflag=0, slptimeo=0) at /usr/src/sys/sys/buf.h:305 305 ret = lockmgr(&(bp)->b_lock, (locktype), &buftimelock, curproc); (kgdb) up 8 #12 0xc02269bf in ffs_blkfree (ip=0xd392ae0c, bno=8103, size=1024) at /usr/src/sys/ufs/ffs/ffs_alloc.c:1470 1470 panic("ffs_blkfree: freeing free frag"); (kgdb) up 1 #13 0xc022af1e in handle_workitem_freeblocks (freeblks=0xc202df00) at /usr/src/sys/ufs/ffs/ffs_softdep.c:2146 2146 ffs_blkfree(&tip, bn, bsize); Copyright (c) 1992-2002 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.7-PRERELEASE #0: Mon Sep 16 10:20:40 EST 2002 root@backup.lane.network:/usr/obj/usr/src/sys/NUCL03 Timecounter "i8254" frequency 1193182 Hz Timecounter "TSC" frequency 730950415 Hz CPU: Pentium III/Pentium III Xeon/Celeron (730.95-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x686 Stepping = 6 Features=0x383fbff real memory = 536739840 (524160K bytes) avail memory = 518791168 (506632K bytes) Preloaded elf kernel "kernel" at 0xc03bc000. netsmb_dev: loaded Pentium Pro MTRR support enabled Using $PIR table, 10 entries at 0xc00f3ad0 npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard pci0: on pcib0 pcib1: at device 1.0 on pci0 pci1: on pcib1 pci1: at 0.0 irq 11 pcib2: at device 30.0 on pci0 pci2: on pcib2 ahc_pci0: port 0xd800-0xd8ff mem 0xfeaff000-0x feafffff irq 11 at device 7.0 on pci2 aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs vr0: port 0xdc00-0xdc7f mem 0xfeafec00-0xfeafe c7f irq 5 at device 11.0 on pci2 vr0: Ethernet address: 00:80:c8:da:0b:69 miibus0: on vr0 amphy0: on miibus0 amphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto xl0: <3Com 3c905C-TX Fast Etherlink XL> port 0xd480-0xd4ff mem 0xfeafe800-0xfeaf e87f irq 11 at device 15.0 on pci2 xl0: Ethernet address: 00:01:02:80:d0:e5 miibus1: on xl0 xlphy0: <3c905C 10/100 internal PHY> on miibus1 xlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto isab0: at device 31.0 on pci0 isa0: on isab0 atapci0: port 0xffa0-0xffaf at device 31.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 pci0: at 31.2 irq 9 pci0: (vendor=0x8086, dev=0x2413) at 31.3 irq 10 pcm0: port 0xef00-0xef3f,0xe800-0xe8ff irq 10 at device 31 .5 on pci0 orm0: