From owner-freebsd-stable Tue Jan 8 3:51:45 2002 Delivered-To: freebsd-stable@freebsd.org Received: from mcqueen.wolfsburg.de (pns.wobline.de [212.68.68.5]) by hub.freebsd.org (Postfix) with ESMTP id E345837B416 for ; Tue, 8 Jan 2002 03:51:35 -0800 (PST) Received: from colt.ncptiddische.net (ppp-219.wobline.de [212.68.69.230]) by mcqueen.wolfsburg.de (8.11.3/8.11.3/tw-20010821) with ESMTP id g08BpRM24357 for ; Tue, 8 Jan 2002 12:51:27 +0100 Received: from tisys.org (jodie.ncptiddische.net [192.168.0.2]) by colt.ncptiddische.net (8.11.6/8.11.6) with ESMTP id g08BqIX42792 for ; Tue, 8 Jan 2002 12:52:20 +0100 (CET) (envelope-from nils@tisys.org) Received: (from nils@localhost) by tisys.org (8.11.6/8.11.6) id g08Bpe600197 for freebsd-stable@freebsd.org; Tue, 8 Jan 2002 12:51:40 +0100 (CET) (envelope-from nils) Date: Tue, 8 Jan 2002 12:51:04 +0100 From: Nils Holland To: freebsd-stable@freebsd.org Subject: 4.5-PRERELEASE: VIA chases again... Message-ID: <20020108125104.A172@tisys.org> Mail-Followup-To: freebsd-stable@freebsd.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.2.5i X-Operating-System: FreeBSD jodie.ncptiddische.net 4.5-PRERELEASE FreeBSD 4.5-PRERELEASE X-Machine-Uptime: 12:35PM up 58 secs, 1 user, load averages: 0.07, 0.03, 0.01 Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG Hi folks, I guess that this is slowly getting boring, but it seems as if I have found one more crash. First of all, Matt and Soren will remember my "famous" NFs crash. Since Soren's 686B fix has been in RELENG_4, this problem is gone, as a long test proved. However, there's something new now. Here are the details: I just wanted to do a "make release". During that process, my machine crashed. The first time, the crash occured while it was checking out the sources from my local (i.e. I'm connected to it via Ethernet) CVS mirror. When I tried again, it'd check out the sources fine, but crash duwing the following "make buildworld" process. The *third* time, it crashed right after the reboot when I tried to rm -R the directory in which I planned to make the release. I have tried to reproduce this problem on two other machines, one is KT266A based and the other uses the MVP3. However, these machines don't seem to be affected. In the end, it seems as if this is the KT133 / 686B bug again. Note that I have already tried about *every* setting in the BIOS and swapped about *every* component in the system. It all makes no difference. I don't know how well others KT133 / 686B machines are working, but I have the feeling that Chaintech, my mainboard's manufacturer, did a really *bad* job in making this board and it's BIOS, and therefore it is even possible (I guess) that this problem cannot be fixed easily, if at all. Since my crappy Chaintech board seems to be the only thing that's affected, I'm at least glad that it doesn't seem that there'ssomething important broken in 4.5-PRERELEASE. Anyway, I'd like to present you some information about my latest crash. Unluckily, these information were not gathered with a debug kernel, but something is already visible right now: It seems to be a *different* crash than the one I had in conjunction with NFS: Script started on Tue Jan 8 12:21:05 2002 root@jodie> gdb -k kernel.0 vmcore.0 This GDB was configured as "i386-unknown-freebsd"... (no debugging symbols found)... IdlePTD at phsyical address 0x00386000 initial pcb at physical address 0x002e85a0 panicstr: softdep_lock: locking against myself panic messages: --- Fatal trap 12: page fault while in kernel mode fault virtual address = 0x38 fault code = supervisor read, page not present instruction pointer = 0x8:0xc01fc645 stack pointer = 0x10:0xe07c0bc8 frame pointer = 0x10:0xe07c0bd4 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 35488 (mkdir) interrupt mask = bio trap number = 12 panic: page fault syncing disks... panic: softdep_lock: locking against myself Uptime: 2h27m12s dumping to dev #ad/0x20001, offset 1580448 dump ata0: resetting devices .. done --- #0 0xc0143706 in dumpsys () (kgdb) where #0 0xc0143706 in dumpsys () #1 0xc0143527 in boot () #2 0xc01438fc in poweroff_wait () #3 0xc01fbd6a in acquire_lock () #4 0xc01ffde8 in softdep_update_inodeblock () #5 0xc01faecd in ffs_update () #6 0xc0204801 in ffs_fsync () #7 0xc020310f in ffs_sync () #8 0xc0172e8f in sync () #9 0xc01432da in boot () #10 0xc01438fc in poweroff_wait () #11 0xc0270232 in trap_fatal () #12 0xc026ff05 in trap_pfault () #13 0xc026faef in trap () #14 0xc01fc645 in pagedep_lookup () #15 0xc01fcf29 in softdep_setup_allocdirect () #16 0xc01fa70f in ffs_balloc () #17 0xc0209a80 in ufs_mkdir () #18 0xc020aa19 in ufs_vnoperate () #19 0xc0175dbe in mkdir () #20 0xc02704dd in syscall2 () #21 0xc0264b55 in Xint0x80_syscall () #22 0x80482d2 in ?? () #23 0x8048135 in ?? () (kgdb) quit root@jodie> exit exit As you can see, this time the crash doesn't seem to take place in some part of the NFS code, but in the UFS/FFS code. Once again, I guess the code is perfectly fine, but my board sucks. Oh, here's also another /var/run/dmesg.boot: Copyright (c) 1992-2002 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.5-PRERELEASE #0: Sat Jan 5 11:41:47 CET 2002 root@poison.ncptiddische.net:/usr/obj/usr/src/sys/JODIE Timecounter "i8254" frequency 1193182 Hz Timecounter "TSC" frequency 996634421 Hz CPU: AMD Athlon(tm) processor (996.63-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0x642 Stepping = 2 Features=0x183f9ff AMD Features=0xc0440000<,AMIE,DSP,3DNow!> real memory = 805240832 (786368K bytes) avail memory = 779370496 (761104K bytes) Preloaded elf kernel "kernel" at 0xc0367000. Pentium Pro MTRR support enabled Using $PIR table, 8 entries at 0xc00fdba0 npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard pci0: on pcib0 pcib1: at device 1.0 on pci0 pci1: on pcib1 pci1: at 0.0 irq 11 isab0: at device 7.0 on pci0 isa0: on isab0 atapci0: port 0xd000-0xd00f at device 7.1 on pci0 atapci0: Correcting VIA config for southbridge data corruption bug ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 pci0: at 7.2 irq 10 pci0: at 7.3 irq 10 chip1: at device 7.4 on pci0 pcm0: port 0xe400-0xe403,0xe000-0xe003,0xdc00-0xdcff irq 5 at device 7.5 on pci0 bktr0: mem 0xdc001000-0xdc001fff irq 10 at device 16.0 on pci0 iicbb0: on bti2c0 iicbus0: on iicbb0 master-only smbus0: on bti2c0 bktr0: Hauppauge Model 44354 C221 bktr0: Detected a MSP3415D-B3 at 0x80 bktr0: Hauppauge WinCast/TV, Philips FR1216 PAL FM tuner, msp3400c stereo, remote control. pci0: (vendor=0x109e, dev=0x0878) at 16.1 irq 10 ed0: port 0xe800-0xe81f irq 11 at device 17.0 on pci0 ed0: address 00:20:18:2f:42:2d, type NE2000 (16 bit) orm0: