Date: Tue, 8 Jan 2002 12:13:20 -0000 From: "Lawrence Farr" <freebsd-net@epcdirect.co.uk> To: <freebsd-stable@freebsd.org> Cc: "'Nils Holland'" <nils@tisys.org> Subject: RE: 4.5-PRERELEASE: VIA chases again... Message-ID: <000801c1983d$dd7dca00$c80fa8c0@lfarr> In-Reply-To: <20020108125104.A172@tisys.org>
next in thread | previous in thread | raw e-mail | index | archive | help
Hi, Have a look at Message-ID: <000901c1944f$9854dc30$c806a8c0@lfarr>. Titled "Fatal trap 12 when running mergemaster." I was having a lot of strange crashes on a previously rock-stable machine. After swapping every component, I gave up and put in a SCSI disk. It ran as solid as a rock again through about 6 or 7 makeworlds, X11 and KDE builds. Moved the install back to another disk (I ran the IBM low level disk tools on the disk first, and it passed). Within an hour I had a panic. It was a corrupt directory error, but it rebooted before I had chance to write down the error. I newfs'd it last night, and left it run with no problems, and it's buildingworld again now. It's an Asus A7V133, and it's Crucial RAM, and a decent 300W supply. Here's a dmesg Copyright (c) 1992-2002 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.5-PRERELEASE #0: Thu Jan 3 15:17:56 GMT 2002 root@frogger.epcdirect.co.uk:/usr/obj/usr/src/sys/P6 Timecounter "i8254" frequency 1193182 Hz CPU: AMD Athlon(tm) Processor (1208.75-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0x642 Stepping = 2 Features=0x183f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,C MOV,PAT,PSE36,MMX,FXSR> AMD Features=0xc0440000<<b18>,AMIE,DSP,3DNow!> real memory = 1073659904 (1048496K bytes) avail memory = 1041154048 (1016752K bytes) Preloaded elf kernel "kernel" at 0xc03ad000. Preloaded elf module "vesa.ko" at 0xc03ad09c. VESA: v3.0, 16384k memory, flags:0x1, mode table:0xc03aa377 (1000117) VESA: 3dfx Interactive, Inc. Pentium Pro MTRR support enabled md0: Malloc disk Using $PIR table, 9 entries at 0xc00f1720 apm0: <APM BIOS> on motherboard apm: found APM BIOS v1.2, connected at v1.2 npx0: <math processor> on motherboard npx0: INT 16 interface pcib0: <Host to PCI bridge> on motherboard pci0: <PCI bus> on pcib0 pcib1: <VIA 8363 (Apollo KT133) PCI-PCI (AGP) bridge> at device 1.0 on pci0 pci1: <PCI bus> on pcib1 pci1: <3Dfx Voodoo 3 graphics accelerator> at 0.0 irq 11 isab0: <VIA 82C686 PCI-ISA bridge> at device 4.0 on pci0 isa0: <ISA bus> on isab0 atapci0: <VIA 82C686 ATA100 controller> port 0xb800-0xb80f at device 4.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 uhci0: <VIA 83C572 USB controller> port 0xb400-0xb41f irq 3 at device 4.2 on pci0 usb0: <VIA 83C572 USB controller> on uhci0 usb0: USB revision 1.0 uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhci1: <VIA 83C572 USB controller> port 0xb000-0xb01f irq 3 at device 4.3 on pci0 usb1: <VIA 83C572 USB controller> on uhci1 usb1: USB revision 1.0 uhub1: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 2 ports with 2 removable, self powered chip1: <VIA 82C686 ACPI interface> at device 4.4 on pci0 pcm0: <VIA VT82C686A> port 0xa000-0xa003,0xa400-0xa403,0xa800-0xa8ff irq 15 at device 4.5 on pci0 fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0x9400-0x941f mem 0xdd800000-0xdd8fffff,0xe1000000-0xe1000fff irq 15 at device 10.0 on pci0 fxp0: Ethernet address 00:90:27:ac:c5:47 inphy0: <i82555 10/100 media interface> on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto ahc0: <Adaptec 2940A Ultra SCSI adapter> port 0x9000-0x90ff mem 0xdd000000-0xdd000fff irq 3 at device 13.0 on pci0 aic7860: Ultra Single Channel A, SCSI Id=7, 3/255 SCBs atapci1: <Promise ATA100 controller> port 0x7400-0x743f,0x7800-0x7803,0x8000-0x8007,0x8400-0x8403,0x8800-0x8807 mem 0xdc800000-0xdc81ffff irq 10 at device 17.0 on pci0 ata2: at 0x8800 on atapci1 ata3: at 0x8000 on atapci1 orm0: <Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xc87ff,0xcc000-0xcdfff on isa0 fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0 kbd0 at atkbd0 psm0: <PS/2 Mouse> irq 12 on atkbdc0 psm0: model IntelliMouse, device ID 3 vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1: configured irq 3 not in bitmap of probed irqs 0 pcf0: <PCF8584 I2C bus controller> at port 0x320-0x321 irq 5 on isa0 iicbus0: <Philips I2C bus> on pcf0 addr 0xaa iicsmb0: <I2C to SMB bridge> on iicbus0 smbus0: <System Management Bus> on iicsmb0 smb0: <SMBus general purpose I/O> on smbus0 iic0: <I2C general purpose I/O> on iicbus0 ad4: 29314MB <IBM-DTLA-307030> [59560/16/63] at ata2-master UDMA100 acd0: CDROM <CD-540E> at ata0-slave using PIO4 Waiting 2 seconds for SCSI devices to settle Mounting root from ufs:/dev/ad4s1a cd0 at ahc0 bus 0 target 2 lun 0 cd0: <MATSHITA PD-2 LF-D100 A108> Removable CD-ROM SCSI-2 device cd0: 10.000MB/s transfers (10.000MHz, offset 15) cd0: cd present [272705 x 2048 byte records] Lawrence Farr EPC Direct Limited > -----Original Message----- > From: owner-freebsd-stable@FreeBSD.ORG > [mailto:owner-freebsd-stable@FreeBSD.ORG] On Behalf Of Nils Holland > Sent: 08 January 2002 11:51 > To: freebsd-stable@FreeBSD.ORG > Subject: 4.5-PRERELEASE: VIA chases again... > > > Hi folks, > > I guess that this is slowly getting boring, but it seems as > if I have found > one more crash. First of all, Matt and Soren will remember my > "famous" NFs > crash. Since Soren's 686B fix has been in RELENG_4, this > problem is gone, > as a long test proved. However, there's something new now. > Here are the > details: > > I just wanted to do a "make release". During that process, my machine > crashed. The first time, the crash occured while it was > checking out the > sources from my local (i.e. I'm connected to it via Ethernet) > CVS mirror. > When I tried again, it'd check out the sources fine, but > crash duwing the > following "make buildworld" process. The *third* time, it > crashed right > after the reboot when I tried to rm -R the directory in which > I planned to > make the release. > > I have tried to reproduce this problem on two other machines, > one is KT266A > based and the other uses the MVP3. However, these machines > don't seem to be > affected. In the end, it seems as if this is the KT133 / 686B > bug again. > Note that I have already tried about *every* setting in the BIOS and > swapped about *every* component in the system. It all makes > no difference. > > I don't know how well others KT133 / 686B machines are > working, but I have > the feeling that Chaintech, my mainboard's manufacturer, did a > really *bad* job in making this board and it's BIOS, and > therefore it is > even possible (I guess) that this problem cannot be fixed > easily, if at > all. Since my crappy Chaintech board seems to be the only thing that's > affected, I'm at least glad that it doesn't seem that there'ssomething > important broken in 4.5-PRERELEASE. > > Anyway, I'd like to present you some information about my > latest crash. > Unluckily, these information were not gathered with a debug > kernel, but > something is already visible right now: It seems to be a > *different* crash > than the one I had in conjunction with NFS: > > > Script started on Tue Jan 8 12:21:05 2002 > root@jodie> gdb -k kernel.0 vmcore.0 > This GDB was configured as "i386-unknown-freebsd"... > (no debugging symbols found)... > IdlePTD at phsyical address 0x00386000 > initial pcb at physical address 0x002e85a0 > panicstr: softdep_lock: locking against myself > panic messages: > --- > Fatal trap 12: page fault while in kernel mode > fault virtual address = 0x38 > fault code = supervisor read, page not present > instruction pointer = 0x8:0xc01fc645 > stack pointer = 0x10:0xe07c0bc8 > frame pointer = 0x10:0xe07c0bd4 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, def32 1, gran 1 > processor eflags = interrupt enabled, resume, IOPL = 0 > current process = 35488 (mkdir) > interrupt mask = bio > trap number = 12 > panic: page fault > > syncing disks... panic: softdep_lock: locking against myself > Uptime: 2h27m12s > > dumping to dev #ad/0x20001, offset 1580448 > dump ata0: resetting devices .. done > --- > #0 0xc0143706 in dumpsys () > (kgdb) where > #0 0xc0143706 in dumpsys () > #1 0xc0143527 in boot () > #2 0xc01438fc in poweroff_wait () > #3 0xc01fbd6a in acquire_lock () > #4 0xc01ffde8 in softdep_update_inodeblock () > #5 0xc01faecd in ffs_update () > #6 0xc0204801 in ffs_fsync () > #7 0xc020310f in ffs_sync () > #8 0xc0172e8f in sync () > #9 0xc01432da in boot () > #10 0xc01438fc in poweroff_wait () > #11 0xc0270232 in trap_fatal () > #12 0xc026ff05 in trap_pfault () > #13 0xc026faef in trap () > #14 0xc01fc645 in pagedep_lookup () > #15 0xc01fcf29 in softdep_setup_allocdirect () > #16 0xc01fa70f in ffs_balloc () > #17 0xc0209a80 in ufs_mkdir () > #18 0xc020aa19 in ufs_vnoperate () > #19 0xc0175dbe in mkdir () > #20 0xc02704dd in syscall2 () > #21 0xc0264b55 in Xint0x80_syscall () > #22 0x80482d2 in ?? () > #23 0x8048135 in ?? () > (kgdb) quit > root@jodie> exit > exit > > As you can see, this time the crash doesn't seem to take > place in some part > of the NFS code, but in the UFS/FFS code. Once again, I guess > the code is > perfectly fine, but my board sucks. Oh, here's also another > /var/run/dmesg.boot: > > > Copyright (c) 1992-2002 The FreeBSD Project. > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, > 1993, 1994 > The Regents of the University of California. All rights > reserved. > FreeBSD 4.5-PRERELEASE #0: Sat Jan 5 11:41:47 CET 2002 > root@poison.ncptiddische.net:/usr/obj/usr/src/sys/JODIE > Timecounter "i8254" frequency 1193182 Hz > Timecounter "TSC" frequency 996634421 Hz > CPU: AMD Athlon(tm) processor (996.63-MHz 686-class CPU) > Origin = "AuthenticAMD" Id = 0x642 Stepping = 2 > > Features=0x183f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR > ,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR> > AMD Features=0xc0440000<<b18>,AMIE,DSP,3DNow!> > real memory = 805240832 (786368K bytes) > avail memory = 779370496 (761104K bytes) > Preloaded elf kernel "kernel" at 0xc0367000. > Pentium Pro MTRR support enabled > Using $PIR table, 8 entries at 0xc00fdba0 > npx0: <math processor> on motherboard > npx0: INT 16 interface > pcib0: <Host to PCI bridge> on motherboard > pci0: <PCI bus> on pcib0 > pcib1: <VIA 8363 (Apollo KT133) PCI-PCI (AGP) bridge> at > device 1.0 on pci0 > pci1: <PCI bus> on pcib1 > pci1: <ATI Mach64-GM graphics accelerator> at 0.0 irq 11 > isab0: <VIA 82C686 PCI-ISA bridge> at device 7.0 on pci0 > isa0: <ISA bus> on isab0 > atapci0: <VIA 82C686 ATA100 controller> port 0xd000-0xd00f at > device 7.1 on pci0 > atapci0: Correcting VIA config for southbridge data corruption bug > ata0: at 0x1f0 irq 14 on atapci0 > ata1: at 0x170 irq 15 on atapci0 > pci0: <VIA 83C572 USB controller> at 7.2 irq 10 > pci0: <VIA 83C572 USB controller> at 7.3 irq 10 > chip1: <VIA 82C686 ACPI interface> at device 7.4 on pci0 > pcm0: <VIA VT82C686A> port > 0xe400-0xe403,0xe000-0xe003,0xdc00-0xdcff irq 5 at device 7.5 on pci0 > bktr0: <BrookTree 878> mem 0xdc001000-0xdc001fff irq 10 at > device 16.0 on pci0 > iicbb0: <I2C generic bit-banging driver> on bti2c0 > iicbus0: <Philips I2C bus> on iicbb0 master-only > smbus0: <System Management Bus> on bti2c0 > bktr0: Hauppauge Model 44354 C221 > bktr0: Detected a MSP3415D-B3 at 0x80 > bktr0: Hauppauge WinCast/TV, Philips FR1216 PAL FM tuner, > msp3400c stereo, remote control. > pci0: <unknown card> (vendor=0x109e, dev=0x0878) at 16.1 irq 10 > ed0: <NE2000 PCI Ethernet (RealTek 8029)> port 0xe800-0xe81f > irq 11 at device 17.0 on pci0 > ed0: address 00:20:18:2f:42:2d, type NE2000 (16 bit) > orm0: <Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xcbfff on isa0 > fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 > drq 2 on isa0 > fdc0: FIFO enabled, 8 bytes threshold > fd0: <1440-KB 3.5" drive> on fdc0 drive 0 > atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 > atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0 > kbd0 at atkbd0 > psm0: <PS/2 Mouse> irq 12 on atkbdc0 > psm0: model Generic PS/2 mouse, device ID 0 > vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem > 0xa0000-0xbffff on isa0 > sc0: <System console> at flags 0x100 on isa0 > sc0: VGA <16 virtual consoles, flags=0x300> > ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0 > ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode > ppc0: FIFO with 16/16/8 bytes threshold > ppbus0: IEEE1284 device found /NIBBLE/ECP > Probing for PnP devices on ppbus0: > ppbus0: <HEWLETT-PACKARD DESKJET 930C> MLC,PCL,PML > lpt0: <Printer> on ppbus0 > lpt0: Interrupt-driven port > ad0: 57241MB <WDC WD600AB-32CZA0> [116301/16/63] at > ata0-master UDMA100 > acd0: CDROM <CD-ROM 52X/AKH> at ata1-master using PIO4 > acd1: CD-RW <LITE-ON LTR-12101B> at ata1-slave using PIO4 > Mounting root from ufs:/dev/ad0s1a > > > Ok, that's it. I'd be thankful for any hints, but I guess I'm > going to have > to wait and track Soren's 686B patch and see if a future > update probably > makes my problems go away. I might also try flashing an older > version of my > BIOS and see if that helps. Oh my, I guess I'm never going to > buy Chaintech > again... > > Greetings > Nils > > -- > Nils Holland > Ti Systems - FreeBSD in Tiddische, Germany > http://www.tisys.org * nils@tisys.org > > To Unsubscribe: send mail to majordomo@FreeBSD.org > with "unsubscribe freebsd-stable" in the body of the message > To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?000801c1983d$dd7dca00$c80fa8c0>