From owner-freebsd-stable Wed Oct 4 16:20:43 2000 Delivered-To: freebsd-stable@freebsd.org Received: from pathlink.net (linda.i1.pathlink.com [209.155.56.197]) by hub.freebsd.org (Postfix) with ESMTP id E76B837B502 for ; Wed, 4 Oct 2000 16:20:37 -0700 (PDT) Received: from dvl-1 (dvl-1.pathlink.com [209.155.56.211]) by pathlink.net (8.9.3/8.9.3) with SMTP id QAA27753 for ; Wed, 4 Oct 2000 16:20:37 -0700 (PDT) (envelope-from kachun@pathlink.com) Message-Id: <200010042320.QAA27753@pathlink.net> To: freebsd-stable@freebsd.org Subject: Re: More panics (different hardware) From: kachun@pathlink.com (Kachun Lee) Date: Wed, 04 Oct 2000 16:20:29 -0700 X-Mailer: WinVN 0.99.8 (x86 32bit) In-Reply-To: <200010041006.e94A6CE56504@karma.freebsd-uk.eu.org> References: <200010011925.MAA45944@pathlink.net> <200010041006.e94A6CE56504@karma.freebsd-uk.eu.org> MIME-Version: 1.0 Content-Type: Text/Plain; charset=US-ASCII Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG I have set up a test server, P2B/DS PII 400Mhz and couple SCSI drives, and it has been running buildworld for over a day without a panic. With the 3 inn's crashed in this thread, I think we can rule out the type of ahc (W, U2 and U160), CPU (PII/III/speed) or SMP. However, all 3 have an ata/ahc combination and more than one ahc. I will try to dig up another ahc and see if I can duplicate the problem with the test server. Of course, the other thing missing is network traffic. In article <200010041006.e94A6CE56504@karma.freebsd-uk.eu.org>, you say... > >kachun@pathlink.com (Kachun Lee) wrote: > >> I upgraded one of our news servers from 4.0-S to 4.1.1 Release. >> [...] >> Fatal trap 12: page fault while in kernel mode >> mp_lock = 00000002; cpuid = 0; lapic.id = 01000000 >> fault virtual address = 0x14 >> fault code = supervisor read, page not present >> instruction pointer = 0x8:0xc0134bb3 >> [...] >> c01349c4 t ahc_action <----------------------------- > >Same here; upgraded two news boxes from 4.1-STABLE to 4.1.1-RELEASE and they >were both crashing regularly. Here's three panics from the worst culprit: > > *** #1 *** > Fatal trap 12: page fault while in kernel mode > fault virtual address = 0x14 > fault code = supervisor read, page not present > instruction pointer = 0x8:0xc012f69f > stack pointer = 0x10:0xc7d6ea18 > frame pointer = 0x10:0xc7d6ea80 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, def32 1, gran 1 > processor eflags = interrupt enabled, resume, IOPL = 0 > current process = 5 (syncer) > interrupt mask = > trap number = 12 > panic: page fault > > syncing disks... panic: lockmgr: locking against myself > > *** #2 *** > > Fatal trap 12: page fault while in kernel mode > fault virtual address = 0x14 > fault code = supervisor read, page not present > instruction pointer = 0x8:0xc012f69f > stack pointer = 0x10:0xc7d6ccc8 > frame pointer = 0x10:0xc7d6cd30 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, def32 1, gran 1 > processor eflags = interrupt enabled, resume, IOPL = 0 > current process = 4 (bufdaemon) > interrupt mask = bio > trap number = 12 > panic: page fault > > syncing disks... 397 397 397 397 397 397 397 397 397 397 397 397 397 397 397 397 397 397 397 397 > giving up on 345 buffers > > *** #3 *** > > Fatal trap 12: page fault while in kernel mode > fault virtual address = 0x14 > fault code = supervisor read, page not present^ > instruction pointer = 0x8:0xc012f69f > stack pointer = 0x10:0xc7d6cc94 > frame pointer = 0x10:0xc7d6ccfc > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, def32 1, gran 1 > processor eflags = interrupt enabled, resume, IOPL = 0 > current process = 4 (bufdaemon) > interrupt mask = bio > trap number = 12 > panic: page fault > > syncing disks... > >Spot the pattern: > > utserv% grep ^instr nakaleen > instruction pointer = 0x8:0xc012f69f > instruction pointer = 0x8:0xc012f69f > instruction pointer = 0x8:0xc012f69f > instruction pointer = 0x8:0xc012f69f > instruction pointer = 0x8:0xc012f69f > instruction pointer = 0x8:0xc012f6a3 > >The last one is running a debug kernel, but I didn't manage to either get a >dump or get into the debugger. :-( > > nakaleen% nm /kernel.debug | grep ^c012f6a3 > nakaleen% nm /kernel.debug | grep ^c012f6a > nakaleen% nm /kernel.debug | grep ^c012f6 > nakaleen% nm /kernel.debug | grep ^c012f > c012f4b4 t ahc_action > >I've reverted to 4.1-STABLE from around 7th August, which is fine, except for >the kvm change between the two. Bah. > >Copyright (c) 1992-2000 The FreeBSD Project. >Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > The Regents of the University of California. All rights reserved. >FreeBSD 4.1.1-RELEASE #0: Tue Oct 3 12:54:30 BST 2000 > ip@karma:/usr/src/sys/compile/NAKALEEN >Timecounter "i8254" frequency 1193182 Hz >Timecounter "TSC" frequency 400911107 Hz >CPU: Pentium II/Pentium II Xeon/Celeron (400.91-MHz 686-class CPU) > Origin = "GenuineIntel" Id = 0x651 Stepping = 1 > Features=0x183f9ff >real memory = 134217728 (131072K bytes) >avail memory = 127995904 (124996K bytes) >Preloaded elf kernel "kernel" at 0xc02b5000. >Pentium Pro MTRR support enabled >npx0: on motherboard >npx0: INT 16 interface >pcib0: on motherboard >pci0: on pcib0 >pcib1: at device 1.0 on pci0 >pci1: on pcib1 >isab0: at device 7.0 on pci0 >isa0: on isab0 >atapci0: port 0xf000-0xf00f at device 7.1 on pci0 >ata0: at 0x1f0 irq 14 on atapci0 >ata1: at 0x170 irq 15 on atapci0 >pci0: at 7.2 >chip1: port 0x5000-0x500f at device 7.3 on pci0 >fxp0: port 0x6800-0x683f mem 0xe4000000-0xe40fffff,0xe4100000-0xe4100fff irq 12 at device 11.0 on pci0 >fxp0: Ethernet address 00:d0:b7:09:07:3f >ahc0: port 0x6c00-0x6cff mem 0xe4101000-0xe4101fff irq 10 at device 13.0 on pci0 >aic7892: Wide Channel A, SCSI Id=7, 32/255 SCBs >ahc1: port 0x7000-0x70ff mem 0xe4102000-0xe4102fff irq 11 at device 15.0 on pci0 >aic7892: Wide Channel A, SCSI Id=7, 32/255 SCBs >fdc0: at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 >fdc0: FIFO enabled, 8 bytes threshold >fd0: <1440-KB 3.5" drive> on fdc0 drive 0 >atkbdc0: at port 0x60,0x64 on isa0 >atkbd0: irq 1 on atkbdc0 >kbd0 at atkbd0 >vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 >sc0: on isa0 >sc0: VGA <16 virtual consoles, flags=0x0> >sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 >sio0: type 16550A, console >sio1 at port 0x2f8-0x2ff irq 3 on isa0 >sio1: type 16550A >ppc0: at port 0x378-0x37f irq 7 on isa0 >ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode >lpt0: on ppbus0 >lpt0: Interrupt-driven port >ad0: 124MB [1001/15/17] at ata0-master using PIO0 >ad1: 325MB [768/14/62] at ata0-slave using PIO0 >ad2: 405MB [989/15/56] at ata1-master using PIO3 >Waiting 5 seconds for SCSI devices to settle >da0 at ahc0 bus 0 target 0 lun 0 >da0: Fixed Direct Access SCSI-2 device >da0: 80.000MB/s transfers (40.000MHz, offset 15, 16bit) >da0: 47702MB (97693755 512 byte sectors: 255H 63S/T 6081C) >da2 at ahc0 bus 0 target 2 lun 0 >da2: Fixed Direct Access SCSI-2 device >da2: 80.000MB/s transfers (40.000MHz, offset 15, 16bit) >da2: 47702MB (97693755 512 byte sectors: 255H 63S/T 6081C) >da4 at ahc1 bus 0 target 0 lun 0 >da4: Fixed Direct Access SCSI-2 device >da4: 80.000MB/s transfers (40.000MHz, offset 15, 16bit) >da4: 47702MB (97693755 512 byte sectors: 255H 63S/T 6081C) >da3 at ahc0 bus 0 target 3 lun 0 >da3: Fixed Direct Access SCSI-2 device >da3: 80.000MB/s transfers (40.000MHz, offset 15, 16bit) >da3: 47702MB (97693755 512 byte sectors: 255H 63S/T 6081C) >da6 at ahc1 bus 0 target 2 lun 0 >da6: Fixed Direct Access SCSI-2 device >da6: 80.000MB/s transfers (40.000MHz, offset 15, 16bit) >da6: 47702MB (97693755 512 byte sectors: 255H 63S/T 6081C) >da1 at ahc0 bus 0 target 1 lun 0 >da1: Fixed Direct Access SCSI-2 device >da1: 80.000MB/s transfers (40.000MHz, offset 15, 16bit) >da1: 47702MB (97693755 512 byte sectors: 255H 63S/T 6081C) >da7 at ahc1 bus 0 target 3 lun 0 >da7: Fixed Direct Access SCSI-2 device >da7: 80.000MB/s transfers (40.000MHz, offset 15, 16bit) >da7: 47702MB (97693755 512 byte sectors: 255H 63S/T 6081C) >da5 at ahc1 bus 0 target 1 lun 0 >da5: Fixed Direct Access SCSI-2 device >da5: 80.000MB/s transfers (40.000MHz, offset 15, 16bit) >da5: 47702MB (97693755 512 byte sectors: 255H 63S/T 6081C) >Mounting root from ufs:/dev/ad0s1a >WARNING: / was not properly dismounted >vinum: loaded >vinum: reading configuration from /dev/da7s1e >vinum: updating configuration from /dev/da4s1e >vinum: updating configuration from /dev/da3s1e >vinum: updating configuration from /dev/da5s1e >vinum: updating configuration from /dev/da6s1e >vinum: updating configuration from /dev/da2s1e >vinum: updating configuration from /dev/da1s1e >vinum: updating configuration from /dev/da0s1e > >FWIW, > >Ian. > > >To Unsubscribe: send mail to majordomo@FreeBSD.org >with "unsubscribe freebsd-stable" in the body of the message To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message