From owner-freebsd-stable@FreeBSD.ORG Fri Sep 19 06:45:49 2003 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id F3AAE16A4B3 for ; Fri, 19 Sep 2003 06:45:48 -0700 (PDT) Received: from imf17aec.mail.bellsouth.net (imf17aec.mail.bellsouth.net [205.152.59.65]) by mx1.FreeBSD.org (Postfix) with ESMTP id 98C3843F75 for ; Fri, 19 Sep 2003 06:45:47 -0700 (PDT) (envelope-from dngor@bellsouth.net) Received: from eyrie.homenet ([68.213.211.142]) by imf17aec.mail.bellsouth.netESMTP <20030919134546.YWIQ1821.imf17aec.mail.bellsouth.net@eyrie.homenet> for ; Fri, 19 Sep 2003 09:45:46 -0400 Received: from eyrie.homenet (abuse@localhost [127.0.0.1]) by eyrie.homenet (8.12.9/8.12.9) with ESMTP id h8JDjic1008648 for ; Fri, 19 Sep 2003 09:45:44 -0400 (EDT) (envelope-from troc@eyrie.homenet) Received: (from troc@localhost) by eyrie.homenet (8.12.9/8.12.9/Submit) id h8JDjiXq008647 for stable@freebsd.org; Fri, 19 Sep 2003 09:45:44 -0400 (EDT) (envelope-from troc) Date: Fri, 19 Sep 2003 09:45:44 -0400 From: Rocco Caputo To: stable@freebsd.org Message-ID: <20030919134544.GC490@eyrie.homenet> References: <3F6A98F3.7080801@freebsd.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <3F6A98F3.7080801@freebsd.org> User-Agent: Mutt/1.4.1i Subject: Re: 4.9 stability update X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 19 Sep 2003 13:45:49 -0000 On Thu, Sep 18, 2003 at 11:49:39PM -0600, Scott Long wrote: > > We'd like to get a new poll on the stability and readiness of 4.9. [...] Since July, my machine has been "spontaneously" rebooting about once every day or two. It started with the addition of a second NIC (LNE 100TX, dc0 driver). Updating the kernel to post-PAE versions has not resolved the issue. Neither has replacing the NIC. The machine did perform a full panic once, before I turned on crash dumps. It hasn't properly panicked since. Aug 8 19:40:05 eyrie /kernel: dc0: TX underrun -- increasing TX threshold Aug 8 19:51:20 eyrie /kernel: Aug 8 19:51:20 eyrie /kernel: Aug 8 19:51:20 eyrie /kernel: Fatal trap 12: page fault while in kernel mode Aug 8 19:51:20 eyrie /kernel: fault virtual address = 0x70080 Aug 8 19:51:20 eyrie /kernel: fault code = supervisor read, page not present Aug 8 19:51:20 eyrie /kernel: instruction pointer = 0x8:0xc0312902 Aug 8 19:51:20 eyrie /kernel: stack pointer = 0x10:0xdb2f6c5c Aug 8 19:51:20 eyrie /kernel: frame pointer = 0x10:0xdb2f6c84 Aug 8 19:51:20 eyrie /kernel: code segment = base 0x0, limit 0xfffff, type 0x1b Aug 8 19:51:20 eyrie /kernel: = DPL 0, pres 1, def32 1, gran 1 Aug 8 19:51:20 eyrie /kernel: processor eflags = interrupt enabled, resume, IOPL = 0 Aug 8 19:51:20 eyrie /kernel: current process = 92 (ppp) Aug 8 19:51:20 eyrie /kernel: interrupt mask = net tty Aug 8 19:51:20 eyrie /kernel: trap number = 12 Aug 8 19:51:20 eyrie /kernel: panic: page fault Aug 8 19:51:20 eyrie /kernel: Aug 8 19:51:20 eyrie /kernel: syncing disks... 52 1 Aug 8 19:51:20 eyrie /kernel: done Aug 8 19:51:20 eyrie /kernel: Uptime: 16h57m35s Aug 8 19:51:20 eyrie /kernel: Automatic reboot in 15 seconds - press a key on the console to abort Aug 8 19:51:20 eyrie /kernel: --> Press a key on the console to reboot, Aug 8 19:51:20 eyrie /kernel: --> or switch off the system now. Aug 8 19:51:20 eyrie /kernel: Rebooting... At one point I noticed my machine running sluggishly. systat/vmstat showed 50000+ interrupts/second. The number calmed down after resetting dc0 with ifconfig down/up. Recently I was looking at X when a reboot happened. It was immediately preceded by a band of red pixels across the top of the screen, as if memory were being improperly written. That prompted me to rebuild a fresh kernel, X, and everything related. The reboots continue. I'm not sure whether the problem is hardware or software. I assumed it was the NIC, but replacing it hasn't helped. I suspect the driver, but so far I don't have a crash dump to help you out. Obligatory dmesg: Sep 17 23:26:45 eyrie /kernel: Copyright (c) 1992-2003 The FreeBSD Project. Sep 17 23:26:45 eyrie /kernel: Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 Sep 17 23:26:45 eyrie /kernel: The Regents of the University of California. All rights reserved. Sep 17 23:26:45 eyrie /kernel: FreeBSD 4.9-PRERELEASE #0: Thu Sep 4 18:48:38 EDT 2003 Sep 17 23:26:45 eyrie /kernel: troc@eyrie.homenet:/usr/obj/usr/src/sys/RC20030904 Sep 17 23:26:45 eyrie /kernel: Timecounter "i8254" frequency 1193182 Hz Sep 17 23:26:45 eyrie /kernel: Timecounter "TSC" frequency 1000041536 Hz Sep 17 23:26:45 eyrie /kernel: CPU: AMD Athlon(tm) Processor (1000.04-MHz 686-class CPU) Sep 17 23:26:45 eyrie /kernel: Origin = "AuthenticAMD" Id = 0x642 Stepping = 2 Sep 17 23:26:45 eyrie /kernel: Features=0x183f9ff Sep 17 23:26:45 eyrie /kernel: AMD Features=0xc0440000 Sep 17 23:26:45 eyrie /kernel: real memory = 536805376 (524224K bytes) Sep 17 23:26:45 eyrie /kernel: avail memory = 517525504 (505396K bytes) Sep 17 23:26:45 eyrie /kernel: Preloaded elf kernel "kernel" at 0xc04ad000. Sep 17 23:26:45 eyrie /kernel: Preloaded elf module "vn.ko" at 0xc04ad09c. Sep 17 23:26:45 eyrie /kernel: Preloaded elf module "linux.ko" at 0xc04ad138. Sep 17 23:26:45 eyrie /kernel: Preloaded elf module "agp.ko" at 0xc04ad1d8. Sep 17 23:26:45 eyrie /kernel: VESA: v2.0, 65536k memory, flags:0x1, mode table:0xc03f3242 (1000022) Sep 17 23:26:45 eyrie /kernel: VESA: ATI RADEON Sep 17 23:26:45 eyrie /kernel: Pentium Pro MTRR support enabled Sep 17 23:26:45 eyrie /kernel: md0: Malloc disk Sep 17 23:26:45 eyrie /kernel: Using $PIR table, 10 entries at 0xc00fdf20 Sep 17 23:26:45 eyrie /kernel: npx0: on motherboard Sep 17 23:26:45 eyrie /kernel: npx0: INT 16 interface Sep 17 23:26:45 eyrie /kernel: pcib0: on motherboard Sep 17 23:26:45 eyrie /kernel: pci0: on pcib0 Sep 17 23:26:45 eyrie /kernel: agp0: mem 0xec000000-0xefffffff at device 0.0 on pci0 Sep 17 23:26:45 eyrie /kernel: pcib1: at device 1.0 on pci0 Sep 17 23:26:45 eyrie /kernel: pci1: on pcib1 Sep 17 23:26:45 eyrie /kernel: pci1: at 0.0 irq 11 Sep 17 23:26:45 eyrie /kernel: isab0: at device 7.0 on pci0 Sep 17 23:26:45 eyrie /kernel: isa0: on isab0 Sep 17 23:26:45 eyrie /kernel: atapci0: port 0x1c40-0x1c4f at device 7.1 on pci0 Sep 17 23:26:45 eyrie /kernel: ata0: at 0x1f0 irq 14 on atapci0 Sep 17 23:26:45 eyrie /kernel: ata1: at 0x170 irq 15 on atapci0 Sep 17 23:26:45 eyrie /kernel: uhci0: port 0x1c00-0x1c1f irq 9 at device 7.2 on pci0 Sep 17 23:26:45 eyrie /kernel: usb0: on uhci0 Sep 17 23:26:45 eyrie /kernel: usb0: USB revision 1.0 Sep 17 23:26:45 eyrie /kernel: uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 Sep 17 23:26:45 eyrie /kernel: uhub0: 2 ports with 2 removable, self powered Sep 17 23:26:45 eyrie /kernel: uhci1: port 0x1c20-0x1c3f irq 9 at device 7.3 on pci0 Sep 17 23:26:45 eyrie /kernel: usb1: on uhci1 Sep 17 23:26:45 eyrie /kernel: usb1: USB revision 1.0 Sep 17 23:26:45 eyrie /kernel: uhub1: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 Sep 17 23:26:45 eyrie /kernel: uhub1: 2 ports with 2 removable, self powered Sep 17 23:26:45 eyrie /kernel: viapropm0: SMBus I/O base at 0x400 Sep 17 23:26:45 eyrie /kernel: viapropm0: port 0x400-0x40f at device 7.4 on pci0 Sep 17 23:26:45 eyrie /kernel: viapropm0: SMBus revision code 0x0 Sep 17 23:26:45 eyrie /kernel: smb0: on smbus0 Sep 17 23:26:45 eyrie /kernel: pcm0: port 0x1c50-0x1c53,0x1c54-0x1c57,0x1000-0x10ff irq 10 at device 7.5 on pci0 Sep 17 23:26:45 eyrie /kernel: pcm0: Sep 17 23:26:45 eyrie /kernel: dc0: port 0x1400-0x14ff mem 0xe8000000-0xe80003ff irq 10 at device 15.0 on pci0 Sep 17 23:26:45 eyrie /kernel: dc0: Ethernet address: 00:04:5a:63:94:cc Sep 17 23:26:45 eyrie /kernel: miibus0: on dc0 Sep 17 23:26:45 eyrie /kernel: ukphy0: on miibus0 Sep 17 23:26:45 eyrie /kernel: ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto Sep 17 23:26:45 eyrie /kernel: rl0: port 0x1800-0x18ff mem 0xe8000400-0xe80004ff irq 5 at device 18.0 on pci0 Sep 17 23:26:45 eyrie /kernel: rl0: Ethernet address: 00:e0:18:30:68:32 Sep 17 23:26:45 eyrie /kernel: miibus1: on rl0 Sep 17 23:26:45 eyrie /kernel: rlphy0: on miibus1 Sep 17 23:26:45 eyrie /kernel: rlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto Sep 17 23:26:45 eyrie /kernel: orm0: