From owner-freebsd-hackers Fri Jun 7 9:33:34 2002 Delivered-To: freebsd-hackers@freebsd.org Received: from tinker.exit.com (tinker.exit.com [206.223.0.1]) by hub.freebsd.org (Postfix) with ESMTP id 27C7637B400; Fri, 7 Jun 2002 09:33:10 -0700 (PDT) Received: from realtime.exit.com (realtime [206.223.0.5]) by tinker.exit.com (8.12.3/8.12.3) with ESMTP id g57GWncn039525; Fri, 7 Jun 2002 09:32:51 -0700 (PDT) (envelope-from frank@exit.com) Received: from realtime.exit.com (localhost [127.0.0.1]) by realtime.exit.com (8.12.3/8.12.2) with ESMTP id g57GWng0099537; Fri, 7 Jun 2002 09:32:49 -0700 (PDT) (envelope-from frank@realtime.exit.com) Received: (from frank@localhost) by realtime.exit.com (8.12.3/8.12.3/Submit) id g57GWlFU099531; Fri, 7 Jun 2002 09:32:47 -0700 (PDT) From: Frank Mayhar Message-Id: <200206071632.g57GWlFU099531@realtime.exit.com> Subject: Re: Numerous hard hangs on TWO different ASUS P4T-E w/P4 1.6G In-Reply-To: <20020607180239.C5061@cscoms.net> To: stable@freebsd.org Date: Fri, 7 Jun 2002 09:32:47 -0700 (PDT) Cc: hackers@freebsd.org Reply-To: frank@exit.com X-Copyright0: Copyright 2002 Frank Mayhar. All Rights Reserved. X-Copyright1: Permission granted for electronic reproduction as Usenet News or email only. X-Mailer: ELM [version 2.4ME+ PL98b (25)] MIME-Version: 1.0 Content-Type: multipart/mixed; boundary=ELM1023467567-97776-0_ Content-Transfer-Encoding: 7bit Sender: owner-freebsd-hackers@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG --ELM1023467567-97776-0_ Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=US-ASCII I'm experiencing hangs as well. At first I thought it was the fxp0/sym driver thing, but I've since changed hardware almost completely and the hangs persis. I'm now strongly suspecting some kind of interrupt problem. For the record, I've attached my dmesg output. This is a dual AMD MP 1900+ (1.6 GHz) Tyan 2466N-4M system. 3Com xl0 ethernet, Adaptec 39160 and 3940 SCSI, Creative Soundblaster Live! audio, Radeon 8500 128MB video (XFree86 4.2). 2GB DDR memory. I see very common short-term hangs, a few seconds to less than a minute. The mouse and keyboard stop responding, X stops updating and everything just pauses, the whole system (including the network). It then starts back up, often dropping keyboard or mouse data. Once in a while (not sure how often, but at least every couple of days) it hangs solid and never comes back. Totally unresponsive. This invariably requires a hard reset. This is a busy system. Near-constant load on the network (some 40-100KB/s), lots of disk accesses. Seems worse when network and/or SCSI load is high. Dmesg output follows. If there's anything I can do to help diagnose this problem, please let me know... -- Frank Mayhar frank@exit.com http://www.exit.com/ Exit Consulting http://www.gpsclock.com/ --ELM1023467567-97776-0_ Content-Transfer-Encoding: 7bit Content-Type: text/plain Content-Disposition: attachment; filename=dmesg.boot Content-Description: Copyright (c) 1992-2002 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.6-RC #21: Thu May 23 15:24:24 PDT 2002 frank@realtime.exit.com:/usr/src/sys/compile/REALTIME Timecounter "i8254" frequency 1193182 Hz CPU: AMD Athlon(tm) MP 1900+ (1600.07-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0x662 Stepping = 2 Features=0x383fbff AMD Features=0xc0480000<,AMIE,DSP,3DNow!> real memory = 2146959360 (2096640K bytes) config> q avail memory = 2086846464 (2037936K bytes) Programming 24 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 1, version: 0x00040010, at 0xfee00000 cpu1 (AP): apic id: 0, version: 0x00040010, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Preloaded elf kernel "kernel" at 0xc042a000. Preloaded userconfig_script "/boot/kernel.conf" at 0xc042a09c. Preloaded elf module "umap.ko" at 0xc042a0ec. link_elf: symbol null_bypass undefined Pentium Pro MTRR support enabled Using $PIR table, 12 entries at 0xc00fdf00 apm0: on motherboard apm: found APM BIOS v1.2, connected at v1.2 npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard pci0: on pcib0 pcib1: at device 1.0 on pci0 pci1: on pcib1 pci1: at 5.0 irq 11 isab0: at device 7.0 on pci0 isa0: on isab0 atapci0: port 0xf000-0xf00f at device 7.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 chip1: at device 7.3 on pci0 ahc0: port 0x1000-0x10ff mem 0xc0000000-0xc0000fff irq 5 at device 9.0 on pci0 aic7899: Ultra160 Wide Channel A, SCSI Id=7, 32/253 SCBs ahc1: port 0x1400-0x14ff mem 0xc0001000-0xc0001fff irq 11 at device 9.1 on pci0 aic7899: Ultra160 Wide Channel B, SCSI Id=7, 32/253 SCBs pcib2: at device 16.0 on pci0 pci2: on pcib2 ohci0: mem 0xc0200000-0xc0200fff irq 10 at device 0.0 on pci2 usb0: OHCI version 1.0, legacy support usb0: SMM does not respond, resetting usb0: on ohci0 usb0: USB revision 1.0 uhub0: (unknown) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 4 ports with 4 removable, self powered pcib3: at device 6.0 on pci2 pci3: on pcib3 ahc2: port 0x4000-0x40ff mem 0xc0300000-0xc0300fff irq 11 at device 4.0 on pci3 aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs ahc3: port 0x4400-0x44ff mem 0xc0301000-0xc0301fff irq 10 at device 5.0 on pci3 aic7880: Ultra Wide Channel B, SCSI Id=7, 16/253 SCBs pcm0: port 0x3080-0x309f irq 10 at device 7.0 on pci2 xl0: <3Com 3c905C-TX Fast Etherlink XL> port 0x3000-0x307f mem 0xc0201000-0xc020107f irq 10 at device 8.0 on pci2 xl0: Ethernet address: 00:e0:81:20:cc:de miibus0: on xl0 ukphy0: on miibus0 ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto orm0: