From owner-freebsd-amd64@FreeBSD.ORG Thu Feb 16 21:07:12 2006 Return-Path: X-Original-To: amd64@FreeBSD.org Delivered-To: freebsd-amd64@FreeBSD.ORG Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id B149C16A422 for ; Thu, 16 Feb 2006 21:07:12 +0000 (GMT) (envelope-from eol1@yahoo.com) Received: from web51904.mail.yahoo.com (web51904.mail.yahoo.com [206.190.48.67]) by mx1.FreeBSD.org (Postfix) with SMTP id 747F743D5A for ; Thu, 16 Feb 2006 21:07:10 +0000 (GMT) (envelope-from eol1@yahoo.com) Received: (qmail 85960 invoked by uid 60001); 16 Feb 2006 21:07:07 -0000 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:Received:Date:From:Reply-To:Subject:To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=XxAAdBXhPcjSZhOQBTUnYsoPm1t7U+igGZdgJdZLAMkgONx3GNp/Qs9ptY6S8pjJxCZuU29fy9M1mNP9KfqgYeM8h59FF4n00zcEK+W2G40CZTJBTx+vSRmUTZUQ6lmlT75A35cCM+5TeOJ45c1WEM44zbunZQ781KwN8XWDFWk= ; Message-ID: <20060216210707.85958.qmail@web51904.mail.yahoo.com> Received: from [62.68.76.227] by web51904.mail.yahoo.com via HTTP; Thu, 16 Feb 2006 13:07:07 PST Date: Thu, 16 Feb 2006 13:07:07 -0800 (PST) From: Peter Thoenen To: amd64@FreeBSD.org MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit Cc: Subject: Unexplained panic? on Sun X2100 - help / pointers? X-BeenThere: freebsd-amd64@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: eol1@yahoo.com List-Id: Porting FreeBSD to the AMD64 platform List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 16 Feb 2006 21:07:13 -0000 Purchased an Sun X2100 a couple weeks ago and been experiencing unexplained panics?* Narrowed it down to what I think is the bge driver but unsure how to confirm / prove this. Help would be appreciated as I would like to get this fixed (mainly if its bad hardware, get replacement parts, if bad software, hope that FBSD 6.1 fixes). No point to owning a server if you can't use it :) * Its a remote colo so I don't actually see a panic screen nor does logging serial console (or syslog) report a panic. Assume panic as the box just arbitrarily dies after a couple hours. * Not seeing anything is /var/crashes though I have enough space on my dumpdir and swap. Logic on bge driver (though might be other network related): - Installed 6.0 w/ GENERIC kernel. Runs fine. (72 hour test, no crash) - Installed custom kernel. Runs fine. (72 hour test, no crash) - Installed dns, httpd, mail, ssh, couple other low bandwidth items. Runs fine (72 hour test, no crash) - Install tor, i2p, freenet. Run ANY ONE (or multiples) of these items and box dies after about 3 to 4 hours of max pf queue (450kbs per each of the 3 items). - Reboot repeat bandwidth intensive test, repeat box dies. - Do not run of those 3 bandwidth intensive item. Box up for 72 hours. - Run any one of the 3 items and once again box dies with 3 to 4 hours. - Rebuild GENERIC kernel (though maybe custom kernel issue) - Repeat run tests, same issue, box dies after 3 or 4 hours of constant 450kbs+ traffic. Might be the bge driver, might be pf altq releated, might be *other*. Lost where to proceed from here. No longer think its hardware related (as in bad hardware) as I can run it for 72+ hours on low bandwidth and no crash. Only crashes on a (albeit minor) load. Not sure if this is of any use but below is my DMESG: --------------------------------------------------------- Copyright (c) 1992-2005 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 6.0-RELEASE-p4 #1: Mon Feb 13 20:48:35 EST 2006 root@nan-elmoth:/usr/obj/usr/src/sys/CUSTOM ACPI APIC Table: Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Opteron(tm) Processor 148 (2211.34-MHz K8-class CPU) Origin = "AuthenticAMD" Id = 0x20f71 Stepping = 1 Features=0x78bfbff Features2=0x1 AMD Features=0xe2500800,LM,3DNow+,3DNow> real memory = 2146304000 (2046 MB) avail memory = 2063179776 (1967 MB) ioapic0 irqs 0-23 on motherboard acpi0: on motherboard acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi0: Power Button (fixed) acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR pci_link0: on acpi0 pci_link1: on acpi0 pci_link2: on acpi0 pci_link3: irq 5 on acpi0 pci_link4: on acpi0 pci_link5: irq 10 on acpi0 pci_link6: on acpi0 pci_link7: irq 3 on acpi0 pci_link8: irq 7 on acpi0 pci_link9: on acpi0 pci_link10: irq 5 on acpi0 pci_link11: irq 11 on acpi0 pci_link12: on acpi0 pci_link13: irq 11 on acpi0 pci_link14: irq 10 on acpi0 pci_link15: on acpi0 pci_link16: irq 0 on acpi0 pci_link17: irq 0 on acpi0 pci_link18: irq 0 on acpi0 pci_link19: irq 0 on acpi0 pci_link20: irq 16 on acpi0 pci_link21: irq 0 on acpi0 pci_link22: irq 0 on acpi0 pci_link23: irq 0 on acpi0 pci_link24: irq 0 on acpi0 pci_link25: irq 0 on acpi0 pci_link26: irq 0 on acpi0 pci_link27: irq 0 on acpi0 pci_link28: irq 0 on acpi0 pci_link29: irq 0 on acpi0 pci_link30: irq 0 on acpi0 pci_link31: irq 0 on acpi0 unknown: I/O range not supported Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x1008-0x100b on acpi0 cpu0: on acpi0 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci_link26: BIOS IRQ 5 for -2145826954.1.INTA is invalid pci_link21: BIOS IRQ 10 for -2145826954.2.INTA is invalid pci_link27: BIOS IRQ 11 for -2145826954.2.INTB is invalid pci_link23: BIOS IRQ 3 for -2145826954.10.INTA is invalid pci_link29: BIOS IRQ 11 for -2145826954.7.INTA is invalid pci_link30: BIOS IRQ 10 for -2145826954.8.INTA is invalid pci0: on pcib0 pci0: at device 0.0 (no driver attached) isab0: at device 1.0 on pci0 isa0: on isab0 pci0: at device 1.1 (no driver attached) ohci0: mem 0xfe02f000-0xfe02ffff irq 21 at device 2.0 on pci0 ohci0: [GIANT-LOCKED] usb0: OHCI version 1.0, legacy support usb0: SMM does not respond, resetting usb0: on ohci0 usb0: USB revision 1.0 uhub0: nVidia OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 8 ports with 8 removable, self powered ehci0: mem 0xfeb00000-0xfeb000ff irq 22 at device 2.1 on pci0 ehci0: [GIANT-LOCKED] usb1: EHCI version 1.0 usb1: companion controller, 4 ports each: usb0 usb1: on ehci0 usb1: USB revision 2.0 uhub1: nVidia EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 uhub1: 8 ports with 8 removable, self powered atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xe800-0xe80f at device 6.0 on pci0 ata0: on atapci0 ata1: on atapci0 atapci1: port 0x9f0-0x9f7,0xbf0-0xbf3,0x970-0x977,0xb70-0xb73,0xd400-0xd40f mem 0xfe02c000-0xfe02cfff irq 23 at device 7.0 on pci0 ata2: on atapci1 ata3: on atapci1 atapci2: port 0x9e0-0x9e7,0xbe0-0xbe3,0x960-0x967,0xb60-0xb63,0xc000-0xc00f mem 0xfe02b000-0xfe02bfff irq 21 at device 8.0 on pci0 ata4: on atapci2 ata5: on atapci2 pcib1: at device 9.0 on pci0 pci_link16: BIOS IRQ 23 for 0.7.INTA is invalid pci_link19: BIOS IRQ 21 for 0.8.INTA is invalid pci_link17: BIOS IRQ 22 for 0.10.INTA is invalid pci1: on pcib1 pci1: at device 5.0 (no driver attached) nve0: port 0xbc00-0xbc07 mem 0xfe02a000-0xfe02afff irq 22 at device 10.0 on pci0 nve0: Ethernet address 00:e0:81:59:33:88 miibus0: on nve0 ukphy0: on miibus0 ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto nve0: Ethernet address: 00:e0:81:59:33:88 nve0: [GIANT-LOCKED] pcib2: at device 11.0 on pci0 pci2: on pcib2 pcib3: at device 12.0 on pci0 pci3: on pcib3 pcib4: at device 13.0 on pci0 pci4: on pcib4 bge0: mem 0xfdaf0000-0xfdafffff irq 19 at device 0.0 on pci4 miibus1: on bge0 brgphy0: on miibus1 brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto bge0: Ethernet address: 00:e0:81:59:33:89 pcib5: at device 14.0 on pci0 pci5: on pcib5 sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A, console orm0: at iomem 0xc0000-0xc7fff,0xc8000-0xcbfff,0xce000-0xcf7ff on isa0 atkbdc0: at port 0x60,0x64 on isa0 atkbd0: flags 0x1 irq 1 on atkbdc0 device_attach: atkbd0 attach returned 6 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x100> sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 ukbd0: GM-TEK USB Composite Device, rev 1.01/0.01, addr 2, iclass 3/1 kbd0 at ukbd0 uhid0: GM-TEK USB Composite Device, rev 1.01/0.01, addr 2, iclass 3/1 Timecounter "TSC" frequency 2211343601 Hz quality 800 Timecounters tick every 1.000 msec Fast IPsec: Initialized Security Association Processing. acd0: DVDROM at ata0-master UDMA66 ad4: 238475MB at ata2-master SATA150 Trying to mount root from ufs:/dev/ad4s1a bge0: link state changed to UP