From owner-freebsd-stable@FreeBSD.ORG Mon Oct 23 18:17:28 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 11AD116A587 for ; Mon, 23 Oct 2006 18:17:28 +0000 (UTC) (envelope-from sbesnard@laposte.net) Received: from dominoext.ville-sevran.fr (dominoext.ville-sevran.fr [81.255.128.12]) by mx1.FreeBSD.org (Postfix) with ESMTP id 607E243D7B for ; Mon, 23 Oct 2006 18:16:47 +0000 (GMT) (envelope-from sbesnard@laposte.net) Message-ID: <453D070D.7000707@laposte.net> Date: Mon, 23 Oct 2006 20:16:45 +0200 From: Stephane Besnard User-Agent: Thunderbird 1.5.0.7 (Windows/20060909) MIME-Version: 1.0 To: freebsd-stable@freebsd.org Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Subject: em, bge, network problems survey. X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 23 Oct 2006 18:17:28 -0000 Hi, We do experience some trouble with em. 1. em0: watchdog timeout resetting Network hangs, server becomes unreachable, and after a few minutes, console doesn't respond anymore. We tried to type something at the console prompt and we got some "chinese" characters :) We had to reboot badly. twice. Then we switched to an xl network card. The problem occurred under load (heavy for our use) : nightly nfs backup that wasn't completed, samba home directory for 300 users, squid just 5 minutes after users began to work at the office. Really bad. 2. We are using a 6.1 release. We were using an old 5.2.1 before that. And we already had some network hang followed by server hang that's why we upgraded to 6.1. really bad idea. The problem occurs once a week now. 3. em cards are experiencing the problem. We bought a brand new one, and it stills occurs. So we are now using the integrated xl card. It works fine, but at 100Mbps. 4. It is an SMP machine. We are using the default kernel. 5. machine not hyperthreading capable 6.ioapic0 irqs 0-23 on motherboard 7. no code patch, default configuration, default kernel 8. 4BSD scheduler I can't do a lot of test with this server, as it is a production one. Hope this will help you to fix. See below, vmwtat, dmesg and pciconf Regards Stephane # vmstat -i interrupt total rate irq1: atkbd0 606 0 irq6: fdc0 3 0 irq14: ata0 47 0 irq19: xl0 19094529 540 irq20: em0 0 0 irq21: em1 twe0 654923 18 cpu0: timer 70651494 1999 cpu1: timer 70651376 1999 Total 161052978 4558 #dmesg Copyright (c) 1992-2006 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 6.1-RELEASE #0: Sun May 7 04:42:56 UTC 2006 root@opus.cse.buffalo.edu:/usr/obj/usr/src/sys/SMP Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Athlon(tm) MP 2600+ (2133.42-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0x681 Stepping = 1 Features=0x383fbff AMD Features=0xc0480800 real memory = 536346624 (511 MB) avail memory = 515244032 (491 MB) ACPI APIC Table: FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 1 cpu1 (AP): APIC ID: 0 MADT: Forcing active-low polarity and level trigger for SCI ioapic0 irqs 0-23 on motherboard kbd1 at kbdmux0 acpi0: on motherboard acpi0: Power Button (fixed) acpi0: Sleep Button (fixed) Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x8008-0x800b on acpi0 cpu0: on acpi0 cpu1: on acpi0 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff,0x8000-0x807f,0x8080-0x80ff iomem 0xd8000-0xdbfff on acpi0 pci0: on pcib0 agp0: port 0x10a0-0x10a3 mem 0xd2000000-0xd3ffffff,0xd0d00000-0xd0d00fff at device 0.0 on pci0 pcib1: at device 1.0 on pci0 pci1: on pcib1 pci1: at device 5.0 (no driver attached) isab0: at device 7.0 on pci0 isa0: on isab0 atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xf000-0xf00f at device 7.1 on pci0 ata0: on atapci0 ata1: on atapci0 pci0: at device 7.3 (no driver attached) em0: port 0x1000-0x103f mem 0xd0880000-0xd089ffff,0xd0800000-0xd083ffff irq 20 at device 8.0 on pci0 em0: Ethernet address: 00:04:23:d5:40:80 em1: port 0x1040-0x107f mem 0xd08a0000-0xd08bffff,0xd0840000-0xd087ffff irq 21 at device 8.1 on pci0 em1: Ethernet address: 00:04:23:d5:40:81 twe0: <3ware Storage Controller. Driver version 1.50.01.002> port 0x1090-0x109f mem 0xd0000000-0xd07fffff irq 21 at device 9.0 on pci0 twe0: [GIANT-LOCKED] twe0: 8 ports, Firmware FE7X 1.05.00.056, BIOS BE7X 1.08.00.046 pcib2: at device 16.0 on pci0 pci2: on pcib2 xl0: <3Com 3c905C-TX Fast Etherlink XL> port 0x3000-0x307f mem 0xd0a01000-0xd0a0107f irq 19 at device 8.0 on pci2 miibus0: on xl0 ukphy0: on miibus0 ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto xl0: Ethernet address: 00:e0:81:21:81:33 atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] psm0: irq 12 on atkbdc0 psm0: [GIANT-LOCKED] psm0: model IntelliMouse, device ID 3 sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A ppc0: port 0x378-0x37f,0x778-0x77f irq 7 drq 3 on acpi0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/9 bytes threshold ppbus0: on ppc0 plip0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 fdc0: port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FAST] pmtimer0 on isa0 orm0: at iomem 0xc0000-0xcffff,0xd0000-0xd07ff,0xd0800-0xd17ff,0xe0000-0xe3fff on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounters tick every 1.000 msec acd0: CDROM at ata0-master UDMA33 twed0: on twe0 twed0: 39265MB (80416192 sectors) twed1: on twe0 twed1: 471192MB (965002240 sectors) SMP: AP CPU #1 Launched! # pciconf -lv agp0@pci0:0:0: class=0x060000 card=0x00000000 chip=0x700c1022 rev=0x11 hdr=0x00 vendor = 'Advanced Micro Devices (AMD)' device = 'AMD-762 CPU to PCI Bridge (SMP chipset)' class = bridge subclass = HOST-PCI pcib1@pci0:1:0: class=0x060400 card=0x00000000 chip=0x700d1022 rev=0x00 hdr=0x01 vendor = 'Advanced Micro Devices (AMD)' device = 'AMD-762 CPU to PCI Bridge (AGP 4x)' class = bridge subclass = PCI-PCI isab0@pci0:7:0: class=0x060100 card=0x00000000 chip=0x74401022 rev=0x05 hdr=0x00 vendor = 'Advanced Micro Devices (AMD)' device = 'AMD-768 PCI to ISA/LPC Bridge' class = bridge subclass = PCI-ISA atapci0@pci0:7:1: class=0x01018a card=0x74411022 chip=0x74411022 rev=0x04 hdr=0x00 vendor = 'Advanced Micro Devices (AMD)' device = 'AMD-768 EIDE Controller' class = mass storage subclass = ATA none0@pci0:7:3: class=0x068000 card=0x74431022 chip=0x74431022 rev=0x03 hdr=0x00 vendor = 'Advanced Micro Devices (AMD)' device = 'AMD-768 System Management' class = bridge em0@pci0:8:0: class=0x020000 card=0x11798086 chip=0x10798086 rev=0x03 hdr=0x00 vendor = 'Intel Corporation' device = '82546EB Dual Port Gigabit Ethernet Controller' class = network subclass = ethernet em1@pci0:8:1: class=0x020000 card=0x11798086 chip=0x10798086 rev=0x03 hdr=0x00 vendor = 'Intel Corporation' device = '82546EB Dual Port Gigabit Ethernet Controller' class = network subclass = ethernet twe0@pci0:9:0: class=0x010400 card=0x100113c1 chip=0x100113c1 rev=0x01 hdr=0x00 vendor = '3ware Inc.' device = '7000/8000 series ATA-133 Storage Controller' class = mass storage subclass = RAID pcib2@pci0:16:0: class=0x060400 card=0x00000000 chip=0x74481022 rev=0x05 hdr=0x01 vendor = 'Advanced Micro Devices (AMD)' device = 'AMD-768 PCI Bridge' class = bridge subclass = PCI-PCI none1@pci1:5:0: class=0x030000 card=0x03251039 chip=0x03251039 rev=0x00 hdr=0x00 vendor = 'Silicon Integrated Systems (SiS)' device = 'SiS325 2D/3D Accelerator' class = display subclass = VGA xl0@pci2:8:0: class=0x020000 card=0x246610f1 chip=0x920010b7 rev=0x78 hdr=0x00 vendor = '3COM Corp, Networking Division' device = '3C905C-TX Fast EtherLink for PC Management NIC' class = network subclass = ethernet