Date: Mon, 7 Feb 2011 17:38:41 -0800 From: Pyun YongHyeon <pyunyh@gmail.com> To: Peter Lai <cowbert@gmail.com> Cc: freebsd-net@freebsd.org Subject: Re: bge wedging 8.2-RC1 Message-ID: <20110208013841.GB1306@michelle.cdnetworks.com> In-Reply-To: <AANLkTikQuaz1JSuFt=p49HTkqxBm7FaNeTmb9LXJU8Kg@mail.gmail.com> References: <AANLkTikQuaz1JSuFt=p49HTkqxBm7FaNeTmb9LXJU8Kg@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
On Mon, Feb 07, 2011 at 06:09:16PM -0600, Peter Lai wrote: > Hello > > I've got a new Dell Precision workstation here with a BCM5761 on intel > mobo for westmere xeons that is wedging with interrupt storm and will > lockup the system randomly. I have turned HTT and auto powermanagement > off in bios (system cannot sleep), lowest cpu acpi state is C1. > > Here is dmesg: > bge0: <Broadcom NetXtreme Gigabit Ethernet Controller, ASIC rev. > 0x5761100> mem 0xf3be0000-0xf3beffff,0xf3bf0000-0xf3bfffff irq 17 at > device 0.0 on pci6 > bge0: CHIP ID 0x05761100; ASIC REV 0x5761; CHIP REV 0x57611; PCI-E > miibus0: <MII bus> on bge0 > brgphy0: <BCM5761 10/100/1000baseTX PHY> PHY 1 on miibus0 > brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, > 1000baseT-master, 1000baseT-FDX, 1000baseT-FDX-master, auto, auto-flow > > Here is pciconf -lv: > bge0@pci0:6:0:0: class=0x020000 card=0x026d1028 chip=0x168114e4 > rev=0x10 hdr=0x00 > vendor = 'Broadcom Corporation' > device = 'Broadcom 57XX Gigabit Integrated Controller (BCM5761)' > class = network > subclass = ethernet > > here is the setup in rc.conf: > > ifconfig_bge0="polling -tso -vlanhwtso -vlanhwtag -vlanmtu inet > 192.168.123.124 netmask 255.255.255.0" > > I have the card plugged into a dlink DSS8 100mbps switch with one > other 100mbps device on it (rich man's crossover cable). > > Before turning off TSO4 and VLAN tagging (because I don't use them), > the card would do several things: > 1. 1 out of 3 reboots: Fail to bring interface up. ifconfig would hang > and systat/vmstat showed 800+ interrupts per second on IRQ256 This is strange. bge(4) does not use MSI if you build bge(4) with DEVICE_POLLING so seeing IRQ256 interrupts looks odd to me. Are you sure bge(4) is using IRQ256? > 2. After a few hours lock up the system, requiring hard reboot > > After disabling TSO4 and VLAN stuff: > bge0: flags=8802<BROADCAST,SIMPLEX,MULTICAST> metric 0 mtu 1500 > options=80083<RXCSUM,TXCSUM,VLAN_HWCSUM,LINKSTATE> > media: Ethernet autoselect (100baseTX > <full-duplex,flowcontrol,rxpause,txpause>) > > Everything seemed fine for about two weeks and then suddenly started > acting up again, locked up, after hard reboot, soft reboot, link will > not come up and I see interrupt storm again.... > If you don't use DEVICE_POLLING, rebuild bge(4) with DEVICE_POLLING. For most cases, you don't need to enable polling on intelligent controllers like bge(4). I also have BCM5761 PCIe controller which shows no such issues. I know there is an edge case(send BD corruption) for BCM5761/BCM5784/ BCM57780 which needs to be investigated. I'm not sure you're seeing that edge case though. > I am close to buying an intel card to replace the bcm, but then I > noticed that the main intel desktop PCI-E card is 82574L-based and > people are having em driver wedging on that too. So now I have broken > ethernet on this box; my primary link is atheros 5212 pci card and I > may be out of pci slots (or else I might try a pci intel card).
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20110208013841.GB1306>