Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 7 Feb 2011 17:38:41 -0800
From:      Pyun YongHyeon <pyunyh@gmail.com>
To:        Peter Lai <cowbert@gmail.com>
Cc:        freebsd-net@freebsd.org
Subject:   Re: bge wedging 8.2-RC1
Message-ID:  <20110208013841.GB1306@michelle.cdnetworks.com>
In-Reply-To: <AANLkTikQuaz1JSuFt=p49HTkqxBm7FaNeTmb9LXJU8Kg@mail.gmail.com>
References:  <AANLkTikQuaz1JSuFt=p49HTkqxBm7FaNeTmb9LXJU8Kg@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
On Mon, Feb 07, 2011 at 06:09:16PM -0600, Peter Lai wrote:
> Hello
> 
> I've got a new Dell Precision workstation here with a BCM5761 on intel
> mobo for westmere xeons that is wedging with interrupt storm and will
> lockup the system randomly. I have turned HTT and auto powermanagement
> off in bios (system cannot sleep), lowest cpu acpi state is C1.
> 
> Here is dmesg:
> bge0: <Broadcom NetXtreme Gigabit Ethernet Controller, ASIC rev.
> 0x5761100> mem 0xf3be0000-0xf3beffff,0xf3bf0000-0xf3bfffff irq 17 at
> device 0.0 on pci6
> bge0: CHIP ID 0x05761100; ASIC REV 0x5761; CHIP REV 0x57611; PCI-E
> miibus0: <MII bus> on bge0
> brgphy0: <BCM5761 10/100/1000baseTX PHY> PHY 1 on miibus0
> brgphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT,
> 1000baseT-master, 1000baseT-FDX, 1000baseT-FDX-master, auto, auto-flow
> 
> Here is pciconf -lv:
> bge0@pci0:6:0:0:	class=0x020000 card=0x026d1028 chip=0x168114e4
> rev=0x10 hdr=0x00
>     vendor     = 'Broadcom Corporation'
>     device     = 'Broadcom 57XX Gigabit Integrated Controller  (BCM5761)'
>     class      = network
>     subclass   = ethernet
> 
> here is the setup in rc.conf:
> 
> ifconfig_bge0="polling -tso -vlanhwtso -vlanhwtag -vlanmtu inet
> 192.168.123.124 netmask 255.255.255.0"
> 
> I have the card plugged into a dlink DSS8 100mbps switch with one
> other 100mbps device on it (rich man's crossover cable).
> 
> Before turning off TSO4 and VLAN tagging (because I don't use them),
> the card would do several things:
> 1. 1 out of 3 reboots: Fail to bring interface up. ifconfig would hang
> and systat/vmstat showed 800+ interrupts per second on IRQ256

This is strange. bge(4) does not use MSI if you build bge(4) with
DEVICE_POLLING so seeing IRQ256 interrupts looks odd to me.
Are you sure bge(4) is using IRQ256?

> 2. After a few hours lock up the system, requiring hard reboot
> 
> After disabling TSO4 and VLAN stuff:
> bge0: flags=8802<BROADCAST,SIMPLEX,MULTICAST> metric 0 mtu 1500
> 	options=80083<RXCSUM,TXCSUM,VLAN_HWCSUM,LINKSTATE>
> 	media: Ethernet autoselect (100baseTX
> <full-duplex,flowcontrol,rxpause,txpause>)
> 
> Everything seemed fine for about two weeks and then suddenly started
> acting up again, locked up, after hard reboot, soft reboot, link will
> not come up and I see interrupt storm again....
> 

If you don't use DEVICE_POLLING, rebuild bge(4) with
DEVICE_POLLING. For most cases, you don't need to enable polling on
intelligent controllers like bge(4).

I also have BCM5761 PCIe controller which shows no such issues. I
know there is an edge case(send BD corruption) for BCM5761/BCM5784/
BCM57780 which needs to be investigated. I'm not sure you're seeing
that edge case though.

> I am close to buying an intel card to replace the bcm, but then I
> noticed that the main intel desktop PCI-E card is 82574L-based and
> people are having em driver wedging on that too. So now I have broken
> ethernet on this box; my primary link is atheros 5212 pci card and I
> may be out of pci slots (or else I might try a pci intel card).



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20110208013841.GB1306>