Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 27 Sep 2006 09:15:39 +0200
From:      Oliver Brandmueller <ob@e-Gitt.NET>
To:        freebsd-stable@freebsd.org
Subject:   Re: 6.2 SHOWSTOPPER - em completely unusable on 6.2
Message-ID:  <20060927071538.GF22229@e-Gitt.NET>
In-Reply-To: <451A1375.5080202@gneto.com>
References:  <451A1375.5080202@gneto.com>

next in thread | previous in thread | raw e-mail | index | archive | help

[-- Attachment #1 --]
Hi,

On Wed, Sep 27, 2006 at 08:00:21AM +0200, Martin Nilsson wrote:
> I get tons of these:
> em0: watchdog timeout -- resetting
> em0: link state changed to DOWN
> em0: link state changed to UP
> 
> mailbox# pciconf -lv
> em0@pci13:0:0:  class=0x020000 card=0x108c15d9 chip=0x108c8086 rev=0x03 
> hdr=0x00
>     vendor   = 'Intel Corporation'
>     device   = 'PRO/1000 PM'
>     class    = network
>     subclass = ethernet
> em1@pci14:0:0:  class=0x020000 card=0x109a15d9 chip=0x109a8086 rev=0x00 
> hdr=0x00
>     vendor   = 'Intel Corporation'
>     class    = network
>     subclass = ethernet
> 
[...]
> I have only seen them on em0. Yesterday I tried sysutils/cpuburn on 
> similar boxes that are netbooted with NFS mounted drives and everytime I 
> loaded the two CPU cores the network went down.

I see the same.

Very much on this one, where I workaround the problem by using polling,
it's a UP machine.

FreeBSD nessie 6.2-PRERELEASE FreeBSD 6.2-PRERELEASE #3: Fri Sep 15 09:48:36 CEST 2006     root@nessie:/usr/obj/usr/src/sys/NESSIE  i386

em0@pci2:1:0:   class=0x020000 card=0x10198086 chip=0x10198086 rev=0x00 hdr=0x00
    vendor   = 'Intel Corporation'
    device   = '82547EI Gigabit Ethernet Controller (LOM)'
    class    = network
    subclass = ethernet

irq18: em0 uhci2                    3319          0


Another machine, also UP, but with two interfaces. The problem is not as 
apparent as on the first machine, but it's there. This machine is not as 
loaded usually (CPU wise) as the first machine. The problem is ONLY on 
em1:

FreeBSD hudson 6.2-PRERELEASE FreeBSD 6.2-PRERELEASE #48: Thu Sep 14 10:19:46 CEST 2006     root@hudson:/usr/obj/usr/src/sys/NFS-32-FBSD6  i386

em0@pci1:1:0:   class=0x020000 card=0x10758086 chip=0x10758086 rev=0x00 hdr=0x00
    vendor   = 'Intel Corporation'
    device   = '82547EI Gigabit Ethernet Controller'
    class    = network
    subclass = ethernet

em1@pci3:2:0:   class=0x020000 card=0x10768086 chip=0x10768086 rev=0x00 hdr=0x00
    vendor   = 'Intel Corporation'
    device   = '82547EI Gigabit Ethernet Controller'
    class    = network
    subclass = ethernet

irq17: em1 ichsmb0             950121879        855
irq18: em0                      71437344         64


The problem appeared after the em updates during the last weeks in the
kernel and has not been observed before this. em is always loaded as a 
module in my kernels. The problem seems to occur more often if the 
machine's CPU is busy.


I have several SMP machines with the following em interfaces, which 
DON'T show the problem, but they also have different chipset on the em 
interface. Most of the kernels were built between Sep 7 and Sep 19.

3 times this:
em0@pci4:5:0:   class=0x020000 card=0x34248086 chip=0x10108086 rev=0x01 hdr=0x00
em1@pci4:5:1:   class=0x020000 card=0x34248086 chip=0x10108086 rev=0x01 hdr=0x00
irq23: em0                     970303432        750



3 times this:
em0@pci4:5:0:   class=0x020000 card=0x34258086 chip=0x100e8086 rev=0x02 hdr=0x00
irq23: em0                     292477376        435


So I can observe at least 3 interesting differences:

- the interface showing the problems shares the interrupt
- for me it happens on UP machines only
- the chips are different

What I can't do: moving the interfaces between machines, these are 
                 onboard interfaces.

What I could do: I could try to unload the USB driver or the ichsmb 
driver on the machiens, where the interrupts are shared. Anyway, the USB 
is not used currently (I have it enabled to be prepared to hook up a USB 
Mass Storage device, which never happend since the problem occured). The 
ichsmb also is usually not queried.

Any suggestions on how I could help?

- Olli


-- 
| Oliver Brandmueller | Offenbacher Str. 1  | Germany       D-14197 Berlin |
| Fon +49-172-3130856 | Fax +49-172-3145027 | WWW:   http://the.addict.de/ |
|               Ich bin das Internet. Sowahr ich Gott helfe.               |
| Eine gewerbliche Nutzung aller enthaltenen Adressen ist nicht gestattet! |

[-- Attachment #2 --]
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (FreeBSD)

iD8DBQFFGiUaiqtMdzjafykRAle5AJ9OQMWWJMEffZNYLN+z/JrI8OCphQCgxVaH
jb9oTMzYrXEOBjvwenFkhtI=
=GKtS
-----END PGP SIGNATURE-----

Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20060927071538.GF22229>