Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 28 Sep 2006 22:29:41 +0200
From:      Philippe Pegon <Philippe.Pegon@crc.u-strasbg.fr>
To:        Mike Jakubik <mikej@rogers.com>
Cc:        stable@freebsd.org
Subject:   Re: CALL FOR TESTERS! [Re: 6.2 SHOWSTOPPER - em completely unusable on 6.2]
Message-ID:  <451C30B5.6080901@crc.u-strasbg.fr>
In-Reply-To: <451C2995.5080303@rogers.com>
References:  <451A1375.5080202@gneto.com>	<20060927071538.GF22229@e-Gitt.NET><451A4189.5020906@samsco.org>	<20060927152824.GJ22229@e-Gitt.NET><20060927155553.GB14563@icarus.home.lan><20060927155904.GM22229@e-Gitt.NET>	<451AA7B1.5080202@samsco.org><20060927191402.GB932@turion.vk2pj.dyndns.org><20060927210349.GG14975@tnn.dglawrence.com>	<451AEB02.2090806@samsco.org>	<002201c6e290$45ece980$b3db87d4@multiplay.co.uk>	<451BD89F.8080203@samsco.org> <451C2995.5080303@rogers.com>

next in thread | previous in thread | raw e-mail | index | archive | help
Mike Jakubik wrote:
> Scott Long wrote:
>> All,
>>
>> Attached is my first cut at addressing the problems described in this 
>> thread.  As I discussed earlier, the VM syncer thread is likely starving
>> the USB interrupt thread.  This causes the shared usb+network 
>> interrupt to remain masked, preventing network interrupts from being 
>> delivered,
>> and thus triggering watchdog timeouts.
> 
> Just to be clear, has it been established that the problem only occurs 
> when em is sharing an interrupt? I have a lot of production machines 
> using the PDSMi board, which is one of the boards that the problem was 
> noticed on, however i do not share any irqs, i always disable USB in the 
> BIOS.

On many of our servers, we have bge cards and I can see a lot of 
watchdog timeouts. We always disable USB in the bios and they didn't 
share irq.

> 
> # vmstat -i
> interrupt                          total       rate
> irq16: em0                      13001181          7
> irq19: atapci0                  76559511         42
> cpu0: timer                   3643365617       1999
> cpu1: timer                   3643365610       1999
> Total                         7376291919       4048

example with our ftp server (ftp8.fr.freebsd.org), a HP DL360 G4 SMP :

# vmstat -i
interrupt                          total       rate
irq1: atkbd0                        1576          0
irq4: sio0                             3          0
irq6: fdc0                            12          0
irq14: ata0                           57          0
irq24: ciss1                    17181184          8
irq25: bge0                    841821262        402
irq26: bge1                    674342644        322
irq72: ciss0                    24194679         11
cpu0: timer                   4180478365       1999
cpu1: timer                   4180886439       1999
Total                         9918906221       4743

# bzgrep watchdog /var/log/messages*
/var/log/messages:Sep 23 02:47:06 anubis kernel: bge1: watchdog timeout 
-- resetting
/var/log/messages.0.bz2:Sep 12 22:22:48 anubis kernel: bge1: watchdog 
timeout -- resetting
/var/log/messages.0.bz2:Sep 17 15:22:01 anubis kernel: bge1: watchdog 
timeout -- resetting
/var/log/messages.0.bz2:Sep 20 12:13:07 anubis kernel: bge1: watchdog 
timeout -- resetting
/var/log/messages.1.bz2:Sep  6 08:33:54 anubis kernel: bge1: watchdog 
timeout -- resetting
/var/log/messages.3.bz2:Aug 29 12:09:36 anubis kernel: bge0: watchdog 
timeout -- resetting
/var/log/messages.4.bz2:Aug 22 15:44:00 anubis kernel: bge0: watchdog 
timeout -- resetting

# pciconf -lv
hostb0@pci0:0:0:        class=0x060000 card=0x32000e11 chip=0x35908086 
rev=0x0a hdr=0x00
     vendor   = 'Intel Corporation'
     device   = 'E752x Server Memory Controller Hub'
     class    = bridge
     subclass = HOST-PCI
pcib1@pci0:2:0: class=0x060400 card=0x00000050 chip=0x35958086 rev=0x0a 
hdr=0x01
     vendor   = 'Intel Corporation'
     device   = 'E752x Memory Controller Hub PCI Express Port A0'
     class    = bridge
     subclass = PCI-PCI
pcib2@pci0:4:0: class=0x060400 card=0x00000050 chip=0x35978086 rev=0x0a 
hdr=0x01
     vendor   = 'Intel Corporation'
     device   = 'E752x Memory Controller Hub PCI Express Port B0'
     class    = bridge
     subclass = PCI-PCI
pcib5@pci0:6:0: class=0x060400 card=0x00000050 chip=0x35998086 rev=0x0a 
hdr=0x01
     vendor   = 'Intel Corporation'
     device   = 'E752x Memory Controller Hub PCI Express Port C0'
     class    = bridge
     subclass = PCI-PCI
pcib6@pci0:28:0:        class=0x060400 card=0x00000050 chip=0x25ae8086 
rev=0x02 hdr=0x01
     vendor   = 'Intel Corporation'
     device   = '6300ESB Hub Interface to PCI-X Bridge'
     class    = bridge
     subclass = PCI-PCI
pcib7@pci0:30:0:        class=0x060400 card=0x00000000 chip=0x244e8086 
rev=0x0a hdr=0x01
     vendor   = 'Intel Corporation'
     device   = '82801BA/CA/DB/DBL/EB/ER/FB (ICH2/3/4/4/5/5/6), 6300ESB 
Hub Interface to PCI Bridge'
     class    = bridge
     subclass = PCI-PCI
isab0@pci0:31:0:        class=0x060100 card=0x00000000 chip=0x25a18086 
rev=0x02 hdr=0x00
     vendor   = 'Intel Corporation'
     device   = '6300ESB LPC Interface Bridge'
     class    = bridge
     subclass = PCI-ISA
atapci0@pci0:31:1:      class=0x01018a card=0x32010e11 chip=0x25a28086 
rev=0x02 hdr=0x00
     vendor   = 'Intel Corporation'
     device   = '6300ESB IDE Controller'
     class    = mass storage
     subclass = ATA
pcib3@pci6:0:0: class=0x060400 card=0x00000044 chip=0x03298086 rev=0x09 
hdr=0x01
     vendor   = 'Intel Corporation'
     device   = '6700PXH PCI Express-to-PCI Express Bridge A'
     class    = bridge
     subclass = PCI-PCI
pcib4@pci6:0:2: class=0x060400 card=0x00000044 chip=0x032a8086 rev=0x09 
hdr=0x01
     vendor   = 'Intel Corporation'
     device   = '6700PXH PCI Express-to-PCI Express Bridge B'
     class    = bridge
     subclass = PCI-PCI
ciss0@pci10:1:0:        class=0x010400 card=0x409b0e11 chip=0x00460e11 
rev=0x01 hdr=0x00
     vendor   = 'Compaq Computer Corp (Now owned by Hewlett-Packard)'
     device   = 'Smart Array 64xx Controller'
     class    = mass storage
     subclass = RAID
ciss1@pci2:1:0: class=0x010400 card=0x40910e11 chip=0x00460e11 rev=0x01 
hdr=0x00
     vendor   = 'Compaq Computer Corp (Now owned by Hewlett-Packard)'
     device   = 'Smart Array 64xx Controller'
     class    = mass storage
     subclass = RAID
bge0@pci2:2:0:  class=0x020000 card=0x00d00e11 chip=0x164814e4 rev=0x10 
hdr=0x00
     vendor   = 'Broadcom Corporation'
     device   = 'BCM5704 NetXtreme Dual Gigabit Adapter'
     class    = network
     subclass = ethernet
bge1@pci2:2:1:  class=0x020000 card=0x00d00e11 chip=0x164814e4 rev=0x10 
hdr=0x00
     vendor   = 'Broadcom Corporation'
     device   = 'BCM5704 NetXtreme Dual Gigabit Adapter'
     class    = network
     subclass = ethernet
none0@pci1:3:0: class=0x030000 card=0x001e0e11 chip=0x47521002 rev=0x27 
hdr=0x00
     vendor   = 'ATI Technologies Inc'
     device   = 'Rage XL PCI'
     class    = display
     subclass = VGA
none1@pci1:4:0: class=0x088000 card=0xb2060e11 chip=0xb2030e11 rev=0x01 
hdr=0x00
     vendor   = 'Compaq Computer Corp (Now owned by Hewlett-Packard)'
     device   = 'iLo Integrated Lights Out Processor'
     class    = base peripheral
none2@pci1:4:2: class=0x088000 card=0xb2060e11 chip=0xb2040e11 rev=0x01 
hdr=0x00
     vendor   = 'Compaq Computer Corp (Now owned by Hewlett-Packard)'
     device   = 'iLo Integrated Lights Out Processor'
     class    = base peripheral

--
Philippe Pegon



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?451C30B5.6080901>