Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 9 Apr 2003 14:57:36 -0700
From:      "Aleksandr Melentiev" <tzap@pacbell.net>
To:        "John Baldwin" <jhb@FreeBSD.org>, <freebsd-smp@freebsd.org>
Subject:   Re: System freezes with SMP support enabled
Message-ID:  <001b01c2fee3$16e1e710$0300a8c0@kronos>
References:  <XFMail.20030409120027.jhb@FreeBSD.org>

next in thread | previous in thread | raw e-mail | index | archive | help
This is a multi-part message in MIME format.

------=_NextPart_000_0018_01C2FEA8.5B9C5C90
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: 7bit

Here's an exact behavior when SMP support is enabled:

Hence, one network card is onboard which is an fxp0.

5.0-RELEASE:
Often spontaneous and constant 'watchdog timeouts' with different network
cards that I have tried including SMC 1255 and 3Com 3C905-TX. An Intel
Pro/100+ showed 'device timeout'. In all cases, system would stop
responding. In most cases network connection would dissapear. Sometimes it
would happen again on the first second after reboot. Onboard network card
didnt show any errors! Only the PCI cards did.

4.8-RELEASE:
Same behavior as above, only no error messages whatsoever, network
connection goes down and system locks up. It is not spontaneous, happens
only when I try to transfer >1MB files locally via ftp (tried several
different ftpd and clients too), maybe other ways of high-speed transfering
are affected too. However, if I throttle my ftp client's upload speed to
15KBytes/sec, it transfers without a problem and systems stays stable.

None of the above happens when SMP support is disabled.
I am attaching dmesg where SMP support is enabled. Might it be because of
the PCI bridge?

Regards,
Alex



----- Original Message -----
From: "John Baldwin" <jhb@FreeBSD.org>
To: "Terry Lambert" <tlambert2@mindspring.com>
Cc: <freebsd-smp@freebsd.org>; "Aleksandr Melentiev" <tzap@pacbell.net>
Sent: Wednesday, April 09, 2003 9:00 AM
Subject: Re: System freezes with SMP support enabled


>
> On 09-Apr-2003 Terry Lambert wrote:
> > What are both network cards?  Are they fxp?
> >
> > There are a couple of possibilities to consider...
> >
> > The first is that if both cards are identical (same vendor, etc.),
> > you might want to use a network card from a different vendor, to
> > make sure it's not the network card driver.
>
> This sounds like a sensible possibility and a worthy test.
>
> > Second, it seems to me that there's a possibility for a deadlock
> > if an interrupt comes in on one CPU, and an ithread to handle it
> > is scheduled to run on a different CPU.  You may want to try using
> > SCHED_4BSD to see if that changes anything.
>
> Huh?  Where in the code do you see this happening exactly?  All the
> bits you should need to look at for this are in ithread_schedule()
> and ithread_loop() in sys/kern/kern_intr.c.  Not only that, but 4.8
> doesn't have ithreads so I doubt seriously that this is causing the
> lockups on 4.x.
>
> --
>
> John Baldwin <jhb@FreeBSD.org>  <><  http://www.FreeBSD.org/~jhb/
> "Power Users Use the Power to Serve!"  -  http://www.FreeBSD.org/

------=_NextPart_000_0018_01C2FEA8.5B9C5C90
Content-Type: application/octet-stream;
	name="dmesg"
Content-Transfer-Encoding: quoted-printable
Content-Disposition: attachment;
	filename="dmesg"

Apr  8 23:40:12 kronos /kernel: Copyright (c) 1992-2003 The FreeBSD =
Project.
Apr  8 23:40:12 kronos /kernel: Copyright (c) 1979, 1980, 1983, 1986, =
1988, 1989, 1991, 1992, 1993, 1994
Apr  8 23:40:12 kronos /kernel: The Regents of the University of =
California. All rights reserved.
Apr  8 23:40:12 kronos /kernel: FreeBSD 4.8-RELEASE #4: Tue Apr  8 =
22:14:12 PDT 2003
Apr  8 23:40:12 kronos /kernel: =
alex@kronos.homeunix.org:/usr/obj/usr/src/sys/KRONOS
Apr  8 23:40:12 kronos /kernel: Timecounter "i8254"  frequency 1193182 =
Hz
Apr  8 23:40:12 kronos /kernel: CPU: Pentium III/Pentium III =
Xeon/Celeron (498.75-MHz 686-class CPU)
Apr  8 23:40:12 kronos /kernel: Origin =3D "GenuineIntel"  Id =3D 0x672  =
Stepping =3D 2
Apr  8 23:40:12 kronos /kernel: =
Features=3D0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE=
,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE>
Apr  8 23:40:12 kronos /kernel: real memory  =3D 536805376 (524224K =
bytes)
Apr  8 23:40:12 kronos /kernel: avail memory =3D 518299648 (506152K =
bytes)
Apr  8 23:40:12 kronos /kernel: Programming 24 pins in IOAPIC #0
Apr  8 23:40:12 kronos /kernel: IOAPIC #0 intpin 2 -> irq 0
Apr  8 23:40:12 kronos /kernel: FreeBSD/SMP: Multiprocessor motherboard
Apr  8 23:40:12 kronos /kernel: cpu0 (BSP): apic id:  1, version: =
0x00040011, at 0xfee00000
Apr  8 23:40:12 kronos /kernel: cpu1 (AP):  apic id:  0, version: =
0x00040011, at 0xfee00000
Apr  8 23:40:12 kronos /kernel: io0 (APIC): apic id:  2, version: =
0x00170011, at 0xfec00000
Apr  8 23:40:12 kronos /kernel: Preloaded elf kernel "kernel" at =
0xc03e4000.
Apr  8 23:40:12 kronos /kernel: VESA: v2.0, 2048k memory, flags:0x0, =
mode table:0xc036a882 (1000022)
Apr  8 23:40:12 kronos /kernel: VESA: Cirrus Logic GD-5480 VGA
Apr  8 23:40:12 kronos /kernel: Pentium Pro MTRR support enabled
Apr  8 23:40:12 kronos /kernel: md0: Malloc disk
Apr  8 23:40:12 kronos /kernel: Using $PIR table, 8 entries at =
0xc00fdf40
Apr  8 23:40:12 kronos /kernel: npx0: <math processor> on motherboard
Apr  8 23:40:12 kronos /kernel: npx0: INT 16 interface
Apr  8 23:40:12 kronos /kernel: pcib0: <Intel 82443BX host to PCI bridge =
(AGP disabled)> on motherboard
Apr  8 23:40:12 kronos /kernel: pci0: <PCI bus> on pcib0
Apr  8 23:40:12 kronos /kernel: sym0: <875> port 0x1400-0x14ff mem =
0xfa200000-0xfa200fff,0xfa204000-0xfa2040ff irq 11 at device 13.0 on =
pci0
Apr  8 23:40:12 kronos /kernel: sym0: No NVRAM, ID 7, Fast-20, SE, =
parity checking
Apr  8 23:40:12 kronos /kernel: sym1: <875> port 0x1800-0x18ff mem =
0xfa201000-0xfa201fff,0xfa204400-0xfa2044ff irq 10 at device 13.1 on =
pci0
Apr  8 23:40:12 kronos /kernel: sym1: No NVRAM, ID 7, Fast-20, SE, =
parity checking
Apr  8 23:40:12 kronos /kernel: fxp0: <Intel Pro 10/100B/100+ Ethernet> =
port 0x1060-0x107f mem 0xfa000000-0xfa0fffff,0xfa205000-0xfa205fff irq 5 =
at device 15.0 on pci0
Apr  8 23:40:12 kronos /kernel: fxp0: Ethernet address 00:90:27:73:5c:04
Apr  8 23:40:12 kronos /kernel: inphy0: <i82555 10/100 media interface> =
on miibus0
Apr  8 23:40:12 kronos /kernel: inphy0:  10baseT, 10baseT-FDX, =
100baseTX, 100baseTX-FDX, auto
Apr  8 23:40:12 kronos /kernel: fxp1: <Intel Pro 10/100B/100+ Ethernet> =
port 0x1080-0x10bf mem 0xfa100000-0xfa1fffff,0xfa202000-0xfa202fff irq =
11 at device 16.0 on pci0
Apr  8 23:40:12 kronos /kernel: fxp1: Ethernet address 00:d0:b7:53:ed:3c
Apr  8 23:40:12 kronos /kernel: inphy1: <i82555 10/100 media interface> =
on miibus1
Apr  8 23:40:12 kronos /kernel: inphy1:  10baseT, 10baseT-FDX, =
100baseTX, 100baseTX-FDX, auto
Apr  8 23:40:12 kronos /kernel: isab0: <Intel 82371AB PCI to ISA bridge> =
at device 18.0 on pci0
Apr  8 23:40:12 kronos /kernel: isa0: <ISA bus> on isab0
Apr  8 23:40:12 kronos /kernel: atapci0: <Intel PIIX4 ATA33 controller> =
port 0x1050-0x105f at device 18.1 on pci0
Apr  8 23:40:12 kronos /kernel: ata0: at 0x1f0 irq 14 on atapci0
Apr  8 23:40:12 kronos /kernel: ata1: at 0x170 irq 15 on atapci0
Apr  8 23:40:12 kronos /kernel: uhci0: <Intel 82371AB/EB (PIIX4) USB =
controller> port 0x10c0-0x10df irq 10 at device 18.2 on pci0
Apr  8 23:40:12 kronos /kernel: usb0: <Intel 82371AB/EB (PIIX4) USB =
controller> on uhci0
Apr  8 23:40:12 kronos /kernel: usb0: USB revision 1.0
Apr  8 23:40:12 kronos /kernel: uhub0: Intel UHCI root hub, class 9/0, =
rev 1.00/1.00, addr 1
Apr  8 23:40:12 kronos /kernel: uhub0: 2 ports with 2 removable, self =
powered
Apr  8 23:40:13 kronos /kernel: Timecounter "PIIX"  frequency 3579545 Hz
Apr  8 23:40:13 kronos /kernel: chip1: <Intel 82371AB Power management =
controller> port 0x1040-0x104f at device 18.3 on pci0
Apr  8 23:40:13 kronos /kernel: pci0: <Cirrus Logic GD5480 SVGA =
controller> at 20.0
Apr  8 23:40:13 kronos /kernel: orm0: <Option ROMs> at iomem =
0xc0000-0xc7fff,0xc8000-0xcbfff,0xcc000-0xcc7ff,0xcc800-0xcdfff on isa0
Apr  8 23:40:13 kronos /kernel: fdc0: <NEC 72065B or clone> at port =
0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
Apr  8 23:40:13 kronos /kernel: fdc0: FIFO enabled, 8 bytes threshold
Apr  8 23:40:13 kronos /kernel: fd0: <1440-KB 3.5" drive> on fdc0 drive =
0
Apr  8 23:40:13 kronos /kernel: atkbdc0: <Keyboard controller (i8042)> =
at port 0x60,0x64 on isa0
Apr  8 23:40:13 kronos /kernel: atkbd0: <AT Keyboard> flags 0x1 irq 1 on =
atkbdc0
Apr  8 23:40:13 kronos /kernel: kbd0 at atkbd0
Apr  8 23:40:13 kronos /kernel: vga0: <Generic ISA VGA> at port =
0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
Apr  8 23:40:13 kronos /kernel: sc0: <System console> at flags 0x100 on =
isa0
Apr  8 23:40:13 kronos /kernel: sc0: VGA <16 virtual consoles, =
flags=3D0x300>
Apr  8 23:40:13 kronos /kernel: sio0 at port 0x3f8-0x3ff irq 4 flags =
0x10 on isa0
Apr  8 23:40:13 kronos /kernel: sio0: type 16550A
Apr  8 23:40:13 kronos /kernel: sio1: configured irq 3 not in bitmap of =
probed irqs 0
Apr  8 23:40:13 kronos /kernel: ppc0: <Parallel port> at port =
0x378-0x37f irq 7 on isa0
Apr  8 23:40:13 kronos /kernel: ppc0: Generic chipset (ECP/PS2/NIBBLE) =
in COMPATIBLE mode
Apr  8 23:40:13 kronos /kernel: ppc0: FIFO with 16/16/8 bytes threshold
Apr  8 23:40:13 kronos /kernel: plip0: <PLIP network interface> on =
ppbus0
Apr  8 23:40:13 kronos /kernel: lpt0: <Printer> on ppbus0
Apr  8 23:40:13 kronos /kernel: lpt0: Interrupt-driven port
Apr  8 23:40:13 kronos /kernel: ppi0: <Parallel I/O> on ppbus0
Apr  8 23:40:13 kronos /kernel: APIC_IO: Testing 8254 interrupt delivery
Apr  8 23:40:13 kronos /kernel: APIC_IO: routing 8254 via IOAPIC #0 =
intpin 2
Apr  8 23:40:13 kronos /kernel: SMP: AP CPU #1 Launched!
Apr  8 23:40:13 kronos /kernel: acd0: CDROM <TOSHIBA CD-ROM XM-6502B> at =
ata0-master PIO4
Apr  8 23:40:13 kronos /kernel: Waiting 15 seconds for SCSI devices to =
settle
Apr  8 23:40:13 kronos /kernel: Mounting root from ufs:/dev/da0s1a
Apr  8 23:40:13 kronos /kernel: da0 at sym0 bus 0 target 0 lun 0
Apr  8 23:40:13 kronos /kernel: da0: <QUANTUM ATLAS IV 9 WLS 0808> Fixed =
Direct Access SCSI-3 device=20
Apr  8 23:40:13 kronos /kernel: da0: 40.000MB/s transfers (20.000MHz, =
offset 16, 16bit), Tagged Queueing Enabled
Apr  8 23:40:13 kronos /kernel: da0: 8761MB (17942584 512 byte sectors: =
255H 63S/T 1116C)
Apr  8 23:40:13 kronos /kernel: da1 at sym0 bus 0 target 1 lun 0
Apr  8 23:40:13 kronos /kernel: da1: <QUANTUM ATLAS IV 9 WLS 0808> Fixed =
Direct Access SCSI-3 device=20
Apr  8 23:40:13 kronos /kernel: da1: 40.000MB/s transfers (20.000MHz, =
offset 16, 16bit), Tagged Queueing Enabled
Apr  8 23:40:13 kronos /kernel: da1: 8761MB (17942584 512 byte sectors: =
255H 63S/T 1116C)
------=_NextPart_000_0018_01C2FEA8.5B9C5C90--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?001b01c2fee3$16e1e710$0300a8c0>