Date: Wed, 9 Apr 2003 14:57:36 -0700 From: "Aleksandr Melentiev" <tzap@pacbell.net> To: "John Baldwin" <jhb@FreeBSD.org>, <freebsd-smp@freebsd.org> Subject: Re: System freezes with SMP support enabled Message-ID: <001b01c2fee3$16e1e710$0300a8c0@kronos> References: <XFMail.20030409120027.jhb@FreeBSD.org>
next in thread | previous in thread | raw e-mail | index | archive | help
This is a multi-part message in MIME format. ------=_NextPart_000_0018_01C2FEA8.5B9C5C90 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Here's an exact behavior when SMP support is enabled: Hence, one network card is onboard which is an fxp0. 5.0-RELEASE: Often spontaneous and constant 'watchdog timeouts' with different network cards that I have tried including SMC 1255 and 3Com 3C905-TX. An Intel Pro/100+ showed 'device timeout'. In all cases, system would stop responding. In most cases network connection would dissapear. Sometimes it would happen again on the first second after reboot. Onboard network card didnt show any errors! Only the PCI cards did. 4.8-RELEASE: Same behavior as above, only no error messages whatsoever, network connection goes down and system locks up. It is not spontaneous, happens only when I try to transfer >1MB files locally via ftp (tried several different ftpd and clients too), maybe other ways of high-speed transfering are affected too. However, if I throttle my ftp client's upload speed to 15KBytes/sec, it transfers without a problem and systems stays stable. None of the above happens when SMP support is disabled. I am attaching dmesg where SMP support is enabled. Might it be because of the PCI bridge? Regards, Alex ----- Original Message ----- From: "John Baldwin" <jhb@FreeBSD.org> To: "Terry Lambert" <tlambert2@mindspring.com> Cc: <freebsd-smp@freebsd.org>; "Aleksandr Melentiev" <tzap@pacbell.net> Sent: Wednesday, April 09, 2003 9:00 AM Subject: Re: System freezes with SMP support enabled > > On 09-Apr-2003 Terry Lambert wrote: > > What are both network cards? Are they fxp? > > > > There are a couple of possibilities to consider... > > > > The first is that if both cards are identical (same vendor, etc.), > > you might want to use a network card from a different vendor, to > > make sure it's not the network card driver. > > This sounds like a sensible possibility and a worthy test. > > > Second, it seems to me that there's a possibility for a deadlock > > if an interrupt comes in on one CPU, and an ithread to handle it > > is scheduled to run on a different CPU. You may want to try using > > SCHED_4BSD to see if that changes anything. > > Huh? Where in the code do you see this happening exactly? All the > bits you should need to look at for this are in ithread_schedule() > and ithread_loop() in sys/kern/kern_intr.c. Not only that, but 4.8 > doesn't have ithreads so I doubt seriously that this is causing the > lockups on 4.x. > > -- > > John Baldwin <jhb@FreeBSD.org> <>< http://www.FreeBSD.org/~jhb/ > "Power Users Use the Power to Serve!" - http://www.FreeBSD.org/ ------=_NextPart_000_0018_01C2FEA8.5B9C5C90 Content-Type: application/octet-stream; name="dmesg" Content-Transfer-Encoding: quoted-printable Content-Disposition: attachment; filename="dmesg" Apr 8 23:40:12 kronos /kernel: Copyright (c) 1992-2003 The FreeBSD = Project. Apr 8 23:40:12 kronos /kernel: Copyright (c) 1979, 1980, 1983, 1986, = 1988, 1989, 1991, 1992, 1993, 1994 Apr 8 23:40:12 kronos /kernel: The Regents of the University of = California. All rights reserved. Apr 8 23:40:12 kronos /kernel: FreeBSD 4.8-RELEASE #4: Tue Apr 8 = 22:14:12 PDT 2003 Apr 8 23:40:12 kronos /kernel: = alex@kronos.homeunix.org:/usr/obj/usr/src/sys/KRONOS Apr 8 23:40:12 kronos /kernel: Timecounter "i8254" frequency 1193182 = Hz Apr 8 23:40:12 kronos /kernel: CPU: Pentium III/Pentium III = Xeon/Celeron (498.75-MHz 686-class CPU) Apr 8 23:40:12 kronos /kernel: Origin =3D "GenuineIntel" Id =3D 0x672 = Stepping =3D 2 Apr 8 23:40:12 kronos /kernel: = Features=3D0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE= ,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE> Apr 8 23:40:12 kronos /kernel: real memory =3D 536805376 (524224K = bytes) Apr 8 23:40:12 kronos /kernel: avail memory =3D 518299648 (506152K = bytes) Apr 8 23:40:12 kronos /kernel: Programming 24 pins in IOAPIC #0 Apr 8 23:40:12 kronos /kernel: IOAPIC #0 intpin 2 -> irq 0 Apr 8 23:40:12 kronos /kernel: FreeBSD/SMP: Multiprocessor motherboard Apr 8 23:40:12 kronos /kernel: cpu0 (BSP): apic id: 1, version: = 0x00040011, at 0xfee00000 Apr 8 23:40:12 kronos /kernel: cpu1 (AP): apic id: 0, version: = 0x00040011, at 0xfee00000 Apr 8 23:40:12 kronos /kernel: io0 (APIC): apic id: 2, version: = 0x00170011, at 0xfec00000 Apr 8 23:40:12 kronos /kernel: Preloaded elf kernel "kernel" at = 0xc03e4000. Apr 8 23:40:12 kronos /kernel: VESA: v2.0, 2048k memory, flags:0x0, = mode table:0xc036a882 (1000022) Apr 8 23:40:12 kronos /kernel: VESA: Cirrus Logic GD-5480 VGA Apr 8 23:40:12 kronos /kernel: Pentium Pro MTRR support enabled Apr 8 23:40:12 kronos /kernel: md0: Malloc disk Apr 8 23:40:12 kronos /kernel: Using $PIR table, 8 entries at = 0xc00fdf40 Apr 8 23:40:12 kronos /kernel: npx0: <math processor> on motherboard Apr 8 23:40:12 kronos /kernel: npx0: INT 16 interface Apr 8 23:40:12 kronos /kernel: pcib0: <Intel 82443BX host to PCI bridge = (AGP disabled)> on motherboard Apr 8 23:40:12 kronos /kernel: pci0: <PCI bus> on pcib0 Apr 8 23:40:12 kronos /kernel: sym0: <875> port 0x1400-0x14ff mem = 0xfa200000-0xfa200fff,0xfa204000-0xfa2040ff irq 11 at device 13.0 on = pci0 Apr 8 23:40:12 kronos /kernel: sym0: No NVRAM, ID 7, Fast-20, SE, = parity checking Apr 8 23:40:12 kronos /kernel: sym1: <875> port 0x1800-0x18ff mem = 0xfa201000-0xfa201fff,0xfa204400-0xfa2044ff irq 10 at device 13.1 on = pci0 Apr 8 23:40:12 kronos /kernel: sym1: No NVRAM, ID 7, Fast-20, SE, = parity checking Apr 8 23:40:12 kronos /kernel: fxp0: <Intel Pro 10/100B/100+ Ethernet> = port 0x1060-0x107f mem 0xfa000000-0xfa0fffff,0xfa205000-0xfa205fff irq 5 = at device 15.0 on pci0 Apr 8 23:40:12 kronos /kernel: fxp0: Ethernet address 00:90:27:73:5c:04 Apr 8 23:40:12 kronos /kernel: inphy0: <i82555 10/100 media interface> = on miibus0 Apr 8 23:40:12 kronos /kernel: inphy0: 10baseT, 10baseT-FDX, = 100baseTX, 100baseTX-FDX, auto Apr 8 23:40:12 kronos /kernel: fxp1: <Intel Pro 10/100B/100+ Ethernet> = port 0x1080-0x10bf mem 0xfa100000-0xfa1fffff,0xfa202000-0xfa202fff irq = 11 at device 16.0 on pci0 Apr 8 23:40:12 kronos /kernel: fxp1: Ethernet address 00:d0:b7:53:ed:3c Apr 8 23:40:12 kronos /kernel: inphy1: <i82555 10/100 media interface> = on miibus1 Apr 8 23:40:12 kronos /kernel: inphy1: 10baseT, 10baseT-FDX, = 100baseTX, 100baseTX-FDX, auto Apr 8 23:40:12 kronos /kernel: isab0: <Intel 82371AB PCI to ISA bridge> = at device 18.0 on pci0 Apr 8 23:40:12 kronos /kernel: isa0: <ISA bus> on isab0 Apr 8 23:40:12 kronos /kernel: atapci0: <Intel PIIX4 ATA33 controller> = port 0x1050-0x105f at device 18.1 on pci0 Apr 8 23:40:12 kronos /kernel: ata0: at 0x1f0 irq 14 on atapci0 Apr 8 23:40:12 kronos /kernel: ata1: at 0x170 irq 15 on atapci0 Apr 8 23:40:12 kronos /kernel: uhci0: <Intel 82371AB/EB (PIIX4) USB = controller> port 0x10c0-0x10df irq 10 at device 18.2 on pci0 Apr 8 23:40:12 kronos /kernel: usb0: <Intel 82371AB/EB (PIIX4) USB = controller> on uhci0 Apr 8 23:40:12 kronos /kernel: usb0: USB revision 1.0 Apr 8 23:40:12 kronos /kernel: uhub0: Intel UHCI root hub, class 9/0, = rev 1.00/1.00, addr 1 Apr 8 23:40:12 kronos /kernel: uhub0: 2 ports with 2 removable, self = powered Apr 8 23:40:13 kronos /kernel: Timecounter "PIIX" frequency 3579545 Hz Apr 8 23:40:13 kronos /kernel: chip1: <Intel 82371AB Power management = controller> port 0x1040-0x104f at device 18.3 on pci0 Apr 8 23:40:13 kronos /kernel: pci0: <Cirrus Logic GD5480 SVGA = controller> at 20.0 Apr 8 23:40:13 kronos /kernel: orm0: <Option ROMs> at iomem = 0xc0000-0xc7fff,0xc8000-0xcbfff,0xcc000-0xcc7ff,0xcc800-0xcdfff on isa0 Apr 8 23:40:13 kronos /kernel: fdc0: <NEC 72065B or clone> at port = 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 Apr 8 23:40:13 kronos /kernel: fdc0: FIFO enabled, 8 bytes threshold Apr 8 23:40:13 kronos /kernel: fd0: <1440-KB 3.5" drive> on fdc0 drive = 0 Apr 8 23:40:13 kronos /kernel: atkbdc0: <Keyboard controller (i8042)> = at port 0x60,0x64 on isa0 Apr 8 23:40:13 kronos /kernel: atkbd0: <AT Keyboard> flags 0x1 irq 1 on = atkbdc0 Apr 8 23:40:13 kronos /kernel: kbd0 at atkbd0 Apr 8 23:40:13 kronos /kernel: vga0: <Generic ISA VGA> at port = 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Apr 8 23:40:13 kronos /kernel: sc0: <System console> at flags 0x100 on = isa0 Apr 8 23:40:13 kronos /kernel: sc0: VGA <16 virtual consoles, = flags=3D0x300> Apr 8 23:40:13 kronos /kernel: sio0 at port 0x3f8-0x3ff irq 4 flags = 0x10 on isa0 Apr 8 23:40:13 kronos /kernel: sio0: type 16550A Apr 8 23:40:13 kronos /kernel: sio1: configured irq 3 not in bitmap of = probed irqs 0 Apr 8 23:40:13 kronos /kernel: ppc0: <Parallel port> at port = 0x378-0x37f irq 7 on isa0 Apr 8 23:40:13 kronos /kernel: ppc0: Generic chipset (ECP/PS2/NIBBLE) = in COMPATIBLE mode Apr 8 23:40:13 kronos /kernel: ppc0: FIFO with 16/16/8 bytes threshold Apr 8 23:40:13 kronos /kernel: plip0: <PLIP network interface> on = ppbus0 Apr 8 23:40:13 kronos /kernel: lpt0: <Printer> on ppbus0 Apr 8 23:40:13 kronos /kernel: lpt0: Interrupt-driven port Apr 8 23:40:13 kronos /kernel: ppi0: <Parallel I/O> on ppbus0 Apr 8 23:40:13 kronos /kernel: APIC_IO: Testing 8254 interrupt delivery Apr 8 23:40:13 kronos /kernel: APIC_IO: routing 8254 via IOAPIC #0 = intpin 2 Apr 8 23:40:13 kronos /kernel: SMP: AP CPU #1 Launched! Apr 8 23:40:13 kronos /kernel: acd0: CDROM <TOSHIBA CD-ROM XM-6502B> at = ata0-master PIO4 Apr 8 23:40:13 kronos /kernel: Waiting 15 seconds for SCSI devices to = settle Apr 8 23:40:13 kronos /kernel: Mounting root from ufs:/dev/da0s1a Apr 8 23:40:13 kronos /kernel: da0 at sym0 bus 0 target 0 lun 0 Apr 8 23:40:13 kronos /kernel: da0: <QUANTUM ATLAS IV 9 WLS 0808> Fixed = Direct Access SCSI-3 device=20 Apr 8 23:40:13 kronos /kernel: da0: 40.000MB/s transfers (20.000MHz, = offset 16, 16bit), Tagged Queueing Enabled Apr 8 23:40:13 kronos /kernel: da0: 8761MB (17942584 512 byte sectors: = 255H 63S/T 1116C) Apr 8 23:40:13 kronos /kernel: da1 at sym0 bus 0 target 1 lun 0 Apr 8 23:40:13 kronos /kernel: da1: <QUANTUM ATLAS IV 9 WLS 0808> Fixed = Direct Access SCSI-3 device=20 Apr 8 23:40:13 kronos /kernel: da1: 40.000MB/s transfers (20.000MHz, = offset 16, 16bit), Tagged Queueing Enabled Apr 8 23:40:13 kronos /kernel: da1: 8761MB (17942584 512 byte sectors: = 255H 63S/T 1116C) ------=_NextPart_000_0018_01C2FEA8.5B9C5C90--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?001b01c2fee3$16e1e710$0300a8c0>