Date: Mon, 29 Dec 2003 11:58:25 -0500 From: "Alan Lew" <alan@canweb.ca> To: <freebsd-bugs@freebsd.org> Cc: alan@canweb.ca Subject: Re: kern/37043: Latest stable causes SCSI bus freeze on sym0 when running SMP Message-ID: <000001c3ce2c$f99091b0$162da8c0@gemini>
next in thread | raw e-mail | index | archive | help
Gentlemen, Since upgrading our kernels to 4.x, we’ve noticed this problem. After installing Gérard’s Sym workaround (http://docs.freebsd.org/cgi/mid.cgi?200208222210.g7MMABwT084798), the issue seems to have vanished on 3 of our 5 affected servers. The 5 servers are all Asus AP1400 boxes (CUR-DLSR mobo, ServerWorks 3 LE chipset) running various stable 4.x kernels, all containing the LSI Logic 53C1010-33 chipset. Of the two remaining affected boxes, one runs merrily for a few weeks with minimal load (serving two static web pages) and then dies, spitting out “sym0:0:control msgout 80 22 25d” like messages. The other box runs with no load (completely idle) and hard locks after 2-3 days with no error or panic messages written anywhere. The latter affected box was recently sent to Asus in Taiwan for repairs to the SCSI backplane board, apparently a known ASUS AP1400 issue (don’t know if this info helps, but...) Below is the dmesg output of the two affected boxes (as described above, in order) as well as the “pciconf -l –v” output. I hope this helps. We’re also offering a chance if anyone wishes to mess around with one of these affected boxes; we will make it available remotely for testing. Any thoughts? Regards, ...alan lew alan@canweb.ca -- SNIP –- [ Box #1 DMESG ] Copyright (c) 1992-2002 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.7-RELEASE-p3 #3: Tue Jan 7 11:33:58 EST 2003 alan@rigel.canweb.ca:/usr/obj/usr/src/sys/RIGEL Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (1000.04-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x686 Stepping = 6 Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE, MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE> real memory = 1073721344 (1048556K bytes) avail memory = 1041727488 (1017312K bytes) Changing APIC ID for IO APIC #1 from 3 to 1 in MP table Changing APIC ID for IO APIC #1 from 3 to 1 on chip Programming 16 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 Programming 16 pins in IOAPIC #1 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 3, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 0, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x000f0011, at 0xfec00000 io1 (APIC): apic id: 1, version: 0x000f0011, at 0xfec01000 Preloaded elf kernel "kernel" at 0xc0356000. Pentium Pro MTRR support enabled md0: Malloc disk Using $PIR table, 8 entries at 0xc00f1010 npx0: <math processor> on motherboard npx0: INT 16 interface pcib0: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard IOAPIC #1 intpin 6 -> irq 2 IOAPIC #1 intpin 7 -> irq 10 IOAPIC #1 intpin 22 -> irq 11 pci0: <PCI bus> on pcib0 fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xd800-0xd83f mem 0xfd800000-0xfd8fffff,0xfe000000-0xfe000fff irq 2 at device 2.0 on pci0 fxp0: Ethernet address 00:e0:18:0a:b1:14 inphy0: <i82555 10/100 media interface> on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto pci0: <ATI Mach64-GR graphics accelerator> at 7.0 fxp1: <Intel Pro 10/100B/100+ Ethernet> port 0xd000-0xd03f mem 0xfa800000-0xfa8fffff,0xfb000000-0xfb000fff irq 10 at device 8.0 on pci0 fxp1: Ethernet address 00:e0:18:0a:b1:15 inphy1: <i82555 10/100 media interface> on miibus1 inphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto isab0: <ServerWorks IB6566 PCI to ISA bridge> at device 15.0 on pci0 isa0: <ISA bus> on isab0 atapci0: <ServerWorks ROSB4 ATA33 controller> port 0xb800-0xb80f at device 15.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 ohci0: <OHCI (generic) USB controller> mem 0xfa000000-0xfa000fff irq 11 at device 15.2 on pci0 usb0: OHCI version 1.0, legacy support usb0: <OHCI (generic) USB controller> on ohci0 usb0: USB revision 1.0 uhub0: (0x1166) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 4 ports with 4 removable, self powered ufm0: GemTek Corp USB FM Radio, rev 1.00/4.10, addr 2 pcib1: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard IOAPIC #1 intpin 8 -> irq 12 IOAPIC #1 intpin 9 -> irq 15 pci1: <PCI bus> on pcib1 sym0: <1010-33> port 0xb400-0xb4ff mem 0xf9000000-0xf9001fff,0xf9800000-0xf98003ff irq 12 at device 5.0 on pci1 sym0: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking sym0: open drain IRQ line driver, using on-chip SRAM sym0: using LOAD/STORE-based firmware. sym0: handling phase mismatch from SCRIPTS. sym1: <1010-33> port 0xb000-0xb0ff mem 0xf8000000-0xf8001fff,0xf8800000-0xf88003ff irq 15 at device 5.1 on pci1 sym1: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking sym1: open drain IRQ line driver, using on-chip SRAM sym1: using LOAD/STORE-based firmware. sym1: handling phase mismatch from SCRIPTS. orm0: <Option ROMs> at iomem 0xc0000-0xca7ff,0xcc000-0xcffff on isa0 fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0 kbd0 at atkbd0 vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio0: configured irq 4 not in bitmap of probed irqs 0 sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 8250 sio1: configured irq 3 not in bitmap of probed irqs 0 ppc0: parallel port not found. APIC_IO: Testing 8254 interrupt delivery APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 intpin 2 APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0 IP packet filtering initialized, divert disabled, rule-based forwarding enabled, default to deny, logging disabled ata0-slave: ATAPI identify retries exceeded SMP: AP CPU #1 Launched! acd0: CDROM <CD-224E> at ata0-master PIO4 Waiting 15 seconds for SCSI devices to settle (noperiph:sym0:0:-1:-1): SCSI BUS reset delivered. (noperiph:sym1:0:-1:-1): SCSI BUS reset delivered. (probe10:sym0:0:11:0): phase change 6-7 6@0c918d8c resid=4. pass1 at sym0 bus 0 target 11 lun 0 pass1: <SDR GEM318 0> Fixed Processor SCSI-2 device pass1: 3.300MB/s transfers da0 at sym0 bus 0 target 0 lun 0 da0: <FUJITSU MAN3184MC 0107> Fixed Direct Access SCSI-3 device da0: 160.000MB/s transfers (80.000MHz, offset 62, 16bit), Tagged Queueing Enabled da0: 17522MB (35885448 512 byte sectors: 255H 63S/T 2233C) Mounting root from ufs:/dev/da0s1a [ Box #2 DMESG ] Copyright (c) 1992-2002 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.7-RELEASE-p17 #2: Mon Sep 22 14:16:48 EDT 2003 root@newaltair.canweb.ca:/usr/obj/usr/src/sys/ALTAIR Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (1000.04-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x686 Stepping = 6 Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE, MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE> real memory = 1073721344 (1048556K bytes) avail memory = 1041502208 (1017092K bytes) Changing APIC ID for IO APIC #1 from 3 to 1 in MP table Changing APIC ID for IO APIC #1 from 3 to 1 on chip Programming 16 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 Programming 16 pins in IOAPIC #1 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 3, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 0, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x000f0011, at 0xfec00000 io1 (APIC): apic id: 1, version: 0x000f0011, at 0xfec01000 Preloaded elf kernel "kernel" at 0xc038d000. Pentium Pro MTRR support enabled md0: Malloc disk Using $PIR table, 8 entries at 0xc00f1010 npx0: <math processor> on motherboard npx0: INT 16 interface pcib0: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard IOAPIC #1 intpin 6 -> irq 2 IOAPIC #1 intpin 7 -> irq 3 pci0: <PCI bus> on pcib0 fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xd800-0xd83f mem 0xfd800000-0xfd8fffff,0xfe000000-0xfe000fff irq 2 at device 2.0 on pci0 fxp0: Ethernet address 00:e0:18:0a:89:88 inphy0: <i82555 10/100 media interface> on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto pci0: <ATI Mach64-GR graphics accelerator> at 7.0 fxp1: <Intel Pro 10/100B/100+ Ethernet> port 0xd000-0xd03f mem 0xfa800000-0xfa8fffff,0xfb000000-0xfb000fff irq 3 at device 8.0 on pci0 fxp1: Ethernet address 00:e0:18:0a:89:89 inphy1: <i82555 10/100 media interface> on miibus1 inphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto isab0: <ServerWorks IB6566 PCI to ISA bridge> at device 15.0 on pci0 isa0: <ISA bus> on isab0 atapci0: <ServerWorks ROSB4 ATA33 controller> port 0xb800-0xb80f at device 15.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 pcib1: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard IOAPIC #1 intpin 0 -> irq 7 IOAPIC #1 intpin 8 -> irq 10 IOAPIC #1 intpin 9 -> irq 11 pci1: <PCI bus> on pcib1 pci1: <unknown card> (vendor=0x9004, dev=0x7178) at 2.0 irq 7 sym0: <1010-33> port 0xb000-0xb0ff mem 0xf8800000-0xf8801fff,0xf9000000-0xf90003ff irq 10 at device 5.0 on pci1 sym0: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking sym0: open drain IRQ line driver, using on-chip SRAM sym0: using LOAD/STORE-based firmware. sym0: handling phase mismatch from SCRIPTS. sym1: <1010-33> port 0xa800-0xa8ff mem 0xf7800000-0xf7801fff,0xf8000000-0xf80003ff irq 11 at device 5.1 on pci1 sym1: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking sym1: open drain IRQ line driver, using on-chip SRAM sym1: using LOAD/STORE-based firmware. sym1: handling phase mismatch from SCRIPTS. orm0: <Option ROMs> at iomem 0xc0000-0xca7ff,0xcc000-0xcffff,0xd0000-0xd27ff on isa0 fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0 kbd0 at atkbd0 vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1: configured irq 3 not in bitmap of probed irqs 0 ppc0: parallel port not found. APIC_IO: Testing 8254 interrupt delivery APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 intpin 2 APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0 IP packet filtering initialized, divert disabled, rule-based forwarding enabled, default to deny, logging limited to 100 packets/entry by default ata0-slave: ATAPI identify retries exceeded acd0: MODE_SENSE_BIG trying to write on read buffer SMP: AP CPU #1 Launched! acd0: MODE_SENSE_BIG - NO SENSE asc=0x00 ascq=0x00 error=0x04 acd0: CDROM <CD-224E> at ata0-master PIO4 Waiting 15 seconds for SCSI devices to settle (noperiph:sym0:0:-1:-1): SCSI BUS reset delivered. (noperiph:sym1:0:-1:-1): SCSI BUS reset delivered. (probe10:sym0:0:11:0): phase change 6-7 6@0c936d8c resid=4. pass1 at sym0 bus 0 target 11 lun 0 pass1: <SDR GEM318 0> Fixed Processor SCSI-2 device pass1: 3.300MB/s transfers da0 at sym0 bus 0 target 0 lun 0 da0: <FUJITSU MAN3184MC 0107> Fixed Direct Access SCSI-3 device da0: 160.000MB/s transfers (80.000MHz, offset 62, 16bit), Tagged Queueing Enabled da0: 17522MB (35885448 512 byte sectors: 255H 63S/T 2233C) Mounting root from ufs:/dev/da0s1a [ PCICONF –l –V output ] chip0@pci0:0:0: class=0x060000 card=0x00000000 chip=0x00091166 rev=0x05 hdr=0x00 vendor = 'Reliance Computer Corp./ServerWorks' device = 'NB6536 (CNB20HE) AGP interface' class = bridge subclass = HOST-PCI chip1@pci0:0:1: class=0x060000 card=0x00000000 chip=0x00091166 rev=0x05 hdr=0x00 vendor = 'Reliance Computer Corp./ServerWorks' device = 'NB6536 (CNB20HE) AGP interface' class = bridge subclass = HOST-PCI fxp0@pci0:2:0: class=0x020000 card=0x100c8086 chip=0x12298086 rev=0x08 hdr=0x00 vendor = 'Intel Corporation' device = '82557/8/9 EtherExpress PRO/100(B) Ethernet Adapter' class = network subclass = ethernet none0@pci0:7:0: class=0x030000 card=0x802b1043 chip=0x47521002 rev=0x27 hdr=0x00 vendor = 'ATI Technologies' device = 'Rage XL PCI' class = display subclass = VGA fxp1@pci0:8:0: class=0x020000 card=0x100c8086 chip=0x12298086 rev=0x08 hdr=0x00 vendor = 'Intel Corporation' device = '82557/8/9 EtherExpress PRO/100(B) Ethernet Adapter' class = network subclass = ethernet isab0@pci0:15:0: class=0x060100 card=0x02001166 chip=0x02001166 rev=0x4f hdr=0x00 vendor = 'Reliance Computer Corp./ServerWorks' device = 'OSB4 PCI to ISA Bridge' class = bridge subclass = PCI-ISA atapci0@pci0:15:1: class=0x01018a card=0x00000000 chip=0x02111166 rev=0x00 hdr=0x00 vendor = 'Reliance Computer Corp./ServerWorks' device = 'OSB4 PCI EIDE Controller' class = mass storage subclass = ATA ohci0@pci0:15:2: class=0x0c0310 card=0x02201166 chip=0x02201166 rev=0x04 hdr=0x00 vendor = 'Reliance Computer Corp./ServerWorks' device = 'OSB4 OpenHCI Compliant USB Controller' class = serial bus subclass = USB sym0@pci1:5:0: class=0x010000 card=0x00000000 chip=0x00201000 rev=0x01 hdr=0x00 vendor = 'LSI Logic (Was: Symbios Logic, NCR)' device = 'LSI53C1010-33 PCI to Dual Channel Ultra160 SCSI Multifunction Controller' class = mass storage subclass = SCSI sym1@pci1:5:1: class=0x010000 card=0x00000000 chip=0x00201000 rev=0x01 hdr=0x00 vendor = 'LSI Logic (Was: Symbios Logic, NCR)' device = 'LSI53C1010-33 PCI to Dual Channel Ultra160 SCSI Multifunction Controller' class = mass storage subclass = SCSI -- SNIP –-
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?000001c3ce2c$f99091b0$162da8c0>