Date: Mon, 2 Jul 2001 12:03:13 +0200 (CEST) From: "Hartmann, O." <ohartman@klima.physik.uni-mainz.de> To: Pete French <pfrench@firstcallgroup.co.uk> Cc: <freebsd-stable@FreeBSD.ORG>, <freebsd-questions@FreeBSD.ORG> Subject: Re: HELP! Server crashes since last cvsupdate! Message-ID: <Pine.BSF.4.33.0107021200440.829-100000@klima.physik.uni-mainz.de> In-Reply-To: <E15H0Kk-000344-00@dilbert.firstcallgroup.co.uk>
next in thread | previous in thread | raw e-mail | index | archive | help
On Mon, 2 Jul 2001, Pete French wrote: Both machines are SMP systems and here I send the dmesg output. They use both an AMI MegaRAID U160 SCSI controller, the big on uses the Enterprise 1600, the smaller on the Elite 1600. ATMOS (Enterprise1600) Copyright (c) 1992-2001 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.3-STABLE #104: Mon Jul 2 11:32:08 CEST 2001 root@atmos.physik.uni-mainz.de:/usr/obj/usr/src/sys/ATMOS Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (868.57-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x683 Stepping = 3 Features=0x387fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,PN,MMX,FXSR,SSE> real memory = 2147483648 (2097152K bytes) avail memory = 2087956480 (2039020K bytes) Programming 16 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 Programming 16 pins in IOAPIC #1 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 1, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 0, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x000f0011, at 0xfec00000 io1 (APIC): apic id: 3, version: 0x000f0011, at 0xfec01000 Preloaded elf kernel "kernel" at 0xc0394000. Pentium Pro MTRR support enabled npx0: <math processor> on motherboard npx0: INT 16 interface pcib0: <Host to PCI bridge> on motherboard IOAPIC #1 intpin 13 -> irq 2 IOAPIC #1 intpin 12 -> irq 16 IOAPIC #1 intpin 7 -> irq 17 pci0: <PCI bus> on pcib0 pcib3: <PCI to PCI bridge (vendor=1166 device=0005)> at device 0.1 on pci0 IOAPIC #1 intpin 1 -> irq 18 pci1: <PCI bus> on pcib3 pci1: <NVidia Riva TNT2 graphics accelerator> at 0.0 irq 18 sym0: <896> port 0xf800-0xf8ff mem 0xfeafe000-0xfeafffff,0xfeafac00-0xfeafafff irq 2 at device 1.0 on pci0 sym0: Symbios NVRAM, ID 7, Fast-40, LVD, parity checking sym0: open drain IRQ line driver, using on-chip SRAM sym0: using LOAD/STORE-based firmware. sym0: handling phase mismatch from SCRIPTS. sym1: <896> port 0xf400-0xf4ff mem 0xfeafc000-0xfeafdfff,0xfeafa800-0xfeafabff irq 16 at device 1.1 on pci0 sym1: Symbios NVRAM, ID 7, Fast-40, SE, parity checking sym1: open drain IRQ line driver, using on-chip SRAM sym1: using LOAD/STORE-based firmware. sym1: handling phase mismatch from SCRIPTS. pcib5: <DEC 21154 PCI-PCI bridge> at device 3.0 on pci0 IOAPIC #1 intpin 2 -> irq 19 pci2: <PCI bus> on pcib5 pcib6: <DEC 21154 PCI-PCI bridge> at device 0.0 on pci2 IOAPIC #1 intpin 0 -> irq 20 pci3: <PCI bus> on pcib6 amr0: <AMI MegaRAID> mem 0xf4000000-0xf7ffffff irq 20 at device 0.0 on pci3 amr0: <Series 471 40 Logical Drive Firmware> Firmware A159, BIOS 3.11, 64MB RAM pci2: <unknown card> (vendor=0x1077, dev=0x1216) at 1.0 irq 18 pci2: <unknown card> (vendor=0x1077, dev=0x1216) at 2.0 irq 19 fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xfcc0-0xfcff mem 0xfe900000-0xfe9fffff,0xfeaf9000-0xfeaf9fff irq 17 at device 7.0 on pci0 fxp0: Ethernet address 00:e0:81:00:f0:d7 inphy0: <i82555 10/100 media interface> on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto isab0: <ServerWorks IB6566 PCI to ISA bridge> at device 15.0 on pci0 isa0: <ISA bus> on isab0 pci0: <Unknown PCI ATA controller> at 15.1 pcib1: <ServerWorks NB6536 2.0HE host to PCI bridge> on motherboard pci4: <PCI bus> on pcib1 pcib2: <ServerWorks host to PCI bridge> on motherboard pci5: <PCI bus> on pcib2 pcib4: <ServerWorks host to PCI bridge> on motherboard pci6: <PCI bus> on pcib4 orm0: <Option ROMs> at iomem 0xc0000-0xc9fff,0xca000-0xcdfff on isa0 atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 atkbd0: <AT Keyboard> irq 1 on atkbdc0 kbd0 at atkbd0 psm0: <PS/2 Mouse> irq 12 on atkbdc0 psm0: model IntelliMouse, device ID 3 vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: <System console> on isa0 sc0: VGA <6 virtual consoles, flags=0x200> fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1 at port 0x2f8-0x2ff irq 3 flags 0x10 on isa0 sio1: type 16550A ppc0: <Parallel port> at port 0x378-0x37f irq 7 drq 1 flags 0x8 on isa0 ppc0: SMC-like chipset (ECP-only) in ECP mode ppc0: FIFO with 16/16/8 bytes threshold lpt0: <Printer> on ppbus0 lpt0: Interrupt-driven port APIC_IO: Testing 8254 interrupt delivery APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 intpin 2 APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0 IP packet filtering initialized, divert enabled, rule-based forwarding enabled, default to deny, unlimited logging DUMMYNET initialized (010124) IPsec: Initialized Security Association Processing. Waiting 4 seconds for SCSI devices to settle (noperiph:sym0:0:-1:-1): SCSI BUS reset delivered. (noperiph:sym1:0:-1:-1): SCSI BUS reset delivered. amrd0: <MegaRAID logical drive> on amr0 amrd0: 245014MB (501788672 sectors) RAID 5 (optimal) SMP: AP CPU #1 Launched! sa0 at sym1 bus 0 target 5 lun 0 sa0: <HP C5713A H910> Removable Sequential Access SCSI-2 device sa0: 40.000MB/s transfers (20.000MHz, offset 31, 16bit) Mounting root from ufs:/dev/amrd0s1a cd0 at sym1 bus 0 target 3 lun 0 cd0: <TEAC CD-ROM CD-532S 1.0A> Removable CD-ROM SCSI-2 device cd0: 20.000MB/s transfers (20.000MHz, offset 16) cd0: cd present [275963 x 2048 byte records] ch0 at sym1 bus 0 target 5 lun 1 ch0: <HP C5713A H910> Removable Changer SCSI-2 device ch0: 40.000MB/s transfers (20.000MHz, offset 31, 16bit) ch0: 6 slots, 1 drive, 0 pickers, 0 portals link_elf: symbol splash_register undefined fxp0: promiscuous mode enabled KLIMA (Elite1600): Copyright (c) 1992-2001 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.3-STABLE #63: Mon Jul 2 11:46:19 CEST 2001 root@klima.physik.uni-mainz.de:/usr/obj/usr/src/sys/KLIMA Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (803.60-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x686 Stepping = 6 Features=0x387fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,PN,MMX,FXSR,SSE> real memory = 1073725440 (1048560K bytes) avail memory = 1041309696 (1016904K bytes) Programming 24 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 3, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 0, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00178011, at 0xfec00000 Preloaded elf kernel "kernel" at 0xc03ef000. Pentium Pro MTRR support enabled npx0: <math processor> on motherboard npx0: INT 16 interface pcib0: <Host to PCI bridge> on motherboard IOAPIC #0 intpin 18 -> irq 2 IOAPIC #0 intpin 19 -> irq 10 IOAPIC #0 intpin 16 -> irq 11 pci0: <PCI bus> on pcib0 pcib2: <VIA 82C598MVP (Apollo MVP3) PCI-PCI (AGP) bridge> at device 1.0 on pci0 pci1: <PCI bus> on pcib2 pci1: <Matrox MGA G200 AGP graphics accelerator> at 0.0 irq 11 isab0: <VIA 82C686 PCI-ISA bridge> at device 4.0 on pci0 isa0: <ISA bus> on isab0 pci0: <VIA 85C586 ATA controller> at 4.1 pci0: <VIA 83C572 USB controller> at 4.2 irq 2 pci0: <VIA 83C572 USB controller> at 4.3 irq 2 fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xb800-0xb83f mem 0xf1800000-0xf18fffff,0xf2000000-0xf2000fff irq 10 at device 9.0 on pci0 fxp0: Ethernet address 00:d0:b7:06:6e:78 inphy0: <i82555 10/100 media interface> on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto pcib3: <DEC 21154 PCI-PCI bridge> at device 11.0 on pci0 pci2: <PCI bus> on pcib3 pcib4: <DEC 21154 PCI-PCI bridge> at device 0.0 on pci2 IOAPIC #0 intpin 17 -> irq 13 pci3: <PCI bus> on pcib4 amr0: <AMI MegaRAID> mem 0xf4000000-0xf5ffffff irq 13 at device 0.0 on pci3 amr0: <Series 493 40 Logical Drive Firmware> Firmware A159, BIOS 3.11, 32MB RAM pci2: <unknown card> (vendor=0x1077, dev=0x1216) at 1.0 irq 2 ahc0: <Adaptec 2940 Ultra SCSI adapter> port 0x9800-0x98ff mem 0xf0800000-0xf0800fff irq 11 at device 12.0 on pci0 aic7880: Wide Channel A, SCSI Id=7, 16/255 SCBs pcib1: <Host to PCI bridge> on motherboard pci4: <PCI bus> on pcib1 orm0: <Option ROM> at iomem 0xc0000-0xc7fff on isa0 atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 atkbd0: <AT Keyboard> irq 1 on atkbdc0 kbd0 at atkbd0 psm0: <PS/2 Mouse> irq 12 on atkbdc0 psm0: model IntelliMouse, device ID 3 vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: <System console> on isa0 sc0: VGA <8 virtual consoles, flags=0x200> fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1 at port 0x2f8-0x2ff irq 3 flags 0x10 on isa0 sio1: type 16550A ppc0: <Parallel port> at port 0x378-0x37f irq 7 drq 1 flags 0x8 on isa0 ppc0: SMC-like chipset (ECP-only) in ECP mode ppc0: FIFO with 16/16/8 bytes threshold lpt0: <Printer> on ppbus0 lpt0: Interrupt-driven port APIC_IO: Testing 8254 interrupt delivery APIC_IO: routing 8254 via IOAPIC #0 intpin 2 DUMMYNET initialized (010124) IP packet filtering initialized, divert enabled, rule-based forwarding enabled, default to deny, unlimited logging IPsec: Initialized Security Association Processing. Waiting 4 seconds for SCSI devices to settle amrd0: <MegaRAID logical drive> on amr0 amrd0: 43735MB (89569280 sectors) RAID 5 (optimal) SMP: AP CPU #1 Launched! sa0 at ahc0 bus 0 target 4 lun 0 sa0: <HP C5683A C005> Removable Sequential Access SCSI-2 device sa0: 40.000MB/s transfers (20.000MHz, offset 8, 16bit) Mounting root from ufs:/dev/amrd0s1a cd0 at ahc0 bus 0 target 3 lun 0 cd0: <TEAC CD-ROM CD-532S 1.0A> Removable CD-ROM SCSI-2 device cd0: 20.000MB/s transfers (20.000MHz, offset 15) cd0: Attempt to query device size failed: NOT READY, Medium not present link_elf: symbol splash_register undefined fxp0: promiscuous mode enabled :>> Since our last update Friday, 29th June, both SMP machines run :>> into a "stuck" condition after a while. This happened now two times :>> and I do not know what happens. :> :>I've been seeing this effect since 4.3-RELEASE actually. WIth pretty much :>identical symptoms to the ones you descibe. Asking here earlier people :>seemed to think that it was the disc controllers getting locked up as this :>will lead to the effects described. Sometimes the machine will run :>for weeks at a time, sometimes it will freeze after a few hours. The :>easiest way I can make it lockup is to try and access a very large :>file from two processes at once. :> :>I'm currently trying to find time to work out how to use the kernel :>debugging stuff to connect over the network and see what sort of :>state the kernels in (which it is apparently posssible to do). But :>not really got anywhere with that yet. I'd be intyerested in knowing :>what sort of machine you have and what the components are to see if :>theres anything that both systems have in common (other than the SMP bits). :> :>cheers, :> :>-pcf. :> :>To Unsubscribe: send mail to majordomo@FreeBSD.org :>with "unsubscribe freebsd-stable" in the body of the message :> -- MfG O. Hartmann ohartman@klima.physik.uni-mainz.de ---------------------------------------------------------------- IT-Administration des Institut fuer Physik der Atmosphaere (IPA) ---------------------------------------------------------------- Johannes Gutenberg Universitaet Mainz Becherweg 21 55099 Mainz Tel: +496131/3924662 (Maschinenraum) Tel: +496131/3924144 FAX: +496131/3923532 To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?Pine.BSF.4.33.0107021200440.829-100000>