Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 2 Jul 2001 12:03:13 +0200 (CEST)
From:      "Hartmann, O." <ohartman@klima.physik.uni-mainz.de>
To:        Pete French <pfrench@firstcallgroup.co.uk>
Cc:        <freebsd-stable@FreeBSD.ORG>, <freebsd-questions@FreeBSD.ORG>
Subject:   Re: HELP! Server crashes since last cvsupdate!
Message-ID:  <Pine.BSF.4.33.0107021200440.829-100000@klima.physik.uni-mainz.de>
In-Reply-To: <E15H0Kk-000344-00@dilbert.firstcallgroup.co.uk>

next in thread | previous in thread | raw e-mail | index | archive | help
On Mon, 2 Jul 2001, Pete French wrote:

Both machines are SMP systems and here I send the dmesg output. They use
both an AMI MegaRAID U160 SCSI controller, the big on uses the Enterprise 1600,
the smaller on the Elite 1600.

ATMOS (Enterprise1600)


Copyright (c) 1992-2001 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
	The Regents of the University of California. All rights reserved.
FreeBSD 4.3-STABLE #104: Mon Jul  2 11:32:08 CEST 2001
    root@atmos.physik.uni-mainz.de:/usr/obj/usr/src/sys/ATMOS
Timecounter "i8254"  frequency 1193182 Hz
CPU: Pentium III/Pentium III Xeon/Celeron (868.57-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x683  Stepping = 3
  Features=0x387fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,PN,MMX,FXSR,SSE>
real memory  = 2147483648 (2097152K bytes)
avail memory = 2087956480 (2039020K bytes)
Programming 16 pins in IOAPIC #0
IOAPIC #0 intpin 2 -> irq 0
Programming 16 pins in IOAPIC #1
FreeBSD/SMP: Multiprocessor motherboard
 cpu0 (BSP): apic id:  1, version: 0x00040011, at 0xfee00000
 cpu1 (AP):  apic id:  0, version: 0x00040011, at 0xfee00000
 io0 (APIC): apic id:  2, version: 0x000f0011, at 0xfec00000
 io1 (APIC): apic id:  3, version: 0x000f0011, at 0xfec01000
Preloaded elf kernel "kernel" at 0xc0394000.
Pentium Pro MTRR support enabled
npx0: <math processor> on motherboard
npx0: INT 16 interface
pcib0: <Host to PCI bridge> on motherboard
IOAPIC #1 intpin 13 -> irq 2
IOAPIC #1 intpin 12 -> irq 16
IOAPIC #1 intpin 7 -> irq 17
pci0: <PCI bus> on pcib0
pcib3: <PCI to PCI bridge (vendor=1166 device=0005)> at device 0.1 on pci0
IOAPIC #1 intpin 1 -> irq 18
pci1: <PCI bus> on pcib3
pci1: <NVidia Riva TNT2 graphics accelerator> at 0.0 irq 18
sym0: <896> port 0xf800-0xf8ff mem 0xfeafe000-0xfeafffff,0xfeafac00-0xfeafafff irq 2 at device 1.0 on pci0
sym0: Symbios NVRAM, ID 7, Fast-40, LVD, parity checking
sym0: open drain IRQ line driver, using on-chip SRAM
sym0: using LOAD/STORE-based firmware.
sym0: handling phase mismatch from SCRIPTS.
sym1: <896> port 0xf400-0xf4ff mem 0xfeafc000-0xfeafdfff,0xfeafa800-0xfeafabff irq 16 at device 1.1 on pci0
sym1: Symbios NVRAM, ID 7, Fast-40, SE, parity checking
sym1: open drain IRQ line driver, using on-chip SRAM
sym1: using LOAD/STORE-based firmware.
sym1: handling phase mismatch from SCRIPTS.
pcib5: <DEC 21154 PCI-PCI bridge> at device 3.0 on pci0
IOAPIC #1 intpin 2 -> irq 19
pci2: <PCI bus> on pcib5
pcib6: <DEC 21154 PCI-PCI bridge> at device 0.0 on pci2
IOAPIC #1 intpin 0 -> irq 20
pci3: <PCI bus> on pcib6
amr0: <AMI MegaRAID> mem 0xf4000000-0xf7ffffff irq 20 at device 0.0 on pci3
amr0: <Series 471 40 Logical Drive Firmware> Firmware A159, BIOS 3.11, 64MB RAM
pci2: <unknown card> (vendor=0x1077, dev=0x1216) at 1.0 irq 18
pci2: <unknown card> (vendor=0x1077, dev=0x1216) at 2.0 irq 19
fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xfcc0-0xfcff mem 0xfe900000-0xfe9fffff,0xfeaf9000-0xfeaf9fff irq 17 at device 7.0 on pci0
fxp0: Ethernet address 00:e0:81:00:f0:d7
inphy0: <i82555 10/100 media interface> on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
isab0: <ServerWorks IB6566 PCI to ISA bridge> at device 15.0 on pci0
isa0: <ISA bus> on isab0
pci0: <Unknown PCI ATA controller> at 15.1
pcib1: <ServerWorks NB6536 2.0HE host to PCI bridge> on motherboard
pci4: <PCI bus> on pcib1
pcib2: <ServerWorks host to PCI bridge> on motherboard
pci5: <PCI bus> on pcib2
pcib4: <ServerWorks host to PCI bridge> on motherboard
pci6: <PCI bus> on pcib4
orm0: <Option ROMs> at iomem 0xc0000-0xc9fff,0xca000-0xcdfff on isa0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
psm0: <PS/2 Mouse> irq 12 on atkbdc0
psm0: model IntelliMouse, device ID 3
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
sc0: <System console> on isa0
sc0: VGA <6 virtual consoles, flags=0x200>
fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
sio1 at port 0x2f8-0x2ff irq 3 flags 0x10 on isa0
sio1: type 16550A
ppc0: <Parallel port> at port 0x378-0x37f irq 7 drq 1 flags 0x8 on isa0
ppc0: SMC-like chipset (ECP-only) in ECP mode
ppc0: FIFO with 16/16/8 bytes threshold
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
APIC_IO: Testing 8254 interrupt delivery
APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 intpin 2
APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0
IP packet filtering initialized, divert enabled, rule-based forwarding enabled, default to deny, unlimited logging
DUMMYNET initialized (010124)
IPsec: Initialized Security Association Processing.
Waiting 4 seconds for SCSI devices to settle
(noperiph:sym0:0:-1:-1): SCSI BUS reset delivered.
(noperiph:sym1:0:-1:-1): SCSI BUS reset delivered.
amrd0: <MegaRAID logical drive> on amr0
amrd0: 245014MB (501788672 sectors) RAID 5 (optimal)
SMP: AP CPU #1 Launched!
sa0 at sym1 bus 0 target 5 lun 0
sa0: <HP C5713A H910> Removable Sequential Access SCSI-2 device
sa0: 40.000MB/s transfers (20.000MHz, offset 31, 16bit)
Mounting root from ufs:/dev/amrd0s1a
cd0 at sym1 bus 0 target 3 lun 0
cd0: <TEAC CD-ROM CD-532S 1.0A> Removable CD-ROM SCSI-2 device
cd0: 20.000MB/s transfers (20.000MHz, offset 16)
cd0: cd present [275963 x 2048 byte records]
ch0 at sym1 bus 0 target 5 lun 1
ch0: <HP C5713A H910> Removable Changer SCSI-2 device
ch0: 40.000MB/s transfers (20.000MHz, offset 31, 16bit)
ch0: 6 slots, 1 drive, 0 pickers, 0 portals
link_elf: symbol splash_register undefined
fxp0: promiscuous mode enabled

KLIMA (Elite1600):


Copyright (c) 1992-2001 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
	The Regents of the University of California. All rights reserved.
FreeBSD 4.3-STABLE #63: Mon Jul  2 11:46:19 CEST 2001
    root@klima.physik.uni-mainz.de:/usr/obj/usr/src/sys/KLIMA
Timecounter "i8254"  frequency 1193182 Hz
CPU: Pentium III/Pentium III Xeon/Celeron (803.60-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x686  Stepping = 6
  Features=0x387fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,PN,MMX,FXSR,SSE>
real memory  = 1073725440 (1048560K bytes)
avail memory = 1041309696 (1016904K bytes)
Programming 24 pins in IOAPIC #0
IOAPIC #0 intpin 2 -> irq 0
FreeBSD/SMP: Multiprocessor motherboard
 cpu0 (BSP): apic id:  3, version: 0x00040011, at 0xfee00000
 cpu1 (AP):  apic id:  0, version: 0x00040011, at 0xfee00000
 io0 (APIC): apic id:  2, version: 0x00178011, at 0xfec00000
Preloaded elf kernel "kernel" at 0xc03ef000.
Pentium Pro MTRR support enabled
npx0: <math processor> on motherboard
npx0: INT 16 interface
pcib0: <Host to PCI bridge> on motherboard
IOAPIC #0 intpin 18 -> irq 2
IOAPIC #0 intpin 19 -> irq 10
IOAPIC #0 intpin 16 -> irq 11
pci0: <PCI bus> on pcib0
pcib2: <VIA 82C598MVP (Apollo MVP3) PCI-PCI (AGP) bridge> at device 1.0 on pci0
pci1: <PCI bus> on pcib2
pci1: <Matrox MGA G200 AGP graphics accelerator> at 0.0 irq 11
isab0: <VIA 82C686 PCI-ISA bridge> at device 4.0 on pci0
isa0: <ISA bus> on isab0
pci0: <VIA 85C586 ATA controller> at 4.1
pci0: <VIA 83C572 USB controller> at 4.2 irq 2
pci0: <VIA 83C572 USB controller> at 4.3 irq 2
fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xb800-0xb83f mem 0xf1800000-0xf18fffff,0xf2000000-0xf2000fff irq 10 at device 9.0 on pci0
fxp0: Ethernet address 00:d0:b7:06:6e:78
inphy0: <i82555 10/100 media interface> on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
pcib3: <DEC 21154 PCI-PCI bridge> at device 11.0 on pci0
pci2: <PCI bus> on pcib3
pcib4: <DEC 21154 PCI-PCI bridge> at device 0.0 on pci2
IOAPIC #0 intpin 17 -> irq 13
pci3: <PCI bus> on pcib4
amr0: <AMI MegaRAID> mem 0xf4000000-0xf5ffffff irq 13 at device 0.0 on pci3
amr0: <Series 493 40 Logical Drive Firmware> Firmware A159, BIOS 3.11, 32MB RAM
pci2: <unknown card> (vendor=0x1077, dev=0x1216) at 1.0 irq 2
ahc0: <Adaptec 2940 Ultra SCSI adapter> port 0x9800-0x98ff mem 0xf0800000-0xf0800fff irq 11 at device 12.0 on pci0
aic7880: Wide Channel A, SCSI Id=7, 16/255 SCBs
pcib1: <Host to PCI bridge> on motherboard
pci4: <PCI bus> on pcib1
orm0: <Option ROM> at iomem 0xc0000-0xc7fff on isa0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
psm0: <PS/2 Mouse> irq 12 on atkbdc0
psm0: model IntelliMouse, device ID 3
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
sc0: <System console> on isa0
sc0: VGA <8 virtual consoles, flags=0x200>
fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
sio1 at port 0x2f8-0x2ff irq 3 flags 0x10 on isa0
sio1: type 16550A
ppc0: <Parallel port> at port 0x378-0x37f irq 7 drq 1 flags 0x8 on isa0
ppc0: SMC-like chipset (ECP-only) in ECP mode
ppc0: FIFO with 16/16/8 bytes threshold
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
APIC_IO: Testing 8254 interrupt delivery
APIC_IO: routing 8254 via IOAPIC #0 intpin 2
DUMMYNET initialized (010124)
IP packet filtering initialized, divert enabled, rule-based forwarding enabled, default to deny, unlimited logging
IPsec: Initialized Security Association Processing.
Waiting 4 seconds for SCSI devices to settle
amrd0: <MegaRAID logical drive> on amr0
amrd0: 43735MB (89569280 sectors) RAID 5 (optimal)
SMP: AP CPU #1 Launched!
sa0 at ahc0 bus 0 target 4 lun 0
sa0: <HP C5683A C005> Removable Sequential Access SCSI-2 device
sa0: 40.000MB/s transfers (20.000MHz, offset 8, 16bit)
Mounting root from ufs:/dev/amrd0s1a
cd0 at ahc0 bus 0 target 3 lun 0
cd0: <TEAC CD-ROM CD-532S 1.0A> Removable CD-ROM SCSI-2 device
cd0: 20.000MB/s transfers (20.000MHz, offset 15)
cd0: Attempt to query device size failed: NOT READY, Medium not present
link_elf: symbol splash_register undefined
fxp0: promiscuous mode enabled
:>> Since our last update Friday, 29th June, both SMP machines run
:>> into a "stuck" condition after a while. This happened now two times
:>> and I do not know what happens.
:>
:>I've been seeing this effect since  4.3-RELEASE actually. WIth pretty much
:>identical symptoms to the ones you descibe. Asking here earlier people
:>seemed to think that it was the disc controllers getting locked up as this
:>will lead to the effects described. Sometimes the machine will run
:>for weeks at a time, sometimes it will freeze after a few hours. The
:>easiest way I can make it lockup is to try and access a very large
:>file from two processes at once.
:>
:>I'm currently trying to find time to work out how to use the kernel
:>debugging stuff to connect over the network and see what  sort of
:>state the kernels in (which it is apparently posssible to do). But
:>not really got anywhere with that yet. I'd be intyerested in knowing
:>what sort of machine you have and what the components are to see if
:>theres anything that both systems have in common (other than the SMP bits).
:>
:>cheers,
:>
:>-pcf.
:>
:>To Unsubscribe: send mail to majordomo@FreeBSD.org
:>with "unsubscribe freebsd-stable" in the body of the message
:>

--
MfG
O. Hartmann

ohartman@klima.physik.uni-mainz.de
----------------------------------------------------------------
IT-Administration des Institut fuer Physik der Atmosphaere (IPA)
----------------------------------------------------------------
Johannes Gutenberg Universitaet Mainz
Becherweg 21
55099 Mainz

Tel: +496131/3924662 (Maschinenraum)
Tel: +496131/3924144
FAX: +496131/3923532


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-stable" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?Pine.BSF.4.33.0107021200440.829-100000>