Date: Thu, 19 Oct 2000 11:33:02 -0300 (ADT) From: The Hermit Hacker <scrappy@hub.org> To: Terry Lambert <tlambert@primenet.com> Cc: Kris Kirby <kris@catonic.net>, freebsd-stable@FreeBSD.ORG, freebsd-smp@FreeBSD.ORG Subject: Re: Dual CPU Xeon server hangs on SMP kernel ... Message-ID: <Pine.BSF.4.21.0010191112010.974-400000@thelab.hub.org> In-Reply-To: <200010182344.QAA18472@usr01.primenet.com>
index | next in thread | previous in thread | raw e-mail
[-- Attachment #1 --]
Okay, attached are dmesg outputs for the MP hang and the UP boot, as
wellas an mptable from the UP boot ...
Pulling out the points you brought up, things look the same in the dmesg
output:
new-demeter# grep Origin *.dmesg
MP.dmesg: Origin = "GenuineIntel" Id = 0x6a0 Stepping = 0
UP.dmesg: Origin = "GenuineIntel" Id = 0x6a0 Stepping = 0
grep Features *.dmesg
MP.dmesg: Features=0x387fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,PN,MMX,FXSR,SSE>
UP.dmesg: Features=0x387fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,PN,MMX,FXSR,SSE>
but, in mptable, my 'step' is different on this machine:
--
Processors: APIC ID Version State Family Model Step Flags
1 0x11 BSP, usable 6 10 0 0x0301
0 0x11 AP, usable 6 10 1 0x0301
where on another one of my Dual processor machines, the step's are equal?
Okay, try to put a bit more detail on the machine in here ... on booting,
it comes up with recognizing two CPUs (BSP is P2, AP is P1)
BIOS Build Level - MWE119AUS
BIOS Revision - 2
SP ROM Build Level - MWET14A
SP ROM Revision Level - 4
Machine Type (Netfinity 7100) - 86663RY
System Board ID - J1DD0039039
System Info shows:
Pentium III Xeon A1 Stepping
Pentium III Xeon A0 Stepping
And, goign through all the menus in the BIOS, I can't see any reference to
Unixware, or anything that I can change/scroll though that has it as an
option ...
When the system hangs, I can hit ctl-alt-del to reboot the machine, but,
with DDB enabled, I can't seem to get into there, but am not sure if that
is something to do with my console switch or not ... if there is something
I can try in DDB that will provide more information, please let me know
and I'll try and figure out how to get it to ddrop into it ...
There, I think that this pretty much answers every/anything you asked
about this time? :)
Thanks ...
On Wed, 18 Oct 2000, Terry Lambert wrote:
> > > > Okay, since I can't actually boot into SMP, how would you suggest I get
> > > > that information? MPtable I know how to get, and can do that without
> > > > being in SMP, so will get that one later tonight and send it on, but any
> > > > sugggestions on how to get the dmesg from an actual SMP boot is most
> > > > welcome?
> > >
> > > Pull the keyboard and connect to the box with a null-modem connected to
> > > COM1 of the Xeon box. Capture to logfile. cat ~/logfile | mail
> > > freebsd@lists.
> >
> > damn, I was afraid that was going to be the only solution :( ah well,
> > will try and get into office tonight and do this ...
>
> It's important to see the stuff that comes up differently
> when attempting an SMP boot.
>
> However, assumming you had a UP boot in hand, you could get
> most of the information by comparing it to a hung SMP boot,
> and manually filing in the differences.
>
> The first thing to check is that chipset features, Id, and
> stepping are identical; if they aren't, you could have
> problems.
>
> The other issues are MPtable version, interrupt routing, and
> BIOS programming of your chipset (in general, I think that
> selecting "UnixWare" has shown the most overall success, but
> ou may need to go into advanced BIOs options for specific
> settings).
>
> BIOS versions and other stuff are also useful.
>
> The reason you didn't get this shotgun to start with (the way
> you asked the questions is basically a request for a shotgun)
> is that it's more useful to the project as a whole to know
> exatly why you are failing, so that it can be surgically
> corrected. Giving you a shotgun, and getting back a "Thanks;
> it works now" message doesn't provide a way for changing the
> code to avoid your situation in the future, and handing out
> shotguns for all of eternity isn't really a tenable support
> situation.
>
> Now that I've given you a shotgun, at least fire off only one
> pellet at a time, and let us know which one, if any, hits the
> mark.
>
> Preferrably, you'll go to the trouble of capturing the data,
> so that we can give you a better diagnosis, and help avoid
> this pain for others (or your next system) in the future.
>
>
> Terry Lambert
> terry@lambert.org
> ---
> Any opinions in this posting are my own and not those of my present
> or previous employers.
>
>
Marc G. Fournier ICQ#7615664 IRC Nick: Scrappy
Systems Administrator @ hub.org
primary: scrappy@hub.org secondary: scrappy@{freebsd|postgresql}.org
[-- Attachment #2 --]
Console: serial port
BIOS drive A: is disk0
BIOS drive C: is disk1
BIOS drive D: is disk2
BIOS drive E: is disk3
BIOS drive F: is disk4
BIOS drive G: is disk5
BIOS 632kB/1047535kB available memory
FreeBSD/i386 bootstrap loader, Revision 0.8
(root@new-demeter.acadiau.ca, Tue Oct 17 12:25:08 ADT 2000)
Loading /boot/defaults/loader.conf
/kernel text=0x15807d data=0x1ce0c+0x1e9a8 syms=[0x4+0x22740+0x4+0x26b9f]
-
Hit [Enter] to boot immediately, or any other key for command prompt.
Booting [kernel]...
Copyright (c) 1992-2000 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 4.1.1-STABLE #0: Thu Oct 19 10:28:14 ADT 2000
root@new-demeter.acadiau.ca:/usr/obj/usr/src/sys/kernel
Timecounter "i8254" frequency 1193182 Hz
CPU: Pentium III/Pentium III Xeon/Celeron (699.18-MHz 686-class CPU)
Origin = "GenuineIntel" Id = 0x6a0 Stepping = 0
Features=0x387fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,PN,MMX,FXSR,SSE>
real memory = 1073721344 (1048556K bytes)
avail memory = 1042911232 (1018468K bytes)
Programming 16 pins in IOAPIC #0
IOAPIC #0 intpin 2 -> irq 0
Programming 16 pins in IOAPIC #1
IOAPIC #1 intpin 1 -> irq 2
IOAPIC #1 intpin 2 -> irq 9
IOAPIC #1 intpin 6 -> irq 16
IOAPIC #1 intpin 7 -> irq 17
IOAPIC #1 intpin 8 -> irq 15
IOAPIC #1 intpin 9 -> irq 18
IOAPIC #1 intpin 10 -> irq 11
FreeBSD/SMP: Multiprocessor motherboard
cpu0 (BSP): apic id: 1, version: 0x00040011, at 0xfee00000
cpu1 (AP): apic id: 0, version: 0x00040011, at 0xfee00000
io0 (APIC): apic id: 14, version: 0x000f0011, at 0xfec00000
io1 (APIC): apic id: 13, version: 0x000f0011, at 0xfec01000
Preloaded elf kernel "kernel" at 0xc02df000.
Pentium Pro MTRR support enabled
npx0: <math processor> on motherboard
npx0: INT 16 interface
pcib1: <RCC host to PCI bridge> on motherboard
pci1: <PCI bus> on pcib1
pcib0: <RCC host to PCI bridge> on motherboard
pci0: <PCI bus> on pcib0
ahc0: <Adaptec aic7896/97 Ultra2 SCSI adapter> port 0x2000-0x20ff mem 0xfebff000-0xfebfffff irq 2 at device 1.0 on pci0
aic7896/97: Wide Channel A, SCSI Id=7, 32/255 SCBs
ahc1: <Adaptec aic7896/97 Ultra2 SCSI adapter> port 0x2100-0x21ff mem 0xfebfe000-0xfebfefff irq 2 at device 1.1 on pci0
aic7896/97: Wide Channel B, SCSI Id=7, 32/255 SCBs
pci0: <S3 Trio3D graphics accelerator> at 6.0
isab0: <ServerWorks IB6566 PCI to ISA bridge> at device 15.0 on pci0
isa0: <ISA bus> on isab0
pci0: <OHCI USB controller> at 15.2 irq 11
pcib2: <RCC host to PCI bridge> on motherboard
pci2: <PCI bus> on pcib2
rl0: <Accton MPX 5030/5038 10/100BaseTX> port 0x2200-0x22ff mem 0xf7fffc00-0xf7fffcff irq 15 at device 5.0 on pci2
rl0: Ethernet address: 00:e0:29:67:4e:d0
miibus0: <MII bus> on rl0
rlphy0: <RealTek internal media interface> on miibus0
rlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
pcib3: <RCC host to PCI bridge> on motherboard
pci3: <PCI bus> on pcib3
fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A, console
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
psm0: <PS/2 Mouse> irq 12 on atkbdc0
psm0: model GlidePoint, device ID 0
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x100>
APIC_IO: routing 8254 via IOAPIC #0 intpin 2
Waiting 15 seconds for SCSI devices to settle
SMP: AP CPU #1 Launched!
[-- Attachment #3 --]
Copyright (c) 1992-2000 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 4.1.1-STABLE #0: Thu Oct 19 09:46:46 ADT 2000
root@new-demeter.acadiau.ca:/usr/obj/usr/src/sys/kernel
Timecounter "i8254" frequency 1193182 Hz
Timecounter "TSC" frequency 699180891 Hz
CPU: Pentium III/Pentium III Xeon/Celeron (699.18-MHz 686-class CPU)
Origin = "GenuineIntel" Id = 0x6a0 Stepping = 0
Features=0x387fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,PN,MMX,FXSR,SSE>
real memory = 1073721344 (1048556K bytes)
avail memory = 1042964480 (1018520K bytes)
Preloaded elf kernel "kernel.old" at 0xc02d0000.
Pentium Pro MTRR support enabled
npx0: <math processor> on motherboard
npx0: INT 16 interface
pcib1: <RCC host to PCI bridge> on motherboard
pci1: <PCI bus> on pcib1
pcib0: <RCC host to PCI bridge> on motherboard
pci0: <PCI bus> on pcib0
ahc0: <Adaptec aic7896/97 Ultra2 SCSI adapter> port 0x2000-0x20ff mem 0xfebff000-0xfebfffff irq 9 at device 1.0 on pci0
aic7896/97: Wide Channel A, SCSI Id=7, 32/255 SCBs
ahc1: <Adaptec aic7896/97 Ultra2 SCSI adapter> port 0x2100-0x21ff mem 0xfebfe000-0xfebfefff irq 9 at device 1.1 on pci0
aic7896/97: Wide Channel B, SCSI Id=7, 32/255 SCBs
pci0: <S3 Trio3D graphics accelerator> at 6.0
isab0: <ServerWorks IB6566 PCI to ISA bridge> at device 15.0 on pci0
isa0: <ISA bus> on isab0
pci0: <OHCI USB controller> at 15.2 irq 15
pcib2: <RCC host to PCI bridge> on motherboard
pci2: <PCI bus> on pcib2
rl0: <Accton MPX 5030/5038 10/100BaseTX> port 0x2200-0x22ff mem 0xf7fffc00-0xf7fffcff irq 11 at device 5.0 on pci2
rl0: Ethernet address: 00:e0:29:67:4e:d0
miibus0: <MII bus> on rl0
rlphy0: <RealTek internal media interface> on miibus0
rlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
pcib3: <RCC host to PCI bridge> on motherboard
pci3: <PCI bus> on pcib3
fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
psm0: <PS/2 Mouse> irq 12 on atkbdc0
psm0: model GlidePoint, device ID 0
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
Waiting 15 seconds for SCSI devices to settle
pass5 at ahc1 bus 0 target 15 lun 0
pass5: <IBM LnRv1.5V S 80 B004> Fixed Processor SCSI-2 device
pass5: 3.300MB/s transfers
da0 at ahc1 bus 0 target 0 lun 0
da0: <IBM-PSG DDYS-T09170M F S94S> Fixed Direct Access SCSI-3 device
da0: 80.000MB/s transfers (40.000MHz, offset 63, 16bit), Tagged Queueing Enabled
da0: 8678MB (17774160 512 byte sectors: 255H 63S/T 1106C)
da2 at ahc1 bus 0 target 3 lun 0
da2: <IBM-PSG ST318404LC !# 3146> Fixed Direct Access SCSI-3 device
da2: 80.000MB/s transfers (40.000MHz, offset 63, 16bit), Tagged Queueing Enabled
da2: 17357MB (35548320 512 byte sectors: 255H 63S/T 2212C)
da1 at ahc1 bus 0 target 1 lun 0
da1: <IBM-PSG ST318404LC !# 3146> Fixed Direct Access SCSI-3 device
da1: 80.000MB/s transfers (40.000MHz, offset 63, 16bit), Tagged Queueing Enabled
da1: 17357MB (35548320 512 byte sectors: 255H 63S/T 2212C)
da4 at ahc1 bus 0 target 5 lun 0
da4: <IBM-PSG ST318404LC !# 3146> Fixed Direct Access SCSI-3 device
da4: 80.000MB/s transfers (40.000MHz, offset 63, 16bit), Tagged Queueing Enabled
da4: 17357MB (35548320 512 byte sectors: 255H 63S/T 2212C)
da3 at ahc1 bus 0 target 4 lun 0
da3: <IBM-PSG ST318404LC !# 3146> Fixed Direct Access SCSI-3 device
da3: 80.000MB/s transfers (40.000MHz, offset 63, 16bit), Tagged Queueing Enabled
da3: 17357MB (35548320 512 byte sectors: 255H 63S/T 2212C)
Mounting root from ufs:/dev/da0s1a
ohci0: <OHCI (generic) USB controller> mem 0xfebfd000-0xfebfdfff irq 15 at device 15.2 on pci0
usb0: OHCI version 1.0, legacy support
usb0: <OHCI (generic) USB controller> on ohci0
usb0: USB revision 1.0
uhub0: (unknown) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 4 ports with 4 removable, self powered
[-- Attachment #4 --]
===============================================================================
MPTable, version 2.0.15
-------------------------------------------------------------------------------
MP Floating Pointer Structure:
location: EBDA
physical address: 0x0009e140
signature: '_MP_'
length: 16 bytes
version: 1.4
checksum: 0x22
mode: Virtual Wire
-------------------------------------------------------------------------------
MP Config Table Header:
physical address: 0x0009e590
signature: 'PCMP'
base table length: 404
version: 1.4
checksum: 0x7d
OEM ID: 'IBM ENSW'
Product ID: 'Mohawk SMP '
OEM table pointer: 0x00000000
OEM table size: 0
entry count: 42
local APIC address: 0xfee00000
extended table length: 168
extended table checksum: 251
-------------------------------------------------------------------------------
MP Config Base Table Entries:
--
Processors: APIC ID Version State Family Model Step Flags
1 0x11 BSP, usable 6 10 0 0x0301
0 0x11 AP, usable 6 10 1 0x0301
--
Bus: Bus ID Type
0 PCI
1 PCI
2 PCI
3 PCI
4 PCI
5 PCI
6 PCI
7 ISA
--
I/O APICs: APIC ID Version State Address
14 0x11 usable 0xfec00000
13 0x11 usable 0xfec01000
--
I/O Ints: Type Polarity Trigger Bus ID IRQ APIC ID PIN#
INT conforms conforms 7 1 14 1
INT conforms conforms 7 0 14 2
INT conforms conforms 7 4 14 4
INT active-lo level 7 5 14 5
INT conforms conforms 7 6 14 6
INT conforms conforms 7 8 14 8
INT conforms conforms 7 12 14 12
INT conforms conforms 7 13 14 13
INT conforms conforms 7 14 14 14
INT conforms conforms 0 1:A 13 1
INT conforms conforms 0 1:A 13 2
INT conforms conforms 0 15:A 13 10
INT conforms conforms 2 5:A 13 8
INT conforms conforms 2 3:A 13 6
INT conforms conforms 2 3:B 14 7
INT conforms conforms 2 3:C 13 6
INT conforms conforms 2 3:D 14 7
INT conforms conforms 2 4:A 13 7
INT conforms conforms 2 4:B 14 10
INT conforms conforms 2 4:C 13 7
INT conforms conforms 2 4:D 14 10
INT conforms conforms 2 5:B 14 3
INT conforms conforms 2 5:C 13 8
INT conforms conforms 2 5:D 14 3
INT conforms conforms 2 6:A 13 9
INT conforms conforms 2 6:B 14 7
INT conforms conforms 2 6:C 13 9
INT conforms conforms 2 6:D 14 7
--
Local Ints: Type Polarity Trigger Bus ID IRQ APIC ID PIN#
NMI conforms conforms 7 0 255 1
ExtINT conforms conforms 7 0 255 0
-------------------------------------------------------------------------------
MP Config Extended Table Entries:
--
System Address Space
bus ID: 0 address type: memory address
address base: 0xa0000
address range: 0x20000
--
System Address Space
bus ID: 0 address type: memory address
address base: 0xc0000
address range: 0x20000
--
System Address Space
bus ID: 0 address type: memory address
address base: 0xf8000000
address range: 0x8000000
--
System Address Space
bus ID: 2 address type: memory address
address base: 0x40000000
address range: 0x5b000000
--
System Address Space
bus ID: 2 address type: prefetch address
address base: 0x9b000000
address range: 0x5b000000
--
System Address Space
bus ID: 2 address type: memory address
address base: 0xf6000000
address range: 0x2000000
--
System Address Space
bus ID: 0 address type: I/O address
address base: 0x0
address range: 0x2200
--
System Address Space
bus ID: 2 address type: I/O address
address base: 0x2200
address range: 0xde00
--
Bus Heirarchy
bus ID: 7 bus info: 0x01 parent bus ID: 0
-------------------------------------------------------------------------------
# SMP kernel config file options:
# Required:
options SMP # Symmetric MultiProcessor Kernel
options APIC_IO # Symmetric (APIC) I/O
# Optional (built-in defaults will work in most cases):
#options NCPU=2 # number of CPUs
#options NBUS=8 # number of busses
#options NAPIC=2 # number of IO APICs
#options NINTR=28 # number of INTs
===============================================================================
home |
help
Want to link to this message? Use this
URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?Pine.BSF.4.21.0010191112010.974-400000>
