From owner-freebsd-current Wed Jun 9 17:23:59 1999 Delivered-To: freebsd-current@freebsd.org Received: from argus.tfs.net (as2-p98.tfs.net [139.146.205.98]) by hub.freebsd.org (Postfix) with ESMTP id A59D514BF5 for ; Wed, 9 Jun 1999 17:23:52 -0700 (PDT) (envelope-from jbryant@argus.tfs.net) Received: (from jbryant@localhost) by argus.tfs.net (8.9.3/8.8.5) id TAA27790; Wed, 9 Jun 1999 19:23:36 -0500 (CDT) From: Jim Bryant Message-Id: <199906100023.TAA27790@argus.tfs.net> Subject: Re: 4-way SMP broken ? In-Reply-To: <199906100001.UAA07014@lor.watermarkgroup.com> from Luoqi Chen at "Jun 9, 99 08:01:04 pm" To: luoqi@watermarkgroup.com (Luoqi Chen) Date: Wed, 9 Jun 1999 19:23:34 -0500 (CDT) Cc: freebsd-current@freebsd.org Reply-To: jbryant@unix.tfs.net X-Windows: R00LZ!@# MS-Winbl0wz DR00LZ!@# X-files: The truth is that the X-Files is fiction X-Republican: The best kind!!! X-Operating-System: FreeBSD 4.0-CURRENT #31: Thu Apr 8 10:40:17 CDT 1999 X-Mailer: ELM [version 2.4ME+ PL32 (25)] MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: owner-freebsd-current@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG In reply: > > Hi, > > > > I've been trying to install 19990604-CURRENT on a couple of SC450NX > > boxes. It works fine with 2 cpu's, but an SMP kernel with 4 cpu's > > falls over very quickly (I think while it's setting up the APIC > > stuff, or very shortly after - the messages about APIC bus ids appear > > on the screen very briefly, then the machine reboots itself). > > > Do you mean messages like these? > FreeBSD/SMP: Multiprocessor motherboard > cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfec08000 > cpu1 (AP): apic id: 12, version: 0x00040011, at 0xfec08000 > io0 (APIC): apic id: 13, version: 0x00170011, at 0xfec00000 > By the time you see these messages, all cpus should have been booted up > successfully, any crash immediately follows is not likely to be SMP related. > It's helpful to pinpoint the crash if you could include the last few lines > from a verbose boot. interesting. then why the delay in bringing up the AP? Note in the dmesg output below, that the AP only comes up during th SCSI delay. I have also added other comments to the following output. ----------------------------------------------------------------------- [Last night's kernel] Copyright (c) 1992-1999 The FreeBSD Project. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 4.0-CURRENT #7: Wed Jun 9 16:10:23 CDT 1999 jbryant@wahoo:/usr/src/sys/compile/WAHOO Timecounter "i8254" frequency 1192990 Hz CPU: Pentium II/Xeon/Celeron (686-class CPU) Origin = "GenuineIntel" Id = 0x650 Stepping=0 Features=0x183fbff real memory = 134217728 (131072K bytes) avail memory = 126902272 (123928K bytes) Programming 24 pins in IOAPIC #0 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Preloaded elf kernel "kernel" at 0xc0393000. Preloaded userconfig_script "/boot/kernel.conf" at 0xc039309c. DEVFS: ready for devices Pentium Pro MTRR support enabled, default memory type is uncacheable ipl: ERROR: driver has bogus cdevsw->d_maj = -1 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ?????? ccd0-3: Concatenated disk drivers Probing for PnP devices: CSN 1 Vendor ID: YMH0802 [0x0208a865] Serial 0xffffffff Comp ID: PNPb02f [0x2fb0d041] npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard pci0: on pcib0 chip0: at device 0.0 on pci0 pcib1: at device 1.0 on pci0 pci1: on pcib1 vga-pci0: irq 2 at device 0.0 on pci1 isab0: at device 7.0 on pci0 chip1: at device 7.1 on pci0 uhci0: irq 19 at device 7.2 on pci0 usb0: on uhci0 uhub0 at usb0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered intpm0: at device 7.3 on pci0 intpm0: I/O mapped fcb0 intpm0: intr IRQ 9 enabled revision 0 intsmb0: smbus0: on intsmb0 intpm0: PM I/O mapped fc00 ed0: irq 17 at device 12.0 on pci0 ed0: address 00:00:e8:4e:0e:16, type NE2000 (16 bit) ahc0: irq 19 at device 15.0 on pci0 ahc0: Using left over BIOS settings ahc0: aic7895 Wide Channel A, SCSI Id=7, 255 SCBs ahc1: irq 16 at device 15.1 on pci0 ahc1: Using left over BIOS settings ahc1: aic7895 Wide Channel B, SCSI Id=7, 255 SCBs devclass_alloc_unit: ed0 already exists, using next available unit number ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ?????? isa0: on motherboard fdc0: at port 0x3f0-0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> at fdc0 drive 0 ppc0 at port 0x378-0x37f irq 7 on isa0 ppc0: Generic chipset (ECP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/8 bytes threshold plip0: on ppbus 0 lpt0: on ppbus 0 lpt0: Interrupt-driven port ppi0: on ppbus 0 lppps0: on ppbus 0 sio0 at port 0x3f8-0x3ff irq 4 on isa0 sio0: type 16550A sio1: configured irq 3 not in bitmap of probed irqs 0 joy0 at port 0x201 on isa0 joy0: joystick atkbdc0: at port 0x60-0x6f on isa0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 psm0: irq 12 on atkbdc0 psm0: model Generic PS/2 mouse, device ID 0 vga0: on isa0 sc0: at flags 0x6 on isa0 pca0 at port 0x40 on isa0 pca0: PC speaker audio driver DEVFS: ready to run APIC_IO: Testing 8254 interrupt delivery APIC_IO: Broken MP table detected: 8254 is not connected to IO APIC int pin 2 APIC_IO: routing 8254 via 8259 on pin 0 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Tyan Thunder2 S1696DLUA Motherboard, Rogue? IP packet filtering initialized, divert enabled, rule-based forwarding disabled, default to accept, unlimited logging DUMMYNET initialized (990504) ds0 XXX: driver didn't set ifq_maxlen ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ??????? BRIDGE 981214, have 12 interfaces -- index 1 ed0 type 6 phy 0 addrl 6 addr 00.00.e8.4e.0e.16 IP Filter: initialized. Default = pass all, Logging = enabled Waiting 2 seconds for SCSI devices to settle SMP: AP CPU #1 Launched! ^^^^^^^^^^^^^^^^^^^^^^^^ This is what I was talking about (probe0:ahc0:0:0:0): data overrun detected in Data-In phase. Tag == 0xe. (probe0:ahc0:0:0:0): Have seen Data Phase. Length = 36. NumSGs = 1. sa0 at ahc0 bus 0 target 5 lun 0 sa0: Removable Sequential Access SCSI-2 device sa0: 10.000MB/s transfers (10.000MHz, offset 8) da0 at ahc0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-2 device da0: 20.000MB/s transfers (10.000MHz, offset 8, 16bit), Tagged Queueing Enabled da0: 4094MB (8386000 512 byte sectors: 64H 32S/T 4094C) changing root device to da0s1a cd0 at ahc0 bus 0 target 6 lun 0 cd0: Removable CD-ROM SCSI-2 device cd0: 10.000MB/s transfers (10.000MHz, offset 15) cd0: Attempt to query device size failed: NOT READY, Medium not present jim -- All opinions expressed are mine, if you | "I will not be pushed, stamped, think otherwise, then go jump into turbid | briefed, debriefed, indexed, or radioactive waters and yell WAHOO !!! | numbered!" - #1, "The Prisoner" ------------------------------------------------------------------------------ Inet: jbryant@tfs.net AX.25: kc5vdj@wv0t.#neks.ks.usa.noam grid: EM28pw voice: KC5VDJ - 6 & 2 Meters AM/FM/SSB, 70cm FM. http://www.tfs.net/~jbryant ------------------------------------------------------------------------------ HF/6M/2M: IC-706-MkII, 2M: HTX-212, 2M: HTX-202, 70cm: HTX-404, Packet: KPC-3+ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message