Date: Mon, 10 Apr 2000 18:22:27 -0500 From: "Duncan, Eric A." <eduncan@idealmusic.com> To: <freebsd-questions@freebsd.org> Subject: Serious SMP Problems on Dell PE 2450. Message-ID: <NDBBIMCDELJLMJDJPOFIOENMCDAA.eduncan@idealmusic.com>
index | next in thread | raw e-mail
[-- Attachment #1 --]
Basically, I enable SMP support and the system hangs on the PCI probing on
boot up. I originally wanted to install 3.4. Here's the configuration:
Dell Power Edge 2450
Dual 733mhz 133mhz bus P-IIIs
512megs
Adaptec 7899 U2W SCSI (2940 ahc driver)
^- Important info below.
Here's the story:
FreeBSD 3.4
----------------
Upon installing 3.4 from the floppies (ftp install), the kernel never found
the SCSI controller (ahc0). It never even probed it during the PCI scan. I
even tried a 3.3-STABLE kernel without success. I built a custom kernel on
another box building in the ahc0 driver. I even tried wiring down devices,
calling scsibus, setting them manually, setting with '?'s. I tried it all
and built a new kernel every time. Even making sure the boot loader wasn't
di'ing it.
The ahc controller was never probed upon boot. It never even showed 'ahc0:
not found'. Nothing. Very odd!
FreeBSD 4.0
----------------
Immediately FBSD4 found the ahc0 controller and loaded the correct driver.
It found my tape backup devices, ata cdrom, and 2 da 18gig SCSI drives
without any problems.
I continued to install the kernel, ports, packages, etc. The system is
running great with all devices configured!
I tried to build a custom kernel. To disable drivers and clean up the
kernel, firewall, smp, etc. Everything was going great until the SMP part.
As I mentioned above, the kernel halts, with no errors, it simply locks the
machine up while booting. I tried even using the GENERIC kernel, which
enables a lot and works just fine. Until I simply enable SMP support.
And example SMP setup I tried is (which seems to be what the system asks
for):
options SMP
options APIC_IO
options NCPU=2
options NBUS=4
options NAPIC=2
options NINTR=28
Yes, I tried disabling all but the first two lines. The kernel would halt
immediately telling me I has more then 1 apic and I had to rebuild my kernel
(it numbered NAPIC=2 on the error). It also reported back errors that I had
4 buses and to increase my NBUS=4 and that I had 28 intr and to increase
NINTR to 28.
Now, I haven't tried OVER SIZING any of the above. I had to ship the
servers off today. But I will be back working on them in a few days and
would love any advice.
Another odd story. Just for argument sake, I compiled a 3.3-STABLE kernel
with SMP support with the params above, and the floppies loaded it correctly
enabling both CPU0 and CPU1 on boot. Just no SCSI card/Harddrives were
found because of the 3.x problem of not detecting my SCSI board.
So, I now have a 3.x kernel with SMP but no HD. And a 4.0 kernel with HDs,
but no SMP support.
Yes, I tried changing all sorts of settings in the BIOS and Utility config.
Changing mem IOs, IRQs, disabling the second bus, etc etc etc. I have been
up since 11am Sunday morning working on this. I just NOW boxes them up. I
haven't been to sleep yet.
HELP! A $1,500 second CPU laying waste in this extremely fast system isn't
too good at all.
Thanks in advance,
Eric Duncan
eduncan@idealmusic.com
[-- Attachment #2 --]
x>" IPM.Microsoft Mail.Note 1
# # &