From owner-freebsd-smp  Sat Oct  5  2:29:20 2002
Delivered-To: freebsd-smp@freebsd.org
Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125])
	by hub.freebsd.org (Postfix) with ESMTP
	id 58AC637B401; Sat,  5 Oct 2002 02:29:18 -0700 (PDT)
Received: from avocet.mail.pas.earthlink.net (avocet.mail.pas.earthlink.net [207.217.120.50])
	by mx1.FreeBSD.org (Postfix) with ESMTP
	id BAD2443E3B; Sat,  5 Oct 2002 02:29:17 -0700 (PDT)
	(envelope-from tlambert2@mindspring.com)
Received: from pool0020.cvx22-bradley.dialup.earthlink.net ([209.179.198.20] helo=mindspring.com)
	by avocet.mail.pas.earthlink.net with esmtp (Exim 3.33 #1)
	id 17xlFK-0001IY-00; Sat, 05 Oct 2002 02:29:14 -0700
Message-ID: <3D9EB0A4.4CD09E20@mindspring.com>
Date: Sat, 05 Oct 2002 02:28:04 -0700
From: Terry Lambert <tlambert2@mindspring.com>
X-Mailer: Mozilla 4.79 [en] (Win98; U)
X-Accept-Language: en
MIME-Version: 1.0
To: Nate Lawson <nate@root.org>
Cc: David Francheski <davidf@caymas.com>, freebsd-arch@FreeBSD.ORG,
	freebsd-smp@FreeBSD.org
Subject: Re: Running independent kernel instances on dual-Xeon/E7500 system
References: <Pine.BSF.4.21.0210041721250.96201-100000@root.org>
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Sender: owner-freebsd-smp@FreeBSD.ORG
Precedence: bulk
List-ID: <freebsd-smp.FreeBSD.ORG>
List-Archive: <http://docs.freebsd.org/mail/> (Web Archive)
List-Help: <mailto:majordomo@FreeBSD.ORG?subject=help> (List Instructions)
List-Subscribe: <mailto:majordomo@FreeBSD.ORG?subject=subscribe%20freebsd-smp>
List-Unsubscribe: <mailto:majordomo@FreeBSD.ORG?subject=unsubscribe%20freebsd-smp>
X-Loop: FreeBSD.org

Nate Lawson wrote:
> On Fri, 4 Oct 2002, David Francheski wrote:
> > I have a dual-Xeon processor (with E7500 chipset) motherboard.
> > Can anybody tell me what the development effort would be to
> > boot and run two independent copies of the FreeBSD kernel,
> > one on each Xeon processor?   By this I mean that an SMP
> > enabled kernel would not be utilized, each kernel would be UP.
> >
> > Regards,
> > David L. Francheski
> 
> Not possible without another BIOS, PCI bus, and separate memory --
> i.e. another PC.

IPL'ing is not the same as "running".  So long as you crafted the
memory image of the second OS and its page tables, etc., using the
first processor, there should be no problem running a second copy
of an OS on an AP, as a result of a START IPI from the BP, after
the code is crafted.  Thus there is no need for a separate BIOS.

For running, there are two types of devices which one cares about:
devices which can be duplicated, and therefore assigned as seperate
resources, and devices which cannot.  For PCI devices, this breaks
down to an interrupt routing issue.  There are four PCI interrupts:
A, B, C, and D.  So long as no device allocated to each processor
does not share an interrupt, there is no problem.  Thus you do not
need a separate PCI bus.

Note: For devices which cannot be shared, but which are required,
there are two approaches: the device may be virtualized, and then
access to it contended between the processors, or the device may
be virtualized in one instance, and accessed via proxy to the other
processor (e.g. via IPI triggers for IPC).  VMWare operates this
way for a number of its own devices, which can not be physical
devices, since they must be shared with the host OS, rather than
assigned directly to the VMWare "machine", or to the host OS (both
are available options for many devices).

The memory can be seperated logically, rather than physically.  In
fact, one could either use the PAE mode in exclusively 4K page mode,
or the PSE-36, exclusively in 4M page mode, without significant
changes to the VM system, to permit motherboards that can handle it
to wupport up to 4G of physical RAM per CPU, up to 16 CPUs (the
practical limitations on this due to motherboard availability is 4).
Thus there is no need for physically seperate memory.  The 4K mode
would require an additional layer of indirection (Peter Wemm may
actually have completed some or all of the code necessary for PAE
use alread), and the 4M (PSE-36) mode would require hacking the
system to be able to use 4M pages, rather than 4K (mostly, this
effect the paging paths themselves; you would likely get 2M pages
(for PAE large pages, which are 2M instead of 4M in size) for use
in PAE out of this for free, if you went to a "power of two multiple
of 4K" size parameter for paging operations.

--

I've personally considered pursuing the ability to run code seperately,
though with the same 4G address space, seperated, so as to permit
running a debugger against a "crashed" FreeBSD "system" running on an
AP, doing the debugging from the BP, as a hosted system.  The cost
in labor would be 2-3 months of continuous work, I think... that is
the estimate I arrived at, when I considered the project previously.
Doing this certaily beats the cost of buying an ICE to get similar
capability.


It would be interesting to see what other people have to say on this,
other than "can't be done" (not to pick on you in particular, here;
this is the knee-jerk reaction many people have to things like this).

-- Terry

To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-smp" in the body of the message