From owner-freebsd-current@FreeBSD.ORG Thu Mar 19 15:58:51 2009 Return-Path: Delivered-To: current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4BBF61065670; Thu, 19 Mar 2009 15:58:51 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 198F68FC15; Thu, 19 Mar 2009 15:58:51 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from server.baldwin.cx (pool-98-109-39-197.nwrknj.fios.verizon.net [98.109.39.197]) by cyrus.watson.org (Postfix) with ESMTPSA id 9B85A46B81; Thu, 19 Mar 2009 11:58:50 -0400 (EDT) Received: from localhost (john@localhost [127.0.0.1]) (authenticated bits=0) by server.baldwin.cx (8.14.3/8.14.3) with ESMTP id n2JFwIoo016925; Thu, 19 Mar 2009 11:58:42 -0400 (EDT) (envelope-from jhb@freebsd.org) From: John Baldwin To: freebsd-current@freebsd.org Date: Thu, 19 Mar 2009 11:04:48 -0400 User-Agent: KMail/1.9.7 References: <20081121231400.GA94863@onelab2.iet.unipi.it> <20081121232325.GA15258@citylink.fud.org.nz> <20081121234104.GA95875@onelab2.iet.unipi.it> In-Reply-To: <20081121234104.GA95875@onelab2.iet.unipi.it> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200903191104.48803.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH authentication, not delayed by milter-greylist-2.0.2 (server.baldwin.cx [127.0.0.1]); Thu, 19 Mar 2009 11:58:44 -0400 (EDT) X-Virus-Scanned: ClamAV 0.94.2/9139/Thu Mar 19 10:09:58 2009 on server.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-4.4 required=4.2 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.1.3 X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on server.baldwin.cx Cc: kib@freebsd.org, Luigi Rizzo , current@freebsd.org, Andrew Thompson Subject: Re: Recent versions of pxeboot hang/panic on AMD platform. X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 19 Mar 2009 15:58:51 -0000 On Friday 21 November 2008 6:41:04 pm Luigi Rizzo wrote: > On Fri, Nov 21, 2008 at 03:23:25PM -0800, Andrew Thompson wrote: > > On Sat, Nov 22, 2008 at 12:14:00AM +0100, Luigi Rizzo wrote: > > > [copying some people involved with recent related commits] > > > > > > As reported in kern/118222 recent versions of pxeboot hang/panic > > > on AMD platform. > > > > > > Initial reports mentioned that the RELENG_6 versions worked well, > > > however i found out that even the recent RELENG_6 code is problematic. > > > > > > Specifically, the problem i see on two machines with AMD CPU (one > > > is an Asus M2N-VM) motherboard netbooting with PXEboot, is that the > > > loading of config files or binary modules (kernel, etc.) randomly > > > hangs with recent version of pxeboot (RELENG_6, RELENG_7 and HEAD > > > all give the same behaviour). > > > > I have found that the kernel size can trigger this for me, after > > reducing the size I didnt experience loader hangs with pxe. You may want > > to experiment with this in your investigations. > > no luck - the hang often occurs as early as while reading loader.conf, > which is way smaller than the kernel. > > Assuming that the interrupt thing is a possible cause for the bug, > I can understand that the kernel size can affect the probability of > getting an interrupt at the wrong time. > > There was one bug due to memory overflow that i fixed in a recent > commit to head (see boot/common/interp.c), > but this only helped on a different amd64 motherboard. Is this now resolved with the latest pxeboot from HEAD? -- John Baldwin