From owner-freebsd-current@FreeBSD.ORG Sat Jun 25 11:27:30 2011 Return-Path: Delivered-To: current@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0CECC106566B; Sat, 25 Jun 2011 11:27:30 +0000 (UTC) (envelope-from gibbs@FreeBSD.org) Received: from aslan.scsiguy.com (ns1.scsiguy.com [70.89.174.89]) by mx1.freebsd.org (Postfix) with ESMTP id BF7168FC08; Sat, 25 Jun 2011 11:27:29 +0000 (UTC) Received: from Justins-MacBook-Pro.local (c-71-235-8-131.hsd1.nh.comcast.net [71.235.8.131]) (authenticated bits=0) by aslan.scsiguy.com (8.14.4/8.14.4) with ESMTP id p5PBSnxv091070 (version=TLSv1/SSLv3 cipher=DHE-RSA-CAMELLIA256-SHA bits=256 verify=NO); Sat, 25 Jun 2011 05:28:50 -0600 (MDT) (envelope-from gibbs@FreeBSD.org) Message-ID: <4E05C618.8070703@FreeBSD.org> Date: Sat, 25 Jun 2011 07:27:20 -0400 From: "Justin T. Gibbs" Organization: The FreeBSD Project User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.2.18) Gecko/20110616 Thunderbird/3.1.11 MIME-Version: 1.0 To: Andrey Chernov , Scott Long , Kostik Belousov , Eir Nym , "Kenneth D. Merry" , current@FreeBSD.org, will@FreeBSD.org References: <20110621204934.GB9877@vniz.net> <20110622035404.GA38834@nargothrond.kdm.org> <20110622041325.GA13754@vniz.net> <20110622200919.GA72504@nargothrond.kdm.org> <4E03FDFD.70203@FreeBSD.org> <55FDA4B1-CA5E-4304-9239-3AAF0FC6FF5F@samsco.org> <4E04F188.9030105@FreeBSD.org> <20110624222645.GA75222@vniz.net> <4E053534.4080205@FreeBSD.org> <20110625043916.GA78847@vniz.net> In-Reply-To: <20110625043916.GA78847@vniz.net> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.2.6 (aslan.scsiguy.com [70.89.174.89]); Sat, 25 Jun 2011 05:28:52 -0600 (MDT) Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: Subject: Re: Exactly that commit (was Re: Latest -current 100% hang at the late boot stage) X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: gibbs@FreeBSD.org List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 25 Jun 2011 11:27:30 -0000 On 6/25/11 12:39 AM, Andrey Chernov wrote: > On Fri, Jun 24, 2011 at 09:09:08PM -0400, Justin T. Gibbs wrote: > >> No problem. I just set kern.geom.debugflags=4 in loader.conf and here is > >> new photo (with recent kernel, no patches): > >> http://img803.imageshack.us/img803/4679/25062011006.jpg > >> I skip all noisy parts related to ada0 and ada1 partitions probes. > >> As you can see, only 3 cd0-related geom call issued, right before cd1 > >> probe shown. Strange thing is that I see no single cd1-related geom > >> call, but it may be because of hang. > > > > The GEOM processing is serialized, so that is not unexpected. What your > > logs are telling me is that the probe for CD0 is hanging. I don't know > > why. > > Could you just postpone GEOM calls after any probe will be completed? It > seems GEOM goes here even before probe and waits for probe forever. What > probe waits in the same time is unclear for me (ccb_scan), but CD devices > are slow and may not survive such multisleeping, missing some responses in > the middle. The problem is not GEOM. It's not the thread waiting in ccb_scan - that thread is designed to wait there until an asynchronous device arrival/departure event occurs which is not the case here. The problem is in or below CAM, and that problem is causing the probe to never complete. > > Are you positive it is this specific SVN revision that prevents cd0 > > from probing properly and not one of my previous CAM commits? > I use splitting by half method to find exact date which boots, then see > the next commit above that date. Pre-commit kernel goes to multiuser and > network is alive. I don't test CDs are working, I'll do that later and > report it. So you know that revisions 223081, 223084, 223085, and 223089 all boot just fine? I committed five revisions on that date. 223099 just happens to be the last one for that day. -- Justin