From owner-freebsd-current@FreeBSD.ORG Sat Jun 25 01:09:18 2011 Return-Path: Delivered-To: current@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 59391106566C; Sat, 25 Jun 2011 01:09:18 +0000 (UTC) (envelope-from gibbs@FreeBSD.org) Received: from aslan.scsiguy.com (aslan.scsiguy.com [70.89.174.89]) by mx1.freebsd.org (Postfix) with ESMTP id 19D9D8FC12; Sat, 25 Jun 2011 01:09:17 +0000 (UTC) Received: from Justins-MacBook-Pro.local (c-71-235-8-131.hsd1.nh.comcast.net [71.235.8.131]) (authenticated bits=0) by aslan.scsiguy.com (8.14.4/8.14.4) with ESMTP id p5P1Aa8K088790 (version=TLSv1/SSLv3 cipher=DHE-RSA-CAMELLIA256-SHA bits=256 verify=NO); Fri, 24 Jun 2011 19:10:37 -0600 (MDT) (envelope-from gibbs@FreeBSD.org) Message-ID: <4E053534.4080205@FreeBSD.org> Date: Fri, 24 Jun 2011 21:09:08 -0400 From: "Justin T. Gibbs" Organization: The FreeBSD Project User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.2.18) Gecko/20110616 Thunderbird/3.1.11 MIME-Version: 1.0 To: Andrey Chernov , Scott Long , Kostik Belousov , Eir Nym , "Kenneth D. Merry" , current@FreeBSD.org, will@FreeBSD.org References: <20110620114656.GA83524@vniz.net> <20110621161719.GA16166@nargothrond.kdm.org> <20110621204934.GB9877@vniz.net> <20110622035404.GA38834@nargothrond.kdm.org> <20110622041325.GA13754@vniz.net> <20110622200919.GA72504@nargothrond.kdm.org> <4E03FDFD.70203@FreeBSD.org> <55FDA4B1-CA5E-4304-9239-3AAF0FC6FF5F@samsco.org> <4E04F188.9030105@FreeBSD.org> <20110624222645.GA75222@vniz.net> In-Reply-To: <20110624222645.GA75222@vniz.net> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.2.6 (aslan.scsiguy.com [70.89.174.89]); Fri, 24 Jun 2011 19:10:40 -0600 (MDT) Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: Subject: Re: Exactly that commit (was Re: Latest -current 100% hang at the late boot stage) X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: gibbs@FreeBSD.org List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 25 Jun 2011 01:09:18 -0000 On 6/24/11 6:26 PM, Andrey Chernov wrote: > On Fri, Jun 24, 2011 at 04:20:24PM -0400, Justin T. Gibbs wrote: > > Instead, I believe that either one of the GEOM taste methods is leaking an > > access reference (so cdclose() is not called), or the CD driver is failing > > to release the hold semaphore during probing. Setting kern.geom.debugflags > > to '4' will trace the access calls and allow the GEOM side to be ruled out. > > If GEOM is exonerated, we can add tracing to cam_perihp_(un)hold to track > > this down further. > > No problem. I just set kern.geom.debugflags=4 in loader.conf and here is > new photo (with recent kernel, no patches): > http://img803.imageshack.us/img803/4679/25062011006.jpg > I skip all noisy parts related to ada0 and ada1 partitions probes. > As you can see, only 3 cd0-related geom call issued, right before cd1 > probe shown. Strange thing is that I see no single cd1-related geom > call, but it may be because of hang. The GEOM processing is serialized, so that is not unexpected. What your logs are telling me is that the probe for CD0 is hanging. I don't know why. Are you positive it is this specific SVN revision that prevents cd0 from probing properly and not one of my previous CAM commits? Just getting to multi-user doesn't mean we're ok here. My GEOM changes may make the system hang earlier, but you'll need to test access to cd0 even if you get to multi-user mode to be sure that the device is functioning correctly. I just want to be positive that we're barking up the right tree. -- Justin