From owner-freebsd-stable@FreeBSD.ORG Tue Oct 2 13:51:30 2012 Return-Path: Delivered-To: freebsd-stable@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id DD40F1065672; Tue, 2 Oct 2012 13:51:30 +0000 (UTC) (envelope-from avg@FreeBSD.org) Received: from citadel.icyb.net.ua (citadel.icyb.net.ua [212.40.38.140]) by mx1.freebsd.org (Postfix) with ESMTP id 099B88FC26; Tue, 2 Oct 2012 13:51:28 +0000 (UTC) Received: from odyssey.starpoint.kiev.ua (alpha-e.starpoint.kiev.ua [212.40.38.101]) by citadel.icyb.net.ua (8.8.8p3/ICyb-2.3exp) with ESMTP id QAA23281; Tue, 02 Oct 2012 16:51:25 +0300 (EEST) (envelope-from avg@FreeBSD.org) Message-ID: <506AF15D.1010707@FreeBSD.org> Date: Tue, 02 Oct 2012 16:51:25 +0300 From: Andriy Gapon User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:15.0) Gecko/20120911 Thunderbird/15.0.1 MIME-Version: 1.0 To: geoffroy desvernay References: <506AE944.3020806@centrale-marseille.fr> In-Reply-To: <506AE944.3020806@centrale-marseille.fr> X-Enigmail-Version: 1.4.3 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 8bit Cc: Alexander Motin , freebsd-stable@FreeBSD.org Subject: Re: ahcich reset -> cannot mount zfs root in 9.1-PRE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 02 Oct 2012 13:51:31 -0000 on 02/10/2012 16:16 geoffroy desvernay said the following: > Hi all, > > Trying to upgrade a system from 9.0-RELEASE to 9.1-PRE from yesterday on > my machine (GEOM+ZFS mirror setup on ada[01]p3), the new kernel becomes > unable to mount root... The only way to recover is to boot from 9.0 kernel. > The disks were already named ada[01] in 9.0, so I suspect nothing there... > > I tried > - disabling AHCI in bios (no change seen) > - change cables, check PSU, test disks with smartctl > > Here are some bits (via serial console): > ahci0: port > 0xc000-0xc007,0xb000-0xb003,0xa000-0xa007,0x9000-0x9003,0x8000-0x800f > mem 0xfe9ff800-0xfe9ffbff irq 22 at device 18.0 on pci0 > ahci0: AHCI v1.10 with 4 3Gbps ports, Port Multiplier supported > ahci0: Caps: 64bit NCQ SNTF MPS AL CLO 3Gbps PM PMD SSC PSC 32cmd CCC 4ports > ahcich0: at channel 0 on ahci0 > ahcich0: Caps: HPCP > ahcich1: at channel 1 on ahci0 > ahcich1: Caps: HPCP > ahcich2: at channel 2 on ahci0 > ahcich2: Caps: HPCP > ahcich3: at channel 3 on ahci0 > ahcich3: Caps: HPCP > ahcich0: AHCI reset... > ahcich0: SATA connect time=100us status=00000123 > ahcich0: AHCI reset: device found > ahcich0: AHCI reset: device ready after 0ms > > The difference with 9.0 is after that: here is 9.0's next lines: (same > for ahcich1) > (aprobe0:ahcich0:0:15:0): Command timed out > (aprobe0:ahcich0:0:15:0): Error 5, Retries exhausted > (aprobe0:ahcich0:0:0:0): SIGNATURE: 0000 > > And 9.1-PRE's: > (aprobe0:ahcich0:0:15:0): NOP. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 > (aprobe0:ahcich0:0:15:0): CAM status: Command timeout > (aprobe0:ahcich0:0:15:0): Error 5, Retries exhausted > > In both cases ada[01] are detected and available, but with 9.1-PRE I see: > GEOM_RAID: Promise: Disk ada0 state changed from NONE to SPARE. > GEOM_RAID: Promise: Disk ada1 state changed from NONE to SPARE. > > (I see the same when I # kldload geom_raid # from running 9.0, doesn't > breaks anything...) > > I attach the full boot log with 9.1-PRE (bios with NO-raid nor AHCI > enabled, but this changes nothing in the output) > > I could test patches or try any command required to debug this… But for > the moment I don't know where to search (and kernel code is far away > from my current skills in debugging…) You probably need to clear RAID metadata on the disks as I think that disabling geom_raid is not possible in 9.1-PRE. I think that Alexander can help you more here. -- Andriy Gapon