From owner-freebsd-hardware@FreeBSD.ORG Sat Jul 23 22:29:22 2011 Return-Path: Delivered-To: freebsd-hardware@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E743A106566C; Sat, 23 Jul 2011 22:29:22 +0000 (UTC) (envelope-from mavbsd@gmail.com) Received: from mail-fx0-f44.google.com (mail-fx0-f44.google.com [209.85.161.44]) by mx1.freebsd.org (Postfix) with ESMTP id 4196D8FC14; Sat, 23 Jul 2011 22:29:22 +0000 (UTC) Received: by fxe6 with SMTP id 6so5336171fxe.17 for ; Sat, 23 Jul 2011 15:29:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=sender:message-id:date:from:user-agent:mime-version:to:cc:subject :references:in-reply-to:x-enigmail-version:content-type :content-transfer-encoding; bh=j8UWQkZWBNzIOPG++EGZbAlHrVOkZ48j2N8YxW8VfSY=; b=xVnfbRmNvpaABhU91yGKFHEwuGvDWr7Gc9AJmfCJZdEvfMxS95b6oAue88ZitpounP 6npFJWkHXgGeBYU+r6g0rOrw9IkmSje/SbJLGa0CgGIisnNM2uV5iAQp5tagiXvC4nVq YXwjfVZIzdiHturB7W6rC2ZabW4dTvxwMZ9Uo= Received: by 10.223.68.22 with SMTP id t22mr4258090fai.145.1311460161235; Sat, 23 Jul 2011 15:29:21 -0700 (PDT) Received: from mavbook2.mavhome.dp.ua (pc.mavhome.dp.ua [212.86.226.226]) by mx.google.com with ESMTPS id f7sm2128263faa.8.2011.07.23.15.29.19 (version=SSLv3 cipher=OTHER); Sat, 23 Jul 2011 15:29:20 -0700 (PDT) Sender: Alexander Motin Message-ID: <4E2B4B38.70207@FreeBSD.org> Date: Sun, 24 Jul 2011 01:29:12 +0300 From: Alexander Motin User-Agent: Thunderbird 2.0.0.23 (X11/20091212) MIME-Version: 1.0 To: lev@FreeBSD.org References: <1981757790.20110720013856@serebryakov.spb.ru> <4E29A3D6.1080609@FreeBSD.org> <2710115660.20110723004620@serebryakov.spb.ru> In-Reply-To: <2710115660.20110723004620@serebryakov.spb.ru> X-Enigmail-Version: 0.96.0 Content-Type: text/plain; charset=windows-1251 Content-Transfer-Encoding: 8bit Cc: freebsd-hardware@freebsd.org Subject: Re: ahci.ko / geom_mirror / zfs hangs up system when one of HDDs fauilts. X-BeenThere: freebsd-hardware@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: General discussion of FreeBSD hardware List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 23 Jul 2011 22:29:23 -0000 Lev Serebryakov wrote: > Hello, Alexander. > You wrote 22 èþëÿ 2011 ã., 20:22:46: > >>> Screenshot of LARA console in such case is attached. >> Kernel messages look like if controller or device stuck, unable to >> complete some command and can't recover from that condition even after >> device hard reset. I don't see what driver can do about it, except being >> more aggressive in dropping faulty device after several consecutive >> timeouts. If that is not a wanted way out, start from updating card BIOS >> and devices firmware. > It is very common hardware: ICH10 on MS-7522 (MSI X58 Platinum) motherboard: > > ahci0: port 0xb000-0xb007,0xac00-0xac03,0xa880-0xa887,0xa800-0xa803,0xa480-0xa49f mem 0xf9ffa000-0xf9ffa7ff irq 19 at device 31.2 on pci0 > ahci0: [ITHREAD] > ahci0: AHCI v1.20 with 6 3Gbps ports, Port Multiplier not supported > > And Samsung F3 drive: > > ada1 at ahcich1 bus 0 scbus1 target 0 lun 0 > ada1: ATA-8 SATA 2.x device > ada1: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes) > ada1: Command Queueing enabled > ada1: 715404MB (1465149168 512 byte sectors: 16H 63S/T 16383C) > > I'm not sure, that it is possible to update firmware on these > drives. And MoBo BIOS looks like latest one. Then I have no idea what to do about the cause of errors. What's about consequences, I've tried to simulate alike problem (device detected, but doesn't respond). Recovery (dropping failed device) took a lot of time, but finally (after about 10 minutes) it succeeded and ZFS continued operation without that drive. After that I've just committed one patch to the HEAD and sent another one to freebsd-scsi@ for review. That, I hope, should significantly (down to 1-2 minutes) speedup that process. How long have you waited before and after making that screenshot? -- Alexander Motin