From owner-freebsd-current@FreeBSD.ORG Tue Apr 28 18:11:57 2009 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 536E51065670 for ; Tue, 28 Apr 2009 18:11:57 +0000 (UTC) (envelope-from artemb@gmail.com) Received: from mail-gx0-f176.google.com (mail-gx0-f176.google.com [209.85.217.176]) by mx1.freebsd.org (Postfix) with ESMTP id 028178FC24 for ; Tue, 28 Apr 2009 18:11:56 +0000 (UTC) (envelope-from artemb@gmail.com) Received: by gxk24 with SMTP id 24so366275gxk.19 for ; Tue, 28 Apr 2009 11:11:56 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:sender:received:in-reply-to :references:date:x-google-sender-auth:message-id:subject:from:to:cc :content-type:content-transfer-encoding; bh=DUwRPCc1/IWm23b2FwmfSXIxc0ss6OaG57mdBJeQdJ4=; b=FZBwSgXq2t6FNXmBYQ9HjF65Sy7bm3x5O2+DmtsYQFrD/tvAv48KJCPaLV5GnDQNIk A9qLyybAd62yfqqumksfeEESIugjEl1rUhKVTfHkqPob+qxWHtDaFPDmXX0aIXfx1+XC bjA3nMi3zao0eUUGW+fZfkr4pkXTVMyPTxnJY= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type :content-transfer-encoding; b=xziNP5AA6v/8V43Bx1nhiQwsesOnZshj0cgjpO8VEEdSgPbEfYtV6JMBuZFBtjfVT6 PwQvlIuuduhEj5bJ0a7fALJHNriWS03r4bWLvhpBB/pWwimVB02opo7BRkVZf19Bqa+N B9XRiwO2GHdjEgwPZ4TZM54qxBcrMPqB3tduY= MIME-Version: 1.0 Sender: artemb@gmail.com Received: by 10.90.75.13 with SMTP id x13mr8758659aga.102.1240942316247; Tue, 28 Apr 2009 11:11:56 -0700 (PDT) In-Reply-To: References: <08D7DC2A-68BE-47B6-8D5D-5DE6B48F87E5@wanderview.com> Date: Tue, 28 Apr 2009 11:11:56 -0700 X-Google-Sender-Auth: 9fdac5b7a2ee3a64 Message-ID: From: Artem Belevich To: Ben Kelly Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Cc: freebsd-current@freebsd.org, Ryan Stone Subject: Re: [patch] zfs livelock and thread priorities X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 28 Apr 2009 18:11:57 -0000 Ben, My system had eventually deadlocked overnight, though it took much longer than before to reach that point. In the end I've got many many processes sleeping in zio_wait with no disk activity whatsoever. I'm not sure if that's the same issue or not. Here are stack traces for all processes -- http://pastebin.com/f364e1452 I've got the core saved, so if you want me to dig out some more info, let me know if/how I could help. --Artem On Mon, Apr 27, 2009 at 6:39 PM, Artem Belevich wrote: >> I've updated the patch to use va_copy(). =A0Unfortunately I don't have a= n >> AMD64 machine to test on at the moment. =A0However, I did look at other = places >> in the kernel where va_copy() is used so I am hopeful I got it right. > > The latest diff does not crash on amd64. That's the good news. > > Even better news is that the hang that I had with enabled ZIL under > heavy load now seems to be gone. > The build that was usually triggering the hang after about an hour has > been going for about three hours now. > > Thanks a lot for the diff. It does help to bring ZFS on FreeBSD closer > to production quality. > > --Artem >