From owner-freebsd-current@FreeBSD.ORG Sat Nov 10 20:30:29 2007 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 302DB16A420; Sat, 10 Nov 2007 20:30:29 +0000 (UTC) (envelope-from hlh@restart.be) Received: from tignes.restart.be (unknown [IPv6:2001:41d0:1:2ad2::1]) by mx1.freebsd.org (Postfix) with ESMTP id 9FA2F13C4A7; Sat, 10 Nov 2007 20:30:28 +0000 (UTC) (envelope-from hlh@restart.be) Received: from restart.be (avoriaz.tunnel.bel [IPv6:2001:41d0:1:2ad2::fffe:0]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client CN "avoriaz.restart.be", Issuer "CA master" (verified OK)) by tignes.restart.be (Postfix) with ESMTP id 8D9001BAC24; Sat, 10 Nov 2007 21:30:27 +0100 (CET) Received: from morzine.restart.bel (morzine6.restart.bel [IPv6:2001:41d0:1:2ad2::1:2]) (authenticated bits=0) by restart.be (8.14.1/8.14.1) with ESMTP id lAAKUOai005957; Sat, 10 Nov 2007 21:30:24 +0100 (CET) (envelope-from hlh@restart.be) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=restart.be; s=avoriaz; t=1194726627; bh=tsDqkhtEz886sEAnvR37FRGXXMC7L10AQUucR/Q 9m7Y=; h=DomainKey-Signature:Message-ID:Date:From:Organization: User-Agent:MIME-Version:To:CC:Subject:References:In-Reply-To: Content-Type:Content-Transfer-Encoding:X-Scanned-By; b=a9syHIFEyOy ywhKa+MRUGBD/mv51Urwqn8+O73RQRD8xQLnBHnS3lpol8o4OrA72dLT2N4CSFTrqod KLuiC0dg== DomainKey-Signature: a=rsa-sha1; s=avoriaz; d=restart.be; c=nofws; q=dns; h=message-id:date:from:organization:user-agent:mime-version:to:cc: subject:references:in-reply-to:content-type: content-transfer-encoding:x-scanned-by; b=k6I+QyPJZGwn8yCmhVxeaOmiwwT0TYYLspoNqc773X4tHyu/zFPQHFSxCESzC0Ras XLcRurMkFSIJDo1CqAoyQ== Message-ID: <473614E0.8010401@restart.be> Date: Sat, 10 Nov 2007 21:30:24 +0100 From: Henri Hennebert Organization: RestartSoft User-Agent: Thunderbird 2.0.0.6 (X11/20071101) MIME-Version: 1.0 To: Pawel Jakub Dawidek References: <47348CAC.5020208@restart.be> <20071110085312.GG37371@garage.freebsd.pl> <4735986F.5000307@restart.be> <20071110124806.GI37371@garage.freebsd.pl> In-Reply-To: <20071110124806.GI37371@garage.freebsd.pl> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 2.63 on IPv6:2001:41d0:1:2ad2::1:1 Cc: freebsd-current@freebsd.org, freebsd-stable@freebsd.org Subject: Re: ZFS deadlock ? X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 10 Nov 2007 20:30:29 -0000 Pawel Jakub Dawidek wrote: > On Sat, Nov 10, 2007 at 12:39:27PM +0100, Henri Hennebert wrote: >> Pawel Jakub Dawidek wrote: >>> On Fri, Nov 09, 2007 at 05:37:00PM +0100, Henri Hennebert wrote: >>>> hello >>>> >>>> To push zfs, I launch 2 scrub at the same time, after ~20 seconds the >>>> system freeze: >>> [...] >>> >>> I found a deadlock too. If it's reproducable for you, can you try this >>> patch: >> I reproduce it after 30 minutes, si I try you patch. >> >>> http://people.freebsd.org/~pjd/patches/zgd_done.patch >> when I try to load zfs.ko I get: >> >> # kldload zfs >> link_elf: symbol kproc_create undefined >> kldload: can't load zfs: No such file or directory >> >> What must I add to my config to resolve this symbol / problem > > Ouch, you don't use HEAD. Try changing kproc_*() to kthread_*(). > It load correctly now... Moreover, no deadlock after multiple scrub in // and some buildworld to make sure... look fine for my config :-) Just to give credit to zfs, scrub encounter 2 IO errors without impact on my data :) /var/log/messages: Nov 10 15:33:00 morzine kernel: ad6: TIMEOUT - READ_DMA48 retrying (1 retry left) LBA=299882429 Nov 10 15:33:06 morzine kernel: ad6: TIMEOUT - READ_DMA48 retrying (0 retries left) LBA=299882429 Nov 10 15:33:12 morzine kernel: ad6: FAILURE - READ_DMA48 timed out LBA=299882429 Nov 10 16:55:53 morzine kernel: ad6: WARNING - SETFEATURES SET TRANSFER MODE taskqueue timeout - completing request directly Nov 10 16:55:53 morzine kernel: ad6: TIMEOUT - READ_DMA48 retrying (1 retry left) LBA=299883325 Nov 10 16:56:06 morzine kernel: ad6: TIMEOUT - READ_DMA48 retrying (0 retries left) LBA=299883325 Nov 10 16:56:13 morzine kernel: ad6: FAILURE - READ_DMA48 timed out LBA=299883325 ZFS is realy great! I will run more test tomorrow... and keep you posted Thanks Henri