From owner-freebsd-stable@freebsd.org Fri Aug 25 01:38:42 2017 Return-Path: Delivered-To: freebsd-stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 4C76BDEC8E9 for ; Fri, 25 Aug 2017 01:38:42 +0000 (UTC) (envelope-from FreeBSD@shaneware.biz) Received: from ipmail02.adl2.internode.on.net (ipmail02.adl2.internode.on.net [150.101.137.139]) by mx1.freebsd.org (Postfix) with ESMTP id C6BA36DEF1 for ; Fri, 25 Aug 2017 01:38:41 +0000 (UTC) (envelope-from FreeBSD@shaneware.biz) Received: from unknown (HELO leader.local) ([118.211.113.221]) by ipmail02.adl2.internode.on.net with ESMTP; 25 Aug 2017 11:03:23 +0930 Subject: Re: file system deadlock in RELENG_11 To: Mike Tancsa , FreeBSD-STABLE Mailing List References: <66b97b27-cbea-a3a8-374d-3f7c017b5418@sentex.net> <28c89f80-4797-7e95-a637-472ac7bc98a5@sentex.net> <4ec50e48-58a5-1f49-3acf-d3b21535c36e@sentex.net> From: Shane Ambler Message-ID: Date: Fri, 25 Aug 2017 11:03:21 +0930 User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:52.0) Gecko/20100101 Thunderbird/52.2.1 MIME-Version: 1.0 In-Reply-To: <4ec50e48-58a5-1f49-3acf-d3b21535c36e@sentex.net> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-AU Content-Transfer-Encoding: 7bit X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 25 Aug 2017 01:38:42 -0000 On 25/08/2017 05:40, Mike Tancsa wrote: > On 8/24/2017 4:01 PM, Mike Tancsa wrote: >> OK, this is fairly easy to repeat. If I start a sync of a snapshot via >> zrep, it hangs the box. CTRL+T shows >> >> >> DEBUG: overiding stale lock on zroot/chyves from pid 19378 >> sending zroot/chyves@zrep_000010 to 10.151.9.2:zroot/chyves >> cannot receive new filesystem stream: destination >> 'zroot/chyves/guests/resi/disk1' exists >> must specify -F to overwrite it Are you sending a snapshot from the host to the guest? or is the guest sending to the host? >> load: 0.48 cmd: zfs 29690 [tx->tx_sync_done_cv] 358.94r 0.00u 0.00s 0% >> 3476k >> load: 0.48 cmd: zfs 29690 [tx->tx_sync_done_cv] 360.42r 0.00u 0.00s 0% > > I was able to get ps to run and not sure if its helpful or not, but > these are the two unkillable zfs processes > > > root 5 0.0 0.0 0 128 - DL 10:09 0:05.58 > [zfskern] > root 29683 0.0 0.0 7752 3824 5 DE+ 15:53 0:00.32 zfs > send -R -I zroot/chyves@zrep_00000d zroot/chyves@zrep_000010 > root 29690 0.0 0.0 7752 3492 5 D+ 15:53 0:00.00 zfs > rename zroot/chyves@zrep_000010 zroot/chyves@zrep_000010_unsent > Should be unrelated but several years ago there was a deadlock issue when renaming a zvol, unless the fix got undone recently this may just be a distraction or a hint of a solution... https://svnweb.freebsd.org/base?view=revision&revision=272474 -- FreeBSD - the place to B...Storing Data Shane Ambler