From owner-freebsd-fs@freebsd.org Fri Oct 27 17:20:06 2017 Return-Path: Delivered-To: freebsd-fs@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id A1327E4CD89 for ; Fri, 27 Oct 2017 17:20:06 +0000 (UTC) (envelope-from ben.rubson@gmail.com) Received: from mail-wm0-x22a.google.com (mail-wm0-x22a.google.com [IPv6:2a00:1450:400c:c09::22a]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 2FC0C70DD3 for ; Fri, 27 Oct 2017 17:20:06 +0000 (UTC) (envelope-from ben.rubson@gmail.com) Received: by mail-wm0-x22a.google.com with SMTP id r68so5134511wmr.3 for ; Fri, 27 Oct 2017 10:20:06 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; bh=OZUtfqBVQ+LpG6iK6NDAB26y9WKXhxLxL1e8dbRVmC4=; b=GlfqDsDgMqFnDNrcpnU7VTZvSJlzMufUovo/s48RerUE2HUlXnHmKnZlq7AGdVj3zV 7Vx1ap+OGdVzRfF8n5sJEicw9lXUm5sTCg+FqXlyOMyszF/zNaqzIBrM6G1Wc1j7vk6V NUuQt4et3TrBY3lhNUx/atDBnNOFZZmbqN69WVzoOHYm9zKevoJ9NXCBpDBP2mO32GvY QUZJ8BprTJq+bHa0XxFHhlNFU3GSbLoZtEuAOaUK+EUuHBxTsIUzQuE2tmCB6qLFKtKO IFy9LmjJMGmBOL7/74rFgshfGjCWnYAX2Y+D0+gp8Z9Ag4scH4XATTsRKoTObV9+WnVU k9fw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; bh=OZUtfqBVQ+LpG6iK6NDAB26y9WKXhxLxL1e8dbRVmC4=; b=OlAK+LvvvuHswJkXx1W6DINsVkHBaU3ArT/zDWnVh6275HN2BsAbV1KSOseLyr6FiE 457hS4I6JeT1iGIX+M6caOBbc2K/KTf6htZ6cKzKlgPam61l/lgyV3iV+yCg+OgPS89O qV324ya5TbbPPrMtO4bk1xnOJxWnminbnoeI/6VG+G/N/62zBMGNE7LJE+9BBzzbQ8Vr NLJDUztj2GdxcxNfZxJX9NC1gzWp9NOlq1E4Dq1gz5Wu6g97Wo5iwHinVwA+/zW9vQuJ LjyyJ/BR6JU9JhXeoHIZfmrj7YQZf4S9VbuAREbRSacrFxTM4Jc/Qpiuet9AdV8srHAE qT3A== X-Gm-Message-State: AMCzsaURQmHZfVmpjl5/2L1CmCbxayPtGs/a4OWq2IOe+DqIEapbKxpe +mylYRYTnLOo0MPLbALP7JMnX60o X-Google-Smtp-Source: ABhQp+T0aWyKaTzDLA50HeO1NFmVZHUcQQgSmC/rwvLybsQS/tXJ9yGBoYpsu1x44j1J5ObH2sa7tg== X-Received: by 10.28.154.137 with SMTP id c131mr1090507wme.142.1509124804333; Fri, 27 Oct 2017 10:20:04 -0700 (PDT) Received: from bens-mac.home (LFbn-MAR-1-416-163.w2-15.abo.wanadoo.fr. [2.15.241.163]) by smtp.gmail.com with ESMTPSA id w75sm1062680wmw.17.2017.10.27.10.20.03 for (version=TLS1 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Fri, 27 Oct 2017 10:20:03 -0700 (PDT) Content-Type: text/plain; charset=us-ascii; delsp=yes; format=flowed Mime-Version: 1.0 (Mac OS X Mail 9.3 \(3124\)) Subject: Re: ZFS stalled after some mirror disks were lost From: Ben RUBSON In-Reply-To: <13AF09F5-3ED9-4C81-A5E2-94BA770E991B@gmail.com> Date: Fri, 27 Oct 2017 19:20:01 +0200 Content-Transfer-Encoding: 7bit Message-Id: <84A3920F-9143-40E9-A91D-13E7B7FB733E@gmail.com> References: <4A0E9EB8-57EA-4E76-9D7E-3E344B2037D2@gmail.com> <82632887-E9D4-42D0-AC05-3764ABAC6B86@gmail.com> <20171007150848.7d50cad4@fabiankeil.de> <6d1c80df-7e9f-c891-31ae-74dad3f67985@internetx.com> <13AF09F5-3ED9-4C81-A5E2-94BA770E991B@gmail.com> To: Freebsd fs X-Mailer: Apple Mail (2.3124) X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 27 Oct 2017 17:20:06 -0000 On 13 Oct 2017 18:58, Ben RUBSON wrote: > The issue only happens when I disconnect iSCSI drives, it does not occurs > suddenly by itself. > So I would say the issue is on FreeBSD side, not network hardware :) > > 2 distinct behaviours/issues : > - 1 : when I disconnect iSCSI drives from the server running the pool > (iscsictl -Ra), some iSCSI drives remain on the system, leaving ZFS > stalled ; > - 2 : when I disconnect iSCSI drives from the target (shut NIC down / > shutdown ctld), server running the pool sometimes panics (traces in my > previous mail, 06/10). > > (...) > > Andriy, who took many debug traces from my system, managed to reproduce > the first issue locally, using a 3-way ZFS mirror with one local disk > plus two iSCSI disks. > Sounds like there is a deadlock issue on iSCSI initiator side. So, Andriy proposed a patch which solves this first issue : https://reviews.freebsd.org/D12652 > Regarding the second issue, I'm not able to reproduce it if I don't use > geom-labels. > There may then be an issue on geom-label side (which could then also > affect fully-local ZFS pools using geom-labels). and another one for the second issue : https://reviews.freebsd.org/D12809 Many thanks to the list, to Andriy for his nice & impressive work, Alexander & Edward for their reviews. Ben