From owner-freebsd-fs@freebsd.org Mon Oct 2 20:02:26 2017 Return-Path: Delivered-To: freebsd-fs@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 9CEC4E26737 for ; Mon, 2 Oct 2017 20:02:26 +0000 (UTC) (envelope-from ben.rubson@gmail.com) Received: from mail-wm0-x22e.google.com (mail-wm0-x22e.google.com [IPv6:2a00:1450:400c:c09::22e]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 2BA0C75DB9 for ; Mon, 2 Oct 2017 20:02:26 +0000 (UTC) (envelope-from ben.rubson@gmail.com) Received: by mail-wm0-x22e.google.com with SMTP id i82so10557159wmd.3 for ; Mon, 02 Oct 2017 13:02:26 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; bh=GXcZUNcGOb6KAdNz3zdhd8nrcUQEqjcxSRhRrNQAlaM=; b=oNHMNDzWoMlqDEGvM/xT5P22oPD6KtEZXAGZ9YRqnd1zhun69eMLxGbnWJ9j5gNJng HwDHH5xaVsUjWPBxbwa9hoEEkS6KfTUPb5YB09f6dcFocvjNf/ELT63+oSCf3vfe7OO0 1OIkV3dc57AN9gcKRBjwfSx99PpTOL1EACTwQ4DppIM9PoUwFrg/h+dWX7k3QkzXp6gc b1UTaTzkAFdNVHAdRBhICX5gQcYLfLmahFymOYsXIlnZo2cYugBGdcIqfShMRe/Rqfc+ /kJzGdCBL9JIWBQausFF34jGAfA8OEEoluyVvyygVmbv6YXHbHPeDQsfS+Rcey4HcoKI JMdA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; bh=GXcZUNcGOb6KAdNz3zdhd8nrcUQEqjcxSRhRrNQAlaM=; b=lAxI0ancHszhdQs14wXmx6WqHk1eDQgYahPLWbMMECcny7qx51aO1O5UE8xnDxAKLw +CBj0Ozh77LuxYL/kkVgll9k53eG3XhV51mwN941qwhrdqydTKPASQPIZkzimOuAMO0T JWcW3lXhXR9p4+iTs5x2iYE8OQ7uKWCgZVhaB97v+aEKezAoLhhkw4BynauI/GnIzC7c 9R+ALxzyyaBePcOEKSw1VKnUfXjemrTr3vxUFdw6XCZorod7H+fasrj30BLgmyaANpOh YJTgKwChJSgXEfSxSsP/xf5L728KhkgCSZVT88h1WhiXzOUEsVldhpBpMPCCBgqhbOzS l/eA== X-Gm-Message-State: AHPjjUgUv5cDNd+kz2h3DVbNFHRtxCFUDWmivN/YU+/N2YS0ZHdRGrAv HBsT5C+l68VXZdzj5NDdFFgjgXgH X-Google-Smtp-Source: AOwi7QBSudFg/4XWXUxzxPjEmpCzhwJg/OX8gt47l2ovEd4wf+woA971LHoamH+8Z2lXUAOHQWAc8g== X-Received: by 10.28.174.67 with SMTP id x64mr12514286wme.82.1506974544460; Mon, 02 Oct 2017 13:02:24 -0700 (PDT) Received: from bens-mac.home (LFbn-MAR-1-445-220.w2-15.abo.wanadoo.fr. [2.15.38.220]) by smtp.gmail.com with ESMTPSA id x75sm14796881wme.3.2017.10.02.13.02.23 for (version=TLS1 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Mon, 02 Oct 2017 13:02:23 -0700 (PDT) Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Mac OS X Mail 9.3 \(3124\)) Subject: Re: ZFS stalled after some mirror disks were lost From: Ben RUBSON In-Reply-To: <7fb4c99b-f3a0-1dda-691c-35f25769ed5c@multiplay.co.uk> Date: Mon, 2 Oct 2017 22:02:23 +0200 Content-Transfer-Encoding: quoted-printable Message-Id: References: <4A0E9EB8-57EA-4E76-9D7E-3E344B2037D2@gmail.com> <71d4416a-3454-df36-adae-34c0b70cd84e@multiplay.co.uk> <8A189756-028A-465E-9962-D0181FAEBB79@gmail.com> <953DD379-C03A-4737-BAD8-14BB2DB4AB05@gmail.com> <4f725113-bac3-64bb-9858-690811e73153@multiplay.co.uk> <54AD0000-AF0B-4682-9047-6E6C1B82506C@gmail.com> <7fb4c99b-f3a0-1dda-691c-35f25769ed5c@multiplay.co.uk> To: Freebsd fs X-Mailer: Apple Mail (2.3124) X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 02 Oct 2017 20:02:26 -0000 > On 02 Oct 2017, at 21:47, Steven Hartland = wrote: >=20 > On 02/10/2017 20:10, Ben RUBSON wrote: >>> On 02 Oct 2017, at 20:41, Steven Hartland = wrote: >>>=20 >>> I'm guessing that the devices haven't disconnected cleanly so are = just stalling all requests to them and hence the pool. >> I even tried to ifconfig down the network interface serving the iscsi = targets, it did not help. >>=20 >>> I'm not that familiar with iscsi, does it still show under under = camcontrol or geom? >> # geom disk list >> (...) >> Geom name: da13 >> Providers: >> 1. Name: da13 >> Mediasize: 3999688294912 (3.6T) >> Sectorsize: 512 >> Mode: r1w1e2 >> wither: (null) >>=20 >> Geom name: da15 >> Providers: >> 1. Name: da15 >> Mediasize: 3999688294912 (3.6T) >> Sectorsize: 512 >> Mode: r1w1e2 >> wither: (null) >>=20 >> Geom name: da16 >> Providers: >> 1. Name: da16 >> Mediasize: 3999688294912 (3.6T) >> Sectorsize: 512 >> Mode: r1w1e2 >> wither: (null) >>=20 >> Geom name: da19 >> Providers: >> 1. Name: da19 >> Mediasize: 3999688294912 (3.6T) >> Sectorsize: 512 >> Mode: r1w1e2 >> wither: (null) >>=20 >> # camcontrol devlist >> // does not show the above disks > So these daXX devices represent your iscsi devices? Yes, and only one is still visible under /dev/, with its label under /dev/label/. So I may have one problematic drive among 4. > If so looks like your problem is at the iscsi layer, as its not = disconnected properly, so as far ZFS is concerned its still waiting for = them. Certainly procstat will talk ! I have switched production to another server, so feel free if any other trace is needed. Thank you again, Ben