From owner-freebsd-current@FreeBSD.ORG Tue Oct 2 10:39:43 2007 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 65B8416A417 for ; Tue, 2 Oct 2007 10:39:43 +0000 (UTC) (envelope-from pascal.juergens@googlemail.com) Received: from ug-out-1314.google.com (ug-out-1314.google.com [66.249.92.168]) by mx1.freebsd.org (Postfix) with ESMTP id AE13713C4A3 for ; Tue, 2 Oct 2007 10:39:42 +0000 (UTC) (envelope-from pascal.juergens@googlemail.com) Received: by ug-out-1314.google.com with SMTP id a2so2418052ugf for ; Tue, 02 Oct 2007 03:39:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=beta; h=domainkey-signature:received:received:mime-version:in-reply-to:references:content-type:message-id:from:subject:date:to:x-mailer; bh=Fr964eR5yXv80x7705orio9+QLTDI5uhhULmIl/6k80=; b=sphK/ZICTsXbwKDDWJpd0BiHLFFVFuItkP1qsj1VwBVxJXvIDfFXgL1+kWz+BIv545UX5Vuf4ggdG8QFj1r+ki8q6tEmTPEQP7uWvwfSascwgrfj+s+HYYcUQNwNq/22LjrWKU2gCRsy02PVshOm0D8W7ZxQFO5Y9VU8va6pq2I= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlemail.com; s=beta; h=received:mime-version:in-reply-to:references:content-type:message-id:from:subject:date:to:x-mailer; b=IiYh3S+QJdkPdXoOdAu5Xj+Fb/9NIIrPjFS3Z0l8n+CF36e9DnlwJBFvO/NXbMMGVN4lvEzlDIqb87LxrG1gqeSq7w+20hXlo5vbccj3Pz9qb8RXO9bI0KaLbziGwHEZ3mdSmcjBiYHAugurX5/xLmHVJiuQp9y+u+As7pVmJHQ= Received: by 10.67.25.15 with SMTP id c15mr763859ugj.1191319994092; Tue, 02 Oct 2007 03:13:14 -0700 (PDT) Received: from ?192.168.1.2? ( [88.68.219.24]) by mx.google.com with ESMTPS id x33sm482262ugc.2007.10.02.03.13.12 (version=TLSv1/SSLv3 cipher=OTHER); Tue, 02 Oct 2007 03:13:12 -0700 (PDT) Mime-Version: 1.0 (Apple Message framework v752.3) In-Reply-To: <20071002075824.C5CEF16A4FA@hub.freebsd.org> References: <20071002075824.C5CEF16A4FA@hub.freebsd.org> Message-Id: <8C71B0BA-25D1-4B27-B7C9-788277B28B2F@googlemail.com> From: =?ISO-8859-1?Q?Pascal_J=FCrgens?= Date: Tue, 2 Oct 2007 12:13:07 +0200 To: freebsd-current@freebsd.org X-Mailer: Apple Mail (2.752.3) Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Content-Transfer-Encoding: 7bit X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: Re: Accessing bad hard drive causes panic X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 02 Oct 2007 10:39:43 -0000 Dan, I had similar problems on a system with a cursed VIA chipset (K400). When I tried accessing a 5 disk RAIDZ from a PCI controller, it would give me dozens of errors and then fail on large files. So the problem you have might not necessarily be related to the disk. Before sending a healthy HD back (and maybe losing your data), please check - memory (memtest / ultimate boot cd) - cables - BIOS updates - ide controllers (esp. pci-based additional controllers. My mainboard can stand none of them) - did you check the console when the system was down? It might be not drive errors crashing your machine, but that too little memory (less than 1G) causing your kernel to panic with ZFS's memory demands (kmem_malloc: kmem_map too small, discussion here http:// kerneltrap.org/mailarchive/freebsd-current/2007/9/21/271557) On my machine with 512M ram, the machine also hangs on heavy load after some time, despite tuning along the lines of http:// wiki.freebsd.org/ZFSTuningGuide, and I cannot under any circumstances get it to scrub without dying. Hope this helps for further investigation, Pascal Juergens On 02.10.2007, at 09:58, freebsd-current-request@freebsd.org wrote: > A few months ago I installed 7.0-CURRENT in order to migrate to > zfs. At the > time when I was copying files from a geo concat volume to the zfs > pool the > server would freeze (unresponsive to pings). I figured this was > the nature > of CURRENT and moved on. Yesterday I recompiled the kernel from > the latest > source and the issue persists. > > The issue is that one of the old drives is experiencing a hardware > failure. > When ever it is accessed (from geo concat or added to the zfs pool > and > scurbed) the server freezes, requiring a power cycle. > > I know that current isn't for the average user, which I am. But I > figured I > would report this and am willing to help diagnose the issue. > > The drive passes SMART selftests and returns healthy status, but has > reported over 500 errors. I am going to send the drive in for > replacement > soon. > > Let me know what I can do to help. > > -Dan > > > -- > Dan Borello > dborello@uiuc.edu > Structural Engineering Graduate Student > University of Illinois - Urbana Champaign > P: 847-877-6287