From owner-freebsd-geom@FreeBSD.ORG Sun Jan 14 01:26:21 2007 Return-Path: X-Original-To: freebsd-geom@freebsd.org Delivered-To: freebsd-geom@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 4E37F16A40F for ; Sun, 14 Jan 2007 01:26:21 +0000 (UTC) (envelope-from arne_woerner@yahoo.com) Received: from web30305.mail.mud.yahoo.com (web30305.mail.mud.yahoo.com [209.191.69.67]) by mx1.freebsd.org (Postfix) with SMTP id 155D113C428 for ; Sun, 14 Jan 2007 01:26:20 +0000 (UTC) (envelope-from arne_woerner@yahoo.com) Received: (qmail 95887 invoked by uid 60001); 14 Jan 2007 01:26:20 -0000 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:X-YMail-OSG:Received:Date:From:Subject:To:Cc:In-Reply-To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=4TmBN3faWsnb+O8ENnN++LB7A2Q9ndU9vybYbUiMCwLRMfCjS3ZdNOUVULkCo2QiTK+9Opb4LM/emCW/6YpHl2LFxLd2q9wbfunzTVfImEiHV+IPNelS+CdjeY+UJtB739ZXLH2XJYhlW5EWgL6WnhLH+nRz7Cevv5LAwGGIbC4= ; Message-ID: <20070114012620.95885.qmail@web30305.mail.mud.yahoo.com> X-YMail-OSG: Rnhm.3cVM1kac56vzWb1a6Vv03UTTLN5P_355TH1F3fO7D7ocajGFyTQ2gMpKCZTxqydu9VWsiTedc2d7Cfu9x4wNmHsYpnLXta3UDu2yYdPG9dZcViGOjBoVy7vYdS8_Chrrw29.X7fTjA- Received: from [213.54.176.27] by web30305.mail.mud.yahoo.com via HTTP; Sat, 13 Jan 2007 17:26:20 PST Date: Sat, 13 Jan 2007 17:26:20 -0800 (PST) From: "R. B. Riddick" To: CyberLeo Kitsana In-Reply-To: <45A9836B.5030802@cyberleo.net> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit Cc: FreeBSD Geom Subject: Re: geom_raid5 livelock? X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 14 Jan 2007 01:26:21 -0000 Thank you for the tests... :-) --- CyberLeo Kitsana wrote: > I haven't forgotten. I was just trying to figure out how to build and > use the new module without having to reboot the box. Guess there's no > safe way. > Yup... I always remove all disks, when I am testing (then I can unload the module)... But then u had to do a rebuild, which is not so good... I do not know, why GEOM taste-s the disks again, after the device has been stopped... I havent found out the trick yet... > http://home.cyberleo.net/cyberleo/workspace/Zip/graid5-all3.log > That's the latest log, with the debug lines inserted. > OK - So we have a false parity at 264438038528. Now we could 1. (hard way) Look at the data (16KB) at that offset of all disks and see, where exactly the parity mismatches ("dd if=/dev/adXs2 bs=16k iseek=16140017 of=/tmp/adXs2.test..."; then we could XOR them all together and see, where we have non-zero bits) or 2. (easy way) Do a re-sync (graid5 configure -R raid5) and look at kern.geom.raid5.veri_w (the number of bad parity blocks). The re-sync seems to be necessary, because it is very likely, that ur graid5 device has bad parity blocks... It is difficult to say, how that happened (could be an unintentional "graid5 configure -R" or a bug in graid5 or...)... If you can reproduce it (maybe with a smaller device?), I would be glad, if you could let me know, how u did it... -Arne ____________________________________________________________________________________ Yahoo! Music Unlimited Access over 1 million songs. http://music.yahoo.com/unlimited