Date: Sun, 12 Dec 2004 22:05:50 -0800 From: Joe Rhett <jrhett@meer.net> To: Doug White <dwhite@gumbysoft.com> Cc: =?iso-8859-1?Q?S=F8ren?= Schmidt <sos@DeepCore.dk> Subject: Re: drive failure during rebuild causes page fault Message-ID: <20041213060549.GE78120@meer.net> In-Reply-To: <20041212215841.X83257@carver.gumbysoft.com> References: <20041213052628.GB78120@meer.net> <20041213054159.GC78120@meer.net> <20041212215841.X83257@carver.gumbysoft.com>
next in thread | previous in thread | raw e-mail | index | archive | help
> On Sun, 12 Dec 2004, Joe Rhett wrote: > > And another, I can now confirm that it is fairly easy to kill 5.3-release > > during the rebuilding process. The following steps will cause a kernel > > page fault consistently: > > > > atacontrol create RAID0 ad6 ad10 > > atacontrol detach 5 > > log: ad10 deleted from ar0 disk1 > > log: ad10 WARNING - removed from configuration > > atacontrol addspare 0 ad8 > > log: ad8 inserted into ar0 disk1 as spare > > atacontrol rebuild 0 > > atacontrol detach 4 > > log: ad8 deleted from ar0 disk1 > > log: ad8 WARNING - removed from configuration > > > > Fatal trap 12: page fault while in kernel mode > > fault virtual address = 0x10 On Sun, Dec 12, 2004 at 09:59:16PM -0800, Doug White wrote: > Thats a nice shotgun you have there. Yessir. And that's what testing is designed to uncover. The question is why this works, and how do we prevent it? Is there a proper way to handle these sort of events? If so, where is it documented? And fyi just pulling the drives causes the same failure so that means that RAID1 buys you nothing because your system will also crash. -- Joe Rhett Senior Geek Meer.net
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20041213060549.GE78120>