Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 17 Oct 2008 05:08:58 -0700
From:      Jeremy Chadwick <koitsu@FreeBSD.org>
To:        Miroslav Lachman <000.fbsd@quip.cz>
Cc:        Gavin Atkinson <gavin@FreeBSD.org>, freebsd-stable@FreeBSD.org
Subject:   Re: Recommendations for servers running SATA drives [hot-swap]
Message-ID:  <20081017120858.GA20746@icarus.home.lan>
In-Reply-To: <48F87C0E.8060404@quip.cz>
References:  <20080927064417.GA43638@icarus.home.lan> <Pine.OSX.4.64.0809271453550.4630@toasty.nat.fasttrackmonkey.com> <20080927202250.GA60980@icarus.home.lan> <48E0DB7E.20804@quip.cz> <1222699642.24339.12.camel@buffy.york.ac.uk> <48E0F36C.1080400@quip.cz> <20080929153220.GA11459@icarus.home.lan> <48F7964C.4060309@quip.cz> <20081016202322.GA2429@icarus.home.lan> <48F87C0E.8060404@quip.cz>

next in thread | previous in thread | raw e-mail | index | archive | help
On Fri, Oct 17, 2008 at 01:50:38PM +0200, Miroslav Lachman wrote:
> Jeremy Chadwick wrote:
>> On Thu, Oct 16, 2008 at 09:30:20PM +0200, Miroslav Lachman wrote:
>>
>>> Today I was replacing disk in one Sun Fire X2100 M2 so I tried   
>>> hot-swapping. It was as you said: atacontrol detach ata3, replace the 
>>>  HDD, atacontrol attach ata3 and new disk is in the system. I tried 
>>> it 3  times to be sure that it was not coincidence - no panic was 
>>> produced ;o)
>>> So in this case, hot-swapping on Sun Fire X2100 M2 with FreeBSD 7.0 
>>> i386  works.
>>
>>
>> That's excellent news.  So it seems possibly the problem I was seeing
>> was with "reinit" causing some sort of chaos.  I'll have to check things
>> on my testbox here at home to see how I caused the panic last time.
>>
>> Thanks for providing feedback, as usual!  :-)
>
> Unfortunately there is one problem - I see a lot of interrupts after  
> disk swapping (about 193k of atapci1)
>
> Interrupts
> 197k total
>      ohci0 21
>      ehci0 22
> 193k atapci1 23
> 2001 cpu0: time
>    1 bge1 273
> 2001 cpu1: time

Okay, so it looks like the interrupt rate on atapci1 after swapping is
going crazy.  What you're showing there looks like heavily modified
vmstat -i output.

> Full output of systat -vm 2 is attached.
>
> It is shown in top as 50% interrupt (CPU state) and load 1 until I  
> rebooted the machine (I can provide MRTG graphs). The system was not in  
> production load, but almost idle. (I will put it in production tomorrow).
> After reboot, everything is OK.

And this box is running the ATA patch Andrey provided, yes?

> Can somebody test hot-swapping with SATA drives and confirm this  
> behavior? (I can't test it now, because machine is in datacenter)

I can test it on my P4SCE box.

I'll check the interrupt rates after each step of the hot-swap to see
if/when the problem starts.

-- 
| Jeremy Chadwick                                jdc at parodius.com |
| Parodius Networking                       http://www.parodius.com/ |
| UNIX Systems Administrator                  Mountain View, CA, USA |
| Making life hard for others since 1977.              PGP: 4BD6C0CB |




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20081017120858.GA20746>