From owner-freebsd-proliant@FreeBSD.ORG  Tue Oct 31 16:55:26 2006
Return-Path: <owner-freebsd-proliant@FreeBSD.ORG>
X-Original-To: freebsd-proliant@freebsd.org
Delivered-To: freebsd-proliant@freebsd.org
Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125])
	by hub.freebsd.org (Postfix) with ESMTP id 1A0DF16A4FB
	for <freebsd-proliant@freebsd.org>;
	Tue, 31 Oct 2006 16:55:26 +0000 (UTC)
	(envelope-from mikhollo@cisco.com)
Received: from sj-iport-6.cisco.com (sj-iport-6.cisco.com [171.71.176.117])
	by mx1.FreeBSD.org (Postfix) with ESMTP id 1C39343D8B
	for <freebsd-proliant@freebsd.org>;
	Tue, 31 Oct 2006 16:55:11 +0000 (GMT)
	(envelope-from mikhollo@cisco.com)
Received: from sj-dkim-2.cisco.com ([171.71.179.186])
	by sj-iport-6.cisco.com with ESMTP; 31 Oct 2006 08:55:11 -0800
Received: from sj-core-2.cisco.com (sj-core-2.cisco.com [171.71.177.254])
	by sj-dkim-2.cisco.com (8.12.11.20060308/8.12.11) with ESMTP id
	k9VGtBr7007606; Tue, 31 Oct 2006 08:55:11 -0800
Received: from [64.101.130.181] ([64.101.130.181])
	by sj-core-2.cisco.com (8.12.10/8.12.6) with ESMTP id k9VGtBin004364;
	Tue, 31 Oct 2006 08:55:11 -0800 (PST)
In-Reply-To: <8D7B26B6-A16E-4ECD-812A-27E4E9DBF70B@warprecords.com>
References: <8D7B26B6-A16E-4ECD-812A-27E4E9DBF70B@warprecords.com>
Mime-Version: 1.0 (Apple Message framework v752.2)
Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
Message-Id: <675649FB-EDF5-43AD-9BE5-333BC33D0361@cisco.com>
Content-Transfer-Encoding: 7bit
From: Mike Holloway <mikhollo@cisco.com>
Date: Tue, 31 Oct 2006 10:53:12 -0600
To: freebsd-proliant@freebsd.org
X-Mailer: Apple Mail (2.752.2)
DKIM-Signature: a=rsa-sha1; q=dns; l=3338; t=1162313711; x=1163177711;
	c=relaxed/simple; s=sjdkim2002;
	h=Content-Type:From:Subject:Content-Transfer-Encoding:MIME-Version;
	d=cisco.com; i=mikhollo@cisco.com;
	z=From:Mike=20Holloway=20<mikhollo@cisco.com>
	|Subject:Re=3A=20RAID=20monitoring=20tools;
	X=v=3Dcisco.com=3B=20h=3D+pgVBaOaSKnCytkW0o7PqW/IAx0=3D;
	b=PF4dxWGYkhvDoGldSFRGIb5ZrEGbrNFhsC3ZJlsd/D9Bk3C4XosRMYMiuEalVF8E5xIfR2ER
	zsHLLV1iEWU1/A72/V9pF9RKNAoY7ww6CW6MhWlbnFkGgpO5uhMTDrhw;
Authentication-Results: sj-dkim-2.cisco.com; header.From=mikhollo@cisco.com;
	dkim=pass ( sig from cisco.com verified; ); 
Cc: Greg Eden <greg@warprecords.com>
Subject: Re: RAID monitoring tools
X-BeenThere: freebsd-proliant@freebsd.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: "Technical discussion of FreeBSD on HP ProLiant server platforms."
	<freebsd-proliant.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-proliant>, 
	<mailto:freebsd-proliant-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-proliant>
List-Post: <mailto:freebsd-proliant@freebsd.org>
List-Help: <mailto:freebsd-proliant-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-proliant>, 
	<mailto:freebsd-proliant-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Tue, 31 Oct 2006 16:55:26 -0000


Greg, you might find this thread useful:

http://lists.freebsd.org/pipermail/freebsd-proliant/2006-February/ 
000058.html

It contains a perl script to run hpasmcli and exit if it doesn't  
return in a reasonable amount of time.  John also mentioned disabling  
ASR on the server in the post below, which seemed to fix my reboot  
problem.  Unfortunately, even still, the hpasmcli utility on FreeBSD  
is unstable for me in it's current form.  You may have different  
results.

http://lists.freebsd.org/pipermail/freebsd-proliant/2006-February/ 
000059.html


-mike


On Oct 31, 2006, at 8:17 AM, Greg Eden wrote:

>
> Hi there
>
> Thanks for this very useful post to the proliant list. i'm just in  
> the process of getting to grips with SNMP and nagios with my  
> proliant hardware running freebsd - this will help no end.
>
> another task i'm just starting on is using output from SHOW TEMP,  
> SHOW FANS from the hpasmd/hpasmcli tools to graph hardware info  
> with nrg. if i get anywhere with it i'll post it to the list.
>
> cheers
>
> greg.
>
>
>
>> Whoops, meant to copy the list...
>>
>>
>> Appreciate the pointer to camcontrol, I previously had just been
>> using swatch to watch syslog and send messages to nagios via nsca.
>> My problem was that I never knew the initial state of the disks until
>> an event happened in syslog.
>>
>>
>> For reference, here's what I saw from camcontrol during my tests
>> (FreeBSD 6.0 rel):
>>
>> During normal operation of the raid:
>> # camcontrol inquiry da0 -D
>> pass0: <COMPAQ RAID 1  VOLUME OK> Fixed Direct Access SCSI-0 device
>>
>> After removing one of the raid member disks:
>> # camcontrol inquiry da0 -D
>> pass0: <COMPAQ RAID 1  VOLUME inte> Fixed Direct Access SCSI-0 device
>>
>> After re-inserting the raid member disk:
>> # camcontrol inquiry da0 -D
>> pass0: <COMPAQ RAID 1  VOLUME reco> Fixed Direct Access SCSI-0 device
>>
>> And about 45 minutes later:
>> # camcontrol inquiry da0 -D
>> pass0: <COMPAQ RAID 1  VOLUME OK> Fixed Direct Access SCSI-0 device
>>
>>
>>
>> And here's the configuration I use for swatch to feed nsca in  
>> realtime:
>>
>> watchfor   /ciss0.*removed/
>>          exec "/usr/local/bin/nsca_report 2 \"Disk Array\" Hot-plug
>> drive removed"
>>
>> watchfor   /ciss0.*failure/
>>          exec "/usr/local/bin/nsca_report 2 \"Disk Array\" Physical
>> drive failure"
>>
>> watchfor   /ciss0.*inserted/
>>          exec "/usr/local/bin/nsca_report 1 \"Disk Array\" Hot-plug
>> drive inserted"
>>
>> watchfor   /ciss0.*recovery->recovering/
>>          exec "/usr/local/bin/nsca_report 1 \"Disk Array\" Drive is
>> rebuilding..."
>>
>> watchfor   /ciss0.*recovering->OK/
>>          exec "/usr/local/bin/nsca_report 0 \"Disk Array\" Drive has
>> successfully rebuilt."
>>
>>
>>
>> For completeness, here's the nsca_report script that I use to send
>> the alarms to nagios, substitute your own thishost and -H:
>>
>> #!/bin/bash
>>
>> outcode=$1
>> thisservice=$2
>>
>> thishost=`echo $HOSTNAME | sed -e "s/\./ /g" | cut -f 1 -d ' '`
>>
>> shift
>> shift
>>
>> echo -e "${thishost}\t${thisservice}\t${outcode}\t$*\n" | /usr/local/
>> bin/send_nsca -H www -c /usr/local/etc/send_nsca.cfg 2>&1 >> /dev/ 
>> null
>>
>>
>>
>> -mike