From owner-freebsd-scsi@freebsd.org  Wed Feb 17 08:15:06 2016
Return-Path: <owner-freebsd-scsi@freebsd.org>
Delivered-To: freebsd-scsi@mailman.ysv.freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org
 [IPv6:2001:1900:2254:206a::19:1])
 by mailman.ysv.freebsd.org (Postfix) with ESMTP id 1C604AAA976
 for <freebsd-scsi@mailman.ysv.freebsd.org>;
 Wed, 17 Feb 2016 08:15:06 +0000 (UTC)
 (envelope-from tinkr@openmailbox.org)
Received: from smtp6.openmailbox.org (smtp6.openmailbox.org [62.4.1.40])
 (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits))
 (Client did not present a certificate)
 by mx1.freebsd.org (Postfix) with ESMTPS id CFB11E73
 for <freebsd-scsi@freebsd.org>; Wed, 17 Feb 2016 08:15:05 +0000 (UTC)
 (envelope-from tinkr@openmailbox.org)
Received: by mail2.openmailbox.org (Postfix, from userid 1004)
 id BCCAF2AC46FF; Wed, 17 Feb 2016 08:38:22 +0100 (CET)
DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=openmailbox.org;
 s=openmailbox; t=1455694702;
 bh=sWHKZE5BOpoRV0FUlKivni4mKu7z1BW9pUiD5JGg5iw=;
 h=Date:From:To:Cc:Subject:In-Reply-To:References:From;
 b=Sezlo60fju+xp/mo2jPu+2WWGl/lpsZCnuKXh6g9eQBsMjyijX2SUZn1Qxg02bX/9
 qO64SV5mgoI3NjkrDnS7ag9kftjxpGaJXOFga0oTv/LoDKjGkEhVcOZRGD3LHwoZiE
 BHpMkQrHV4t6L4pjremfMEV3zbFflkJQkToN0IRQ=
X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on openmailbox-b2
X-Spam-Level: 
X-Spam-Status: No, score=0.6 required=5.0 tests=ALL_TRUSTED,BAYES_50,
 DKIM_ADSP_ALL,URIBL_BLOCKED autolearn=no autolearn_force=no version=3.4.0
Received: from www.openmailbox.org (openmailbox-b1 [10.91.69.218])
 by mail2.openmailbox.org (Postfix) with ESMTP id 8A70F2AC4B23;
 Wed, 17 Feb 2016 08:38:10 +0100 (CET)
MIME-Version: 1.0
Content-Type: text/plain; charset=US-ASCII;
 format=flowed
Content-Transfer-Encoding: 7bit
Date: Wed, 17 Feb 2016 14:38:10 +0700
From: Tinker <tinkr@openmailbox.org>
To: Doug Ambrisko <ambrisko@ambrisko.com>
Cc: freebsd-scsi@freebsd.org
Subject: Re: MRSAS driver/LSI MegaRaid 92XX-93XX admin question: When one of
 the Raid's physical drives break, how is it reported in the
 =?UTF-8?Q?logs=3F?=
In-Reply-To: <20160217000002.GA81916@ambrisko.com>
References: <6a648d421b6d611b4f6f411b66303017@openmailbox.org>
 <55de137d1ed81930cfdbee579d881d62@openmailbox.org>
 <20160217000002.GA81916@ambrisko.com>
Message-ID: <fceaf3867796102969153dea4a4cbbde@openmailbox.org>
X-Sender: tinkr@openmailbox.org
User-Agent: Roundcube Webmail/1.0.6
X-BeenThere: freebsd-scsi@freebsd.org
X-Mailman-Version: 2.1.20
Precedence: list
List-Id: SCSI subsystem <freebsd-scsi.freebsd.org>
List-Unsubscribe: <https://lists.freebsd.org/mailman/options/freebsd-scsi>,
 <mailto:freebsd-scsi-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-scsi/>
List-Post: <mailto:freebsd-scsi@freebsd.org>
List-Help: <mailto:freebsd-scsi-request@freebsd.org?subject=help>
List-Subscribe: <https://lists.freebsd.org/mailman/listinfo/freebsd-scsi>,
 <mailto:freebsd-scsi-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Wed, 17 Feb 2016 08:15:06 -0000

Hi Doug,

Would you mind sharing your kernel patch for that functionality (if I 
understand you right, you patched your kernel to channelize the events 
to the dmesg)?

Thanks,
Tinker

On 2016-02-17 07:00, Doug Ambrisko wrote:
> On Sun, Feb 14, 2016 at 10:13:31PM +0700, Tinker wrote:
> | (Will send any followup from now only to freebsd-scsi@ .)
> |
> | Did some additional research and found that the disk failure indeed 
> is
> | reported in MRSAS' "event log".
> |
> | So my final question then is, how do you extract it into userland (in
> | the absence of an "mfiutil" as the MFI driver has)?
> 
> I have local changes to print the event log in dmesg which gets 
> sysloged.
> We then watch syslog for issues to report things to our customers
> automatically.  This is similar to mfi(4).
> 
> Thanks,
> 
> Doug A.
> | Details below. Thanks.
> |
> | On 2016-02-14 19:59, Tinker wrote:
> | [...]
> | >
> http://www.cisco.com/c/dam/en/us/td/docs/unified_computing/ucs/3rd-party/lsi/mrsas/userguide/LSI_MR_SAS_SW_UG.pdf
> | > on page 305, that is section "A.2 Event Messages" - I don't know 
> for
> | > what LGI chip this document is, but, it does not list particular 
> event
> | > message very clearly for when an individual underlying disk would 
> have
> | > broken, I don't even see any event for when a hot spare would be 
> taken
> | > in use!
> |
> |
> | Wait - this page:
> |
> | 
> https://www.schirmacher.de/display/Linux/Replace+failed+disk+in+MegaRAID+array
> |
> | (and also
> |
> http://serverfault.com/questions/485147/drive-is-failing-but-lsi-megaraid-controller-does-not-detect-it
> | )
> |
> | gives an example of how the host system learns about broken disks:
> |
> |
> | Code: 0x00000051 .. Event Description: State change on VD 00/1 from
> | OPTIMAL(3) to DEGRADED(2)
> |
> |
> | Code: 0x00000072 .. Event Description: State change on PD 
> 05(e0xfc/s0)
> | from ONLINE(18) to FAILED(11)
> |
> | (unclean disk broken seems to be shown as:)
> |
> | Code: 0x00000071 .. Event Description: Unexpected sense: PD 
> 05(e0xfc/s0)
> | Path 4433221103000000, CDB: 2e 00 3a 38 1b c7 00 00 01 00, Sense:
> | b/00/00
> |
> |
> | And this version of the LSI documentation
> |
> |
> http://hwraid.le-vert.net/raw-attachment/wiki/LSIMegaRAIDSAS/megacli_user_guide.pdf
> |
> | gives a clearer definition of the physical and virtual drive states 
> in
> | "1.4.16 Physical Drive States"
> | and "1.4.17 Virtual Disk States" on pages 1-11 to 1-12.
> |
> | So as we see, a physical drive breaking would
> |
> |   * "FAILED" the physical drive
> |
> |   * "DEGRADED" the Virtual Drive (that is the logical exported drive)
> | (from "OPTIMAL")
> |
> |
> | So then, it was indeed the card's "event log" that contains this 
> info.
> |
> |
> |
> | Last question then would only be then, *where* FreeBSD's MRSAS driver
> | sends its event log?
> |
> |
> |
> | _______________________________________________
> | freebsd-stable@freebsd.org mailing list
> | https://lists.freebsd.org/mailman/listinfo/freebsd-stable
> | To unsubscribe, send any mail to 
> "freebsd-stable-unsubscribe@freebsd.org"