Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 7 Nov 2011 03:56:14 -0500
From:      Rich <rercola@acm.jhu.edu>
To:        =?ISO-8859-1?Q?Karli_Sj=F6berg?= <Karli.Sjoberg@slu.se>
Cc:        "freebsd-scsi@freebsd.org" <freebsd-scsi@freebsd.org>, "Kenneth D. Merry" <ken@freebsd.org>, "fs@freebsd.org" <fs@freebsd.org>
Subject:   Re: AOC-USAS2-L8i zfs panics and SCSI errors in messages
Message-ID:  <CAOeNLuqFuA-Ewfj0xyNmfGdbznsoRAYb6GNgGDzN8PtPck0yUw@mail.gmail.com>
In-Reply-To: <75BDE9FA-6130-4BB4-8518-275D68BB3E49@slu.se>
References:  <82B38DBF-DD3A-46CD-93F6-02CDB6506E05@slu.se> <20111025193302.GA30409@nargothrond.kdm.org> <B4D81944-39F5-4053-ACBA-78EBB7DD70EB@slu.se> <20111026101602.GA9768@icarus.home.lan> <75BDE9FA-6130-4BB4-8518-275D68BB3E49@slu.se>

next in thread | previous in thread | raw e-mail | index | archive | help
Observation - the LSI SAS expanders, in my experience, sometimes
misbehave when there are drives which respond slower than some timeout
to commands (as far as I've seen it's only SATA drives it does this
for, but I don't have many SAS drives for comparison), leading to all
further commands to that drive for a bit not working, and then what
happens depending on the OS varies dramatically.

If you could try without an expander (e.g. with 1->4 SAS->SATA fanout
cables), you may be surprised (and/or annoyed) to find your life gets
better.

- Rich

On Mon, Nov 7, 2011 at 3:48 AM, Karli Sj=F6berg <Karli.Sjoberg@slu.se> wrot=
e:
> As a test, I have copied in about 1.5TB and scrubbed several times withou=
t any panic. It stayed solid until periodic weekly:( Same panic as with dai=
ly.
>
> /Karli Sj=F6berg
>
> 26 okt 2011 kl. 12.16 skrev Jeremy Chadwick:
>
> On Wed, Oct 26, 2011 at 11:36:44AM +0200, Karli Sj?berg wrote:
> Hi all,
>
> I tracked down what causes the panics!
>
> I got a tip from aragon and phoenix at the forum about
> /etc/periodic/security/100.chksetuid
>
> And to put:
> daily_status_security_chksetuid_enable=3D"NO"
> into /etc/periodic.conf
>
> This is not truly the cause of the panic, it simply exacerbates it.
>
> Many of the periodic scripts will do things like iterate over all files
> on the filesystem looking for specific attributes, etc.. =A0This tends to
> stress filesystems heavily. =A0This isn't the only one. =A0:-)
>
> I can now run periodic daily without any panics. I?m still wondering
> about the cause of this, the explanation from the forum was that that
> phase is too demanding for multi TB systems. But I have several multi
> TB servers with FreeBSD and ZFS, and none of them has ever behaved
> this way. Besides, the panic is instantaneous, not degenerative. I
> imagine that a run like that would start out OK and then just get
> worse and worse, getting gradually slower and slower until it just
> wouldn?t cope any more and hang. This feels more like hitting a wall.
> As if it found something that is couldn?t deal with and has no choice
> but to panic immediately.
>
> It may be possible that you have some underlying filesystem corruption
> that triggers this situation. =A0Have you actually tried doing a "zpool
> scrub" of your pools and seeing if any errors happen or if the panic
> occurs there?
>
> I'm inclined to think what you're experiencing is probably a bug or
> "quirk" in the storage controller driver you're using. =A0There are other
> drivers that have had fixes applied to them "to make them work decently
> with ZFS", meaning the kind of stressful I/O ZFS puts on them results in
> the controller driver behaving oddly or freaking out, case in point. =A0I=
t
> could also be a controller firmware bug/quirk/design issue. =A0Seriously.
>
> I believe the AOC-USAS2-L8i controller has been discussed on
> freebsd-stable, re: mps(4) driver problems or equivalent, but I'm not
> going to CC that list given that there would be 3 cross-posted lists
> involved and that is liable to upset some folks. =A0You should search the
> mailing lists for discussion of Supermicro controllers that work
> reliably with FreeBSD.
>
> It would be worthwhile to discuss this condition on -stable, mainly with
> something like "Anyone else using the AOC-USAS2-L8i reliably with ZFS?"
> You get the idea.
>
> --
> | Jeremy Chadwick =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0=
 =A0 =A0jdc at parodius.com<http://parodius.com>; |
> | Parodius Networking =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 http://=
www.parodius.com/ |
> | UNIX Systems Administrator =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 Mountain=
 View, CA, US |
> | Making life hard for others since 1977. =A0 =A0 =A0 =A0 =A0 =A0 =A0 PGP=
 4BD6C0CB |
>
>
>
>
> Med V=E4nliga H=E4lsningar
> -------------------------------------------------------------------------=
------
> Karli Sj=F6berg
> Swedish University of Agricultural Sciences
> Box 7079 (Visiting Address Kron=E5sv=E4gen 8)
> S-750 07 Uppsala, Sweden
> Phone: =A0+46-(0)18-67 15 66
> karli.sjoberg@slu.se<mailto:karli.sjoberg@adm.slu.se>
>
> _______________________________________________
> freebsd-fs@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-fs
> To unsubscribe, send any mail to "freebsd-fs-unsubscribe@freebsd.org"
>



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CAOeNLuqFuA-Ewfj0xyNmfGdbznsoRAYb6GNgGDzN8PtPck0yUw>