Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 12 Nov 2012 10:05:46 +0200
From:      Daniel Kalchev <daniel@digsys.bg>
To:        Freddie Cash <fjwcash@gmail.com>
Cc:        freebsd-fs@freebsd.org
Subject:   Re: ZFS HBAs + LSI chip sets (Was: ZFS hang (system #2))
Message-ID:  <50A0ADDA.9040205@digsys.bg>
In-Reply-To: <CAOjFWZ4FX2TrZ9Ns_uJ19=gXRxRqig3XQKV8Dz1bg-EqEHte_A@mail.gmail.com>
References:  <1350698905.86715.33.camel@btw.pki2.com> <1350711509.86715.59.camel@btw.pki2.com> <50825598.3070505@FreeBSD.org> <1350744349.88577.10.camel@btw.pki2.com> <1350765093.86715.69.camel@btw.pki2.com> <508322EC.4080700@FreeBSD.org> <1350778257.86715.106.camel@btw.pki2.com> <CAOjFWZ7G%2BaLPiPQTaUOE5oJY3So0cWYKvU86y4BZ2MQL%2BbqGMA@mail.gmail.com> <5084F6D5.5080400@digsys.bg> <CAOjFWZ4FX2TrZ9Ns_uJ19=gXRxRqig3XQKV8Dz1bg-EqEHte_A@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help


On 22.10.12 16:47, Freddie Cash wrote:
>
> I'll double-check when I get to work, but I'm pretty sure it's 
> 10.something.
>
> On Oct 22, 2012 12:34 AM, "Daniel Kalchev" <daniel@digsys.bg 
> <mailto:daniel@digsys.bg>> wrote:
>
>
>
>     On 21.10.12 09:52, Freddie Cash wrote:
>
>     [...]
>
>         All three run without any serious issues. The only issues
>         we've had are 3, maybe 4, situations where I've tried to
>         destroy multi-TB filesystems without enough RAM in the
>         machine. We're now running a minimum of 32 GB of RAM with 64
>         GB in one box.
>
>
>     What is the firmware on your LSI2008 controllers?
>
>     I am having weird situation with one server that has LSI2008, on
>     9-stable and all SSD configuration. One or two of the drives would
>     drop off the bus for no reason sometimes few times a day and
>     because the current driver ignores bus reset, someone has to
>     physically remove and re-insert the drives for them to come back.
>     Real pain.
>     My firmware version is 12.00.00.00 -- perhaps it is buggy?
>

As weird as it sounds, I discovered that my SSD-only zpool (raidz1)  was 
using ashift=9. So, changed to ashift=12 and not seen disconnects 
anymore for a week now. Is 4k good for these SSDs? Or 8k is better, or 
larger?

It seems it's really an SSD firmware problem, as the SSDs are likely 
doing more work when used with 512b sectors and from time to time fail 
to communicate properly with the bus. The SSDs are OCZ-VERTEX4 (firmware 
1.5).

Sometimes it seems to be drive related problem and perhaps the mps 
driver/hardware  is too sensitive to drive issues.

Daniel



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?50A0ADDA.9040205>