Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 7 Jun 2016 12:09:08 -0500
From:      list-news <list-news@mindpackstudios.com>
To:        Borja Marcos <borjam@sarenet.es>
Cc:        freebsd-scsi@freebsd.org
Subject:   Re: Avago LSI SAS 3008 & Intel SSD Timeouts
Message-ID:  <4072b65d-25d4-2a79-5911-573517b0ee57@mindpackstudios.com>
In-Reply-To: <2823D96D-881D-4D40-B610-FC8292FA2FC5@sarenet.es>
References:  <30c04d8b-80cb-c637-26dc-97caebad3acb@mindpackstudios.com> <b30f968c-cc41-f7de-5a54-35bed961e65a@multiplay.co.uk> <08C01646-9AF3-4E89-A545-C051A284E039@sarenet.es> <986e03a7-5dc8-f5e0-5a17-4bf49459f905@mindpackstudios.com> <2823D96D-881D-4D40-B610-FC8292FA2FC5@sarenet.es>

next in thread | previous in thread | raw e-mail | index | archive | help
The system is a Twin.  In the first post I mentioned this but I probably 
wasn't clear.

The twin unit is this one:
https://www.supermicro.com/products/system/2u/2028/sys-2028tp-decr.cfm

I've used all components from twin node A and B (cpu / memory / 
mainboard / controller).  I still get the errors.  The backplane was the 
original thought of concern, and that has been RMA'd and replaced - 
errors continue.  I've even swapped out power supplies with another 
identical unit I have here.

In every case the errors continue, until I do this:
#camcontrol daX -N 1
(for each drive in the zpool)

Then the errors stop.

The system errors every few minutes while my application is running.  
Set tags to -N 1, and everything goes quiet.  16 cores at 100% cpu and 
drives 80% busy @ ~15k IO p/s, for about 5 hours solid before it 
finishes a batch, no errors are reported with -N set to 1.  If I set 
tags with -N 255 for each device, errors start again within 5 minutes, 
and continue every 2-5 minutes, until the batch is finished.

-Kyle

> I would try, if possible, to swap the controller.
>
>
>
>
>
>
> Borja.
>
>




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?4072b65d-25d4-2a79-5911-573517b0ee57>