Date: Sun, 27 Jan 2013 17:13:25 +0200 From: "Vladislav Prodan" <universite@ukr.net> To: "Steven Hartland" <killing@multiplay.co.uk> Cc: current@freebsd.org, fs@freebsd.org Subject: Re[2]: Re[2]: AHCI timeout when using ZFS + AIO + NCQ Message-ID: <70362.1359299605.3196836531757973504@ffe11.ukr.net> In-Reply-To: <16B555759C2041ED8185DF478193A59D@multiplay.co.uk> References: <16B555759C2041ED8185DF478193A59D@multiplay.co.uk> <93308.1359297551.14145052969567453184@ffe15.ukr.net> <13391.1359029978.3957795939058384896@ffe16.ukr.net> <221B307551154F489452F89E304CA5F7@multiplay.co.uk>
next in thread | previous in thread | raw e-mail | index | archive | help
> ----- Original Message ----- > From: "Vladislav Prodan" <universite@ukr.net> > > >> Is it always the same disk, of so replace it SMART helps identify issues > >> but doesn't tell you 100% there's no problem. > > > > > > Now it has fallen off a different HDD - ada0. > > I'm 99% sure that MHDD will not find problems in HDD - ada0 and ada2. > > I still have three servers with similar chipsets that have similar problems > > with blade ahci times out. > > I notice your disks are connecting at SATA 3.x, which rings bells. We had > a very similar issue on a new Supermicro machine here and after much > testing we proved to our satisfaction that the problem was the HW. I have a motherboard ASUS M5A97 PRO http://www.asus.com/Motherboard/M5A97_PRO/#specifications Has replacement SATA data cables. Putting hard RAID controller does not guarantee data recovery at his death. > Essentially the combination of SATA 3 speeds the midplane / backplane > degraded the connection between the MB and HDD enough to cause > the disks to randomly drop when under load. > > If we connected the disks directly to the MB with SATA cables the > problem went away. In the end we had midplanes changed from an > AHCI pass-through to active LSI controller. > > So if you have any sort of midplane / backplane connecting your disks > try connecting them direct to the MB / controller via known SATA 3.x > compliant cables and see if that stops the drops. > > Another test you can do is to force the disks to connect at SATA 2.x > this also fixed it in our case, but wasn't something we wanted to > put into production hence the controller swap. > > To force SATA 2 speeds you can use the following in /boot/loader.conf > where 'X' is disk identifier e.g. for ada0 X = 0:- > hint.ahcich.X.sata_rev=2 > > Hope this helps. > > Regards > Steve > -- Vladislav V. Prodan System & Network Administrator http://support.od.ua +380 67 4584408, +380 99 4060508 VVP88-RIPE
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?70362.1359299605.3196836531757973504>