Date: Thu, 27 Oct 2011 18:59:57 -0400 From: Alexander Kabaev <kabaev@gmail.com> To: "C. P. Ghost" <cpghost@cordula.ws> Cc: Alexey Shuvaev <shuvaev@physik.uni-wuerzburg.de>, freebsd-current@freebsd.org Subject: Re: Panics after AHCI timeouts Message-ID: <20111027185957.54ece0ad@kan.dyndns.org> In-Reply-To: <CADGWnjX95yMEO06o%2B8xUho4Yc2-R9S=GJTWkGqvfbzDMHqCiGw@mail.gmail.com> References: <20111008201456.GA3529@lexx.ifp.tuwien.ac.at> <20111017190027.GA9873@lexx.ifp.tuwien.ac.at> <CAJ-Vmokbm5z3GPbKjc6_o0_Ea6u_b7twDu=xLeYpORiUpp6Z=Q@mail.gmail.com> <20111018131353.GA83797@lexx.ifp.tuwien.ac.at> <649509EEAEBA42D4A3DCC1FDF5DA72E5@multiplay.co.uk> <20111025202755.4243ae74@kan.dyndns.org> <CADGWnjX95yMEO06o%2B8xUho4Yc2-R9S=GJTWkGqvfbzDMHqCiGw@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
--Sig_/_ehfTFplfGyk.y.4CYlV8t2 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: quoted-printable On Wed, 26 Oct 2011 16:00:55 +0200 "C. P. Ghost" <cpghost@cordula.ws> wrote: > On Wed, Oct 26, 2011 at 2:27 AM, Alexander Kabaev <kabaev@gmail.com> > wrote: > > I do see timeouts on one of my Samsung ST3750330A disks and they > > definitely do not cause any panics. The weird part in my case is > > that disk then immediately reappears as online and mirror zpool can > > be rebuilt by just onlining the disk with 'zpool online <pool> > > <disk>' command. > > > > It seems to be happening once system has accumulated some uptime. If > > rebooted, it keeps running for a week or two with no issues, but > > then timeouts start to happen more or less reliably every single 24 > > hours. >=20 > Does it correlate with high disk activity, i.e. with periodic(8)? >=20 > On my machine, I have a feeling that timeouts occur more often > at that point, than normally... and that they also occur when multiple > processes access the disk simultaneously. >=20 > If it's only one process, the machine (usually) doesn't hang, even > when that process is copying big files back and forth for a long > period of time (it's a backup process). But interleave that process > with another one accessing the same disk, and poof!, almost > immediately ahci timeouts. occur. Very strange... Maybe a race > condition of some sort after all? >=20 No, I cannot say there is any specific correlation to IO load of the machine, timeouts I saw happen randomly and seem almost always happen as system uptime crosses two weeks boundary. I am suspecting Samsung firmware at this point. --=20 Alexander Kabaev --Sig_/_ehfTFplfGyk.y.4CYlV8t2 Content-Type: application/pgp-signature; name=signature.asc Content-Disposition: attachment; filename=signature.asc -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.18 (FreeBSD) iD8DBQFOqeJzQ6z1jMm+XZYRAnb/AJ4haF0NaxrpT1w3T5iHNwdNe6N/OQCfSSxf XCk/Nu1y9lrJXV+7hO/Mpvo= =miJa -----END PGP SIGNATURE----- --Sig_/_ehfTFplfGyk.y.4CYlV8t2--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20111027185957.54ece0ad>