Date: Thu, 24 Jan 2013 08:50:02 +0200 From: Daniel Braniss <danny@cs.huji.ac.il> To: John Nielsen <lists@jnielsen.net> Cc: Adrian Chadd <adrian@freebsd.org>, freebsd-stable@freebsd.org, Ronald Klop <ronald-freebsd8@klop.yi.org> Subject: Re: time issues and ZFS Message-ID: <E1TyGdP-00031f-7E@kabab.cs.huji.ac.il> In-Reply-To: <575CDBE9-0FF3-4F93-A223-9F8FAF3FE936@jnielsen.net> References: <E1TxFcr-0006dx-MX@kabab.cs.huji.ac.il> <1358780588.32417.414.camel@revolution.hippie.lan> <E1TxJP2-000DS8-DJ@kabab.cs.huji.ac.il> <1358783667.32417.434.camel@revolution.hippie.lan> <CAJ-Vmo=2Dmf4Lb-uoUQDrybyRSS=_bnV5KcNYGg5MnMxfhhu7w@mail.gmail.com> <E1TxYHa-0002yo-4Y@kabab.cs.huji.ac.il> <CAJ-VmomdQORjs55ooW55Rgg0i1M13PPtnmCPRrp__btEWQz=4g@mail.gmail.com> <575CDBE9-0FF3-4F93-A223-9F8FAF3FE936@jnielsen.net>
next in thread | previous in thread | raw e-mail | index | archive | help
> On Jan 22, 2013, at 2:40 AM, Adrian Chadd <adrian=40freebsd.org> wrote:= > > > On Jan 21, 2013, at 4:33 AM, Daniel Braniss <danny=40cs.huji.ac.il> = wrote: > >=20 > >> host: DELL PowerEdge R710, 16GB,=20 >=20 > I administer a Dell PowerEdge R710 and I've been seeing the exact same = =3Dthing. It's currently running FreeBSD 9.0-STABLE =230 r236355. It has = a =3DZFS pool which sees moderate load most of the time but can be very h= igh =3Dat times (when certain scripts run, etc.). I hadn't previously =3D= correlated the issue with ZFS load but that is very possible.> > I set a = cron job to restart ntpd when it dies (because the time =3Ddifference exc= eeds the sanity check). The cron job runs =22every 20 =3Dminutes=22, but = that varies greatly when the system stops counting. The =3Dtime offset fr= om ntpdate (which the script runs before restarting ntpd) =3Dvaries a lot= , but always in increments of 300 seconds. I've seen =3Deverything from 1= 200 to 23100. (Yes, that's 23 thousand seconds aka 6 =3Dhours 25 minutes = that the system wasn't keeping time for.) >=20 > Sysctl kern.timecounter.hardware defaults to HPET. I experimented with = =3Dsetting it to ACPI-fast but the issue persisted so I put it back. > kern.timecounter.choice: TSC-low(-100) ACPI-fast(900) HPET(950) i8254(0= ) =3Ddummy(-1000000)> > I first installed the box with an older 9.0-STABL= E and this issue was =3Dnot present. I have been tracking -STABLE on it (= albeit irregularly) so =3DI'm not sure when the issue came up. >=20 >=20 > Have you run tests with the machdep.idle value changed, and fiddling >=20 > kern.eventtimer.periodic / kern.eventtimer.idletick ? >=20 > I would love to resolve this and am able to do some experimenting. I've= =3D_usually_ been seeing the issue 2-3 times every 1-2 days, but I did j= ust =3Dmake some changes: > disabling ZFS compression and deduplication on all pools > updated to 9.1-STABLE from yesterday (r245821) >=20 > If the issue persists I will try changing some of the sysctls above and= =3Dfollow up with the result. If it goes away, I'll try to remember to = =3Dreport that too. >=20 > JN >=20 set kern.eventtimer.timer=3DLAPIC this solved it for me. danny
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?E1TyGdP-00031f-7E>