Date: Wed, 3 Feb 2010 21:09:27 +0900 From: =?UTF-8?B?VG9tbWkgTMOkdHRp?= <sty@iki.fi> To: Attila Nagy <bra@fsn.hu> Cc: freebsd-fs@freebsd.org Subject: Re: Machine stops for some seconds with ZFS Message-ID: <f43ef3191002030409t16fe0056m9d6c83041d8051@mail.gmail.com> In-Reply-To: <4B694689.2030704@fsn.hu> References: <4B694689.2030704@fsn.hu>
next in thread | previous in thread | raw e-mail | index | archive | help
> After a long time, I've switched back to ZFS on my desktop. It runs > 8-STABLE/amd64 with two SATA disks and an USB pendrive. > One-one partition is used from each disk for the zpool, which is encrypte= d > using GELI, and the pendrive is there for L2ARC: > =C2=A0 NAME =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0STATE =C2=A0 =C2=A0 = READ WRITE CKSUM > =C2=A0 data =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0ONLINE =C2=A0 =C2=A0= =C2=A0 0 =C2=A0 =C2=A0 0 =C2=A0 =C2=A0 0 > =C2=A0 =C2=A0 mirror =C2=A0 =C2=A0 =C2=A0 =C2=A0ONLINE =C2=A0 =C2=A0 =C2= =A0 0 =C2=A0 =C2=A0 0 =C2=A0 =C2=A0 0 > =C2=A0 =C2=A0 =C2=A0 ad0s1d.eli =C2=A0ONLINE =C2=A0 =C2=A0 =C2=A0 0 =C2= =A0 =C2=A0 0 =C2=A0 =C2=A0 0 > =C2=A0 =C2=A0 =C2=A0 ad1s1d.eli =C2=A0ONLINE =C2=A0 =C2=A0 =C2=A0 0 =C2= =A0 =C2=A0 0 =C2=A0 =C2=A0 0 > =C2=A0 cache > =C2=A0 =C2=A0 da0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 ONLINE =C2=A0 =C2=A0= =C2=A0 0 =C2=A0 =C2=A0 0 =C2=A0 =C2=A0 0 > > Today, after 12 days of uptime the machine has frozen. I could ping it fr= om > a different machine, even could open a telnet to its ssh port, but I > couldn't get the ssh banner. > > Now I'm building a 9-CURRENT kernel and world to see whether the same > problem persists with that, and during the make process I've noticed a > strange thing. > I build with -j4 (the machine has one dual core CPU), so the fans are > screaming during the process. But every few minutes (I couldn't recognize > any patterns in it) the machine goes completely silent (even more silent > than normally), and everything halts. > =C2=A0PID USERNAME =C2=A0 =C2=A0THR PRI NICE =C2=A0 SIZE =C2=A0 =C2=A0RES= STATE =C2=A0 C =C2=A0 TIME =C2=A0 WCPU COMMAND > 16304 root =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A01 =C2=A044 =C2=A0 =C2=A00 37= 944K =C2=A04576K zio->i =C2=A01 =C2=A0 0:00 =C2=A00.00% sshd > 16405 bra =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 1 =C2=A044 =C2=A0 =C2=A00 37= 944K =C2=A05012K zio->i =C2=A00 =C2=A0 0:00 =C2=A00.00% sshd > 1064 postfix =C2=A0 =C2=A0 =C2=A0 1 =C2=A044 =C2=A0 =C2=A00 =C2=A09104K = =C2=A01772K zio->i =C2=A01 =C2=A0 0:00 =C2=A00.00% pickup This sounds like you're being hit by the same performance slowdown (extensively documented) that seems to affect everybody currently (maybe not those guys with ssd's or 15k rpm drives in big arrays). There's a long thread on -STABLE. Basically how I see it it's impossible to read and write from zfs pool at the same time which might be caused how the arc cache behaves under freebsd (didn't have these problems when zfs was still 'unstable'). I couldn't even watch a 720p video while having small writes (less than 1k every few seconds) to the same array without the smb process going to zio->i state which seems to indicate a complete block on any i/o. Combine with 5400 rpm consumer drives... well... -> switched to opensolaris, performance is now great... --=20 br, Tommi
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?f43ef3191002030409t16fe0056m9d6c83041d8051>