Date: Tue, 12 Oct 2010 03:42:53 -0700 From: Jeremy Chadwick <freebsd@jdc.parodius.com> To: Andriy Gapon <avg@icyb.net.ua> Cc: freebsd-fs@freebsd.org Subject: Re: Locked up processes after upgrade to ZFS v15 Message-ID: <20101012104253.GA30501@icarus.home.lan> In-Reply-To: <20101012100709.GA29861@icarus.home.lan> References: <39F05641-4E46-4BE0-81CA-4DEB175A5FBE@free.de> <20101009111241.GA58948@icarus.home.lan> <CF901B53-657E-49FC-A43B-27BC7D49F7A7@free.de> <4CB17983.3020907@icyb.net.ua> <20101011151508.GA10917@icarus.home.lan> <4CB32C75.2060000@icyb.net.ua> <20101011183707.GA13925@icarus.home.lan> <4CB3870F.7070107@icyb.net.ua> <20101012100709.GA29861@icarus.home.lan>
next in thread | previous in thread | raw e-mail | index | archive | help
On Tue, Oct 12, 2010 at 03:07:09AM -0700, Jeremy Chadwick wrote: > On Tue, Oct 12, 2010 at 12:52:15AM +0300, Andriy Gapon wrote: > > on 11/10/2010 21:37 Jeremy Chadwick said the following: > > > EnableMMAP on > > > EnableSendfile on > > > > Yes, it is it. > > > > Jeremy, Kai, > > could you please try to test this patch? > > http://people.freebsd.org/~avg/zfs-mappedread-sendfile.diff > > > > Kostik, > > could you please review it? > > Andriy, > > I've been trying to reproduce this problem on my testbed box without > much luck so far. The box differs severely -- the biggest differences > being the testbed runs i386 (due to CPU), only has 1GB RAM, and is > single-core. I don't have an amd64 testbed system on hand right now. > > I've been trying to reproduce it by enabling Sendfile and MMAP in Apache > on the system, putting up some very large files on an Apache-accessible > ZFS filesystem, and using something like "wget -r" to download > everything. I've been watching "netstat -m" to monitor the number of > sendfile requests. > > There have been a couple cases where I've seen processes go into "zfs" > state, but I have yet to see any lock up. > > Is there something amd64-specific to the problem at hand, or maybe some > VM feature which isn't getting triggered on i386? Or do you know of a > reliable way to reproduce the issue at this point? An additional question/point of interest: The testbed (i386) box I'm using is built from RELENG_8 sources dated October 11th around 20:00 PDT. I went looking through RELENG_8 commits and I found this committed approximately 25 hours ago: http://www.freebsd.org/cgi/cvsweb.cgi/src/sys/cddl/contrib/opensolaris/uts/common/fs/zfs/zfs_znode.c#rev1.24.2.8 The testbed/i386 box therefore has the above commit. Could this commit be the fix for the problem? In the meantime, I'm going to try rolling back my RELENG_8 src-all on the testbed/i386 box to October 8th and then re-try my tests to see if the problem happens. -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB |
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20101012104253.GA30501>