From owner-freebsd-geom@FreeBSD.ORG Thu Jan 24 14:56:33 2008 Return-Path: Delivered-To: freebsd-geom@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4681F16A418; Thu, 24 Jan 2008 14:56:33 +0000 (UTC) (envelope-from avg@icyb.net.ua) Received: from falcon.cybervisiontech.com (falcon.cybervisiontech.com [217.20.163.9]) by mx1.freebsd.org (Postfix) with ESMTP id 0E72213C4F2; Thu, 24 Jan 2008 14:56:31 +0000 (UTC) (envelope-from avg@icyb.net.ua) Received: from localhost (localhost [127.0.0.1]) by falcon.cybervisiontech.com (Postfix) with ESMTP id B394E744015; Thu, 24 Jan 2008 16:56:30 +0200 (EET) X-Virus-Scanned: Debian amavisd-new at falcon.cybervisiontech.com Received: from falcon.cybervisiontech.com ([127.0.0.1]) by localhost (falcon.cybervisiontech.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id DzYCkhXiJ0nu; Thu, 24 Jan 2008 16:56:30 +0200 (EET) Received: from [10.2.1.87] (gateway.cybervisiontech.com.ua [88.81.251.18]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by falcon.cybervisiontech.com (Postfix) with ESMTP id 4D37A744004; Thu, 24 Jan 2008 16:56:30 +0200 (EET) Message-ID: <4798A71D.6090902@icyb.net.ua> Date: Thu, 24 Jan 2008 16:56:29 +0200 From: Andriy Gapon User-Agent: Thunderbird 2.0.0.9 (X11/20071116) MIME-Version: 1.0 To: freebsd-geom@freebsd.org References: <4732E3C6.5060205@icyb.net.ua> <47343AC5.8090103@icyb.net.ua> <6EBC07A8-054F-476A-8DF5-B54124CEB339@freebsd.org> <4735D203.8010109@icyb.net.ua> <474020BD.4030305@icyb.net.ua> In-Reply-To: <474020BD.4030305@icyb.net.ua> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: Pawel Jakub Dawidek Subject: Re: gjournal on 6.2: Cannot delete /var/.deleted/#613759 X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 24 Jan 2008 14:56:33 -0000 on 18/11/2007 13:23 Andriy Gapon said the following: > on 10/11/2007 17:45 Andriy Gapon said the following: >> on 09/11/2007 14:38 Eric Anderson said the following: >>> When inodes are reused, their gen count should go up (or NFS handles >>> would get broken quickly). The file is probably being removed in- >>> between the readdir and the remove. >>> >> Eric, >> thank you for the reply and the hint. I will try to add i_gen to a name >> that gets assigned to gjournal-managed files under .deleted and see how >> that works. >> > > Tried and it didn't help. The following was obtained during jdk build: > > kernel: UFS_GJGC: Cannot delete /var/.deleted/#1202150:230144382 (error=2) > [some seconds later] > $ find /var/ -inum 1202150 > /var/tmp/tmp/hsperfdata_root/72795 > kernel: UFS_GJGC: Cannot delete /var/.deleted/#1202150:230145003 (error=2) > [some seconds later] > $ find /var/ -inum 1202150 > /var/tmp/tmp/hsperfdata_root/81211 > ^^^^^^^^^^^^ - btw, my /tmp is symlink to /var/tmp/tmp > > So, even adding generation count doesn't fix the issue. Thus it seems > that there seems to be some other kind of race condition in 6.x gjournal > code. > BTW, more data points. I am doing buildworld with various -jX options and /tmp being on a partition with gjournal6. This is on SMP machine with 2-core CPU. If I do buildworld without -j or with -j2, then everything is OK; if I use -j3 or higher, then the build fails very soon with 'interrupted system call' and the messages like the quoted above appear in the system log. I think that it is curious that the maximum ok number of make jobs is the same as the number of logical CPUs. -- Andriy Gapon