From owner-freebsd-current@FreeBSD.ORG  Wed Jan 11 02:20:11 2012
Return-Path: <owner-freebsd-current@FreeBSD.ORG>
Delivered-To: freebsd-current@freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34])
	by hub.freebsd.org (Postfix) with ESMTP id E750A106567D
	for <freebsd-current@freebsd.org>; Wed, 11 Jan 2012 02:20:11 +0000 (UTC)
	(envelope-from alain@bsdgate.org)
Received: from slow3-v.mail.gandi.net (slow3-v.mail.gandi.net [217.70.178.89])
	by mx1.freebsd.org (Postfix) with ESMTP id 698E28FC15
	for <freebsd-current@freebsd.org>; Wed, 11 Jan 2012 02:20:11 +0000 (UTC)
X-WhiteListed: mail was accepted with no delay
X-WhiteListed: mail was accepted with no delay
Received: from relay3-d.mail.gandi.net (relay3-d.mail.gandi.net
	[217.70.183.195])
	by slow3-v.mail.gandi.net (Postfix) with ESMTP id 3D3E93AACF
	for <freebsd-current@freebsd.org>; Wed, 11 Jan 2012 02:55:11 +0100 (CET)
X-Originating-IP: 10.0.10.71
Received: from localhost (front1-v.mgt.gandi.net [10.0.10.71])
	by relay3-d.mail.gandi.net (Postfix) with ESMTP id 7083AA8080;
	Wed, 11 Jan 2012 02:55:00 +0100 (CET)
MIME-Version: 1.0
X-Mailer: Gandi Mail
Message-ID: <57742.1326246900@bsdgate.org>
To: "Kirk McKusick" <mckusick@mckusick.com>
Content-Type: text/plain; charset="utf-8"
X-Origin: 78.192.34.210
Date: Wed, 11 Jan 2012 02:55:00 +0100
From: <alain@bsdgate.org>
Content-Transfer-Encoding: quoted-printable
Cc: freebsd-current@freebsd.org, bryce@bryce.net
Subject: Re: FS hang when creating snapshots on a UFS SU+J setup
X-BeenThere: freebsd-current@freebsd.org
X-Mailman-Version: 2.1.5
Precedence: list
Reply-To: alain@bsdgate.org
List-Id: Discussions about the use of FreeBSD-current
	<freebsd-current.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-current>, 
	<mailto:freebsd-current-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-current>
List-Post: <mailto:freebsd-current@freebsd.org>
List-Help: <mailto:freebsd-current-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-current>,
	<mailto:freebsd-current-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Wed, 11 Jan 2012 02:20:12 -0000





Le mar 10/01/12 19:30, "Kirk McKusick" mckusick@mckusick.com a =C3=A9crit:
> > Date: Mon, 9 Jan 2012 18:30:51 +0100
> > From: Yamagi Burmeister <lists@yamagi
> .org>
> To: jeff@freebsd
> .org, mckusick
> @freebsd.org
> Cc: f
> reebsd-current@freebsd.org, bryce@bryce.n
> et
> Subject: Re: FS hang when creating snapshots on a
> UFS SU+J setup
>=20
> > Hello,
> >=20
> > I'm sorry to bother you, but you may not be aware of
> this thread and
> this problem. We are several people experiencing
> deadlocks, kernel
> panics and other problems when creating sanpshots on
> file systems
> with SU+J. It would be nice to get some feedback,
> e.g. how can we
> help debugging and / or fixing this
> problem.
>=20
> > Thank you,
> > Yamagi
>=20
> First step in debugging is to find out if the problem is SU+J
> specific. To find out, turn off SU+J but leave SU. This change
> is done by running:
>=20
> umount <filesystem>
> tunefs -j disable <filesystem>
> mount <filesystem>
> cd <filesystem>
> rm .sujournal
>=20
> You may want to run `fsck -f' on the filesystem while you have
> it unmounted just to be sure that it is clean. Then run your
> snapshot request to see if it still fails. If it works, then
> we have narrowed the problem down to something related to SU+J.
> If it fails then we have a broader issue to deal with.
>=20
> If you wish to go back to using SU+J after the test, you can
> reenable SU+J by running:
>=20
> umount <filesystem>
> tunefs -j enable <filesystem>
> mount <filesystem>
>=20
> When responding to me, it is best to use my <mckusic
> k@mckusick.com>
email as I tend to read it more regularly.
>=20
> Kirk McKusick
> _______________________________________________
> f
> reebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org"
>=20
>=20

Hi,

Agree that i wasn't disable journaling completely before doing a clean full=
 FSCK.

Taking actions requested, i wasn't able to recover this race condition with=
 SUJ.
but snapshot still OK with only SU :

So here are some few investigations i have taken: (Sorry being too long)=20

This test system was freshly installed by ISO 9.0 RC1 (18 OCTOBER  / after =
the fix) and is csuped on 9_RELENG=20
(40G avail) very basic setup, just dovecot running, on GENERIC.

Operations and results:

Since it's the rootfs:
Clean shutdown, boot single, disable SUJ, mount in RW and remove the .sujou=
rnal and the bad snapshots file,
clean halt.

I reboot in single again, then fsck_ufs -y /dev/ufs/ROOTFS
I got some very minor fixup with freeblock count wrong and summary informat=
ion bad and BLK missing in bitmaps.

After a normal reboot, issue a successful snapshot without softupdate journ=
aling just su.

I reboot in single again, and reactivate SUJ then reboot in normal mode.

Issue snapshot: and again mksnap_ffs eating all cpu, not suspendable, not k=
illable.

So i try to figure out what's going on: with systat -v / gstat / top -SCHP
and  strace / truss / ktrace on ramfs and nfs for tracking mksanp_ffs:

Here some results:

gstat : 26 seconds intense io activity: like normal snapshot.
Bad spare snapshot file created ( UFS label (ROOTFS) not present and some g=
arbage on the beginning.
real and sparse size of file 'very' near to a normal snapshot file.

Truss begin showing info then hang before being usefull.
mksnap_ffs is in running / runnable mode eating 100% cpu in kernel mode, 0%=
 in user mode.
systat : hang
top still running correctly : 15 to 25 % CPU in interrupt SWI4 : CLOCK  ( C=
PU 2 cores )

strace : only for i386 :-(
ktrace: block before showing valuable info, even on remote nfs.
regular process hanging on suspfs.

hard power cycle:

After normal reboot , after regular SUJ FIXUP:
Got Panic  at the login prompt: ( bg_fsck not started )

panic: ffs_sync: rofs mod ( it's physical machine , no screen shots, )

backtrace show ffs_write_suspend+0x...before the ffs_sync

So i retry to reboot with the 9 RC1 CD in live mode, disable suj, disable s=
u, fsck, renable su, suj,=20
mount the fs,without doing something on it, issue a snapshot ( still in liv=
e mode) ,=20
and this time, the snapshot was OK even with SUJ.

So i wrongly figure out that touching the root fs in single user is not as =
best as touching it with a live CD.

But after returning in normal operation, this race is still there.

After various tracking tests, and rebooting in normal mode after the SUJ st=
andard recovery:

I sometime got a double panic after the login prompt

panic:ffs_blkfree_cg=20
and just after the backtrace softdep_process_worklist ...
-> panic: bufwrite: bufwrite is not busy.

I also saw, when there is more io activity while taking snapshot, a kernel =
panic saying:

panic: softdep_deallocate_dependencies: dangling deps

Sure something wrong in this setup, because SUJ snapshot work well on other=
 systems and on 9.0-RELEASE so
 i am lost in cyberspace :-)
If i don't issue snapshot , the system is very stable, even with heavy acti=
vity.

(Smartd has never showing bad things.)

Since it's not a production system, i can fresh reinstall with 9.0-RELEASE =
but since some other people=20
have troubles we prefer investigate.

Questions:=20

It Look like journal get out of sync after these race situation ?
Idea: Does it can make sense to reinit the log file at shutdown time ?

Is it possible that some bad drive write caching (or too aggressive caching=
 in vm) with bad ordering, can=20
trigger this kind of issues with the journal when snapshot is quiescing the=
 fs ?

(ada0: <ST9320423AS 0002SDM1> ATA-8 SATA 2.x device)

If required, i can do some more tests, with KDB compiled in, or whatever.

Thanks again and again for your wonderful work.

Very best regards.

Alain BRAUNER.

FreeBSD: The way to go :-)