From owner-freebsd-fs@FreeBSD.ORG Wed Oct 15 06:31:49 2003 Return-Path: Delivered-To: freebsd-fs@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 42F8516A4C0; Wed, 15 Oct 2003 06:31:49 -0700 (PDT) Received: from fledge.watson.org (fledge.watson.org [204.156.12.50]) by mx1.FreeBSD.org (Postfix) with ESMTP id 8487243FB1; Wed, 15 Oct 2003 06:31:45 -0700 (PDT) (envelope-from robert@fledge.watson.org) Received: from fledge.watson.org (localhost [127.0.0.1]) by fledge.watson.org (8.12.9p2/8.12.9) with ESMTP id h9FDV8Mg052962; Wed, 15 Oct 2003 09:31:12 -0400 (EDT) (envelope-from robert@fledge.watson.org) Received: from localhost (robert@localhost)h9FDV7tR052959; Wed, 15 Oct 2003 09:31:07 -0400 (EDT) (envelope-from robert@fledge.watson.org) Date: Wed, 15 Oct 2003 09:31:07 -0400 (EDT) From: Robert Watson X-Sender: robert@fledge.watson.org To: "Vladimir B. Grebenschikov" In-Reply-To: <1066203984.1271.27.camel@localhost> Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII cc: freebsd-fs@freebsd.org cc: "current@freebsd.org" Subject: Re: fsck_ufs -B lockup any writes on filesystem X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 15 Oct 2003 13:31:49 -0000 On Wed, 15 Oct 2003, Vladimir B. Grebenschikov wrote: > After unclean shutdown, system boots and work properly. After some time > any processes triing to write was blocked in suspfs state. fsck_ufs > process was blocked in suspwt state. I reported an identical set of symptoms over the weekend to Kirk and Jeff, and a fix was committed as src/sys/kerne/vfs_bio.c:1.412. A missing write lock release in buffer cache locking work would leave the disk suspended for writes, and the result is a file system deadlock just as you describe. Could you check and see which version of vfs_bio.c you're running with -- if it's the earlier version, try updating? If it's the newer version, please let me know. Here's the commit message: Date: Mon, 13 Oct 2003 17:38:35 -0700 (PDT) From: Jeff Roberson To: src-committers@FreeBSD.org, cvs-src@FreeBSD.org, cvs-all@FreeBSD.org Subject: cvs commit: src/sys/kern vfs_bio.c jeff 2003/10/13 17:38:35 PDT FreeBSD src repository Modified files: sys/kern vfs_bio.c Log: - Add a mising vn_finished_write() Pointy hat: jeff Found by: robert Obtained from: kirk Revision Changes Path 1.412 +1 -0 src/sys/kern/vfs_bio.c > > Killing blocked processes does not helps. > > # ps alx | fgrep fsck_ufs > 0 917 736 0 75 0 676 268 suspwt D ?? 0:00.91 > fsck_ufs -p -B /dev/ad0s3f > > # strace -p 917 > mount(0x80af3d3, 0x80cb44c, 0x1211000, 0xbfbffcc0^C > ^C > > # ps alx | fgrep suspfs > 0 559 1 0 75 0 7076 1588 suspfs Is ?? 0:01.92 > /usr/local/sbin/cupsd > 207 611 609 0 75 0 8832 2396 suspfs I ?? 0:00.11 > /usr/X11R6/bin/gdm-binary > 207 836 1 0 75 0 18000 9108 suspfs I ?? 0:02.33 > /usr/X11R6/libexec/stickynotes_applet > 207 865 1 2 75 0 55932 41028 suspfs I ?? 1:59.49 > evolution-1.4 > 207 867 1 0 75 0 32296 16000 suspfs I ?? 0:22.15 > /usr/X11R6/share/opera/bin/opera --bi > 207 942 1 0 75 0 4120 2068 suspfs I p0- 0:00.04 licq > > I can cure situation only be running fsck -y in single-user mode. > > PR: kern/58060 > > -- > Vladimir B. Grebenschikov > SWsoft Inc. > > _______________________________________________ > freebsd-current@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-current > To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org" >