From owner-cvs-all Tue Jun 27 17:28:57 2000 Delivered-To: cvs-all@freebsd.org Received: from florence.pavilion.net (florence.pavilion.net [212.74.0.25]) by hub.freebsd.org (Postfix) with ESMTP id 2446937BF66; Tue, 27 Jun 2000 17:28:49 -0700 (PDT) (envelope-from joe@pavilion.net) Received: from genius.systems.pavilion.net (genesis.tao.org.uk [194.242.131.254]) by florence.pavilion.net (8.9.3/8.8.8) with ESMTP id BAA37893; Wed, 28 Jun 2000 01:28:44 +0100 (BST) (envelope-from joe@pavilion.net) Received: by genius.systems.pavilion.net (Postfix, from userid 100) id EFEF412547; Wed, 28 Jun 2000 01:29:56 +0100 (BST) Date: Wed, 28 Jun 2000 01:29:56 +0100 From: Josef Karthauser To: Matthew Jacob Cc: Akinori -Aki- MUSHA , andrews@technologist.com, mckusick@FreeBSD.org, cvs-committers@FreeBSD.org, cvs-all@FreeBSD.org Subject: Re: cvs commit: src/sys/contrib/softupdates softdep.h ffs_softdep.c Message-ID: <20000628012956.B19558@pavilion.net> References: <20000627172831.G32373@pavilion.net> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.2i In-Reply-To: ; from mjacob@feral.com on Tue, Jun 27, 2000 at 09:29:24AM -0700 X-NCC-RegID: uk.pavilion Organisation: Pavilion Internet plc, Lees House, 21-23 Dyke Road, Brighton, England Phone: +44-845-333-5000 Fax: +44-845-333-5001 Mobile: +44-403-596893 Sender: owner-cvs-all@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG On Tue, Jun 27, 2000 at 09:29:24AM -0700, Matthew Jacob wrote: > > > > I'm not sure.... I have a feeling that there are softupdate problems > > running under SMP. A number of times this year I've lost whole filesystems > > on an SMP machines. :( > > Really? Have you told Kirk? > No because I've not had a chance to test it - it's on a production webserver. It's running 3.4 currently, and I've had a couple of crashes over the last year. The first one produced a lot of 'SOFTUPDATE INCONSISTANCY' errors and lost a lot of files - we had to recover from tape and diff to see which files had been lost. (This was on a 16gb ccd containing customer web data). The second time was more recently. The machine crashed and in the process the / partition disappeared, so badly that the kernel wasn't available to boot from! We brought it up on another drive and did a fsck and all the files turned up in /lost+found. Not a single file was left in the file system. Fortunately we had a fresh level 0 dump from that evening and so I did a 'newfs' and 'restore' and the machine's been running since. More recently a friend of mine was having crashes doing a make world under 4.0 with SMP and softupdate. Switching softupdates off caused the crashes to go away. The thing that worries me is that softupdates is supposed to render the filesystem safe so that if a crash occurs we only loose recent data. In both my cases we lost data that hadn't been touched for years - i.e. ports tree data that hasn't ever been touched. It definitely wasn't in the buffer cache before the crash. ! Is anyone else here using softupdates in an SMP environment? I'm not saying that softupdates doesnt work in SMP, but I'm worried that if softupdates doesn't leave things in as consistant a state as you'd expect in an SMP environment. Joe To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe cvs-all" in the body of the message