From owner-freebsd-stable@FreeBSD.ORG Tue Aug 26 17:03:59 2008 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E5B92106567D for ; Tue, 26 Aug 2008 17:03:59 +0000 (UTC) (envelope-from wayne@manor.msen.com) Received: from manor.msen.com (manor.msen.com [148.59.4.66]) by mx1.freebsd.org (Postfix) with ESMTP id 9FDD98FC12 for ; Tue, 26 Aug 2008 17:03:59 +0000 (UTC) (envelope-from wayne@manor.msen.com) Received: from manor.msen.com (localhost [127.0.0.1]) by manor.msen.com (8.12.11/8.12.11) with ESMTP id m7QGgS6p066895 for ; Tue, 26 Aug 2008 12:42:28 -0400 (EDT) (envelope-from wayne@manor.msen.com) Received: (from wayne@localhost) by manor.msen.com (8.12.11/8.12.11/Submit) id m7QGgSIe066894 for freebsd-stable@freebsd.org; Tue, 26 Aug 2008 12:42:28 -0400 (EDT) (envelope-from wayne) Date: Tue, 26 Aug 2008 12:42:28 -0400 From: "Michael R. Wayne" To: freebsd-stable@freebsd.org Message-ID: <20080826164228.GT5557@manor.msen.com> Mail-Followup-To: freebsd-stable@freebsd.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.4.2.1i Subject: Snaphot stability issues on 6.3 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 26 Aug 2008 17:04:00 -0000 One of our servers, running a bunch of jails, has issues when doing nightly dumps only if snapshots are enabled. This box was running 5.X and has been upgraded over time to 6.3. When running 5.X, we attempted to use snapshots on dump (-L) which resulted in almost nightly system hangs during the dump. We ran 6.X for months with no stability issues, backing up nightly w/o snapshots. Took the system down to single user mode, did foreground fsck, enabled -L on dumps and the machine "kinda" hangs twice a week (main host seems OK but jails stop responding and can not be properly stopped), requiring a reset. Removing the snapshots restores system stability. Foreground fsck finds nothing unusual, just what I would expect when doing a reset on a live filesystem. I suspect that there is some corrupt filesystem residue from 5.X since we have no similar issues on 6.x clean installs. Is there something better than just fsck to attempt to resolve these issues? /\/\ \/\/