From owner-freebsd-questions@FreeBSD.ORG Thu Dec 18 09:14:46 2008 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 344DE1065676 for ; Thu, 18 Dec 2008 09:14:46 +0000 (UTC) (envelope-from davidn04@gmail.com) Received: from mail-qy0-f18.google.com (mail-qy0-f18.google.com [209.85.221.18]) by mx1.freebsd.org (Postfix) with ESMTP id D29198FC1D for ; Thu, 18 Dec 2008 09:14:45 +0000 (UTC) (envelope-from davidn04@gmail.com) Received: by qyk11 with SMTP id 11so399576qyk.19 for ; Thu, 18 Dec 2008 01:14:45 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to :subject:in-reply-to:mime-version:content-type :content-transfer-encoding:content-disposition:references; bh=5Yegsg46haoH/znXELF3oiFqbz+6xbIx5S6jhoUKyHQ=; b=aV0eRpNWWGvJjH/CiWzXT45vE6bf4hnd9SUGTddvW/BBNkPRPMDNRyxzMDeO9+r0WX zMgnyeqM6nV4VYvNDJJDBLCK4ujYPtNKpcfwoqPXiIQIkygM45dStEYYA3JxRrVK7szo Vo+0c3QeJsbgfgs1u8B0Otoco8T1bTruPg/mM= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:in-reply-to:mime-version :content-type:content-transfer-encoding:content-disposition :references; b=HEx3hzFmfM7T4VJPnjX3/e0B6vn7S8WK6VZb85hiUYXAGPDhe6diTMP9a1+bc7zLzm M/cqT7agsFjl4/ZiqpKJ6tcIFTmTXInvBBqRSrb3s5I8sPFPWFJSosg8Q7wVavJfduPJ HTXpqjmMCYWOXqbcwS+mYblhH+53bT579huPU= Received: by 10.214.147.11 with SMTP id u11mr2101843qad.131.1229591685176; Thu, 18 Dec 2008 01:14:45 -0800 (PST) Received: by 10.214.150.7 with HTTP; Thu, 18 Dec 2008 01:14:45 -0800 (PST) Message-ID: <4d7dd86f0812180114u1c2935an45f7cb8b112cf0cb@mail.gmail.com> Date: Thu, 18 Dec 2008 20:14:45 +1100 From: "David N" To: freebsd-questions@freebsd.org In-Reply-To: <200812180900.08295.fbsd.questions@rachie.is-a-geek.net> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <4d7dd86f0812150956i3a8d130ak8a4ca462896cd3ff@mail.gmail.com> <200812171015.07390.fbsd.questions@rachie.is-a-geek.net> <4d7dd86f0812170240n28ab5db9qf2816e2d4beefc3d@mail.gmail.com> <200812180900.08295.fbsd.questions@rachie.is-a-geek.net> Subject: Re: Running rsnapshot via cron reboots the machine X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 18 Dec 2008 09:14:46 -0000 2008/12/18 Mel : > On Wednesday 17 December 2008 11:40:00 David N wrote: >> 2008/12/17 Mel : >> > On Monday 15 December 2008 18:56:46 David N wrote: >> >> Hi, >> >> >> >> I have a machine >> >> AMD Sepron LE-1150 >> >> ASUS M2A-VM >> >> 1GB RAM ECC >> >> 2x SATA 300GB >> >> >> >> in a RAID 1 (gmirror). >> >> 7.0-RELEASE-p2 AMD64 generic kernel >> >> >> >> it was doing backups via bacula to an external disk >> >> USB 2.0 SATA disk, and it was working well. (GLabel) /dev/ufs/BackupDisk >> >> >> >> I changed to rsnapshot recently, with the External HDD in glabel + >> >> gjournal (/dev/da0s1.journal -> /dev/ufs/BackupDisk) and it will >> >> reboot the machine roughly 30 minutes after the rsnapshot starts via >> >> CRON. >> > >> > Able to get any crash dumps? [1] I doubt it calls reboot system call >> > after 30 minutes and if it's a heating issue, then it would power down >> > not reboot. So, kernel is probably panicing. >> > >> > [1] >> > http://www.freebsd.org/doc/en_US.ISO8859-1/books/developers-handbook/kern >> >eldebug.html >> > >> > -- >> > Mel >> > >> > Problem with today's modular software: they start with the modules >> > and never get to the software part. >> >> I found something in the vmcore.0 >> >> panic: Journal overflow (joffset=499758276096 active=498475869184 >> inactive=499755984896) >> cpuid = 0 >> Uptime: 16h7m11s >> >> I tried kgdb on on the vmcore but it didn't work, I had -p2 installed, >> but compiled p6 so it might of overwrittin things in /usr/obj >> >> [GDB will not be able to debug user-mode threads: >> /usr/lib/libthread_db.so: Undefined symbol "ps_pglobal_lookup"] >> GNU gdb 6.1.1 [FreeBSD] >> Copyright 2004 Free Software Foundation, Inc. >> GDB is free software, covered by the GNU General Public License, and you >> are welcome to change it and/or distribute copies of it under certain >> conditions. Type "show copying" to see the conditions. >> There is absolutely no warranty for GDB. Type "show warranty" for details. >> This GDB was configured as "amd64-marcel-freebsd". >> Cannot access memory at address 0x0 >> >> >> The journal was set to 2GB on the 400GB USB attached disk. (/dev/da0) >> >> I just formatted the disk without gjournal and see how that goes. I >> guess i can't use gjournal over USB? I have gjournal running on >> another server (gmirror + gjournal) and i thrash it pretty hard >> without any problems. > > It should not panic, but a journal overflow is more likely with USB, cause of > the lower write speed (the journal fills faster then it's being emptied). > Your best bet is to reproduce the panic using the sources that match the > kernel and file a PR and/or post to freebsd-fs list to find out if there are > people with similar problems/usage cases. It could be a tunable that you > missed or that it's a known issue. > > -- > Mel > > Problem with today's modular software: they start with the modules > and never get to the software part. > There are people with similar problems already reported. http://www.freebsd.org/cgi/query-pr.cgi?pr=127420 I tried the tunables kern.geom.journal.force_switch=50 kern.geom.journal.cache.switch=75 which made it crash even faster, in a few minutes and even corrupted the journal. I would test it out more, but its a production server which needs to be up and running. At the moment its just UFS+glabel, I'll try again when 7.1 comes out. Thank you for your help. Regards David N