From owner-freebsd-current@FreeBSD.ORG Tue Apr 19 11:40:56 2005 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id B943416A4CE for ; Tue, 19 Apr 2005 11:40:56 +0000 (GMT) Received: from mh1.centtech.com (moat3.centtech.com [207.200.51.50]) by mx1.FreeBSD.org (Postfix) with ESMTP id 1F2EF43D54 for ; Tue, 19 Apr 2005 11:40:56 +0000 (GMT) (envelope-from anderson@centtech.com) Received: from [10.177.171.220] (neutrino.centtech.com [10.177.171.220]) by mh1.centtech.com (8.13.1/8.13.1) with ESMTP id j3JBetsp066255 for ; Tue, 19 Apr 2005 06:40:55 -0500 (CDT) (envelope-from anderson@centtech.com) Message-ID: <4264EE1B.9050804@centtech.com> Date: Tue, 19 Apr 2005 06:40:11 -0500 From: Eric Anderson User-Agent: Mozilla/5.0 (X11; U; FreeBSD i386; en-US; rv:1.7.5) Gecko/20050325 X-Accept-Language: en-us, en MIME-Version: 1.0 To: FreeBSD Current Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Scanned: ClamAV 0.82/840/Mon Apr 18 20:42:09 2005 on mh1.centtech.com X-Virus-Status: Clean Subject: Panic during ssh/rsync (rsnapshot) X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 19 Apr 2005 11:40:56 -0000 I'm using FreeBSD 5.4RC3 (this happened on 5.3R also) as an rsnapshot server. All this machine does, is run rsnapshot (rsync+ssh+hardlinks = snapshot disk backups). I'm running 6 rsnapshot processes via cron every night at 11pm, so it essentially hammers the machine during that time. About 3 out of 5 nights, my system dies right at the beginning with either the fatal trap (see below) or the panic (see below). It seems to die just when I get about 6 rsync processes running. Any ideas? ---------------- panic: sbflush_locked: cc 0 || mb 0xc3432000 || mbcnt 256 cpuid = 0 boot() called on cpu#0 ---------------- Fatal trap 12: page fault while in kernel mode cpuid = 0; apic id = 00 fault virtual address = 0x15 fault code = supervisor read, page not present instruction pointer = 0x8:0xc077ad60 stack pointer = 0x10:0xe898ab10 frame pointer = 0x10:0xe898ab1c code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 3100 (ssh) trap number = 12 panic: page fault cpuid = 0 -- ------------------------------------------------------------------------ Eric Anderson Sr. Systems Administrator Centaur Technology A lost ounce of gold may be found, a lost moment of time never. ------------------------------------------------------------------------