From owner-freebsd-hackers@FreeBSD.ORG Mon Oct 7 18:34:05 2013 Return-Path: Delivered-To: hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTP id 3670ACF6; Mon, 7 Oct 2013 18:34:05 +0000 (UTC) (envelope-from rank1seeker@gmail.com) Received: from mail-wg0-x22e.google.com (mail-wg0-x22e.google.com [IPv6:2a00:1450:400c:c00::22e]) (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id A32902856; Mon, 7 Oct 2013 18:34:04 +0000 (UTC) Received: by mail-wg0-f46.google.com with SMTP id k14so7714469wgh.25 for ; Mon, 07 Oct 2013 11:34:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:from:to:cc:subject:date:in-reply-to:references; bh=yWmPrAbUAEMuHzMoZ6H3ZAUt6Zw80G5771ka4XC80oU=; b=vtnPULQfURC6TbDT5zN5ROQVYBSBcg4A/IHnUyp8G2yJx726wPy+rJdg9hzWDYmEYZ dR4IQShBvuS4plk9p084HA1WsesChYdH5VvdAHmSR6fOuJgatXgBxdYo9KV8IKYqeV1W xXjRCdM0F0eZJvpa5gDx7a8WLB2okZLeMRURXv8gDCj+AIvnX67N4szZ7uIcAv3uUPEy SLM6hRQGOKf5HytEREAqLH0CiWWBC100ZTQ8sUFGWE5r+3KUK4XDfIJBS3JwC/8KKIhc stZpvV07glZhZ9N57KLVs2OhDsRQCJWtb099iD/KPpHC1oe1zJDnhwadOeRcwUXf/WVK vb1w== X-Received: by 10.180.185.203 with SMTP id fe11mr20018746wic.29.1381170843081; Mon, 07 Oct 2013 11:34:03 -0700 (PDT) Received: from DOMYPC ([82.193.208.225]) by mx.google.com with ESMTPSA id ey4sm42006874wic.11.1969.12.31.16.00.00 (version=TLSv1 cipher=RC4-SHA bits=128/128); Mon, 07 Oct 2013 11:34:02 -0700 (PDT) Message-ID: <20131007.183359.947.1@DOMY-PC> From: rank1seeker@gmail.com To: "John Baldwin" Subject: Re: UFS related panic (daily <-> find) Date: Mon, 07 Oct 2013 20:33:59 +0200 In-Reply-To: <201310071212.05281.jhb@freebsd.org> References: <20130719.174511.786.3@DOMY-PC> <201310021702.49174.jhb@freebsd.org> <20131002.214002.280.2@DOMY-PC> <201310071212.05281.jhb@freebsd.org> X-Mailer: POP Peeper (3.8.1.0) Cc: hackers@freebsd.org X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 07 Oct 2013 18:34:05 -0000 > On Wednesday, October 02, 2013 5:40:02 pm rank1seeker@gmail.com wrote: > > > > > > Ok, here is another one, same case, just this time under > > 9.1-RELEASE-p7 > > > > > > > > > > > > ============================================== > > > > > > Fatal trap 12: page fault while in kernel mode > > > > > > fault virtual address = 0x25 > > > > > > fault code = supervisor read, page not present > > > > > > instruction pointer = 0x20:0xc082c552 > > > > > > stack pointer = 0x28:0xe7eed7a8 > > > > > > frame pointer = 0x28:0xe7eed7ac > > > > > > code segment = base 0x0, limit 0xfffff, type 0x1b > > > > > > = DPL 0, pres 1, def32 1, gran 1 > > > > > > processor eflags = interrupt enabled, resume, IOPL = 0 > > > > > > current process = 63645 (find) > > > > > > trap number = 12 > > > > > > panic: page fault > > > > > > Uptime: 11h16m47s > > > > > > Physical memory: 1014 MB > > > > > > Dumping 143 MB: 128 112 96 80 64 48 32 16 > > > > > > > > > > > > #6 0xc0898d4c in calltrap () at > > /usr/src/sys/i386/i386/exception.s:169 > > > > > > #7 0xc082c552 in inodedep_find (inodedephd=Variable "inodedephd" > > is > > > > not > > > > > > available. > > > > > > ) > > > > > > at /usr/src/sys/ufs/ffs/ffs_softdep.c:2073 > > > > > > > > > > Please go to frame 7 and do 'x/i $rip'. > > > > > > > > > > > > > (kgdb) up 7 > > > > #7 0xc082c552 in inodedep_find (inodedephd=Variable "inodedephd" is > > not > > > > available. > > > > ) at /usr/src/sys/ufs/ffs/ffs_softdep.c:2073 > > > > 2073 /usr/src/sys/ufs/ffs/ffs_softdep.c: No such file or directory. > > > > in /usr/src/sys/ufs/ffs/ffs_softdep.c > > > > (kgdb) x/i $rip > > > > Value can't be converted to integer. > > > > > > Oh, this is i386, use "$eip" instead of "$rip", so 'x/i $eip' at frame 7. > > > > > > (kgdb) x/i $eip > > 0xc082c552 : cmp %ecx,0x24(%eax) > > Ok, so %eax must be 1. I think you probably have failing RAM with a stuck bit > or some such. > Today I've just finished HDD scan with recoverdisk and there were 3 bad sectors. It was stuck on them for a 15 hrs, until it finally did read whole disk, Then I've run it again and it read HDD 100%, without a glitch. I don't know was it a firmware realocated those or those looooong read attempts fixed a thing. Then reboted into single user and run fsck, which detected a LOT unreferenced inodes at /usr, which it successfully reconected. Finally fsck again to get clean, non error output. Could that caused a panics? PS: I'll run a memtest86+ when I get some time. For how long do you advise? Domagoj