From owner-freebsd-questions@FreeBSD.ORG  Thu Mar 29 19:20:45 2007
Return-Path: <owner-freebsd-questions@FreeBSD.ORG>
X-Original-To: freebsd-questions@freebsd.org
Delivered-To: freebsd-questions@freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52])
	by hub.freebsd.org (Postfix) with ESMTP id D6AFB16A403
	for <freebsd-questions@freebsd.org>;
	Thu, 29 Mar 2007 19:20:45 +0000 (UTC) (envelope-from ml@netfence.it)
Received: from parrot.aev.net (parrot.aev.net [212.31.247.179])
	by mx1.freebsd.org (Postfix) with ESMTP id 552FE13C4B8
	for <freebsd-questions@freebsd.org>;
	Thu, 29 Mar 2007 19:20:45 +0000 (UTC) (envelope-from ml@netfence.it)
Received: from soth.ventu ([151.77.236.62]) (authenticated bits=128)
	by parrot.aev.net (8.14.0/8.13.8) with ESMTP id l2TJStp5012645
	(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK)
	for <freebsd-questions@freebsd.org>;
	Thu, 29 Mar 2007 21:29:01 +0200 (CEST) (envelope-from ml@netfence.it)
Received: from [10.1.2.18] (alamar.ventu [10.1.2.18])
	by soth.ventu (8.14.0/8.13.8) with ESMTP id l2TJKlYr089801
	for <freebsd-questions@freebsd.org>;
	Thu, 29 Mar 2007 21:20:47 +0200 (CEST) (envelope-from ml@netfence.it)
Message-ID: <460C117E.1050902@netfence.it>
Date: Thu, 29 Mar 2007 21:20:30 +0200
From: Andrea Venturoli <ml@netfence.it>
User-Agent: Thunderbird 1.5.0.10 (X11/20070306)
MIME-Version: 1.0
To: freebsd-questions@freebsd.org
References: <45F91AE9.1090006@netfence.it>
In-Reply-To: <45F91AE9.1090006@netfence.it>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
X-Scanned-By: MIMEDefang 2.61 on 212.31.247.179
Subject: UPDATE: Server hanged on VFS lock problem
X-BeenThere: freebsd-questions@freebsd.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: User questions <freebsd-questions.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-questions>, 
	<mailto:freebsd-questions-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-questions>
List-Post: <mailto:freebsd-questions@freebsd.org>
List-Help: <mailto:freebsd-questions-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-questions>, 
	<mailto:freebsd-questions-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Thu, 29 Mar 2007 19:20:45 -0000

Andrea Venturoli wrote:

> Is there a way I can get these dumps automatically, without entering 
> DDB, since this is an unattended server?

I still don't know if it's possible to get dump and get going... I don't 
think so, actually...
Anyway I found debug.vfs_badlock_ddb=0 should allow this unattended box 
to continue working.
Now I just wonder what would happen if it did...


Futhermore, I got another dump like this and in both case I got to the 
conclusion that the userland situation is that cyrus-imapd is receiving 
a message which it has to forward to another host. This is probably 
irrelevant, but isn't it quite strange that on a busy 
mailserver/fileserver/a-lot-of-other things, both dumps come from 
exactly the same cronjob (logcheck, btw) sending a mail to the same address.


This is bt (which I forgot in the original message):

> (kgdb) bt
> #0  doadump () at pcpu.h:172
> #1  0xffffffff80245a29 in boot (howto=260) at /usr/src/sys/kern/kern_shutdown.c:409
> #2  0xffffffff802454bb in panic (fmt=0xffffffff803c5a09 "from debugger")
>     at /usr/src/sys/kern/kern_shutdown.c:565
> #3  0xffffffff8017bb12 in db_panic (addr=0, have_addr=0, count=0, modif=0x0)
>     at /usr/src/sys/ddb/db_command.c:438
> #4  0xffffffff8017c055 in db_command_loop () at /usr/src/sys/ddb/db_command.c:350
> #5  0xffffffff8017df4d in db_trap (type=-1471015248, code=0)
>     at /usr/src/sys/ddb/db_main.c:222
> #6  0xffffffff80262089 in kdb_trap (type=3, code=0, tf=0xffffffffa85217b0)
>     at /usr/src/sys/kern/subr_kdb.c:473
> #7  0xffffffff80384c84 in trap (frame=
>       {tf_rdi = 0, tf_rsi = -2139025408, tf_rdx = 1, tf_rcx = 1123776, tf_r8 = 1048064, tf_r9 = 10, tf_rax = 27, tf_rbx = -1099401716568, tf_rbp = -1471014800, tf_r10 = -1471015040, tf_r11 = 4294967255, tf_r12 = -2143248681, tf_r13 = 0, tf_r14 = 0, tf_r15 = -1471014064, tf_trapno = 3, tf_addr = 0, tf_flags = -1099401716568, tf_err = 0, tf_rip = -2144986273, tf_cs = 8, tf_rflags = 642, tf_rsp = -1471014800, tf_ss = 16})
>     at /usr/src/sys/amd64/amd64/trap.c:442
> #8  0xffffffff803709db in calltrap () at /usr/src/sys/amd64/amd64/exception.S:168
> #9  0xffffffff80261b5f in kdb_enter (msg=0x0) at cpufunc.h:63
> #10 0xffffffff802adb4d in assert_vop_elocked (vp=0xffffff00068d1ca8,
>     str=0xffffffff80409ed7 "VOP_WRITE") at /usr/src/sys/kern/vfs_subr.c:3436
> #11 0xffffffff803b3eae in VOP_WRITE_APV (vop=0x0, a=0xffffffffa8521a10)
>     at vnode_if.c:709
> #12 0xffffffff802b935c in vn_write (fp=0xffffff00130ecca8, uio=0xffffffffa8521b50,
>     active_cred=0x1, flags=0, td=0xffffff0023565000) at vnode_if.h:372
> #13 0xffffffff80271b37 in dofilewrite (td=0xffffff0023565000, fd=22,
>     fp=0xffffff00130ecca8, auio=0xffffffffa8521b50, offset=1048064, flags=0)
>     at file.h:252
> #14 0xffffffff80271e01 in kern_writev (td=0xffffff0023565000, fd=22,
>     auio=0xffffffffa8521b50) at /usr/src/sys/kern/sys_generic.c:402
> #15 0xffffffff80271efa in write (td=0x0, uap=0xffffffff80811000)
>     at /usr/src/sys/kern/sys_generic.c:326
> #16 0xffffffff803854a1 in syscall (frame=
>       {tf_rdi = 22, tf_rsi = 34429279984, tf_rdx = 1208, tf_rcx = 6557696, tf_r8 = -2143273848, tf_r9 = 140737488336808, tf_rax = 4, tf_rbx = 1208, tf_rbp = 34429279984, tf_r10 = 1, tf_r11 = 642, tf_r12 = 0, tf_r13 = 22, tf_r14 = 312, tf_r15 = 0, tf_trapno = 12, tf_addr = 6652216, tf_flags = 34384627961, tf_err = 2, tf_rip = 34384825260, tf_cs = 43, tf_rflags = 518, tf_rsp = 140737488336808, tf_ss = 35})
>     at /usr/src/sys/amd64/amd64/trap.c:792
> #17 0xffffffff80370b78 in Xfast_syscall ()
>     at /usr/src/sys/amd64/amd64/exception.S:270
> #18 0x00000008017ecbac in ?? ()


I'd still appreciate if someone with more insight than me could comment 
this.

  bye & Thanks
	av.