Date: Thu, 12 Oct 2006 11:56:21 -0400 From: Ernest Natiello <enatiello@broadviewnet.net> To: Gleb Smirnoff <glebius@FreeBSD.org> Cc: freebsd-stable@FreeBSD.org Subject: Re: freebsd panic on HP Proliant DL360 Message-ID: <1160668581.5159.24.camel@localhost> In-Reply-To: <20061012154826.GO59833@cell.sick.ru> References: <20061012091309.GK59833@FreeBSD.org> <E1GXxPc-0009zm-T4@dilbert.firstcallgroup.co.uk> <20061012101525.GM59833@cell.sick.ru> <1160666283.5159.22.camel@localhost> <20061012154826.GO59833@cell.sick.ru>
next in thread | previous in thread | raw e-mail | index | archive | help
here we go: (kgdb) frame 7 #7 0xc072191d in ip_ctloutput (so=0x1, sopt=0xe9226c90) at /usr/src/sys/netinet/ip_output.c:1184 1184 INP_LOCK(inp); (kgdb) p *sopt $1 = {sopt_dir = SOPT_SET, sopt_level = 0, sopt_name = 1, sopt_val = 0x0, sopt_valsize = 0, sopt_td = 0xc73add80} (kgdb) frame 11 #11 0xc06c3ce6 in setsockopt (td=0xc73add80, uap=0x1) at /usr/src/sys/kern/uipc_syscalls.c:1307 1307 return (kern_setsockopt(td, uap->s, uap->level, uap->name, (kgdb) p td->td_proc->p_comm $2 = "tcpserver\000\000\000\000\000\000\000\000\000\000" (kgdb) On Thu, 2006-10-12 at 19:48 +0400, Gleb Smirnoff wrote: > On Thu, Oct 12, 2006 at 11:18:03AM -0400, Ernest Natiello wrote: > E> Hello, > E> Thank you very much for all of the help. I am trying to understand > E> this issue, as it has been plaguing me for quite some time. > E> So, extrapolating from the below kgdb output, am I to assume that > E> the process causing the error is tcpserver? > > Probably it is. However, can you run the gdb commands I mentioned > in previous post, to make us sure. > > E> And should I further infer > E> that tcpserver would cause this issue on all instances of FreeBSD > E> RELENG_6, regardless of hardware? > > I think so. A tcpserver(8) in given configuration. > > E> I have three other servers HP Proliant DL380s (2u) which are > E> operating in a _similar_ capacity, (incoming vs. outgoing mailservers) > E> running the exact same software, which have never had a problem. > E> These three servers are running: FreeBSD unix29 6.1-PRERELEASE > E> FreeBSD 6.1-PRERELEASE #0: Mon Mar 27 10:42:56 EST 2006 > E> root@unix34.broadviewnet.net:/usr/obj/usr/src/sys/UNIX34 i386 > E> The operating system on this machine was rsync'd from one of the > E> servers that is having the panic issue, yet it continues to operate > E> flawlessly. > > The discussed problem is a race between remote client closing TCP > connection (may be resetting?), and local software performing > setsockopt() system call on the same socket. > > It may happen that this particulat server has to deal with clients > that drop the connection randomly, and other servers don't. That's > why other servers are stable. > > E> I guess I could try swapping the services between two of the > E> servers and see if the behavior follows the move. Does that sound > E> viable? > > You can try it. > > And don't forget to run gdb commands, and see what is the actual > socket option that causes the problem. >
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?1160668581.5159.24.camel>