From owner-freebsd-stable@FreeBSD.ORG Mon May 14 12:45:39 2007 Return-Path: X-Original-To: stable@freebsd.org Delivered-To: freebsd-stable@FreeBSD.ORG Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 1984116A403; Mon, 14 May 2007 12:45:39 +0000 (UTC) (envelope-from freebsd@hub.org) Received: from hub.org (hub.org [200.46.204.220]) by mx1.freebsd.org (Postfix) with ESMTP id C153813C459; Mon, 14 May 2007 12:45:38 +0000 (UTC) (envelope-from freebsd@hub.org) Received: from localhost (unknown [200.46.204.183]) by hub.org (Postfix) with ESMTP id 9709285C8E5; Mon, 14 May 2007 09:45:32 -0300 (ADT) Received: from hub.org ([200.46.204.220]) by localhost (mx1.hub.org [200.46.204.183]) (amavisd-maia, port 10024) with ESMTP id 32672-08; Mon, 14 May 2007 09:45:37 -0300 (ADT) Received: from ganymede.hub.org (blk-89-241-126.eastlink.ca [24.89.241.126]) by hub.org (Postfix) with ESMTP id 1642385C8CC; Mon, 14 May 2007 09:45:32 -0300 (ADT) Received: from localhost (localhost [127.0.0.1]) by ganymede.hub.org (Postfix) with ESMTP id 828BD615F0; Mon, 14 May 2007 09:45:41 -0300 (ADT) Date: Mon, 14 May 2007 09:45:40 -0300 From: "Marc G. Fournier" To: Robert Watson Message-ID: X-Mailer: Mulberry/4.0.8 (Linux/x86) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Content-Disposition: inline Cc: stable@FreeBSD.org Subject: Re: UNIX domain sockets MFC's X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 14 May 2007 12:45:39 -0000 -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 - --On Monday, May 14, 2007 11:29:12 +0100 Robert Watson wrote: > > On Sat, 12 May 2007, Marc G. Fournier wrote: > >>> The fix for this has now been merged as 1.155.2.22. As there have been no >>> new reports of UNIX domain socket problems in the last couple of days, it >>> sounds like the MFC of the last batch of fixes and cleanups has not lead to >>> problems. >> >> I've just upgraded my kernel to the latest, to include the MFC'd code above >> ... > > Yes -- I was very specific in my e-mail regarding the MFC's that they were > not believed to address the problem you are reporting. I think we have a > leak in the way some edge case is handled with regard to UNIX domain socket > shutdown. What would be really nice to know is if that persists in 7-CURRENT, > in which we've redone the way the socket life cycle works. However, I don't > know if you are able to tolerate booting a 7-CURRENT kernel in your > environment...? On that server, that could be very difficult ... if this was happening on any of my HP servers, I would in a minute ... > Did we determine whether backing out to before the unpcb socket reference > count change made any difference for you? The problem appeared to persist after backing it out ... I'm curious about something ... way back, when I was using unionfs, I had a major problem with vnode leakage ... as I mentioned before, this server is the only one I have that uses geom/gmirror on its drives, the rest all use hardware RAID ... is there *any* possibility that I'm seeing some sort of interaction issue? It really bothers me that the only server that I'm seeing this one is the one that I'm using software RAID on ... Would it be useful to add some DEBUG statements to the socket code, to trace open/close/flush/etc? Maybe to see where flush's are being started, but never completed? That sort of thing? - ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email . scrappy@hub.org MSN . scrappy@hub.org Yahoo . yscrappy Skype: hub.org ICQ . 7615664 -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.5 (FreeBSD) iD8DBQFGSFn14QvfyHIvDvMRAow2AKC67Y0QuiiF+ZJA5Tpbd3WUvcmdTwCaAgZS OY4em31JQzIIbs1CUcmpHNo= =1Mqr -----END PGP SIGNATURE-----