From owner-freebsd-amd64@freebsd.org Mon May 16 19:25:25 2016 Return-Path: Delivered-To: freebsd-amd64@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 00235B3DF7B for ; Mon, 16 May 2016 19:25:24 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from kenobi.freebsd.org (kenobi.freebsd.org [IPv6:2001:1900:2254:206a::16:76]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id CB6B01268 for ; Mon, 16 May 2016 19:25:24 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from bugs.freebsd.org ([127.0.1.118]) by kenobi.freebsd.org (8.15.2/8.15.2) with ESMTP id u4GJPOYc007925 for ; Mon, 16 May 2016 19:25:24 GMT (envelope-from bugzilla-noreply@freebsd.org) From: bugzilla-noreply@freebsd.org To: freebsd-amd64@FreeBSD.org Subject: [Bug 209471] Listen queue overflow due to too many sockets stuck in CLOSED state Date: Mon, 16 May 2016 19:25:24 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: Base System X-Bugzilla-Component: kern X-Bugzilla-Version: 10.3-RELEASE X-Bugzilla-Keywords: X-Bugzilla-Severity: Affects Only Me X-Bugzilla-Who: rblayzor@inoc.net X-Bugzilla-Status: New X-Bugzilla-Resolution: X-Bugzilla-Priority: --- X-Bugzilla-Assigned-To: freebsd-net@FreeBSD.org X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: https://bugs.freebsd.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-Mailman-Approved-At: Mon, 16 May 2016 19:39:44 +0000 X-BeenThere: freebsd-amd64@freebsd.org X-Mailman-Version: 2.1.22 Precedence: list List-Id: Porting FreeBSD to the AMD64 platform List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 16 May 2016 19:25:25 -0000 https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D209471 --- Comment #6 from Robert Blayzor --- I do not have an environment that would allow me really to test it. This problem does seem a lot more apparently after upgrading to 10.3 however. It= 's either that, or the work around for BugID 204426 unmasked this one from happening. Bug 204426 we would see our processes normally die, so they never really ran for longer period of time before we had to restart them. We added the patch for PR 204426, and processes seem stable now, but now we have this bug. I believe we used to see this in in 10.2 as well, but not ne= arly as often now that we seem to have 204426 fixed. The application doesn't seem to matter. Our environment has not changed. The only real special setup we have is also described in 204426. ESXi hyperviso= r, VMX NIC driver, and NFS mounted root FS. Other than the sysctl knobs previo= usly mentioned, nothing else special other than a non-GENERIC kernel; but all we= did was remove modules and drivers we do not use so kernel build time is faster. One thing I did not check is to see if this is a V6 socket only issue or if= V4 is also affected. I can still SSH into the server when this happens, only t= he process with the full queue and lots of sockets stuck in CLOSED seems to h= ung. 9 times out of 10 only a server reboot clears the issue. Rarely we can mana= ge to kill -9 the process and restart it. --=20 You are receiving this mail because: You are on the CC list for the bug.=