Date: Tue, 6 Apr 2004 23:56:53 +0100 (BST) From: Richard Wendland <richard@starburst.demon.co.uk> To: berhart@ErhartGroup.COM (Brandon Erhart) Cc: Andre Oppermann <andre@freebsd.org> Subject: Re: FIN_WAIT_[1,2] and LAST_ACK Message-ID: <200404062256.XAA14667@starburst.demon.co.uk> In-Reply-To: <6.0.2.0.2.20040405180951.01c8d898@mx1.erhartgroup.com> from "Brandon Erhart" at Apr 05, 2004 06:11:41 PM
next in thread | previous in thread | raw e-mail | index | archive | help
> They are not timing out after 2MSL. I set my MSL to the lowest possible > setting (10) as to make TIME_WAIT connections disappear. The FIN_WAIT_[1,2] > and LAST_ACK seem to be sticking around for a while. > >Do the FIN_WAIT_1|2 and LAST_ACK time out after 2MSL or do they stick > >around forever? If they stick around forever, then there is something > >broken. I see quite a lot of connections `stuck' in FIN_WAIT_2 from talking to some particular web servers/OSs, and can reproduce the problem. The FreeBSD stack isn't broken, the far end is just acting strangely. In the cases I see it is simply that the remote web-server thread has become confused or stuck and won't close the TCP connection after my FreeBSD client has sent a FIN. My connection is stuck in FIN_WAIT_2 awaiting a FIN from the server to go into CLOSING state, but the far TCP (crazily) is sending keepalives about every 10 minutes. Essentially the problem is the strange server behaviour; I don't think much can be done about this particular case in the FreeBSD client with the socket API. FreeBSD has the traditional Berkeley non-standard TCP behaviour of forcing idle FIN_WAIT_2 connections into the CLOSING state after 11m 15sec, as described on page 246 of TCP Illus. Vol 1 (by setting the tcp_timer_2msl timer in "case TCPS_FIN_WAIT_1:" in tcp_input.c and later calling tcp_close() for idle connections in tcp_timer_2msl()). The 10 mins keep-alive prevents this from happening. NB it doesn't matter that the FreeBSD client process has terminated, the socket is already closed so process termination does nothing extra. A possible improvement on the traditional Berkeley behaviour would be to require some actual data transfer rather than simply non-idleness to prevent FIN_WAIT_2 being forced into CLOSING state. But I doubt this problem is serious enough to make such a change. I can reproduce this with Novell-HTTP-Server (running on NetWare I think), by connecting and closing immediately without sending a method request. This seems to confuse the web server thread, and it doesn't respond with a FIN as you'd expect. This is probably not the only way to cause this confusion, but it serves for reproducing the problem. Here's a tcpdump of what happens on FreeBSD 4.9-PRERELEASE: 20:57:26.987078 fbsd.4613 > novell.80: S 675591727:675591727(0) win 32768 <mss 1460,nop,wscale 0,nop,nop,timestamp 1325730099 0> (DF) 20:57:27.114442 novell.80 > fbsd.4613: S 4258048106:4258048106(0) ack 675591728 win 1460 <mss 1460,wscale 0,nop> (DF) 20:57:27.114472 fbsd.4613 > novell.80: . ack 1 win 33580 (DF) 20:57:31.396853 fbsd.4613 > novell.80: F 1:1(0) ack 1 win 33580 (DF) 20:57:31.521313 novell.80 > fbsd.4613: . ack 2 win 6143 (DF) # novell.80 doesn't respond with a FIN as expected, # instead sends a keep-alive ever 10 mins: 21:07:24.883771 novell.80 > fbsd.4613: . 0:1(1) ack 2 win 6143 (DF) 21:07:24.883786 fbsd.4613 > novell.80: . ack 1 win 33580 (DF) 21:17:20.404002 novell.80 > fbsd.4613: . 0:1(1) ack 2 win 6143 (DF) 21:17:20.404014 fbsd.4613 > novell.80: . ack 1 win 33580 (DF) 21:27:15.357954 novell.80 > fbsd.4613: . 0:1(1) ack 2 win 6143 (DF) 21:27:15.357966 fbsd.4613 > novell.80: . ack 1 win 33580 (DF) 21:37:14.357869 novell.80 > fbsd.4613: . 0:1(1) ack 2 win 6143 (DF) 21:37:14.357881 fbsd.4613 > novell.80: . ack 1 win 33580 (DF) 21:47:06.875293 novell.80 > fbsd.4613: . 0:1(1) ack 2 win 6143 (DF) 21:47:06.875318 fbsd.4613 > novell.80: . ack 1 win 33580 (DF) 21:57:02.118544 novell.80 > fbsd.4613: . 0:1(1) ack 2 win 6143 (DF) 21:57:02.118556 fbsd.4613 > novell.80: . ack 1 win 33580 (DF) 22:06:53.237910 novell.80 > fbsd.4613: . 0:1(1) ack 2 win 6143 (DF) 22:06:53.237930 fbsd.4613 > novell.80: . ack 1 win 33580 (DF) 22:16:45.884856 novell.80 > fbsd.4613: . 0:1(1) ack 2 win 6143 (DF) 22:16:45.884871 fbsd.4613 > novell.80: . ack 1 win 33580 (DF) 22:26:38.595193 novell.80 > fbsd.4613: . 0:1(1) ack 2 win 6143 (DF) 22:26:38.595206 fbsd.4613 > novell.80: . ack 1 win 33580 (DF) ... stuck in FIN_WAIT_2, usually until reboot Richard -- Richard Wendland richard@wendland.org.uk
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200404062256.XAA14667>