Date: Mon, 02 Nov 2009 18:16:56 +0000 From: Gavin Atkinson <gavin@FreeBSD.org> To: Weldon S Godfrey 3 <weldon@excelsusphoto.com> Cc: freebsd-current@FreeBSD.org Subject: Re: FreeBSD 8.0 - network stack crashes? Message-ID: <1257185816.44755.29.camel@buffy.york.ac.uk> In-Reply-To: <alpine.BSF.2.00.0911020747560.80499@emmett.excelsus.com> References: <alpine.BSF.2.00.0911020747560.80499@emmett.excelsus.com>
next in thread | previous in thread | raw e-mail | index | archive | help
On Mon, 2009-11-02 at 10:52 -0500, Weldon S Godfrey 3 wrote: > Up until yesterday, we have been running FreeBSD-CURRENT of 12/08. We > started to see a couple months ago some very odd network behavior. > Something happens to the stack that causes processes accessing the network > to just hang. After the problem happens, usually (but not always), you > can't ssh in. Always, you can't ssh or telnet out, and nothing can access > the NFS shares on the server. You can ping everything from the server. > You can't even do a route add, you can't ssh if you use just the IP > address (although pinging with hostnames it doesn't have cached or in > hosts table resolves). When you try to ssh out, do a route add from the > box, the process just hangs. You can't control C it at all, it hangs > forever. There is nothing in dmesg or messages to indicate an issue. I > try to up/down the interfaces. In CURRENT-12/08, it may allow things to > work for like 30s. Some things that would be useful: - Does "arp -da" fix things? - What's the output of "netstat -m" while the networking is broken? - What does CTRL-T show for the hung SSH or route processes? - What does "procstat -kk" on the same processes show? - Does going to single user mode ("init 1" and killing off any leftover processes) cause the machine to start working again? If so, what's the output of "netstat -m" afterwards? If you look to be hitting some of the limits shown by "netstat -m", try logging the date, "netstat -m" and "vmstat -m" to a file every 30 seconds or similar so that we can see if it is a memory leak, and what may be leaking. Gavin
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?1257185816.44755.29.camel>