From owner-freebsd-net@FreeBSD.ORG Fri Mar 1 16:04:10 2013 Return-Path: Delivered-To: freebsd-net@freebsd.org Received: from mx1.freebsd.org (mx1.FreeBSD.org [8.8.178.115]) by hub.freebsd.org (Postfix) with ESMTP id E32989FD for ; Fri, 1 Mar 2013 16:04:10 +0000 (UTC) (envelope-from ncrogers@gmail.com) Received: from mail-vc0-f180.google.com (mail-vc0-f180.google.com [209.85.220.180]) by mx1.freebsd.org (Postfix) with ESMTP id 7BF6B726 for ; Fri, 1 Mar 2013 16:04:10 +0000 (UTC) Received: by mail-vc0-f180.google.com with SMTP id fo13so2038805vcb.39 for ; Fri, 01 Mar 2013 08:04:09 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:cc:content-type; bh=HLtbcI2TMYPT25tfTxFO2LpDgBj5+bWNzhjTNpqv/aI=; b=I0MOk3Obh0oURq0FtcurjQ81KQlYnXMNszxzsJ6gnJssNFu6UaRCX1JyPFe09Czufm jqEMO/EsmXmQIRBv+OOmeo2fzoz4JjWgjHYuXiZBCiT/FWgUkboA/+o9/1xyaWFiNm+0 VSC0JqcHGirWZaXPgN4kaJKDNTcFWazH35lAsbWADgYT7OU69KwjzSElXe17WcJfsM6p 6uEtbf4UxRMSl0KTapz5gevvR+B9mVrWMn2Bv/owt9xXmKrb5Nyofrayhbxd9hBFae8r TIOgVKcNmMgqbG8mVdksa7qzuhDC9yCTl+gK7AO1MZd26G4OZ/r/U9CUrt8UgaOVdC8R ijGA== MIME-Version: 1.0 X-Received: by 10.58.188.48 with SMTP id fx16mr4320756vec.22.1362153849662; Fri, 01 Mar 2013 08:04:09 -0800 (PST) Received: by 10.52.176.131 with HTTP; Fri, 1 Mar 2013 08:04:09 -0800 (PST) In-Reply-To: References: <512BAA60.3060703@biostat.wisc.edu> <512BAF8D.7080308@biostat.wisc.edu> Date: Fri, 1 Mar 2013 08:04:09 -0800 Message-ID: Subject: Re: igb network lockups From: Nick Rogers To: Jack Vogel Content-Type: text/plain; charset=ISO-8859-1 Cc: freebsd-net@freebsd.org, "Christopher D. Harrison" X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 01 Mar 2013 16:04:10 -0000 FWIW I have been experiencing a similar issue on a number of systems using the em(4) driver under 9.1-RELEASE. This is after upgrading from a snapshot of 8.3-STABLE. My systems use PF+ALTQ as well. The symptoms are: interface stops passing traffic until the system is rebooted. I have not yet been able to gain access to the systems to dig around (after they have crashed), however my kernel/network settings are properly tuned (high mbuf limit, hw.em.rxd/txd=4096, etc). It seems to happen about once a day on systems with around a sustained 50Mb/s of traffic. I realize this is not much to go on but perhaps it helps. I am debating trying the e1000 driver in the latest CURRENT on top of 9.1-RELEASE. I noticed the Intel shared code was updated about a week ago. Would this change or perhaps another change to e1000 since 9.1-RELEASE possibly affect stability in a positive way? Thanks. On Mon, Feb 25, 2013 at 10:45 AM, Jack Vogel wrote: > Have you done any poking around, looking at stats to determine why the > hangs? For instance, > might your mbuf pool be depleted? Some other network resource perhaps? > > Jack > > > On Mon, Feb 25, 2013 at 10:38 AM, Christopher D. Harrison < > harrison@biostat.wisc.edu> wrote: > >> Sure, >> The problem appears on both systems running with ALTQ and vanilla. >> -C >> >> On 02/25/13 12:29, Jack Vogel wrote: >> >> I've not heard of this problem, but I think most users do not use ALTQ, >> and we (Intel) do not >> test using it. Can it be eliminated from the equation? >> >> Jack >> >> >> On Mon, Feb 25, 2013 at 10:16 AM, Christopher D. Harrison < >> harrison@biostat.wisc.edu> wrote: >> >>> I recently have been experiencing network "freezes" and network "lockups" >>> on our Freebsd 9.1 systems which are running zfs and nfs file servers. >>> I upgraded from 9.0 to 9.1 about 2 months ago and we have been having >>> issues with almost bi-monthly. The issue manifests in the system becomes >>> unresponsive to any/all nfs clients. The system is not resource bound as >>> our I/O is low to disk and our network is usually in the 20mbit/40mbit >>> range. We do notice a correlation between temporary i/o spikes and >>> network freezes but not enough to send our system in to "lockup" mode for >>> the next 5min. Currently we have 4 igb nics in 2 aggr's with 8 queue's >>> per nic and our dev.igb reports: >>> >>> dev.igb.3.%desc: Intel(R) PRO/1000 Network Connection version - 2.3.4 >>> >>> I am almost certain the problem is with the ibg driver as a friend is >>> also experiencing the same problem with the same intel igb nic. He has >>> addressed the issue by restarting the network using netif on his systems. >>> According to my friend, once the network interfaces get cleared, everything >>> comes back and starts working as expected. >>> >>> I have noticed an issue with the igb driver and I was looking for >>> thoughts on how to help address this problem. >>> >>> http://freebsd.1045724.n5.nabble.com/em-igb-if-transmit-drbr-and-ALTQ-td5760338.html >>> >>> Thoughts/Ideas are greatly appreciated!!! >>> >>> -C >>> >>> _______________________________________________ >>> freebsd-net@freebsd.org mailing list >>> http://lists.freebsd.org/mailman/listinfo/freebsd-net >>> To unsubscribe, send any mail to "freebsd-net-unsubscribe@freebsd.org" >>> >> >> >> > _______________________________________________ > freebsd-net@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-net > To unsubscribe, send any mail to "freebsd-net-unsubscribe@freebsd.org"