Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 22 Apr 2003 13:12:05 -0700 (PDT)
From:      John Polstra <jdp@polstra.com>
To:        net@freebsd.org
Subject:   Re: em net (optical GigE) driver hangs?
Message-ID:  <200304222012.h3MKC5OZ009213@strings.polstra.com>
In-Reply-To: <FE045D4D9F7AED4CBFF1B3B813C8533701918A83@mail.sandvine.com>
References:  <FE045D4D9F7AED4CBFF1B3B813C8533701918A83@mail.sandvine.com>

next in thread | previous in thread | raw e-mail | index | archive | help
In article <FE045D4D9F7AED4CBFF1B3B813C8533701918A83@mail.sandvine.com>,
Dave Dolson  <ddolson@sandvine.com> wrote:
> 
> Has anyone experienced em interface hangs after approx several days of heavy
> operation?
> 
> We are using a system which is mostly RELENG_4_7, using multiple optical em
> GigE devices.
> 
> The symptom is that the interface stops transmitting or receiving, reporting
> drops on output (no tx descriptors) and input errors (MPC stat-->no receive
> descriptors).
> 
> It turns out that all but 64 transmit descriptors are in use.  The driver is
> waiting for the "done" flag to be set so it can clean the descriptors.
> The device is also in the OACTIVE state at this time.
> 
> After the interface is brought down (or unplugged), the em watchdog timer
> goes off 5s later.
> 
> We are trying to figure out two things:
> 1. why did the driver lock up?
> 2. why didn't the watchdog timer go off earlier?
> 
> I think we would be happy to solve #2 given the rarity of the event.
> Is the RELENG_4 version likely to fix the problem?

I think the RELENG_4 version is likely to eliminate the problem.  See
the comment near the define of EM_RDTR in if_em.h (in the RELENG_4
version of that file, of course).

John
-- 
  John Polstra
  John D. Polstra & Co., Inc.                        Seattle, Washington USA
  "Disappointment is a good sign of basic intelligence."  -- Chögyam Trungpa



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200304222012.h3MKC5OZ009213>