Date: Tue, 22 Apr 2003 13:12:05 -0700 (PDT) From: John Polstra <jdp@polstra.com> To: net@freebsd.org Subject: Re: em net (optical GigE) driver hangs? Message-ID: <200304222012.h3MKC5OZ009213@strings.polstra.com> In-Reply-To: <FE045D4D9F7AED4CBFF1B3B813C8533701918A83@mail.sandvine.com> References: <FE045D4D9F7AED4CBFF1B3B813C8533701918A83@mail.sandvine.com>
next in thread | previous in thread | raw e-mail | index | archive | help
In article <FE045D4D9F7AED4CBFF1B3B813C8533701918A83@mail.sandvine.com>, Dave Dolson <ddolson@sandvine.com> wrote: > > Has anyone experienced em interface hangs after approx several days of heavy > operation? > > We are using a system which is mostly RELENG_4_7, using multiple optical em > GigE devices. > > The symptom is that the interface stops transmitting or receiving, reporting > drops on output (no tx descriptors) and input errors (MPC stat-->no receive > descriptors). > > It turns out that all but 64 transmit descriptors are in use. The driver is > waiting for the "done" flag to be set so it can clean the descriptors. > The device is also in the OACTIVE state at this time. > > After the interface is brought down (or unplugged), the em watchdog timer > goes off 5s later. > > We are trying to figure out two things: > 1. why did the driver lock up? > 2. why didn't the watchdog timer go off earlier? > > I think we would be happy to solve #2 given the rarity of the event. > Is the RELENG_4 version likely to fix the problem? I think the RELENG_4 version is likely to eliminate the problem. See the comment near the define of EM_RDTR in if_em.h (in the RELENG_4 version of that file, of course). John -- John Polstra John D. Polstra & Co., Inc. Seattle, Washington USA "Disappointment is a good sign of basic intelligence." -- Chögyam Trungpa
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200304222012.h3MKC5OZ009213>