From owner-freebsd-stable@FreeBSD.ORG Mon Oct 5 19:32:54 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 36CB810656A3 for ; Mon, 5 Oct 2009 19:32:54 +0000 (UTC) (envelope-from db@danielbond.org) Received: from bgo1smout1.broadpark.no (bgo1smout1.broadpark.no [217.13.4.94]) by mx1.freebsd.org (Postfix) with ESMTP id 599458FC36 for ; Mon, 5 Oct 2009 19:32:53 +0000 (UTC) MIME-version: 1.0 Received: from bgo1sminn1.broadpark.no ([217.13.4.93]) by bgo1smout1.broadpark.no (Sun Java(tm) System Messaging Server 6.3-3.01 (built Jul 12 2007; 32bit)) with ESMTP id <0KR200BOU0T5Q420@bgo1smout1.broadpark.no> for freebsd-stable@freebsd.org; Mon, 05 Oct 2009 20:31:53 +0200 (CEST) Received: from [10.0.0.5] ([89.10.20.96]) by bgo1sminn1.broadpark.no (Sun Java(tm) System Messaging Server 6.3-3.01 (built Jul 12 2007; 32bit)) with ESMTP id <0KR2008970T4OSB1@bgo1sminn1.broadpark.no> for freebsd-stable@freebsd.org; Mon, 05 Oct 2009 20:31:53 +0200 (CEST) Message-id: <57F8F331-E823-4F88-BDD5-A8B95A3B4CB6@danielbond.org> From: Daniel Bond To: Jack Vogel In-reply-to: <2a41acea0910050957x2d085e90w2ebea7f9eb87c3e4@mail.gmail.com> Content-type: multipart/signed; protocol="application/pgp-signature"; micalg=pgp-sha1; boundary=Apple-Mail-3--565055668 Content-transfer-encoding: 7bit Date: Mon, 05 Oct 2009 20:32:14 +0200 References: <4AB9638B.8040607@monkeybrains.net> <4AC3DB8F.7010602@monkeybrains.net> <2a41acea0909301556g1df7dbafv813f5924553c8bfb@mail.gmail.com> <4AC5198E.7030609@monkeybrains.net> <4AC51B4C.7080905@monkeybrains.net> <2a41acea0910011450v41590f3dn112f367f26faed2d@mail.gmail.com> <4AC64835.3060107@monkeybrains.net> <2a41acea0910021237w415efa2cs4354a0f99aef8f6@mail.gmail.com> <4AC66437.4040704@monkeybrains.net> <6194E9BC-3A3D-4941-A777-88C7411905B0@danielbond.org> <2a41acea0910050957x2d085e90w2ebea7f9eb87c3e4@mail.gmail.com> X-Pgp-Agent: GPGMail 1.2.0 (v56) X-Mailer: Apple Mail (2.936) Cc: FreeBSD Stable Subject: Re: em0 watchdog timeouts X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 05 Oct 2009 19:32:54 -0000 This is an OpenPGP/MIME signed message (RFC 2440 and 3156) --Apple-Mail-3--565055668 Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Hi Jack, I'll comment your mail inline: On Oct 5, 2009, at 6:57 PM, Jack Vogel wrote: > This posting just muddies the issue, first you talk about having a > problem that > involves Broadcom, ok, so post about that on something other than > em :) I only meant to indicate that the problem might exist outside the intel driver. I'm also indicating that it happens with several drivers (bge, bce and em) on several different machines, on both pci-x and pci-e. I'm sorry if this is confusing to you, but I still think it's relevant to mention. > > Then you make some references to hardware that you "might have bought" > but didn't, I'm not about debugging 'possible worlds problems' > though so > can't help you there either :) No. I only made references to hardware I actually used, and had real- world issues with. > > Finally you never say what the actual hardware is, other than a > person who > I do not know told you it was the best performer... so, what exactly > is it? Sepherosa is a guy that writes drivers for BSD based operating systems. Including FreeBSD. He has a lot of knowledge in this area. http://people.freebsd.org/~sephe/ The NIC you are referring to, the one sephe recommended me, is a 82571EB. I didn't mention specific hardware, as I think it's more important to note this is an issue I'm experiencing across different sets of hardware and drivers. > > You have a problem once every 10 days, and at a specific time no > less, > this almost always means something in your environment, a cron job run > amok, a piece of hardware that resets, I dunno, but the last thing I > would > suspect given this description is the driver. This is not what I wrote. I wrote I had a problem every 1-10 days, but it would usually happen once every 3-4 days. At worst, every day in periods. It's not at any specific time. If you read my email correctly, I say it *usually* happens arround 11-13:00, but it has happened at random times too. This is my point exactly. I don't think it's the Intel-driver, I think the problem is elsewhere. I had a suspicion it had to do with the combination of nic + qlogic fc-controller, but I have no evidence of this. > > You need a good sysadmin for this debugging I would venture, not a > driver > developer. What I need is useful advice/help. I never stated I needed a driver developer. I'd like to be able to run my favorite OS on cool hardware, in the future, for a high-performing NFS-server, without problems like I've experienced the past 6months, on a production system. Please note that I'm managing a server-park almost completely based on FreeBSD, and I'm running many NFS servers on other hardware, for other services, without issues. I've seen several other FreeBSD-users having problems with this too, so I think it's of importance for the project. As I mentioned originally, I'm happy to dispose the hardware to any FreeBSD developer that might want to look further into this. Debugging it further is above my skill-set, I don't even know where to begin looking, especially since I can't produce any panics. I'm sorry to say, but your reply was %0 useful, Jack. > > Jack > - Daniel --Apple-Mail-3--565055668 content-type: application/pgp-signature; x-mac-type=70674453; name=PGP.sig content-description: This is a digitally signed message part content-disposition: inline; filename=PGP.sig content-transfer-encoding: 7bit -----BEGIN PGP SIGNATURE----- Version: GnuPG/MacGPG2 v2.0.11 (Darwin) iEYEARECAAYFAkrKO64ACgkQF4Ca8+3pySXusACfXvo7rHi8Dw/HEePHDv1wb1mS 5pEAoNNif5AYbrkgRcoJrIcYrp0p4kVW =cJSf -----END PGP SIGNATURE----- --Apple-Mail-3--565055668--