From owner-freebsd-stable@FreeBSD.ORG Mon Mar 29 18:39:47 2010 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B21C61065672 for ; Mon, 29 Mar 2010 18:39:47 +0000 (UTC) (envelope-from pyunyh@gmail.com) Received: from mail-bw0-f216.google.com (mail-bw0-f216.google.com [209.85.218.216]) by mx1.freebsd.org (Postfix) with ESMTP id 38B178FC1B for ; Mon, 29 Mar 2010 18:39:46 +0000 (UTC) Received: by bwz8 with SMTP id 8so5030158bwz.3 for ; Mon, 29 Mar 2010 11:39:46 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:received:from:date:to:cc :subject:message-id:reply-to:references:mime-version:content-type :content-disposition:in-reply-to:user-agent; bh=U/MGs2AH+9LBtrngmyO2kom05j8cBGtMuccpGMGi5eY=; b=HqSuNMAQd6d2m0b/eCccL5Ysw9wlgGi4L9Lex3I4l0FuMWX3DFdttq7YVB04qBuoaZ iGV3dLUBrTBCrWBjMGfOhniZPrQj5eC93vR+nYH+fCUdHaQhGfuWuWSdgI4Rn3ZSsifF Z2opvWVkxwr0e40rzyL6DjO+CX1rxjNe+phAA= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=from:date:to:cc:subject:message-id:reply-to:references:mime-version :content-type:content-disposition:in-reply-to:user-agent; b=JxP6jIjlGO+W2muPSLXEbJS+uTshR69jPVAgY+5O85YrcVKQ66u3ysXAsidY7+7C/Y Q9yoaVmAzT6vPlvX/GA0kJzwH6UwlYSPZRqV8CPJoXs5MMndaA9e5L0msea1v8Z+CQhJ gJd8tISPCuWbyHZzyW6Bg8uuyPHW+E9XJg2FI= Received: by 10.204.73.165 with SMTP id q37mr1917528bkj.100.1269887985764; Mon, 29 Mar 2010 11:39:45 -0700 (PDT) Received: from pyunyh@gmail.com ([174.35.1.224]) by mx.google.com with ESMTPS id d5sm40284313bkd.13.2010.03.29.11.39.43 (version=TLSv1/SSLv3 cipher=RC4-MD5); Mon, 29 Mar 2010 11:39:44 -0700 (PDT) Received: by pyunyh@gmail.com (sSMTP sendmail emulation); Mon, 29 Mar 2010 11:38:48 -0700 From: Pyun YongHyeon Date: Mon, 29 Mar 2010 11:38:48 -0700 To: Attila Nagy Message-ID: <20100329183848.GE1473@michelle.cdnetworks.com> References: <4BAB718C.3090001@fsn.hu> <886B21E1787F0003B89E34B6@[192.168.1.44]> <4BB087B7.3030602@fsn.hu> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4BB087B7.3030602@fsn.hu> User-Agent: Mutt/1.4.2.3i Cc: Mailing List FreeBSD Stable , Michael Loftis Subject: Re: 8-STABLE freezes on UDP traffic (DNS), 7.x doesn't X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: pyunyh@gmail.com List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 29 Mar 2010 18:39:47 -0000 On Mon, Mar 29, 2010 at 12:57:59PM +0200, Attila Nagy wrote: > Hi, > > Michael Loftis wrote: > > > > > > --On Thursday, March 25, 2010 3:22 PM +0100 Attila Nagy > > wrote: > > > > <...> > >> Both unbound and python accepts DNS requests, and it seems when 25% > >> interrupt happens, only unbound is in *udp state, where it is 50%, both > >> programs are in that state. > > > > Try turning of hardware TSO/checksum offload if it's availble on your > > chipset? ifconfig -rxcsum -txcsum -tso -- I'm only using > > nfe chips right now, but w/ the TSO/CSUM on they lock up constantly > > under high load. We're pretty sure it's mostly the nfe driver, or the > > chips themselves, but have never ruled out some generic 8.x hardware > > offload issues. > Bingo, this solved the problem. The current uptime nears four days. > Previously I couldn't go further than a day. > > The machine gets very light TCP load (and other machines which get work > well), so I guess it's UDP RX or TX checksum related. > Hmm, this is unexpected result. Since you're using UDP, TSO is not involved in this issue. Because you disabled RX/TX checksum offloading could you check how many number of 'bad checksum' and and 'no checksum' you have from netstat(1)? To narrow down which side of checksum offloading causes the issue, would you just disable one side in a time? For instance, disable TX checksum offloading with RX checksum offloading enabled and see how bce(4) works. #ifconfig bce0 -txcsum rxcsum If that shows the same issue, try disabling RX checksum offloading but enabling TX checksum offloading. #ifconfig bce0 txcsum -rxcsum