From owner-freebsd-net@FreeBSD.ORG Thu Mar 31 18:07:37 2011 Return-Path: Delivered-To: freebsd-net@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id A1660106566C; Thu, 31 Mar 2011 18:07:37 +0000 (UTC) (envelope-from lists@yamagi.org) Received: from mail.yamagi.overkill.yamagi.org (unknown [IPv6:2a01:4f8:121:2102:1::7]) by mx1.freebsd.org (Postfix) with ESMTP id 3A5B98FC19; Thu, 31 Mar 2011 18:07:37 +0000 (UTC) Received: from [2001:5c0:110d:6600:226:c6ff:fec4:399e] (unknown [IPv6:2001:5c0:110d:6600:226:c6ff:fec4:399e]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mail.yamagi.overkill.yamagi.org (Postfix) with ESMTPSA id 3B89216663D3; Thu, 31 Mar 2011 20:07:31 +0200 (CEST) Date: Thu, 31 Mar 2011 20:07:17 +0200 (CEST) From: Yamagi Burmeister X-X-Sender: yamagi@maka.home.yamagi.org To: YongHyeon PYUN In-Reply-To: <20110331171302.GA11981@michelle.cdnetworks.com> Message-ID: References: <20110330173145.GB8601@michelle.cdnetworks.com> <20110330202858.GC8601@michelle.cdnetworks.com> <20110331171302.GA11981@michelle.cdnetworks.com> User-Agent: Alpine 2.00 (BSF 1167 2008-08-23) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Cc: freebsd-net@freebsd.org, Yamagi Burmeister , yongari@freebsd.org Subject: Re: Kernel memory corruption(?) with age(4) X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 31 Mar 2011 18:07:37 -0000 On Thu, 31 Mar 2011, YongHyeon PYUN wrote: >> All boxes are quadcore machines with 8GB RAM, running FreeBSD/amd64. >> After limiting the memory via hw.physmem to 3GB the problems are gone. >> The box is running crashfree for more than 6 hours and has served over >> 300GB of data via age(4). >> > > Thanks for testing. Remove the hw.physmem configuration and try > attached patch and let me know how it goes. Thanks for your help, but the patch doesn't work. Another random panic - this time "page fault in kernel mode" - with nothing age(4) or network stack related stuff in the backtrace... Maybe it'll help to know about a bug fix in the linux atl1 driver, now replaced by atlx. In git commit 5f08e46b621a769e52a9545a23ab1d5fb2aec1d4 64 bit DMA was disabled: 64-bit DMA causes data corruption with atl1. We don't know why, and Atheros is working on it. For now, just use 32-bit DMA. This is a big hack that is probably wrong, but it stops the bleeding. There was no later follow up on it. I think that this can't be problem on FreeBSD but maybe I'm reading the driver code wrong. The kernel.org gitweb URL is: http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.23.y.git;a=commitdiff;h=5f08e46b621a769e52a9545a23ab1d5fb2aec1d4 -- Homepage: www.yamagi.org Jabber: yamagi@yamagi.org GnuPG/GPG: 0xEFBCCBCB