From owner-freebsd-mobile@FreeBSD.ORG Sat Jul 29 15:40:44 2006 Return-Path: X-Original-To: freebsd-mobile@freebsd.org Delivered-To: freebsd-mobile@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 1260416A503 for ; Sat, 29 Jul 2006 15:40:44 +0000 (UTC) (envelope-from sam@errno.com) Received: from ebb.errno.com (ebb.errno.com [69.12.149.25]) by mx1.FreeBSD.org (Postfix) with ESMTP id 99CD843D46 for ; Sat, 29 Jul 2006 15:40:43 +0000 (GMT) (envelope-from sam@errno.com) Received: from [10.0.0.248] (trouble.errno.com [10.0.0.248]) (authenticated bits=0) by ebb.errno.com (8.13.6/8.12.6) with ESMTP id k6TFefWq005825 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Sat, 29 Jul 2006 08:40:43 -0700 (PDT) (envelope-from sam@errno.com) Message-ID: <44CB8179.5050503@errno.com> Date: Sat, 29 Jul 2006 08:40:41 -0700 From: Sam Leffler User-Agent: Thunderbird 1.5.0.4 (X11/20060724) MIME-Version: 1.0 To: Ross Finlayson References: In-Reply-To: X-Enigmail-Version: 0.94.0.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-mobile@freebsd.org Subject: Re: Ongoing problems with the "ath" interface - is any relief in sight?? X-BeenThere: freebsd-mobile@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Mobile computing with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 29 Jul 2006 15:40:44 -0000 Ross Finlayson wrote: > For several months now, the "ath" interface has been spazzing out at > random times (in systems that are acting as wireless base stations). For > example: > > Jul 28 21:44:47 ns kernel: ath0: stuck beacon; resetting (bmiss count 4) > Jul 28 21:44:47 ns kernel: ath0: ath_reset: unable to reset hardware; > hal status 3 > Jul 28 21:45:08 ns kernel: ath0: device timeout > Jul 28 21:45:08 ns kernel: ath0: stuck beacon; resetting (bmiss count 4) > Jul 28 21:45:08 ns kernel: ath0: ath_reset: unable to reset hardware; > hal status 3 > [and then the interface stops working] > > > %cat /etc/motd > FreeBSD 6.1-STABLE (GENERIC) #6: Thu Jul 27 20:55:43 PDT 2006 > > The error isn't always the same, however. Often it is > ath0: device timeout > or > ath0: discard frame w/o packet header > or even > arp: unknown hardware address format (0x4500) > > In each case, however, the "ath" interface stops working Immediately > after the error report, so I don't believe that the latter two error > reports are legitimate. I'm wondering it perhaps there's a memory smash > somewhere that's corrupting some driver data structures (thereby causing > bogus error reports in addition to stopping the interface from working)? > > The last time I asked about this, someone speculated that 'power save > mode' was the culprit. Unfortunately, the system is running in a coffee > shop that provides public WiFi, so it's not possible to stop clients > from using power save mode. > > On my system, these errors are often happening several times a day. Has > anyone else run into frequent problems like this, and is anyone looking > into a solution? "stuck beacon" means the tx dma of the beacon frame failed to complete in a full beacon interval. Diagnosing such a problem requires understanding why dma failed to complete. This usually involves checking the dma descriptor for clues and/or looking at other h/w-related state. If you have a "memory smash" then you will see it in the descriptor contents--but I doubt it. In my experience this problem is usually caused by feeding bogus data to the dma engine that causes it to lockup but the problem in general is very complicated and not something I can diagnose remotely. Sam