From owner-freebsd-wireless@FreeBSD.ORG Mon Apr 23 08:44:46 2012 Return-Path: Delivered-To: freebsd-wireless@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id CC2BE1065670 for ; Mon, 23 Apr 2012 08:44:46 +0000 (UTC) (envelope-from adrian.chadd@gmail.com) Received: from mail-pz0-f44.google.com (mail-pz0-f44.google.com [209.85.210.44]) by mx1.freebsd.org (Postfix) with ESMTP id 9E4718FC12 for ; Mon, 23 Apr 2012 08:44:46 +0000 (UTC) Received: by dadz14 with SMTP id z14so49640005dad.17 for ; Mon, 23 Apr 2012 01:44:46 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=Ek/nyunMI76t8JIfIYn58HcMZpYoQd91sl9dPEgiYxs=; b=A8h1HGMkRVclvxdW8PwXcigiLiO1T1LaQaZoW8JgthLm0hynuwFvyNKelGYSYbJXkq pgstn82c2GLo4WHysVIMoNJjebzYYSkYQvvSFL5oqKyNlddxLCqYvt0286J+h1S7hPxy +UQ+RGmIX13fqmoRPuR53+BQz/oz5xoFtAX1vlxQsLFy+Fp2mCliFavW3bjmSTI75U/k iLzR/ygmPU11OZKBzEeEqAuCUngYJcYrEbfycDYONAHnP7yc4sjrfgA/WwvBbUCly9pu 4pDVMeL28IOmJaV5nlZyKcm+XoVHsAtJkP5a9TS4A9eEtO7llCEpCO+Lb5qtLK7lgfo1 NpqQ== MIME-Version: 1.0 Received: by 10.68.212.130 with SMTP id nk2mr14274580pbc.166.1335170686399; Mon, 23 Apr 2012 01:44:46 -0700 (PDT) Received: by 10.142.101.9 with HTTP; Mon, 23 Apr 2012 01:44:46 -0700 (PDT) In-Reply-To: <4f94bb9d.0817440a.1e25.1907@mx.google.com> References: <4F9499DA.4000509@yellowspace.net> <4f94bb9d.0817440a.1e25.1907@mx.google.com> Date: Mon, 23 Apr 2012 01:44:46 -0700 Message-ID: From: Adrian Chadd To: Lorenzo Perone Content-Type: text/plain; charset=ISO-8859-1 Cc: "freebsd-wireless@freebsd.org" Subject: Re: Problems with Atheros 9280 - Any chance of an MFC or a test patch against stable/9 for your recent ath(4) work? X-BeenThere: freebsd-wireless@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Discussions of 802.11 stack, tools device driver development." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 23 Apr 2012 08:44:47 -0000 On 22 April 2012 19:16, Adrian Chadd wrote: > Compile with ATH_DEBUG, AH_DEBUG and ATH_DIAGAPI. Then compile up > tools/tools/ath/athstats and run it for me. > To follow up: The LORs you showed (net80211 common lock and net80211 node list lock) are known. I'd like them fixed, but it's going to require a bit of re-jiggling of various bits and pieces to actually fix those. If they were -really- locking things up, you should be able to break into the debugger to find which locks are held (show alllocks) and hopefully we can then identify/fix the dead locks in question. So it could be: * A total crash - eg, because something scribbled into memory it shouldn't have, or some bit of code did something with free'd data somehow, etc * A dead lock - two (or more) locks holding things up I'd like you to run athstats -i ath0 (and then athstats -i ath0 1 for a while) to see what kind of traffic/errors you're seeing. Compiling up tools/tools/net80211/wlanstats/ and running 'wlanstats -i wlan0' (and then wlanstats -i wlan0 1 for a while) couldn't hurt too. You're running it in ap mode, not sta mode, so it's not doing lots of scanning - but are you setting a channel before you enable the hostap interface, or is it also scanning? Is it also seeing lots of stuck beacons in -HEAD ? If not, there's no continous "reset" path going on. I've mostly fixed concurrency issues in the driver; the only real issues are some subtle reset issues and 'RX PCU full' issues that I'm still trying to chase down. Both of which should be logging something. I'd also then like you to try a few things; * try adding kdb/ddb to the kernel, and then break to debugger when things have hung * try then adding the software watchdog to the kernel, enable it before you flip up the wifi device, see if it enters the debugger after the system hangs - the watchdog should fire when it's not patted (assuming it's not being patted!) and enter the debugger Finally, I've not seen this before here, but I've certainly heard the occasional rumour about this kind of behaviour occuring. I'd really like to try and narrow this down but it may take a (long) while. I'm just warning you. :) Thanks, Adrian