From owner-freebsd-wireless@FreeBSD.ORG Sun Mar 11 13:10:55 2012 Return-Path: Delivered-To: freebsd-wireless@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id AD7C91065674; Sun, 11 Mar 2012 13:10:55 +0000 (UTC) (envelope-from vince@unsane.co.uk) Received: from unsane.co.uk (unsane-pt.tunnel.tserv5.lon1.ipv6.he.net [IPv6:2001:470:1f08:110::2]) by mx1.freebsd.org (Postfix) with ESMTP id 1C92C8FC1D; Sun, 11 Mar 2012 13:10:54 +0000 (UTC) Received: from badger.unsane.co.uk (badger.unsane.co.uk [85.233.185.165]) (authenticated bits=0) by unsane.co.uk (8.14.5/8.14.5) with ESMTP id q2BDAra5037688 (version=TLSv1/SSLv3 cipher=DHE-RSA-CAMELLIA256-SHA bits=256 verify=NO); Sun, 11 Mar 2012 13:10:54 GMT (envelope-from vince@unsane.co.uk) Message-ID: <4F5CA45C.1010603@unsane.co.uk> Date: Sun, 11 Mar 2012 13:10:52 +0000 From: Vincent Hoffman User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.6; rv:10.0.2) Gecko/20120216 Thunderbird/10.0.2 MIME-Version: 1.0 To: Adrian Chadd References: <4F59DD98.8080905@unsane.co.uk> <4F5AA149.8000904@unsane.co.uk> <4F5BDF3C.8070605@unsane.co.uk> <4F5C0302.8090403@unsane.co.uk> In-Reply-To: X-Enigmail-Version: 1.3.5 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-wireless@freebsd.org Subject: Re: ath0 timeout was "Re: (more) bugs fixed in -HEAD, AP mode is now mostly (again) stable!" X-BeenThere: freebsd-wireless@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Discussions of 802.11 stack, tools device driver development." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 11 Mar 2012 13:10:55 -0000 On 11/03/2012 01:58, Adrian Chadd wrote: > Hiya, > > Next time it happens, do the sysctl before the scan. I did. > The sysctl will tell me how deep each hardware TX queue is. > > I should likely add some further debugging to tell me how deep the > per-TID software queues are; that'd be helpful here. > > What you're seeing there is something weird which is causing the TX > frames to be queued in software/hardware and not be transmitted, to > the point of buffer exhaustion. See "total TX buffers: 0" ? That means > the frames can't go out for some reason. There's nothing in the > hardware queue, so that also has me slightly concerned. > > I wonder if this is a problem with aggregation and buffer exhaustion. > Hm, can you do "wlandebug +11n" and see if it's trying to exchange > ADDBA frames (and failing) ? There's a known bug where 5 minutes worth of (working) output using wlandebug +11n Mar 11 13:01:58 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <1782:1845> (0 frames) rxseq 1775 tid 0 (retransmit) Mar 11 13:01:58 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <1782:1845> (0 frames) rxseq 1776 tid 0 (retransmit) Mar 11 13:01:58 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <1782:1845> (0 frames) rxseq 1777 tid 0 (retransmit) Mar 11 13:01:58 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <1782:1845> (0 frames) rxseq 1778 tid 0 (retransmit) Mar 11 13:01:58 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <1782:1845> (0 frames) rxseq 1779 tid 0 (retransmit) Mar 11 13:01:58 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <1782:1845> (0 frames) rxseq 1780 tid 0 (retransmit) Mar 11 13:01:58 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <1782:1845> (0 frames) rxseq 1781 tid 0 (retransmit) Mar 11 13:02:31 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <2578:2641> (0 frames) rxseq 2577 tid 0 (retransmit) Mar 11 13:02:34 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <2771:2834> (0 frames) rxseq 2770 tid 0 (retransmit) Mar 11 13:02:37 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <3287:3350> (0 frames) rxseq 3286 tid 0 (retransmit) Mar 11 13:02:38 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <3309:3372> (0 frames) rxseq 3308 tid 0 (retransmit) Mar 11 13:02:43 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <3622:3685> (0 frames) rxseq 3621 tid 0 (retransmit) Mar 11 13:02:47 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <3827:3890> (0 frames) rxseq 3826 tid 0 (retransmit) Mar 11 13:02:48 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <3875:3938> (0 frames) rxseq 3874 tid 0 (retransmit) Mar 11 13:02:55 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <18:81> (0 frames) rxseq 17 tid 0 (retransmit) Mar 11 13:02:56 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <169:232> (0 frames) rxseq 168 tid 0 (retransmit) Mar 11 13:02:57 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <441:504> (0 frames) rxseq 440 tid 0 (retransmit) Mar 11 13:02:58 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <684:747> (0 frames) rxseq 683 tid 0 (retransmit) Mar 11 13:03:01 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <1464:1527> (0 frames) rxseq 1461 tid 0 (retransmit) Mar 11 13:03:01 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <1464:1527> (0 frames) rxseq 1462 tid 0 (retransmit) Mar 11 13:03:01 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <1464:1527> (0 frames) rxseq 1463 tid 0 (retransmit) Mar 11 13:03:03 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <1847:1910> (0 frames) rxseq 1846 tid 0 (retransmit) Mar 11 13:03:16 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <2567:2630> (0 frames) rxseq 2566 tid 0 (retransmit) Mar 11 13:03:16 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <2684:2747> (0 frames) rxseq 2683 tid 0 (retransmit) Mar 11 13:03:17 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <2794:2857> (0 frames) rxseq 2793 tid 0 (retransmit) Mar 11 13:03:20 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <3917:3980> (0 frames) rxseq 3916 tid 0 (retransmit) Mar 11 13:03:21 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <4087:54> (0 frames) rxseq 4086 tid 0 (retransmit) Mar 11 13:03:22 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <521:584> (0 frames) rxseq 520 tid 0 (retransmit) Mar 11 13:03:23 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <653:716> (0 frames) rxseq 652 tid 0 (retransmit) Mar 11 13:03:23 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <654:717> (0 frames) rxseq 653 tid 0 (retransmit) Mar 11 13:03:23 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <871:934> (0 frames) rxseq 870 tid 0 (retransmit) Mar 11 13:03:24 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <1243:1306> (0 frames) rxseq 1242 tid 0 (retransmit) Mar 11 13:04:46 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <1876:1939> (0 frames) rxseq 1875 tid 0 (retransmit) Mar 11 13:05:02 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <1965:2028> (0 frames) rxseq 1964 tid 0 (retransmit) Mar 11 13:05:30 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <2130:2193> (0 frames) rxseq 2129 tid 0 (retransmit) Mar 11 13:05:33 ostracod kernel: wlan0: [e0:91:f5:48:5b:b9] discard MPDU frame, BA win <2189:2252> (0 frames) rxseq 2188 tid 0 (retransmit) Mar 11 13:06:10 ostracod wpa_supplicant[559]: WPA: Group rekeying completed with e0:91:f5:48:5b:b9 [GTK=TKIP] I'll try and force a timeout so I can give the same once its hung. Vince > > Thanks, > > > Adrian