Date: Mon, 3 Apr 2023 16:18:20 -0700 From: Kevin Oberman <rkoberman@gmail.com> To: "Bjoern A. Zeeb" <bz@freebsd.org> Cc: FreeBSD wireless mailing list <wireless@freebsd.org> Subject: Re: Request for testing - firmware crash, wpa, locking Message-ID: <CAN6yY1u_TwMozQMqm387u6_iteHP%2B_mr69ugg7gwFs%2BfSNPpMw@mail.gmail.com> In-Reply-To: <33qn43q-rn1n-prsr-ppr-p846nq875741@mnoonqbm.arg> References: <33qn43q-rn1n-prsr-ppr-p846nq875741@mnoonqbm.arg>
next in thread | previous in thread | raw e-mail | index | archive | help
--000000000000400a3e05f876c7c0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable On Fri, Mar 31, 2023 at 1:09=E2=80=AFPM Bjoern A. Zeeb <bz@freebsd.org> wro= te: > Hi, > > (1) Colin has fixed the panic (after the firmware crash) so many people > keep > seeing. This may mean that you may now (contrary to before) try a > service netif restart wlan0 > to recover from such a crash. This changes is all the way to 13.2. > I am still very pre-occupied with real life but I am hoping that I can > get a possible fix tested and pushed during my Easter "holidays". > > > (2) Along with enweiwu and cy the "startup problem" showing as > "CTRL-EVENT-SCAN-FAILED" was debugged a bit more. We've for now > backed out the rc startup script change and restored the old behaviour > of wpa_supplicant with additional logging. This changes is currently in > main only but will be MFCed to stable/13 in the next week most likely. > If you etcupdate (mergemaster) and pull that change in I would kindly > ask you to turn on debugging for wpa_supplicant and check if you see > any log lines including "(changed)" [beware most should be "(no > change)" along with IFF_UP in the line. If so please contact me. > I still have a hypothesis that we may simply exploit a race in net80211 > there which will need better fixing. > > > (3) if you are using iwlwifi (or rtw88) I just pushed some locking > changes into main. I would appreciate if you could test and let me know > if there are any new regressions (they do not fix the firmware crash > from (1) yet!). > > Lots of health, > /bz > > -- > Bjoern A. Zeeb r15:7 > Bjoern, Just updated to the latest main and received a previously unseen lock order reversal on my iwlwifi when the network came up. This did not prevent the network from starting normally. FreeBSD 14.0-CURRENT #9 main-n261962-41236539d8dd-dirty: Mon Apr 3 13:06:31 PDT 2023 lock order reversal: (sleepable after non-sleepable) 1st 0xfffffe01466a0020 iwlwifi0_com_lo (iwlwifi0_com_lo, sleep mutex) @ /usr/src/sys/net80211/ieee80211_ioctl.c:3552 2nd 0xffffffff81fa9ce0 rtnl cloner lock (rtnl cloner lock, sx) @ /usr/src/sys/netlink/route/iface.c:306 lock order iwlwifi0_com_lo -> rtnl cloner lock attempted at: #0 0xffffffff80c61093 at witness_checkorder+0xbb3 #1 0xffffffff80bfb5b7 at _sx_slock_int+0x67 #2 0xffffffff80e58241 at dump_iface+0x501 #3 0xffffffff80e578cb at rtnl_handle_ifevent+0xab #4 0xffffffff80d70e75 at ieee80211_notify_ifnet_change+0x65 #5 0xffffffff80d9c29f at ieee80211_start_locked+0x6f #6 0xffffffff80d7fd56 at ieee80211_ioctl+0x356 #7 0xffffffff80d1d2d5 at ifhwioctl+0xe05 #8 0xffffffff80d1ecd5 at ifioctl+0x925 #9 0xffffffff80c66cee at kern_ioctl+0x1fe #10 0xffffffff80c66a84 at sys_ioctl+0x154 #11 0xffffffff810e54f0 at amd64_syscall+0x140 #12 0xffffffff810b8b7b at fast_syscall_common+0xf8 I have done no testing, but everything seems to be operating fine. If there is further information I can provide, just let me know. Thanks! --=20 Kevin Oberman, Part time kid herder and retired Network Engineer E-mail: rkoberman@gmail.com PGP Fingerprint: D03FB98AFA78E3B78C1694B318AB39EF1B055683 --000000000000400a3e05f876c7c0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable <div dir=3D"ltr"><div dir=3D"ltr"><div class=3D"gmail_default" style=3D"fon= t-family:tahoma,sans-serif;font-size:small">On Fri, Mar 31, 2023 at 1:09=E2= =80=AFPM Bjoern A. Zeeb <<a href=3D"mailto:bz@freebsd.org">bz@freebsd.or= g</a>> wrote:</div></div><div class=3D"gmail_quote"><blockquote class=3D= "gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(2= 04,204,204);padding-left:1ex">Hi,<br> <br> (1) Colin has fixed the panic (after the firmware crash) so many people kee= p<br> seeing.=C2=A0 This may mean that you may now (contrary to before) try a<br> =C2=A0 =C2=A0 =C2=A0 =C2=A0 service netif restart wlan0<br> to recover from such a crash.=C2=A0 This changes is all the way to 13.2.<br= > I am still very pre-occupied with real life but I am hoping that I can<br> get a possible fix tested and pushed during my Easter "holidays".= <br> <br> <br> (2) Along with enweiwu and cy the "startup problem" showing as<br= > "CTRL-EVENT-SCAN-FAILED" was debugged a bit more.=C2=A0 We've= for now<br> backed out the rc startup script change and restored the old behaviour<br> of wpa_supplicant with additional logging.=C2=A0 This changes is currently = in<br> main only but will be MFCed to stable/13 in the next week most likely.<br> If you etcupdate (mergemaster) and pull that change in I would kindly<br> ask you to turn on debugging for wpa_supplicant and check if you see<br> any log lines including "(changed)"=C2=A0 [beware most should be = "(no<br> change)" along with IFF_UP in the line.=C2=A0 If so please contact me.= <br> I still have a hypothesis that we may simply exploit a race in net80211<br> there which will need better fixing.<br> <br> <br> (3) if you are using iwlwifi (or rtw88) I just pushed some locking<br> changes into main.=C2=A0 I would appreciate if you could test and let me kn= ow<br> if there are any new regressions (they do not fix the firmware crash<br> from (1) yet!).<br> <br> Lots of health,<br> /bz<br> <br> -- <br> Bjoern A. Zeeb=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2= =A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0r15:7<br></blockquot= e><div><br></div><div style=3D"font-family:tahoma,sans-serif;font-size:smal= l" class=3D"gmail_default">Bjoern,</div><div style=3D"font-family:tahoma,sa= ns-serif;font-size:small" class=3D"gmail_default"><br></div><div style=3D"f= ont-family:tahoma,sans-serif;font-size:small" class=3D"gmail_default">Just = updated to the latest main and received=C2=A0 a previously unseen lock orde= r reversal on my iwlwifi when the network came up. This did not prevent the= network from starting normally.</div><div style=3D"font-family:tahoma,sans= -serif;font-size:small" class=3D"gmail_default">FreeBSD 14.0-CURRENT #9 mai= n-n261962-41236539d8dd-dirty: Mon Apr =C2=A03 13:06:31 PDT 2023</div><div s= tyle=3D"font-family:tahoma,sans-serif;font-size:small" class=3D"gmail_defau= lt"><br><div style=3D"font-family:tahoma,sans-serif;font-size:small" class= =3D"gmail_default"></div><div style=3D"font-family:tahoma,sans-serif;font-s= ize:small" class=3D"gmail_default">lock order reversal: (sleepable after no= n-sleepable)<br>=C2=A01st 0xfffffe01466a0020 iwlwifi0_com_lo (iwlwifi0_com_= lo, sleep mutex) @ /usr/src/sys/net80211/ieee80211_ioctl.c:3552<br>=C2=A02n= d 0xffffffff81fa9ce0 rtnl cloner lock (rtnl cloner lock, sx) @ /usr/src/sys= /netlink/route/iface.c:306<br>lock order iwlwifi0_com_lo -> rtnl cloner = lock attempted at:<br>#0 0xffffffff80c61093 at witness_checkorder+0xbb3<br>= #1 0xffffffff80bfb5b7 at _sx_slock_int+0x67<br>#2 0xffffffff80e58241 at dum= p_iface+0x501<br>#3 0xffffffff80e578cb at rtnl_handle_ifevent+0xab<br>#4 0x= ffffffff80d70e75 at ieee80211_notify_ifnet_change+0x65<br>#5 0xffffffff80d9= c29f at ieee80211_start_locked+0x6f<br>#6 0xffffffff80d7fd56 at ieee80211_i= octl+0x356<br>#7 0xffffffff80d1d2d5 at <span class=3D"gmail-LI gmail-ng">if= hwioctl+0xe05</span><br>#8 0xffffffff80d1ecd5 at <span class=3D"gmail-LI gm= ail-ng">ifioctl+0x925</span><br>#9 0xffffffff80c66cee at kern_ioctl+0x1fe<b= r>#10 0xffffffff80c66a84 at sys_ioctl+0x154<br>#11 0xffffffff810e54f0 at am= d64_syscall+0x140<br>#12 0xffffffff810b8b7b at fast_syscall_common+0xf8</di= v><div style=3D"font-family:tahoma,sans-serif;font-size:small" class=3D"gma= il_default"><br></div></div></div><div style=3D"font-family:tahoma,sans-ser= if;font-size:small" class=3D"gmail_default">I have done no testing, but eve= rything seems to be operating fine.</div><div style=3D"font-family:tahoma,s= ans-serif;font-size:small" class=3D"gmail_default"><br></div><div style=3D"= font-family:tahoma,sans-serif;font-size:small" class=3D"gmail_default">If t= here is further information I can provide, just let me know.<br></div><div = style=3D"font-family:tahoma,sans-serif;font-size:small" class=3D"gmail_defa= ult"><br></div><div style=3D"font-family:tahoma,sans-serif;font-size:small"= class=3D"gmail_default">Thanks!<br></div><span class=3D"gmail_signature_pr= efix">-- </span><br><div dir=3D"ltr" class=3D"gmail_signature"><div dir=3D"= ltr"><div><div dir=3D"ltr"><div><div dir=3D"ltr"><div><div dir=3D"ltr">Kevi= n Oberman, Part time kid herder and retired Network Engineer<br>E-mail: <a = href=3D"mailto:rkoberman@gmail.com" target=3D"_blank">rkoberman@gmail.com</= a><br></div><div>PGP Fingerprint: D03FB98AFA78E3B78C1694B318AB39EF1B055683<= /div></div></div></div></div></div></div></div></div> --000000000000400a3e05f876c7c0--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CAN6yY1u_TwMozQMqm387u6_iteHP%2B_mr69ugg7gwFs%2BfSNPpMw>