Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 6 Mar 2025 06:56:27 -0800
From:      Warner Losh <imp@bsdimp.com>
To:        John Baldwin <jhb@freebsd.org>
Cc:        Mateusz Guzik <mjguzik@gmail.com>, Zhenlei Huang <zlei@freebsd.org>, Mateusz Guzik <mjg@freebsd.org>,  src-committers <src-committers@freebsd.org>,  "<dev-commits-src-all@freebsd.org>" <dev-commits-src-all@freebsd.org>, Warner Losh <imp@freebsd.org>,  "<dev-commits-src-main@freebsd.org>" <dev-commits-src-main@freebsd.org>
Subject:   Re: git: 234683726708 - main - devclass: make devclass_alloc_unit use M_NOWAIT
Message-ID:  <CANCZdfr-1WTp0XoWYieHfo9GZ3Du6QKtyVzh_6aX%2BQnPHyN8ig@mail.gmail.com>
In-Reply-To: <d09b8cd1-ac71-43a4-9625-e796fd142f16@FreeBSD.org>
References:  <202503061103.526B32Id022652@gitrepo.freebsd.org> <F1B3652E-0D0C-402A-8509-D510992DAC15@FreeBSD.org> <CAGudoHF=eRaHCcjRrvd4sG4-OBu0GrmVRpiHEeU1ayG=M9oXrg@mail.gmail.com> <d09b8cd1-ac71-43a4-9625-e796fd142f16@FreeBSD.org>

index | next in thread | previous in thread | raw e-mail

[-- Attachment #1 --]
On Thu, Mar 6, 2025, 5:33 AM John Baldwin <jhb@freebsd.org> wrote:

> On 3/6/25 06:35, Mateusz Guzik wrote:
> > On Thu, Mar 6, 2025 at 12:32 PM Zhenlei Huang <zlei@freebsd.org> wrote:
> >>
> >>
> >>
> >> On Mar 6, 2025, at 7:03 PM, Mateusz Guzik <mjg@FreeBSD.org> wrote:
> >>
> >> The branch main has been updated by mjg:
> >>
> >> URL:
> https://cgit.FreeBSD.org/src/commit/?id=234683726708cf5212d672d676d30056d4133859
> >>
> >> commit 234683726708cf5212d672d676d30056d4133859
> >> Author:     Mateusz Guzik <mjg@FreeBSD.org>
> >> AuthorDate: 2025-03-06 11:01:49 +0000
> >> Commit:     Mateusz Guzik <mjg@FreeBSD.org>
> >> CommitDate: 2025-03-06 11:01:49 +0000
> >>
> >>     devclass: make devclass_alloc_unit use M_NOWAIT
> >>
> >>     The only caller already does this.
> >>
> >>     The routine can be called with a mutex held making M_WAITOK illegal.
> >>
> >>     Sponsored by:   Rubicon Communications, LLC ("Netgate")
> >> ---
> >> sys/kern/subr_bus.c | 8 ++++++--
> >> 1 file changed, 6 insertions(+), 2 deletions(-)
> >>
> >> diff --git a/sys/kern/subr_bus.c b/sys/kern/subr_bus.c
> >> index 9506e471705c..0422352bba51 100644
> >> --- a/sys/kern/subr_bus.c
> >> +++ b/sys/kern/subr_bus.c
> >> @@ -1208,6 +1208,7 @@ devclass_get_sysctl_tree(devclass_t dc)
> >> static int
> >> devclass_alloc_unit(devclass_t dc, device_t dev, int *unitp)
> >> {
> >> + device_t *devices;
> >> const char *s;
> >> int unit = *unitp;
> >>
> >> @@ -1264,8 +1265,11 @@ devclass_alloc_unit(devclass_t dc, device_t dev,
> int *unitp)
> >> int newsize;
> >>
> >> newsize = unit + 1;
> >> - dc->devices = reallocf(dc->devices,
> >> -    newsize * sizeof(*dc->devices), M_BUS, M_WAITOK);
> >> + devices = reallocf(dc->devices,
> >> +    newsize * sizeof(*dc->devices), M_BUS, M_NOWAIT);
> >>
> >>
> >> I'd recommend against this. From the commit message of f3d3c63442ff,
> Warner said,
> >>> In addition, transition to M_WAITOK since this is a sleepable context
> >> So, the M_WAITOK is intentional.
> >>
> >> Rather than reverting this, the caller devclass_add_device() should use
> M_WAITOK.
> >>
> >
> > Per my commit message this is callable from a *NOT* sleepable context.
> >
> > Here is a splat we got at Netgate:
> >
> > uma_zalloc_debug: zone "malloc-16" with the following non-sleepable
> locks held:
> > exclusive sleep mutex SD slot mtx (sdhci) r = 0 (0xd8dec028) locked @
> >
> /var/jenkins/workspace/pfSense-Plus-snapshots-25_03-main/sources/FreeBSD-src-plus-RELENG_25_03/sys/dev/sdhci/sdhci.c:688
> > stack backtrace:
> > #0 0xc0330ebc at witness_debugger+0x78
> > #1 0xc033217c at witness_warn+0x428
> > #2 0xc05b0a58 at uma_zalloc_debug+0x34
> > #3 0xc05b067c at uma_zalloc_arg+0x30
> > #4 0xc0291760 at malloc+0x8c
> > #5 0xc02920ec at reallocf+0x14
> > #6 0xc02f8894 at devclass_add_device+0x1e8
> > #7 0xc02f6c78 at make_device+0xe0
> > #8 0xc02f6abc at device_add_child_ordered+0x30
> > #9 0xc0156e0c at sdhci_card_task+0x238
> > #10 0xc0324090 at taskqueue_run_locked+0x1b4
> > #11 0xc0323ea0 at taskqueue_run+0x50
> > #12 0xc0275f88 at ithread_loop+0x264
>
> Just use a regular taskqueue like taskqueue_thread instead of
> taskqueue_swi?
> PCI hotplug defines its own thread taskqueue for adding and removing
> devices.
>
> The bug is here, IMO.  Eventually new-bus will need some sort of topology
> lock and that will have to be an sx lock, so this code needs to change
> anyway.  The sound code that tries to frob devices with a regular mutex
> also needs to change.
>

I should dust off the branch that i have this one. There's about a dozen
places I had to change at the time...

In terms of taskqueue_swi, it's probably something that needs to go away.
> Generally speaking, code uses tasks for functions that need to sleep,
> and taskqueue_swi breaks that.
>

Its a holdover from spl days for sure.

I will fix sdhci to use a proper taskqueue and then revert this commit.
>

Thanks. You can add me to the review

Warner

> --
> John Baldwin
>
>

[-- Attachment #2 --]
<div dir="auto"><div><br><br><div class="gmail_quote gmail_quote_container"><div dir="ltr" class="gmail_attr">On Thu, Mar 6, 2025, 5:33 AM John Baldwin &lt;<a href="mailto:jhb@freebsd.org">jhb@freebsd.org</a>&gt; wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">On 3/6/25 06:35, Mateusz Guzik wrote:<br>
&gt; On Thu, Mar 6, 2025 at 12:32 PM Zhenlei Huang &lt;<a href="mailto:zlei@freebsd.org" target="_blank" rel="noreferrer">zlei@freebsd.org</a>&gt; wrote:<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; On Mar 6, 2025, at 7:03 PM, Mateusz Guzik &lt;mjg@FreeBSD.org&gt; wrote:<br>
&gt;&gt;<br>
&gt;&gt; The branch main has been updated by mjg:<br>
&gt;&gt;<br>
&gt;&gt; URL: <a href="https://cgit.FreeBSD.org/src/commit/?id=234683726708cf5212d672d676d30056d4133859" rel="noreferrer noreferrer" target="_blank">https://cgit.FreeBSD.org/src/commit/?id=234683726708cf5212d672d676d30056d4133859</a><br>;
&gt;&gt;<br>
&gt;&gt; commit 234683726708cf5212d672d676d30056d4133859<br>
&gt;&gt; Author:     Mateusz Guzik &lt;mjg@FreeBSD.org&gt;<br>
&gt;&gt; AuthorDate: 2025-03-06 11:01:49 +0000<br>
&gt;&gt; Commit:     Mateusz Guzik &lt;mjg@FreeBSD.org&gt;<br>
&gt;&gt; CommitDate: 2025-03-06 11:01:49 +0000<br>
&gt;&gt;<br>
&gt;&gt;     devclass: make devclass_alloc_unit use M_NOWAIT<br>
&gt;&gt;<br>
&gt;&gt;     The only caller already does this.<br>
&gt;&gt;<br>
&gt;&gt;     The routine can be called with a mutex held making M_WAITOK illegal.<br>
&gt;&gt;<br>
&gt;&gt;     Sponsored by:   Rubicon Communications, LLC (&quot;Netgate&quot;)<br>
&gt;&gt; ---<br>
&gt;&gt; sys/kern/subr_bus.c | 8 ++++++--<br>
&gt;&gt; 1 file changed, 6 insertions(+), 2 deletions(-)<br>
&gt;&gt;<br>
&gt;&gt; diff --git a/sys/kern/subr_bus.c b/sys/kern/subr_bus.c<br>
&gt;&gt; index 9506e471705c..0422352bba51 100644<br>
&gt;&gt; --- a/sys/kern/subr_bus.c<br>
&gt;&gt; +++ b/sys/kern/subr_bus.c<br>
&gt;&gt; @@ -1208,6 +1208,7 @@ devclass_get_sysctl_tree(devclass_t dc)<br>
&gt;&gt; static int<br>
&gt;&gt; devclass_alloc_unit(devclass_t dc, device_t dev, int *unitp)<br>
&gt;&gt; {<br>
&gt;&gt; + device_t *devices;<br>
&gt;&gt; const char *s;<br>
&gt;&gt; int unit = *unitp;<br>
&gt;&gt;<br>
&gt;&gt; @@ -1264,8 +1265,11 @@ devclass_alloc_unit(devclass_t dc, device_t dev, int *unitp)<br>
&gt;&gt; int newsize;<br>
&gt;&gt;<br>
&gt;&gt; newsize = unit + 1;<br>
&gt;&gt; - dc-&gt;devices = reallocf(dc-&gt;devices,<br>
&gt;&gt; -    newsize * sizeof(*dc-&gt;devices), M_BUS, M_WAITOK);<br>
&gt;&gt; + devices = reallocf(dc-&gt;devices,<br>
&gt;&gt; +    newsize * sizeof(*dc-&gt;devices), M_BUS, M_NOWAIT);<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; I&#39;d recommend against this. From the commit message of f3d3c63442ff, Warner said,<br>
&gt;&gt;&gt; In addition, transition to M_WAITOK since this is a sleepable context<br>
&gt;&gt; So, the M_WAITOK is intentional.<br>
&gt;&gt;<br>
&gt;&gt; Rather than reverting this, the caller devclass_add_device() should use M_WAITOK.<br>
&gt;&gt;<br>
&gt; <br>
&gt; Per my commit message this is callable from a *NOT* sleepable context.<br>
&gt; <br>
&gt; Here is a splat we got at Netgate:<br>
&gt; <br>
&gt; uma_zalloc_debug: zone &quot;malloc-16&quot; with the following non-sleepable locks held:<br>
&gt; exclusive sleep mutex SD slot mtx (sdhci) r = 0 (0xd8dec028) locked @<br>
&gt; /var/jenkins/workspace/pfSense-Plus-snapshots-25_03-main/sources/FreeBSD-src-plus-RELENG_25_03/sys/dev/sdhci/sdhci.c:688<br>
&gt; stack backtrace:<br>
&gt; #0 0xc0330ebc at witness_debugger+0x78<br>
&gt; #1 0xc033217c at witness_warn+0x428<br>
&gt; #2 0xc05b0a58 at uma_zalloc_debug+0x34<br>
&gt; #3 0xc05b067c at uma_zalloc_arg+0x30<br>
&gt; #4 0xc0291760 at malloc+0x8c<br>
&gt; #5 0xc02920ec at reallocf+0x14<br>
&gt; #6 0xc02f8894 at devclass_add_device+0x1e8<br>
&gt; #7 0xc02f6c78 at make_device+0xe0<br>
&gt; #8 0xc02f6abc at device_add_child_ordered+0x30<br>
&gt; #9 0xc0156e0c at sdhci_card_task+0x238<br>
&gt; #10 0xc0324090 at taskqueue_run_locked+0x1b4<br>
&gt; #11 0xc0323ea0 at taskqueue_run+0x50<br>
&gt; #12 0xc0275f88 at ithread_loop+0x264<br>
<br>
Just use a regular taskqueue like taskqueue_thread instead of taskqueue_swi?<br>
PCI hotplug defines its own thread taskqueue for adding and removing devices.<br>
<br>
The bug is here, IMO.  Eventually new-bus will need some sort of topology<br>
lock and that will have to be an sx lock, so this code needs to change<br>
anyway.  The sound code that tries to frob devices with a regular mutex<br>
also needs to change.<br></blockquote></div></div><div dir="auto"><br></div><div dir="auto">I should dust off the branch that i have this one. There&#39;s about a dozen places I had to change at the time...</div><div dir="auto"><br></div><div dir="auto"><div class="gmail_quote gmail_quote_container"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
In terms of taskqueue_swi, it&#39;s probably something that needs to go away.<br>
Generally speaking, code uses tasks for functions that need to sleep,<br>
and taskqueue_swi breaks that.<br></blockquote></div></div><div dir="auto"><br></div><div dir="auto">Its a holdover from spl days for sure.</div><div dir="auto"><br></div><div dir="auto"><div class="gmail_quote gmail_quote_container"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
I will fix sdhci to use a proper taskqueue and then revert this commit.<br></blockquote></div></div><div dir="auto"><br></div><div dir="auto">Thanks. You can add me to the review </div><div dir="auto"><br></div><div dir="auto">Warner</div><div dir="auto"><div class="gmail_quote gmail_quote_container"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
-- <br>
John Baldwin<br>
<br>
</blockquote></div></div></div>
help

Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CANCZdfr-1WTp0XoWYieHfo9GZ3Du6QKtyVzh_6aX%2BQnPHyN8ig>