Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 10 Jan 2025 07:05:12 -0700
From:      Warner Losh <imp@bsdimp.com>
To:        John Baldwin <jhb@freebsd.org>
Cc:        Konstantin Belousov <kostikbel@gmail.com>, src-committers@freebsd.org,  dev-commits-src-all@freebsd.org, dev-commits-src-main@freebsd.org
Subject:   Re: git: ccabc7c2e556 - main - DEVICE_IDENTIFY.9: Modernize description and use cases
Message-ID:  <CANCZdfoL32QO9PnPnLe3rvxh1y2c2NjRmhTszbb1XNT6JM0X8w@mail.gmail.com>
In-Reply-To: <9acd1878-2ee3-47c8-aab9-29d5be200081@FreeBSD.org>
References:  <202501092020.509KKt1U058876@gitrepo.freebsd.org> <Z4BVVXpyh1MALzc_@kib.kiev.ua> <9acd1878-2ee3-47c8-aab9-29d5be200081@FreeBSD.org>

next in thread | previous in thread | raw e-mail | index | archive | help
--0000000000000b282d062b5a967c
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

On Fri, Jan 10, 2025 at 5:45=E2=80=AFAM John Baldwin <jhb@freebsd.org> wrot=
e:

> On 1/9/25 18:01, Konstantin Belousov wrote:
> > On Thu, Jan 09, 2025 at 08:20:55PM +0000, John Baldwin wrote:
> >> The branch main has been updated by jhb:
> >>
> >> URL:
> https://cgit.FreeBSD.org/src/commit/?id=3Dccabc7c2e556ac0b14da9b682b706cc=
af251c0fe
> >>
> >> commit ccabc7c2e556ac0b14da9b682b706ccaf251c0fe
> >> Author:     John Baldwin <jhb@FreeBSD.org>
> >> AuthorDate: 2025-01-09 20:20:16 +0000
> >> Commit:     John Baldwin <jhb@FreeBSD.org>
> >> CommitDate: 2025-01-09 20:20:16 +0000
> >>
> >>      DEVICE_IDENTIFY.9: Modernize description and use cases
> >>
> >>      Mention adding devices based on firmware tables and software-only
> >>      pseudo-devices as use cases for identify methods as those are mor=
e
> >>      common than reading random I/O ports to identify a legacy ISA
> device.
> >>
> >>      Describe how device_find_chid can be used to avoid duplicates.
> While
> >>      here, explicitly note that devices added in identify methods
> typically
> >>      use a fixed device name.
> >>
> >>      Trim the cross-references a bit.
> >>
> >>      Reviewed by:    ziaee, imp
> >>      Differential Revision:  https://reviews.freebsd.org/D48367
> >> ---
> >>   share/man/man9/DEVICE_IDENTIFY.9 | 52
> +++++++++++++++++++---------------------
> >>   1 file changed, 25 insertions(+), 27 deletions(-)
> >>
> >> diff --git a/share/man/man9/DEVICE_IDENTIFY.9
> b/share/man/man9/DEVICE_IDENTIFY.9
> >> index d75c1a91ce4a..b10d94143050 100644
> >> --- a/share/man/man9/DEVICE_IDENTIFY.9
> >> +++ b/share/man/man9/DEVICE_IDENTIFY.9
> >> @@ -26,44 +26,46 @@
> >>   .\" (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF TH=
E
> USE OF
> >>   .\" THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE=
.
> >>   .\"
> >> -.Dd January 15, 2017
> >> +.Dd January 9, 2025
> >>   .Dt DEVICE_IDENTIFY 9
> >>   .Os
> >>   .Sh NAME
> >>   .Nm DEVICE_IDENTIFY
> >> -.Nd identify a device, register it
> >> +.Nd identify new child devices and register them
> >>   .Sh SYNOPSIS
> >>   .In sys/param.h
> >>   .In sys/bus.h
> >>   .Ft void
> >>   .Fn DEVICE_IDENTIFY "driver_t *driver" "device_t parent"
> > So what is the 'parent' for driver which creates devices based on the
> > firmware tables?
>
> Hmmm, I could maybe try to clarify this further.  In new-bus, drivers are
> associated with a parent bus devclass.  All of the drivers associated wit=
h
> a given parent bus are then eligible for use with children of any bus
> devices for that bus.  Thus, for example:
>
> DRIVER_MODULE(foo, bar, ....)
>
> Associates the "bar" driver with the bus "foo".  For any fooX bus devices=
,
> the "bar" driver can attach to children of fooX.
>
> Most device_if.m methods operate on "child" devices, so device_probe,
> device_attach, device_detach, etc. all operate on a given barX device
> that is a child of a fooX.
>
> device_identify is different.  Instead, each fooX bus device can call
> bus_identify_children (formerly the somewhat misnamed bus_generic_probe)
> during the fooX device_attach routine.  bus_identify_children looks for
> the devclass of foo and then walks all of the eligible device drivers
> for potential children of foo invoking this method with fooX as the
> parent.  The idea being that device_identify will create "barX" children
> of "fooX" explicitly using BUS_ADD_CHILD.
>
> In terms of which parent device, it's really about where a given "barX"
> device should live.  For system-wide "top-level" devices that aren't
> behind some other bridge on an ACPI system, the pattern we use is to
> hang those devices as children of acpi0, so you end up with
>
> DRIVER_MODULE(acpi, bar, ...)
>
> and the parent device at the time of DEVICE_IDENTIFY is acpi0.  However,
> we also use identify in some other places.  nexus0 children all tend to
> either be explicitly added in nexus_attach() or added via identify
> routines that use nexus as the parent.  legacy0 on x86 also uses identify
> routines to add child devices like pcibX.
>

These are likely OK. It's arguably possible to model this as devices known
in other places, but it gets super awkward and you'd have to do hand-stands
that are worse in other ways.


> Another case is the ipmi(4) device.  Legacy ISA IPMI devices are describe=
d
> by an entry in the SMBIOS table.  The ipmi(4) driver uses an identify
> routine to add an ipmi0 device as a child of isa0 (since it has I/O ports
> like a typical ISA device), but the identify routine is table-driven sinc=
e
> it depends on parsing the smbios table.
>

This is bogus, imho, but not worth fixing. We should have a isasmbb device
that parses the child, creates a isasmb bus and then adds the children it
finds from parsing the smbtable and moves on. It would be much cleaner.
There's several other devices that kinda live here, but none worth
supporting
these days, so a rewrite is a waste of time.


> cpufreqX is another odd case, and I'm not quite convinced it is correct.
> Today we enumerate cpuX devices hung off of some nexus-like device (on
> x86 cpuX are children of either acpi0 or legacy0).  Each cpufreqX
> driver then uses identify routines to add named children (p4tccX, estX,
> hwpstateX, etc.).  Those identify routines all have "cpu" as the parent
> bus so that cpuX is the parent device (and they are called for each
> instance of a cpuX device).  For cpufreq I feel like we actually want
> something a bit different on x86 at least.  I think we want to create
> an explicit cpufreqX device in cpu_attach, and that the various cpufreq
> drivers that manage frequency should all be "cpufreq" drivers that bid
> to attach to that device node instead of creating duplicate nodes that
> try to duplicate work.  Today the various identify routines try to
> check for each other instead which is fragile.  It may be that we'd
> need/want two device_t nodes, one for P-states and one for throttling,
> though it might be that we only want the throttling for certain P-state
> drivers or some such.
>

This is definitely wrong in a big way. I have about 80-90% of the conversio=
n
to be proper children of cpu nodes, as appropriate.

Warner


> For your IOMMU case, you can use an ACPI table "anywhere" in the device
> tree to enumerate device nodes if necessary (though PCI buses don't
> currently call bus_identify_children today and don't have a BUS_ADD_CHILD
> method), but if you want to be "ready" before other generic children like
> PCI bridges are attached, you probably need to be a child of acpi0.
>




> --
> John Baldwin
>
>

--0000000000000b282d062b5a967c
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div dir=3D"ltr"><br></div><br><div class=3D"gmail_quote g=
mail_quote_container"><div dir=3D"ltr" class=3D"gmail_attr">On Fri, Jan 10,=
 2025 at 5:45=E2=80=AFAM John Baldwin &lt;<a href=3D"mailto:jhb@freebsd.org=
">jhb@freebsd.org</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote"=
 style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);p=
adding-left:1ex">On 1/9/25 18:01, Konstantin Belousov wrote:<br>
&gt; On Thu, Jan 09, 2025 at 08:20:55PM +0000, John Baldwin wrote:<br>
&gt;&gt; The branch main has been updated by jhb:<br>
&gt;&gt;<br>
&gt;&gt; URL: <a href=3D"https://cgit.FreeBSD.org/src/commit/?id=3Dccabc7c2=
e556ac0b14da9b682b706ccaf251c0fe" rel=3D"noreferrer" target=3D"_blank">http=
s://cgit.FreeBSD.org/src/commit/?id=3Dccabc7c2e556ac0b14da9b682b706ccaf251c=
0fe</a><br>
&gt;&gt;<br>
&gt;&gt; commit ccabc7c2e556ac0b14da9b682b706ccaf251c0fe<br>
&gt;&gt; Author:=C2=A0 =C2=A0 =C2=A0John Baldwin &lt;jhb@FreeBSD.org&gt;<br=
>
&gt;&gt; AuthorDate: 2025-01-09 20:20:16 +0000<br>
&gt;&gt; Commit:=C2=A0 =C2=A0 =C2=A0John Baldwin &lt;jhb@FreeBSD.org&gt;<br=
>
&gt;&gt; CommitDate: 2025-01-09 20:20:16 +0000<br>
&gt;&gt;<br>
&gt;&gt;=C2=A0 =C2=A0 =C2=A0 DEVICE_IDENTIFY.9: Modernize description and u=
se cases<br>
&gt;&gt;=C2=A0 =C2=A0 =C2=A0 <br>
&gt;&gt;=C2=A0 =C2=A0 =C2=A0 Mention adding devices based on firmware table=
s and software-only<br>
&gt;&gt;=C2=A0 =C2=A0 =C2=A0 pseudo-devices as use cases for identify metho=
ds as those are more<br>
&gt;&gt;=C2=A0 =C2=A0 =C2=A0 common than reading random I/O ports to identi=
fy a legacy ISA device.<br>
&gt;&gt;=C2=A0 =C2=A0 =C2=A0 <br>
&gt;&gt;=C2=A0 =C2=A0 =C2=A0 Describe how device_find_chid can be used to a=
void duplicates.=C2=A0 While<br>
&gt;&gt;=C2=A0 =C2=A0 =C2=A0 here, explicitly note that devices added in id=
entify methods typically<br>
&gt;&gt;=C2=A0 =C2=A0 =C2=A0 use a fixed device name.<br>
&gt;&gt;=C2=A0 =C2=A0 =C2=A0 <br>
&gt;&gt;=C2=A0 =C2=A0 =C2=A0 Trim the cross-references a bit.<br>
&gt;&gt;=C2=A0 =C2=A0 =C2=A0 <br>
&gt;&gt;=C2=A0 =C2=A0 =C2=A0 Reviewed by:=C2=A0 =C2=A0 ziaee, imp<br>
&gt;&gt;=C2=A0 =C2=A0 =C2=A0 Differential Revision:=C2=A0 <a href=3D"https:=
//reviews.freebsd.org/D48367" rel=3D"noreferrer" target=3D"_blank">https://=
reviews.freebsd.org/D48367</a><br>
&gt;&gt; ---<br>
&gt;&gt;=C2=A0 =C2=A0share/man/man9/DEVICE_IDENTIFY.9 | 52 ++++++++++++++++=
+++---------------------<br>
&gt;&gt;=C2=A0 =C2=A01 file changed, 25 insertions(+), 27 deletions(-)<br>
&gt;&gt;<br>
&gt;&gt; diff --git a/share/man/man9/DEVICE_IDENTIFY.9 b/share/man/man9/DEV=
ICE_IDENTIFY.9<br>
&gt;&gt; index d75c1a91ce4a..b10d94143050 100644<br>
&gt;&gt; --- a/share/man/man9/DEVICE_IDENTIFY.9<br>
&gt;&gt; +++ b/share/man/man9/DEVICE_IDENTIFY.9<br>
&gt;&gt; @@ -26,44 +26,46 @@<br>
&gt;&gt;=C2=A0 =C2=A0.\&quot; (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING I=
N ANY WAY OUT OF THE USE OF<br>
&gt;&gt;=C2=A0 =C2=A0.\&quot; THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBI=
LITY OF SUCH DAMAGE.<br>
&gt;&gt;=C2=A0 =C2=A0.\&quot;<br>
&gt;&gt; -.Dd January 15, 2017<br>
&gt;&gt; +.Dd January 9, 2025<br>
&gt;&gt;=C2=A0 =C2=A0.Dt DEVICE_IDENTIFY 9<br>
&gt;&gt;=C2=A0 =C2=A0.Os<br>
&gt;&gt;=C2=A0 =C2=A0.Sh NAME<br>
&gt;&gt;=C2=A0 =C2=A0.Nm DEVICE_IDENTIFY<br>
&gt;&gt; -.Nd identify a device, register it<br>
&gt;&gt; +.Nd identify new child devices and register them<br>
&gt;&gt;=C2=A0 =C2=A0.Sh SYNOPSIS<br>
&gt;&gt;=C2=A0 =C2=A0.In sys/param.h<br>
&gt;&gt;=C2=A0 =C2=A0.In sys/bus.h<br>
&gt;&gt;=C2=A0 =C2=A0.Ft void<br>
&gt;&gt;=C2=A0 =C2=A0.Fn DEVICE_IDENTIFY &quot;driver_t *driver&quot; &quot=
;device_t parent&quot;<br>
&gt; So what is the &#39;parent&#39; for driver which creates devices based=
 on the<br>
&gt; firmware tables?<br>
<br>
Hmmm, I could maybe try to clarify this further.=C2=A0 In new-bus, drivers =
are<br>
associated with a parent bus devclass.=C2=A0 All of the drivers associated =
with<br>
a given parent bus are then eligible for use with children of any bus<br>
devices for that bus.=C2=A0 Thus, for example:<br>
<br>
DRIVER_MODULE(foo, bar, ....)<br>
<br>
Associates the &quot;bar&quot; driver with the bus &quot;foo&quot;.=C2=A0 F=
or any fooX bus devices,<br>
the &quot;bar&quot; driver can attach to children of fooX.<br>
<br>
Most device_if.m methods operate on &quot;child&quot; devices, so device_pr=
obe,<br>
device_attach, device_detach, etc. all operate on a given barX device<br>
that is a child of a fooX.<br>
<br>
device_identify is different.=C2=A0 Instead, each fooX bus device can call<=
br>
bus_identify_children (formerly the somewhat misnamed bus_generic_probe)<br=
>
during the fooX device_attach routine.=C2=A0 bus_identify_children looks fo=
r<br>
the devclass of foo and then walks all of the eligible device drivers<br>
for potential children of foo invoking this method with fooX as the<br>
parent.=C2=A0 The idea being that device_identify will create &quot;barX&qu=
ot; children<br>
of &quot;fooX&quot; explicitly using BUS_ADD_CHILD.<br>
<br>
In terms of which parent device, it&#39;s really about where a given &quot;=
barX&quot;<br>
device should live.=C2=A0 For system-wide &quot;top-level&quot; devices tha=
t aren&#39;t<br>
behind some other bridge on an ACPI system, the pattern we use is to<br>
hang those devices as children of acpi0, so you end up with<br>
<br>
DRIVER_MODULE(acpi, bar, ...)<br>
<br>
and the parent device at the time of DEVICE_IDENTIFY is acpi0.=C2=A0 Howeve=
r,<br>
we also use identify in some other places.=C2=A0 nexus0 children all tend t=
o<br>
either be explicitly added in nexus_attach() or added via identify<br>
routines that use nexus as the parent.=C2=A0 legacy0 on x86 also uses ident=
ify<br>
routines to add child devices like pcibX.<br></blockquote><div><br></div><d=
iv>These are likely OK. It&#39;s arguably possible to model this as devices=
 known</div><div>in other places, but it gets super awkward and you&#39;d h=
ave to do hand-stands</div><div>that are worse in other ways.</div><div>=C2=
=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8e=
x;border-left:1px solid rgb(204,204,204);padding-left:1ex">
Another case is the ipmi(4) device.=C2=A0 Legacy ISA IPMI devices are descr=
ibed<br>
by an entry in the SMBIOS table.=C2=A0 The ipmi(4) driver uses an identify<=
br>
routine to add an ipmi0 device as a child of isa0 (since it has I/O ports<b=
r>
like a typical ISA device), but the identify routine is table-driven since<=
br>
it depends on parsing the smbios table.<br></blockquote><div><br></div><div=
>This is bogus, imho, but not worth fixing. We should have a isasmbb device=
</div><div>that parses the child, creates=C2=A0a isasmb bus and then adds t=
he children it</div><div>finds from parsing the smbtable and moves on. It w=
ould be much cleaner.</div><div>There&#39;s several other devices that kind=
a live here, but none worth supporting</div><div>these days, so a rewrite i=
s a waste of time.</div><div>=C2=A0</div><blockquote class=3D"gmail_quote" =
style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);pa=
dding-left:1ex">
cpufreqX is another odd case, and I&#39;m not quite convinced it is correct=
.<br>
Today we enumerate cpuX devices hung off of some nexus-like device (on<br>
x86 cpuX are children of either acpi0 or legacy0).=C2=A0 Each cpufreqX<br>
driver then uses identify routines to add named children (p4tccX, estX,<br>
hwpstateX, etc.).=C2=A0 Those identify routines all have &quot;cpu&quot; as=
 the parent<br>
bus so that cpuX is the parent device (and they are called for each<br>
instance of a cpuX device).=C2=A0 For cpufreq I feel like we actually want<=
br>
something a bit different on x86 at least.=C2=A0 I think we want to create<=
br>
an explicit cpufreqX device in cpu_attach, and that the various cpufreq<br>
drivers that manage frequency should all be &quot;cpufreq&quot; drivers tha=
t bid<br>
to attach to that device node instead of creating duplicate nodes that<br>
try to duplicate work.=C2=A0 Today the various identify routines try to<br>
check for each other instead which is fragile.=C2=A0 It may be that we&#39;=
d<br>
need/want two device_t nodes, one for P-states and one for throttling,<br>
though it might be that we only want the throttling for certain P-state<br>
drivers or some such.<br></blockquote><div><br></div><div>This is definitel=
y wrong in a big way. I have about 80-90% of the conversion</div><div>to be=
 proper children of cpu nodes, as appropriate.</div><div><br></div><div>War=
ner</div><div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin=
:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"=
>
For your IOMMU case, you can use an ACPI table &quot;anywhere&quot; in the =
device<br>
tree to enumerate device nodes if necessary (though PCI buses don&#39;t<br>
currently call bus_identify_children today and don&#39;t have a BUS_ADD_CHI=
LD<br>
method), but if you want to be &quot;ready&quot; before other generic child=
ren like<br>
PCI bridges are attached, you probably need to be a child of acpi0.<br></bl=
ockquote><div><br></div><div><br></div><div>=C2=A0</div><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rg=
b(204,204,204);padding-left:1ex">
-- <br>
John Baldwin<br>
<br>
</blockquote></div></div>

--0000000000000b282d062b5a967c--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CANCZdfoL32QO9PnPnLe3rvxh1y2c2NjRmhTszbb1XNT6JM0X8w>