From nobody Wed May 24 09:47:02 2023 X-Original-To: freebsd-arch@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4QR5v53XNLz4CVBs for ; Wed, 24 May 2023 09:47:13 +0000 (UTC) (envelope-from hps@selasky.org) Received: from mail.turbocat.net (turbocat.net [IPv6:2a01:4f8:c17:6c4b::2]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 4QR5v32g2Zz4BJb; Wed, 24 May 2023 09:47:11 +0000 (UTC) (envelope-from hps@selasky.org) Authentication-Results: mx1.freebsd.org; none Received: from [10.36.2.145] (unknown [46.212.121.255]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by mail.turbocat.net (Postfix) with ESMTPSA id 1B1842600D2; Wed, 24 May 2023 11:47:03 +0200 (CEST) Message-ID: <39e6793a-33f9-8f48-30d3-eaa30d321452@selasky.org> Date: Wed, 24 May 2023 11:47:02 +0200 List-Id: Discussion related to FreeBSD architecture List-Archive: https://lists.freebsd.org/archives/freebsd-arch List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-arch@freebsd.org MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:102.0) Gecko/20100101 Thunderbird/102.10.1 Subject: Re: [RFC] An idea for general kernel post-processing automation in FreeBSD To: Emmanuel Vadot , Kyle Evans Cc: John Baldwin , Mark Johnston , Warner Losh , Konstantin Belousov , "freebsd-arch@freebsd.org" , Jessica Clarke References: <394775eb-1000-60d9-2ddd-5c2aca6c99ea@selasky.org> <08ba506f-66c5-35eb-e992-ecf9dfeccf93@selasky.org> <20230524082850.ffc8793b6bb1b47325ae0139@bidouilliste.com> Content-Language: en-US From: Hans Petter Selasky In-Reply-To: <20230524082850.ffc8793b6bb1b47325ae0139@bidouilliste.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Rspamd-Queue-Id: 4QR5v32g2Zz4BJb X-Spamd-Bar: ---- X-Spamd-Result: default: False [-4.00 / 15.00]; REPLY(-4.00)[]; ASN(0.00)[asn:24940, ipnet:2a01:4f8::/32, country:DE] X-Rspamd-Pre-Result: action=no action; module=replies; Message is reply to one we originated X-ThisMailContainsUnwantedMimeParts: N On 5/24/23 08:28, Emmanuel Vadot wrote: >>> +1. Hi, >>> Also, the concern over the runtime for the potential worst-case for qsort >>> seems_way_ overblown compared to the actual unsorted arrays one will get >>> out of the linker. The only possible downside of qsort that matters in >>> this case is the stack usage. I think it is fair to say there are two levels of questions: The first question is: Runtime sorting vs Compile time sorting The second question is: Mathematically correct or System/Political correct sorting Level 1: 1.1) Runtime sorting puts the least requirements on the toolchain. Data is concated in an unsorted binary object section. But in order to read and debug this section, special tools are needed. 1.2) Compiletime sorting puts more requirements on the toolchain. Referring Jessica Clarke, this is a "gross hack". On the other hand, debugging and visibility makes this desirable. Level 2: Should an appropriate sorting function be used or not? Typically there are like 1K of entries to sort, but speaking about the indexer, there are far less sorting keys for the 1K of entries. 2.1) libkern's qsort() aka https://en.wikipedia.org/wiki/Quicksort Using Quicksort is basically like first randomizing the data, and then sorting it again. It is not optimal in any ways with regards to CPU time. But system wise you could save code by using the same sorting algorithm over and over again. 2.2) mergesort() Using mergesort() has a predictable executing time, bound to log2(N) * N, where N is the number of elements to sort. The code in question already allocates a new array to store the output of the sort, and a kernel mergesort() routine could easily use the output array as temporary storage. 2.3) radixsort() There are less 64-bit keys than sysinits, and I see a radix sort algorithm would complete in less than log2(N) * N time. Radix sort is an in-place sorting algorithm, and requires little code. It works similarly to QuickSort, but doesn't go around a guess pivots. It already knows them, because there is a sorting key. 2.4) insertionsort() When arrays are already sorted, the execution time is O(N), else O(N²) 2.5) bubblesort() Sorting time is more or less constant O(N²). In case compile time sorting is selected, there is no need for runtime sorting. FreeBSD controls the build of all kernel modules, in-tree and out-of-the tree. We go by source and not binaries, and that is our strength. This can be an ABI contract thing for SYSINITs and kernel modules. In case runtime sorting is selected, I see a radix sorter as the less CPU and stack intensive algorithm. This would be the mathematically correct choice. qsort() as-of-today is the politically correct choice. Less code, but more CPU and stack. >>> >> IMO sorting at runtime is a minor feature by itself, too. I mentioned >> in one of the reviews somewhere, we have in the past have had bugs due >> to poor specification of sysinit order within a subsystem -- I'm >> recalling a specific one that wasn't even that long ago, maybe three >> or four years, that manu@ had hit because he did or didn't include >> some device in his config that ended up shifting link order of other >> objects and reversing two sysinits into an order that wasn't actually >> functional. > Yes this problem was a pain to debug (I mean just to understand the > actual problem), and iirc it was never solved, I only had the issue on > one machine that I don't use anymore and the problem went away and > re-apears from time to time when sysinits where added. > >> We can still add some bits to test that (preferably a >> tunable to reverse order within a subsystem rather than having to >> re-link the kernel) even with sorting at link-time, but it sure does >> look a lot less fragile if we have to sort it anyways and we just >> reverse one criteria. I think if you could just diff two plaintext files, listing all the constructors in the correct order, that would be of great help when problems arise in this area. You cannot always rely on having a console during bringup. At the mention of manu@, I remember how difficult it was to bringup the graphics driver code and kernel modules which are common among FreeBSD desktop users. What do you do, if all you have is a black screen, and you need to debug something? Does it make sense if I enumerate and collect the different views on this issue on a WikiPage at FreeBSD.org, so it doesn't get lost for the future? --HPS