From nobody Wed Oct 2 14:54:52 2024 X-Original-To: freebsd-current@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4XJdBy2mtSz5XdmY for ; Wed, 02 Oct 2024 14:55:06 +0000 (UTC) (envelope-from wlosh@bsdimp.com) Received: from mail-pg1-x535.google.com (mail-pg1-x535.google.com [IPv6:2607:f8b0:4864:20::535]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (2048 bits) client-digest SHA256) (Client CN "smtp.gmail.com", Issuer "WR4" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4XJdBx6qn7z4rQg for ; Wed, 2 Oct 2024 14:55:05 +0000 (UTC) (envelope-from wlosh@bsdimp.com) Authentication-Results: mx1.freebsd.org; none Received: by mail-pg1-x535.google.com with SMTP id 41be03b00d2f7-7e6cbf6cd1dso4626761a12.3 for ; Wed, 02 Oct 2024 07:55:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bsdimp-com.20230601.gappssmtp.com; s=20230601; t=1727880904; x=1728485704; darn=freebsd.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=d+DiO2NONT/lXJQoYg+Mg42ybIlb98q9k/74AcSJgRg=; b=1K0C2U+OxkCeD3RJIo5yJfji7ZUXltFXZncMsHuZVP896v6I/kmO5kBSakuoeV4Q1T f1+cx11jlnZvTQHktZurxGcMr57kTJ9fYH1igsR17KcexRhmIGzBhvQX2dNqAR+GqpAp mSrU68ErUFbPcRJ663EhzCCHz/t09a5Clf1fD0NlAZhks31/WtIEHFKEwiN2UdEqurO5 OFga9+5aVe1g2kS+Nh9a3JQFqHaKiQDx5raxoWIE7JOGxaab1Sl5gI68topXbmP7tdiX QUdjthm5ElXR4TZSNeYPv9UDRpbJrz5kPQwmgu24fXlmBbM4LUhLxVOBCjhATch2+DKZ 1X7g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1727880904; x=1728485704; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=d+DiO2NONT/lXJQoYg+Mg42ybIlb98q9k/74AcSJgRg=; b=vv67y+sJOl+rlk6NXWm/wQHZSEAs41K2wGPVWFuXlRTBgiAivg3Wf/G/oYPhXeTDJT orq9HU8BloxMZq6FGgw1IhSPNA7Hw4J2iqNMl03bPmPUxqJq3Z0R/Hxi0qS5GMDLj9zs xz0mBxs8RmyePmX2K7iY0dG1jNmmfImMK/1aHcaeJrLHAJRxCnbmC9usTgGSekzWf0ez innRHcM8XXUMdh9u2XIfzwVIsZ77JFMWKwad86lmK+R+y4n/bbZqNBHT1snomhirW1HU QoY/foSWRPj6uG4TkeTw5av//h+P0rDANrqNuvEbHYL21GIpy5K63z1g4ABpMy7ybjH7 mG+w== X-Forwarded-Encrypted: i=1; AJvYcCUCuiy6IWioSEAbqLYCPlvRCdtLMCVRE7MDIfocJn1k+g2iN6I0xE4j6znwDFZYjoE6m7RX4NTc/+zLlGQDP4A=@freebsd.org X-Gm-Message-State: AOJu0Yytz7iLvrH4W/ZGkkkMQKx6HI4AFs1Y0QPd4d5MpYcolvBHsV2r gkn5DBfyISsvpGeTDEbsTmHK2iuRH92+2hpkCRXUhi6atrN9IEARsVXmmP85vsajQ457Q1fn8nq Vfh9XwZ8nm8XeybVv9deubxTc/Bmlh/RnnL8EPA== X-Google-Smtp-Source: AGHT+IHD7I/kaoDezeM4r/Nfrb1OiwLLSfgblUbuNibox3BMf5Wd3P1GQPx4hRQfzWNh1dWVY1cIO1QHQxGIbHETTRg= X-Received: by 2002:a05:6a20:4c0f:b0:1d6:236e:da58 with SMTP id adf61e73a8af0-1d6236edaf7mr4078208637.0.1727880904136; Wed, 02 Oct 2024 07:55:04 -0700 (PDT) List-Id: Discussions about the use of FreeBSD-current List-Archive: https://lists.freebsd.org/archives/freebsd-current List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-current@FreeBSD.org MIME-Version: 1.0 References: <0db2d927-3299-2c0f-2310-d8e386fb31c6@macktronics.com> In-Reply-To: <0db2d927-3299-2c0f-2310-d8e386fb31c6@macktronics.com> From: Warner Losh Date: Wed, 2 Oct 2024 08:54:52 -0600 Message-ID: Subject: Re: FYI: make's "max_jobs" needs to be separated from -j (now?) To: Dan Mack Cc: dsdqmzk@hotmail.com, freebsd-current@freebsd.org Content-Type: multipart/alternative; boundary="0000000000008177f606237f9f5d" X-Rspamd-Pre-Result: action=no action; module=replies; Message is reply to one we originated X-Spamd-Result: default: False [-4.00 / 15.00]; REPLY(-4.00)[]; ASN(0.00)[asn:15169, ipnet:2607:f8b0::/32, country:US] X-Rspamd-Queue-Id: 4XJdBx6qn7z4rQg X-Spamd-Bar: ---- --0000000000008177f606237f9f5d Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable On Wed, Oct 2, 2024 at 8:42=E2=80=AFAM Dan Mack wrot= e: > On Wed, 2 Oct 2024, dsdqmzk@hotmail.com wrote: > > > Dan Mack wrote: > >> On Wed, 2 Oct 2024, dsdqmzk@hotmail.com wrote: > >> > >>> David Wolfskill wrote: > >>>> I have been tracking stable/ and head (daily, with a few exceptions) > for > >>>> many years, now. Over time, I set up a set of ([t]csh) aliases to > >>>> simplify the exercise for me. > >>>> > >>>> Until yesterday, the "make -j${max_jobs} buildworld" construct had > >>>> worked without issue, but (yesterday), the invocation failed quite > >>>> quickly: > >>>> > >>>> | Tue Oct 1 11:54:18 UTC 2024 > >>>> | --- buildworld --- > >>>> | make[1]: "/usr/src/Makefile.inc1" line 362: SYSTEM_COMPILER: > >>>> Determined that CC=3Dcc matches the source tree. Not bootstrapping = a > >>>> cross-compiler. > >>>> | make[1]: "/usr/src/Makefile.inc1" line 367: SYSTEM_LINKER: > >>>> Determined that LD=3Dld matches the source tree. Not bootstrapping = a > >>>> cross-linker. > >>>> | -------------------------------------------------------------- > >>>> | >>> World build started on Tue Oct 1 11:54:18 UTC 2024 > >>>> | -------------------------------------------------------------- > >>>> | >>> Deleting stale files in build tree... > >>>> | 0.14 real 0.23 user 0.10 sys > >>>> | *** [_cleanworldtmp] Error code 6 > >>>> | > >>>> | make[1]: stopped making "buildworld" in /usr/src > >>>> | .ERROR_TARGET=3D'_cleanworldtmp' > >>>> | .ERROR_META_FILE=3D'' > >>>> > >>>> On a bit of a whim, I tried adjusting the "max_jobs" values > (downward), > >>>> which didn't help, but removing the "-j14" entirely did not produce = a > >>>> failure. > >>>> > >>>> On the other hand, rebuilding clang/llvm with a single core on a > laptop > >>>> (when I actually want to be able to use the laptop later in the day > >>>> while I'm at work) didn't seem productive. > >>>> > >>>> A bit more rather randomly "trying stuff" yielded the result that > while > >>>> > >>>> make -j14 buildworld > >>>> > >>>> failed (as described above), > >>>> > >>>> make -j 14 buildworld > >>>> > >>>> carries on as before -- it's building lib/clang (and using multiple > >>>> cores to do so).... :-} > >>> > >>> Just got the same error, but both invocations didn't work, and I > noticed > >>> that bootstrapped version of mtree failed to run because of (now) > >>> missing libmd.so.6. I think it's not really related to whitespace > >>> between -j and jobs number, rather you had to (re)build the bootstrap > >>> tools. > >> > >> I have been building current twice daily for a while and didn't notice > >> this regression but I do have the space after "-j" > >> > >> #!/bin/sh > >> make -j 16 buildworld > /logs/bw.$$ 2>&1 && \ > >> make -j 8 kernel KERNCONF=3DGENERIC > /logs/bk.$$ 2>&1 && \ > >> sync && reboot > > > > Do you also do `make delete-old-libs`? > > > >> I grepped all my logs across 3 servers and did not see a single instan= ce > >> of [_cleanworldtmp] Error code ... in any of the logs. What was the > >> hash of the build you were on there, I can try to reproduce it quickly > >> (but it might only trigger with your builddir state I guess) > > > > If I understand the problem correctly, it should be as easy as: > > > > 1. build on pre-e7a629c851d system > > 2. install/reboot > > 3. make delete-old-libs > > 4. try to build world/kernel that fail as above, and, I think, make > > kernel-toolchain was the one failing because mtree failed to run > > (because of libmd.so.6 gone now) > > > > In any case, wiping out /usr/obj solved it for me. > > Ack, okay. I can't trigger it with a fresh or my /usr/obj but in any > event the error number 6 is probably referring to a path or directory > missing while doing a parallel build given some input state :-) > > #define ENXIO 6 /* Device not configured */ > ENXIO usually is reserved for hardware errors when a device disappears for block I/O contexts. So I'm not sure that this theory is so good. But shell error exit statuses are largely independent of errnos. Warner --0000000000008177f606237f9f5d Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable


=
On Wed, Oct 2, 2024 at 8:42=E2=80=AFA= M Dan Mack <mack@macktronics.com= > wrote:
= On Wed, 2 Oct 2024, dsdqmzk@hotmail.com wrote:

> Dan Mack wrote:
>> On Wed, 2 Oct 2024, dsdqmzk@hotmail.com wrote:
>>
>>> David Wolfskill wrote:
>>>> I have been tracking stable/ and head (daily, with a few e= xceptions) for
>>>> many years, now.=C2=A0 Over time, I set up a set of ([t]cs= h) aliases to
>>>> simplify the exercise for me.
>>>>
>>>> Until yesterday, the "make -j${max_jobs} buildworld&q= uot; construct had
>>>> worked without issue, but (yesterday), the invocation fail= ed quite
>>>> quickly:
>>>>
>>>> | Tue Oct=C2=A0 1 11:54:18 UTC 2024
>>>> | --- buildworld ---
>>>> | make[1]: "/usr/src/Makefile.inc1" line 362: SY= STEM_COMPILER:
>>>> Determined that CC=3Dcc matches the source tree.=C2=A0 Not= bootstrapping a
>>>> cross-compiler.
>>>> | make[1]: "/usr/src/Makefile.inc1" line 367: SY= STEM_LINKER:
>>>> Determined that LD=3Dld matches the source tree.=C2=A0 Not= bootstrapping a
>>>> cross-linker.
>>>> | --------------------------------------------------------= ------
>>>> | >>> World build started on Tue Oct=C2=A0 1 11:5= 4:18 UTC 2024
>>>> | --------------------------------------------------------= ------
>>>> | >>> Deleting stale files in build tree...
>>>> |=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 0.14 rea= l=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 0.23 user=C2=A0=C2=A0=C2= =A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 0.10 sys
>>>> | *** [_cleanworldtmp] Error code 6
>>>> |
>>>> | make[1]: stopped making "buildworld" in /usr/s= rc
>>>> | .ERROR_TARGET=3D'_cleanworldtmp'
>>>> | .ERROR_META_FILE=3D''
>>>>
>>>> On a bit of a whim, I tried adjusting the "max_jobs&q= uot; values (downward),
>>>> which didn't help, but removing the "-j14" e= ntirely did not produce a
>>>> failure.
>>>>
>>>> On the other hand, rebuilding clang/llvm with a single cor= e on a laptop
>>>> (when I actually want to be able to use the laptop later i= n the day
>>>> while I'm at work) didn't seem productive.
>>>>
>>>> A bit more rather randomly "trying stuff" yielde= d the result that while
>>>>
>>>> =C2=A0=C2=A0=C2=A0=C2=A0make -j14 buildworld
>>>>
>>>> failed (as described above),
>>>>
>>>> =C2=A0=C2=A0=C2=A0=C2=A0make -j 14 buildworld
>>>>
>>>> carries on as before -- it's building lib/clang (and u= sing multiple
>>>> cores to do so)....=C2=A0 :-}
>>>
>>> Just got the same error, but both invocations didn't work,= and I noticed
>>> that bootstrapped version of mtree failed to run because of (n= ow)
>>> missing libmd.so.6.=C2=A0 I think it's not really related = to whitespace
>>> between -j and jobs number, rather you had to (re)build the bo= otstrap
>>> tools.
>>
>> I have been building current twice daily for a while and didn'= t notice
>> this regression but I do have the space after "-j"
>>
>> =C2=A0 #!/bin/sh
>> =C2=A0=C2=A0 make -j 16 buildworld=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0= =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 > /logs/bw.$$ 2>&= ;1 && \
>> =C2=A0=C2=A0 make -j 8 kernel KERNCONF=3DGENERIC=C2=A0 > /logs/= bk.$$ 2>&1 && \
>> =C2=A0=C2=A0 sync && reboot
>
> Do you also do `make delete-old-libs`?
>
>> I grepped all my logs across 3 servers and did not see a single in= stance
>> of [_cleanworldtmp] Error code ... in any of the logs.=C2=A0 What = was the
>> hash of the build you were on there, I can try to reproduce it qui= ckly
>> (but it might only trigger with your builddir state I guess)
>
> If I understand the problem correctly, it should be as easy as:
>
> 1. build on pre-e7a629c851d system
> 2. install/reboot
> 3. make delete-old-libs
> 4. try to build world/kernel that fail as above, and, I think, make > kernel-toolchain was the one failing because mtree failed to run
> (because of libmd.so.6 gone now)
>
> In any case, wiping out /usr/obj solved it for me.

Ack, okay.=C2=A0 =C2=A0I can't trigger it with a fresh or my /usr/obj b= ut in any
event the error number 6 is probably referring to a path or directory
missing while doing a parallel build given some input state :-)

#define ENXIO=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A06=C2=A0 =C2=A0 =C2=A0= =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0/* Device not configured */

ENXIO usually is reserved for hardware errors when = a device disappears
for block I/O contexts. So I'm not sure t= hat this theory is so good.

But shell error exit s= tatuses are largely independent of errnos.

Warner<= /div>
--0000000000008177f606237f9f5d--