Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 17 Nov 2025 21:08:29 -0800
From:      Mark Millard <marklmi@yahoo.com>
To:        Adrian Chadd <adrian@freebsd.org>
Cc:        bob prohaska <fbsd@www.zefox.net>, Carl Shapiro <cshapiro@panix.com>, Ronald Klop <ronald@freebsd.org>, freebsd-arm@freebsd.org, freebsd-current@freebsd.org
Subject:   Re: Still seeing Failed assertion: "p[i] == 0" on armv7 buildworld
Message-ID:  <13E753F4-84F8-4ADB-96B6-908897D6971C@yahoo.com>
In-Reply-To: <CAJ-Vmo=TbT7nD7rBrNnq3cutwMp9f7WXtQ-k9mUBne5ht4zGWg@mail.gmail.com>
References:  <aOvTG-20QRJtJJwf@int21h> <CANCZdfrJ8rph_rkT3Mk-sNYKNspoV15SvHWLsahzS0HnULi4ww@mail.gmail.com> <aO068RrAehdiHOoZ@www.zefox.net> <aRUJPryA4Vmu8dDD@www.zefox.net> <4957be52-e57f-4f5f-9626-d0f706480fe1@FreeBSD.org> <87ldkai9lu.fsf@panix.com> <aRXuLTN4hkGykHIl@www.zefox.net> <877bvthymv.fsf@panix.com> <aRdJ5xYeKEmhuIgh@www.zefox.net> <ouy1pm0nued.fsf@panix3.panix.com> <aRtBYaaa0n3_lwar@www.zefox.net> <CAJ-Vmo=TbT7nD7rBrNnq3cutwMp9f7WXtQ-k9mUBne5ht4zGWg@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help

On Nov 17, 2025, at 19:53, Adrian Chadd <adrian@freebsd.org> wrote:

> (random reply, sorry bob)
> 
> i think i saw someone say they can trigger it with a single super large source file, is that right? No need for parallelism, just build that one file?

I do not remember seeing such a claim about "p[i] == 0" failures.

All examples that I'm aware of involve both parallelism and memory
pressure. Parallelism without memory pressure did not get have the
issue happen in my armv7 testing on the Windows Dev Kit 2023.
(Trying to test such on a RPi2B v1.1 would be problematical.)

Parallelism with memory pressure has not shown the problem so far
for aarch64 testing.


> If so please pipe up, i'd like to see if you can get that over to mark on his armv8 box and then we can try some stuff (like using cpuset to pin the compilation to a single core so it doesn't migrate)
> 
> 
> -adrian
> 
> 
> On Mon, 17 Nov 2025 at 07:37, bob prohaska <fbsd@www.zefox.net> wrote:
> On Fri, Nov 14, 2025 at 05:04:10PM -0500, Carl Shapiro wrote:
> > bob prohaska <fbsd@www.zefox.net> writes:
> > 
> > > Those files have been overwritten by restarting the buildworld sessions.
> > > They tend to be large and diffcult to synchronize with the .cpp and .sh
> > > files generated by the crash. It could be done if it's useful.
> > 
> > At least from the perspective of debugging malloc(3), they'd be useful,
> > even if the files for reproducing the crash are not synchronized with
> > the std{err,out} output.  For example, there might be other log messages
> > generated by jemalloc.
> > 
> > I need a moment to look at the code and step through what it is doing on
> > FreeBSD but my first guess is that there might just be an incorrect
> > assumption about committed memory always coming back zeroed.  That
> > should be true on 64-bit Linux when MADV_DONTNEED is used but not true
> > if another advice is used like MADV_FREE on either FreeBSD or Linux.  It
> > is always possible that the kernel is mishanding some memory but I would
> > like to rule out jemalloc itself before pointing a finger there.
> 
> Here is an example of both the buildworld.log file and the generated
> diagnostic files, which for some reason didn't include .sh and .cpp files.
> 
> http://www.zefox.net/~fbsd/assertion_failure/hostname_www.zefox.org/buildworld.log
> http://www.zefox.net/~fbsd/assertion_failure/hostname_www.zefox.org/symbolizer-input-bcaebf
> http://www.zefox.net/~fbsd/assertion_failure/hostname_www.zefox.org/symbolizer-output-1aa401
> 
> This host's particular buildworld attempt has been going on for a long time, to the extent that
> world and kernel are mismatched:
> root@www:/usr/src # uname -KU
> 1600000 1500063
> The immediate goal is to get them back in sync.




===
Mark Millard
marklmi at yahoo.com




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?13E753F4-84F8-4ADB-96B6-908897D6971C>