Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 24 Feb 2022 03:23:51 +0200
From:      Konstantin Belousov <kostikbel@gmail.com>
To:        Alexander Motin <mav@freebsd.org>
Cc:        Mike Karels <mike@karels.net>, Tomoaki AOKI <junchoon@dec.sakura.ne.jp>, "Chen, Alvin W" <Weike.Chen@dell.com>, freebsd-current@freebsd.org
Subject:   Re: [Intel AlderLake] Read&Write files to FAT32 or UFS partition cause data corrupt due to P-Core&E-Core
Message-ID:  <YhbeJ%2B5KvGt9sr9U@kib.kiev.ua>
In-Reply-To: <bd125a09-3b72-92dd-fcb1-466a9a81fe43@FreeBSD.org>
References:  <YhE1rWoA%2BhMfebq/@kib.kiev.ua> <59cbcfe2-cd53-69d8-65d6-7a79e656f494@FreeBSD.org> <YhVnsB5ZwLYmpAFP@kib.kiev.ua> <1f968af1-1c57-9a09-7e01-145a5262e27f@FreeBSD.org> <YhVyFIFA5XnbGHej@kib.kiev.ua> <06768ef6-c88e-b6c7-0db3-f61eb4230937@FreeBSD.org> <YhV0waCqFLfBy8s7@kib.kiev.ua> <470db559-7e7d-1af7-5983-2858814329d2@FreeBSD.org> <YhV5LOCJMuumTJZw@kib.kiev.ua> <bd125a09-3b72-92dd-fcb1-466a9a81fe43@FreeBSD.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Wed, Feb 23, 2022 at 12:25:24PM -0500, Alexander Motin wrote:
> On 22.02.2022 19:00, Konstantin Belousov wrote:
> > On Tue, Feb 22, 2022 at 06:53:09PM -0500, Alexander Motin wrote:
> > > On 22.02.2022 18:41, Konstantin Belousov wrote:
> > > > On Tue, Feb 22, 2022 at 06:38:24PM -0500, Alexander Motin wrote:
> > > > > On 22.02.2022 18:30, Konstantin Belousov wrote:
> > > > > > As another blind guess, try to disable pcid, vm.pmap.pcid_enabled=0.
> > > > > 
> > > > > Do you mean it to be a workaround for TrueNAS 12, or it should provide some
> > > > > information?  The system is at the office and has no IPMI, so I can't switch
> > > > > the boot device from home right now.
> > > > I intended to see if it is the cause or related feature.
> > > 
> > > I'll try that on the 12 tomorrow, if applicable.
> > 
> > Yes should be relevant still.
> 
> It did the trick.  I repeated several times successful boots with the pcid
> disabled, and failed ones with default enabled.  In attachment you may find
> verbose serial console output captures with pcid disabled and enabled,
> though without the cpuinfo patch.  During the testing I had only one P and
> one E cores enabled to reduce noise.  Only after that I found P core having
> SMT enabled, but I then repeated without SMT also, so it is indeed
> irrelevant.
> 
> I'm curios, what in pcid could differentiate the P and E cores, and have it
> got fixed in latest stable/13, or I am just "unlucky" to not reproduce it
> there?

I am curious as well.  PCID works on both big Intel cores, and on small
cores like Apollo Lake etc.  So the fact that it does not properly interact
in P/E settings either mean that there is something I did not accounted
for from the spec, or there is a bug in silicon.

I have no idea why do we work on stable/13 and HEAD.  There were enough
changes to PCID code there, but it was mostly restructuring and polishing.

So the only way to get more understanding is to bisect to see which commit
on HEAD fixed the boot.



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?YhbeJ%2B5KvGt9sr9U>