Date: Thu, 24 Feb 2022 03:23:51 +0200 From: Konstantin Belousov <kostikbel@gmail.com> To: Alexander Motin <mav@freebsd.org> Cc: Mike Karels <mike@karels.net>, Tomoaki AOKI <junchoon@dec.sakura.ne.jp>, "Chen, Alvin W" <Weike.Chen@dell.com>, freebsd-current@freebsd.org Subject: Re: [Intel AlderLake] Read&Write files to FAT32 or UFS partition cause data corrupt due to P-Core&E-Core Message-ID: <YhbeJ%2B5KvGt9sr9U@kib.kiev.ua> In-Reply-To: <bd125a09-3b72-92dd-fcb1-466a9a81fe43@FreeBSD.org> References: <YhE1rWoA%2BhMfebq/@kib.kiev.ua> <59cbcfe2-cd53-69d8-65d6-7a79e656f494@FreeBSD.org> <YhVnsB5ZwLYmpAFP@kib.kiev.ua> <1f968af1-1c57-9a09-7e01-145a5262e27f@FreeBSD.org> <YhVyFIFA5XnbGHej@kib.kiev.ua> <06768ef6-c88e-b6c7-0db3-f61eb4230937@FreeBSD.org> <YhV0waCqFLfBy8s7@kib.kiev.ua> <470db559-7e7d-1af7-5983-2858814329d2@FreeBSD.org> <YhV5LOCJMuumTJZw@kib.kiev.ua> <bd125a09-3b72-92dd-fcb1-466a9a81fe43@FreeBSD.org>
next in thread | previous in thread | raw e-mail | index | archive | help
On Wed, Feb 23, 2022 at 12:25:24PM -0500, Alexander Motin wrote: > On 22.02.2022 19:00, Konstantin Belousov wrote: > > On Tue, Feb 22, 2022 at 06:53:09PM -0500, Alexander Motin wrote: > > > On 22.02.2022 18:41, Konstantin Belousov wrote: > > > > On Tue, Feb 22, 2022 at 06:38:24PM -0500, Alexander Motin wrote: > > > > > On 22.02.2022 18:30, Konstantin Belousov wrote: > > > > > > As another blind guess, try to disable pcid, vm.pmap.pcid_enabled=0. > > > > > > > > > > Do you mean it to be a workaround for TrueNAS 12, or it should provide some > > > > > information? The system is at the office and has no IPMI, so I can't switch > > > > > the boot device from home right now. > > > > I intended to see if it is the cause or related feature. > > > > > > I'll try that on the 12 tomorrow, if applicable. > > > > Yes should be relevant still. > > It did the trick. I repeated several times successful boots with the pcid > disabled, and failed ones with default enabled. In attachment you may find > verbose serial console output captures with pcid disabled and enabled, > though without the cpuinfo patch. During the testing I had only one P and > one E cores enabled to reduce noise. Only after that I found P core having > SMT enabled, but I then repeated without SMT also, so it is indeed > irrelevant. > > I'm curios, what in pcid could differentiate the P and E cores, and have it > got fixed in latest stable/13, or I am just "unlucky" to not reproduce it > there? I am curious as well. PCID works on both big Intel cores, and on small cores like Apollo Lake etc. So the fact that it does not properly interact in P/E settings either mean that there is something I did not accounted for from the spec, or there is a bug in silicon. I have no idea why do we work on stable/13 and HEAD. There were enough changes to PCID code there, but it was mostly restructuring and polishing. So the only way to get more understanding is to bisect to see which commit on HEAD fixed the boot.
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?YhbeJ%2B5KvGt9sr9U>