Date: Tue, 7 Aug 2018 16:11:13 -0700 From: Doug Ambrisko <ambrisko@ambrisko.com> To: Konstantin Belousov <kostikbel@gmail.com> Cc: Kyle Evans <kevans@freebsd.org>, Eitan Adler <eadler@freebsd.org>, freebsd-current Current <freebsd-current@freebsd.org> Subject: Re: panic: mutex pmap not owned at ... efirt_machdep.c:255 Message-ID: <20180807231113.GA58652@ambrisko.com> In-Reply-To: <20180807224207.GF1884@kib.kiev.ua> References: <CACNAnaFv9Gj%2BM2Gb8FbwY5q56dnTR6OgjZ5qth9gjr8LreHeow@mail.gmail.com> <20180804083720.GJ6049@kib.kiev.ua> <CACNAnaF1anCXzjivX0jqur%2Bomy_%2B6Xv4D%2BDttzLQY9g6qaxYbw@mail.gmail.com> <20180805104341.GX6049@kib.kiev.ua> <CACNAnaHAoVE57aw7UUPO=psOXz0nnUaCSXtJSFhpBo%2BBhp_qnw@mail.gmail.com> <CAF6rxgmGLbLnMacc_Bb7_RKEAb6vazBjXwcECzsuVqJWKMrBvQ@mail.gmail.com> <CACNAnaHdgbe%2BVDZC-gV-oU7=DgNZhEfg1Uanrs0Lh4MgkLXsOQ@mail.gmail.com> <20180807172949.GD1884@kib.kiev.ua> <20180807214910.GA53911@ambrisko.com> <20180807224207.GF1884@kib.kiev.ua>
next in thread | previous in thread | raw e-mail | index | archive | help
On Wed, Aug 08, 2018 at 01:42:07AM +0300, Konstantin Belousov wrote: | On Tue, Aug 07, 2018 at 02:49:10PM -0700, Doug Ambrisko wrote: | > On Tue, Aug 07, 2018 at 08:29:49PM +0300, Konstantin Belousov wrote: | > | On Tue, Aug 07, 2018 at 11:50:44AM -0500, Kyle Evans wrote: | > | > On Tue, Aug 7, 2018 at 12:09 AM, Eitan Adler <eadler@freebsd.org> wrote: | > | > > On Mon, 6 Aug 2018 at 11:27, Kyle Evans <kevans@freebsd.org> wrote: | > | > >> | > | > >> On Sun, Aug 5, 2018 at 5:43 AM, Konstantin Belousov <kostikbel@gmail.com> wrote: | > | > >> > On Sat, Aug 04, 2018 at 09:46:39PM -0500, Kyle Evans wrote: | > | > >> >> | > | > >> >> He now gets a little further, but ends up with the same panic due to | > | > >> >> efirtc_probe trying to get time to verify the rtc's actually | > | > >> >> implemented. What kind of approach must we take to ensure curcpu is | > | > >> >> synced? | > | > >> > | > | > >> > It does not panic for me, when I load efirt.ko from the loader prompt. | > | > >> > Anyway, try this | > | > >> | > | > >> Right, I also don't get a panic on any of my machines from this. | > | > >> Hopefully he'll have a chance to try this soon. | > | > > | > | > > This change has no impact: it still panics in the same way as without the patch. | > | > > | > | > | > | > That seems indicative of a bigger problem, since we use proc0 | > | > throughout all these bits so we should still be dealing with the same | > | > pmap that got passed to pmap_pinit0 when we grab | > | > curthread->td_proc->p_vmspace->vm_pmap. | > | | > | Can you confirm that you get the early efi_enter() call from rtc code, | > | when you preload the module or compile it into the kernel ? | > | > When I ran into this, I did this change: | > | > Index: dev/efidev/efirt.c | > =================================================================== | > --- dev/efidev/efirt.c (revision 337264) | > +++ dev/efidev/efirt.c (working copy) | > @@ -257,7 +257,8 @@ | > if (efi_runtime == NULL) | > return (ENXIO); | > td = curthread; | > - curpmap = &td->td_proc->p_vmspace->vm_pmap; | > +// curpmap = &td->td_proc->p_vmspace->vm_pmap; | > + curpmap = PCPU_GET(curpmap); | > PMAP_LOCK(curpmap); | > mtx_lock(&efi_lock); | > fpu_kern_enter(td, NULL, FPU_KERN_NOCTX); | > @@ -272,7 +273,8 @@ | > | > efi_arch_leave(); | > | > - curpmap = &curproc->p_vmspace->vm_pmap; | > +// curpmap = &curproc->p_vmspace->vm_pmap; | > + curpmap = PCPU_GET(curpmap); | > td = curthread; | > fpu_kern_leave(td, NULL); | > mtx_unlock(&efi_lock); | > | > Don't know if it is right. Some previous code used both | > curpmap = PCPU_GET(curpmap); | > and | > curpmap = &td->td_proc->p_vmspace->vm_pmap; | > recently it was changes to only use | > curpmap = &td->td_proc->p_vmspace->vm_pmap; | > | > Things seem to work after that. I was able to repro. it with | > qemu-system-x86_64 in UEFI mode. I think it also failed in | > bhyve UEFI mode. | | The pcpu curpmap and curproc vmspace pmap should be synced. Esp. since | there is code relying on this early. I do not want to paper it over. | | In fact, try this please. Ignore my previous change. | | diff --git a/sys/amd64/amd64/pmap.c b/sys/amd64/amd64/pmap.c | index 572b2197453..4bce36cc0e5 100644 | --- a/sys/amd64/amd64/pmap.c | +++ b/sys/amd64/amd64/pmap.c | @@ -7536,7 +7536,8 @@ pmap_activate_sw(struct thread *td) | PCPU_SET(kcr3, pmap->pm_cr3); | PCPU_SET(ucr3, pmap->pm_ucr3); | } | - } | + } else | + PCPU_SET(curpmap, pmap); | if (pmap->pm_ucr3 != PMAP_NO_CR3) { | rsp0 = ((vm_offset_t)PCPU_PTR(pti_stack) + | PC_PTI_STACK_SZ * sizeof(uint64_t)) & ~0xful; That works for qemu and bhyve booting in UEFI PXE mode. I backed out my other change and synced to head. Thanks, Doug A.
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20180807231113.GA58652>