Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 15 May 2023 13:26:31 +0000
From:      bugzilla-noreply@freebsd.org
To:        bugs@FreeBSD.org
Subject:   [Bug 271438] panic due to memory corruption at zfs/disk probing stage during install of 20230504 snapshot
Message-ID:  <bug-271438-227@https.bugs.freebsd.org/bugzilla/>

next in thread | raw e-mail | index | archive | help
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D271438

            Bug ID: 271438
           Summary: panic due to memory corruption at zfs/disk probing
                    stage during install of 20230504 snapshot
           Product: Base System
           Version: CURRENT
          Hardware: amd64
                OS: Any
            Status: New
          Severity: Affects Only Me
          Priority: ---
         Component: kern
          Assignee: bugs@FreeBSD.org
          Reporter: bcran@FreeBSD.org

Created attachment 242195
  --> https://bugs.freebsd.org/bugzilla/attachment.cgi?id=3D242195&action=
=3Dedit
dmesg from a dual-EPYC PowerEdge R7525

During an install of FreeBSD 14-CURRENT snapshot 20230504, my dual-EPYC
PowerEdge R7525 system panics at the zfs/disk probing stage due to memory
corruption.
It appears to be because of memory corruption, because I've also seen it ca=
use
bsdinstall to simply segfault.

I ended up working around the panic by disabling the mrsas controller during
install, and re-enabling it later.=20

panic: mtx_lock() of spin mutex (null) @ /usr/src/sys/kern/kern_clock.c:269
cpuid =3D 67
time =3D 1683225642
KDB: stack backtrace:
db_trace_self_wrapper() at db_trace_self_wrapper+0x2b/frame 0xfffffe037aa72=
db0
vpanic() at vpanic+0x152/frame 0xfffffe037aa72e00
panic() at panic+0x43/frame 0xfffffe037aa72e60
__mtx_lock_flags() at __mtx_lock_flags+0x13b/frame 0xfffffe037aa72eb0
deadlkres() at deadlkres+0xef/frame 0xfffffe037aa72ef0
fork_exit() at fork_exit+0x80/frame 0xfffffe037aa72f30
fork_trampoline() at fork_trampoline+0xe/frame 0xfffffe037aa72f30
--- trap 0, rip =3D 0, rsp =3D 0, rbp =3D 0 ---
KDB: enter: panic
[ thread pid 0 tid 101238 ]
Stopped at      kdb_enter+0x32: movq    $0,0xddd833(%rip)
db>


Start of dmesg:

FreeBSD 14.0-CURRENT #0 main-n262746-4194bbb34c60: Thu May  4 08:05:46 UTC =
2023
root@releng1.nyi.freebsd.org:/usr/obj/usr/src/amd64.amd64/sys/GENERIC amd64
FreeBSD clang version 15.0.7 (https://github.com/llvm/llvm-project.git
llvmorg-15.0.7-0-g8dfdcc7b7bf6)
WARNING: WITNESS option enabled, expect reduced performance.
SRAT: Too many memory domains
VT(efifb): resolution 1024x768
CPU: AMD EPYC 7713 64-Core Processor (1996.32-MHz K8-class CPU)
  Origin=3D"AuthenticAMD"  Id=3D0xa00f11  Family=3D0x19  Model=3D0x1 Steppi=
ng=3D1
Features=3D0x178bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,=
MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2,HTT>
Features2=3D0x7efa320b<SSE3,PCLMULQDQ,MON,SSSE3,FMA,CX16,PCID,SSE4.1,SSE4.2=
,x2APIC,MOVBE,POPCNT,AESNI,XSAVE,OSXSAVE,AVX,F16C,RDRAND>
  AMD Features=3D0x2e500800<SYSCALL,NX,MMX+,FFXSR,Page1GB,RDTSCP,LM>
  AMD
Features2=3D0x75c237ff<LAHF,CMP,SVM,ExtAPIC,CR8,ABM,SSE4A,MAS,Prefetch,OSVW=
,IBS,SKINIT,WDT,TCE,Topology,PCXC,PNXC,DBE,PL2I,MWAITX,ADMSKX>
  Structured Extended
Features=3D0x219c97a9<FSGSBASE,BMI1,AVX2,SMEP,BMI2,ERMS,INVPCID,PQM,PQE,RDS=
EED,ADX,SMAP,CLFLUSHOPT,CLWB,SHA>
  Structured Extended Features2=3D0x40069c<UMIP,PKU,OSPKE,VAES,VPCLMULQDQ,R=
DPID>
  Structured Extended Features3=3D0x10<FSRM>
  XSAVE Features=3D0xf<XSAVEOPT,XSAVEC,XINUSE,XSAVES>
  AMD Extended Feature Extensions ID
EBX=3D0x91bef75f<CLZERO,IRPerf,XSaveErPtr,RDPRU,MCOMMIT,WBNOINVD,IBPB,IBRS,=
STIBP,STIBP_ALWAYSON,PREFER_IBRS,PPIN,SSBD>
  SVM: NP,NRIP,VClean,AFlush,DAssist,NAsids=3D32768
  TSC: P-state invariant, performance statistics
real memory  =3D 277021196288 (264188 MB)
avail memory =3D 266164191232 (253833 MB)

disk info:

root@:~ # gpart show
=3D>        6  233308149  nvd0  GPT  (890G)
          6  233308149        - free -  (890G)

=3D>        6  233308149  diskid/DISK-S61ANA0R100142  GPT  (890G)
          6  233308149                              - free - (890G)

=3D>       34  937571901  ada0  GPT  (447G)
         34       2014        - free -  (1.0M)
       2048  937568256     1  ms-basic-data  (447G)
  937570304       1631        - free -  (816K)

=3D>       34  937571901  diskid/DISK-74ff0e529b990010  GPT (447G)
         34       2014                                - free - (1.0M)
       2048  937568256                             1 ms-basic-data  (447G)
  937570304       1631                                - free - (816K)

=3D>        34  3125627501  da0  GPT  (1.5T)
          34        2014       - free -  (1.0M)
        2048     1048576    1  efi  (512M)
     1050624  2929686528    2  linux-data  (1.4T)
  2930737152    62500864    3  linux-swap  (30G)
  2993238016   132389519       - free -  (63G)

=3D>        34  3125627501  da1  GPT  (1.5T)
          34        2014       - free -  (1.0M)
        2048       32768    1  ms-reserved  (16M)
       34816  3125592064    2  ms-basic-data  (1.5T)
  3125626880         655       - free -  (328K)

=3D>        34  3125627501  diskid/DISK-S61ENE0R302148  GPT (1.5T)
          34        2014                              - free - (1.0M)
        2048     1048576                           1  efi  (512M)
     1050624  2929686528                           2  linux-data (1.4T)
  2930737152    62500864                           3  linux-swap (30G)
  2993238016   132389519                              - free - (63G)

=3D>        34  3125627501  diskid/DISK-202229EBAC90  GPT  (1.5T)
          34        2014                            - free - (1.0M)
        2048       32768                         1  ms-reserved (16M)
       34816  3125592064                         2  ms-basic-data (1.5T)
  3125626880         655                            - free - (328K)

=3D>    17  544157  cd0  MBR  (1.0G)
      17  544157       - free -  (1.0G)

=3D>    17  544157  iso9660/14_0_CURRENT_AMD64_CD  MBR  (1.0G)
      17  544157                                 - free -  (1.0G)

--=20
You are receiving this mail because:
You are the assignee for the bug.=



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-271438-227>