Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 7 Nov 2019 20:53:17 -0500
From:      Kurt Hackenberg <kh@panix.com>
To:        freebsd-x11@freebsd.org
Subject:   AMD Ryzen Vega problem
Message-ID:  <24004.51853.331522.684781@rain.home>

next in thread | raw e-mail | index | archive | help
Hello people.

FreeBSD graphics seems to have a problem on my desktop PC.  I hope
somebody here will have some ideas about this.

A recent build of 13.0-CURRENT hung silently during boot, apparently
shortly after loading and starting amdgpu.  I saw some messages from
drm on the console, then everything stopped.

A build from October 24 (r353967) worked much better, but had at least
one problem, which looks serious.  Details of that are at the end of
this message.

Both builds seem to work fine when amdgpu is not loaded (though with
no /dev/dri, and presumably no hardware graphics acceleration).

The PC is an HP Pavilion 590-p0025.  The processor is an AMD Ryzen 3
2200G, with an integrated graphics processor -- "Radeon Vega 8
graphics".  FreeBSD says the processor is this:

CPU: AMD Ryzen 3 2200G with Radeon Vega Graphics     (3493.53-MHz K8-class CPU)
  Origin="AuthenticAMD"  Id=0x810f10  Family=0x17  Model=0x11  Stepping=0
  Features=0x178bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2,HTT>
  Features2=0x7ed8320b<SSE3,PCLMULQDQ,MON,SSSE3,FMA,CX16,SSE4.1,SSE4.2,MOVBE,POPCNT,AESNI,XSAVE,OSXSAVE,AVX,F16C,RDRAND>
  AMD Features=0x2e500800<SYSCALL,NX,MMX+,FFXSR,Page1GB,RDTSCP,LM>
  AMD Features2=0x35c233ff<LAHF,CMP,SVM,ExtAPIC,CR8,ABM,SSE4A,MAS,Prefetch,OSVW,SKINIT,WDT,TCE,Topology,PCXC,PNXC,DBE,PL2I,MWAITX>
  Structured Extended Features=0x209c01a9<FSGSBASE,BMI1,AVX2,SMEP,BMI2,RDSEED,ADX,SMAP,CLFLUSHOPT,SHA>
  XSAVE Features=0xf<XSAVEOPT,XSAVEC,XINUSE,XSAVES>
  AMD Extended Feature Extensions ID EBX=0x1007<CLZERO,IRPerf,XSaveErPtr,IBPB>
  SVM: (disabled in BIOS) NP,NRIP,VClean,AFlush,DAssist,NAsids=32768
  TSC: P-state invariant, performance statistics

The software is this:

~$ uname -a
FreeBSD rain.home 13.0-CURRENT FreeBSD 13.0-CURRENT #0 r354368: Wed Nov  6 02:19:37 EST 2019     root@rain.home:/usr/obj/usr/src/amd64.amd64/sys/GENERIC  amd64
~$ pkg info 'drm*'
drm-devel-kmod-5.0.g20191023
~$ pkg info '*firmware*'
gpu-firmware-kmod-g20191015
~$

-----------------------------------

The version that mostly worked was an earlier build of FreeBSD, but
with the same graphics processor software:

~$ uname -a
FreeBSD rain.home 13.0-CURRENT FreeBSD 13.0-CURRENT #2 r353967: Thu Oct 24 01:53:28 EDT 2019     root@rain.home:/usr/obj/usr/src/amd64.amd64/sys/GENERIC  amd64
~$ pkg info 'drm*'
drm-devel-kmod-5.0.g20191023
~$ pkg info '*firmware*'
gpu-firmware-kmod-g20191015
~$

With this software, when mpv plays a short video, a pair of errors
happens five times in a row:

Oct 24 19:09:53 rain kernel: uma_zalloc_arg: zone "128" with the following non-sleepable locks held:
Oct 24 19:09:53 rain kernel: exclusive rw vm object (vm object) r = 0 (0xfffff801dac97500) locked @ /usr/src/sys/vm/vm_fault.c:767
Oct 24 19:09:53 rain kernel: stack backtrace:
Oct 24 19:09:53 rain kernel: #0 0xffffffff80c2cab1 at witness_debugger+0x71
Oct 24 19:09:53 rain kernel: #1 0xffffffff80c2dab0 at witness_warn+0x430
Oct 24 19:09:53 rain kernel: #2 0xffffffff80ef4a2b at uma_zalloc_arg+0x3b
Oct 24 19:09:53 rain kernel: #3 0xffffffff80b98f19 at malloc+0x99
Oct 24 19:09:53 rain kernel: #4 0xffffffff8261c5ad at linux_alloc_current+0x3d
Oct 24 19:09:53 rain kernel: #5 0xffffffff8261bdb7 at linux_cdev_pager_populate+0x237
Oct 24 19:09:53 rain kernel: #6 0xffffffff80efc131 at vm_fault+0x5c1
Oct 24 19:09:53 rain kernel: #7 0xffffffff80efba1e at vm_fault_trap+0x6e
Oct 24 19:09:53 rain kernel: #8 0xffffffff81033c05 at trap_pfault+0x205
Oct 24 19:09:53 rain kernel: #9 0xffffffff81033253 at trap+0x453
Oct 24 19:09:53 rain kernel: #10 0xffffffff81009f9c at calltrap+0x8
Oct 24 19:09:53 rain kernel: uma_zalloc_arg: zone "64" with the following non-sleepable locks held:
Oct 24 19:09:53 rain kernel: exclusive rw vm object (vm object) r = 0 (0xfffff801dac97500) locked @ /usr/src/sys/vm/vm_fault.c:767
Oct 24 19:09:53 rain kernel: stack backtrace:
Oct 24 19:09:53 rain kernel: #0 0xffffffff80c2cab1 at witness_debugger+0x71
Oct 24 19:09:53 rain kernel: #1 0xffffffff80c2dab0 at witness_warn+0x430
Oct 24 19:09:53 rain kernel: #2 0xffffffff80ef4a2b at uma_zalloc_arg+0x3b
Oct 24 19:09:53 rain kernel: #3 0xffffffff80b98f19 at malloc+0x99
Oct 24 19:09:53 rain kernel: #4 0xffffffff8261c5d2 at linux_alloc_current+0x62
Oct 24 19:09:53 rain kernel: #5 0xffffffff8261bdb7 at linux_cdev_pager_populate+0x237
Oct 24 19:09:53 rain kernel: #6 0xffffffff80efc131 at vm_fault+0x5c1
Oct 24 19:09:53 rain kernel: #7 0xffffffff80efba1e at vm_fault_trap+0x6e
Oct 24 19:09:53 rain kernel: #8 0xffffffff81033c05 at trap_pfault+0x205
Oct 24 19:09:53 rain kernel: #9 0xffffffff81033253 at trap+0x453
Oct 24 19:09:53 rain kernel: #10 0xffffffff81009f9c at calltrap+0x8

(and four more times)

and then this happens:

Oct 24 19:10:05 rain kernel: [drm ERROR :drm_sched_entity_flush] ==========> BUG: entity->rq->sched is NULL
Oct 24 19:10:05 rain syslogd: last message repeated 5 times

Eventually the system hangs or crashes, though I'm not sure that's
caused by this specific error.



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?24004.51853.331522.684781>