Date: Fri, 27 Nov 2020 15:07:10 -0800 From: Bakul Shah <bakul@iitbombay.org> To: Rebecca Cran <rebecca@bsdio.com> Cc: Hans Petter Selasky <hps@selasky.org>, freebsd-current@freebsd.org, kib@freebsd.org Subject: Re: panic shortly after boot when amdgpu.ko is loaded (fpu?) Message-ID: <916B4D57-6C8A-4510-AE29-5E289717CBCA@iitbombay.org> In-Reply-To: <0075A3F0-C106-4970-B840-0DFAEA29DBC9@iitbombay.org> References: <2a0f9031-a96d-2989-4d6c-a7691c451b74@bsdio.com> <d19ff5d6-65a8-251a-693b-3ff42b60a252@selasky.org> <40ac5686-aa96-f9e4-7c9c-5dbe628af49a@bsdio.com> <0075A3F0-C106-4970-B840-0DFAEA29DBC9@iitbombay.org>
next in thread | previous in thread | raw e-mail | index | archive | help
> On Nov 27, 2020, at 1:47 PM, Bakul Shah <bakul@iitbombay.org> wrote:
>
>
>
>> On Nov 27, 2020, at 9:09 AM, Rebecca Cran <rebecca@bsdio.com> wrote:
>>
>> On 11/27/20 4:29 AM, Hans Petter Selasky wrote:
>>>
>>> Is the problem always triggered by hald? If you disable hald in rc.conf, does the system run for a longer period of time?
>>
>> It turns out that disabling ntpd let the system run for a longer period of time - until I ran "sysctl sys" at which point I got a panic.
>>
>> And this time the panic actually implicates amdgpu.ko, which is an improvement:
>>
>>
>> #9 0x0000000000000000 in ?? ()
>> #10 0xffffffff82a14c4e in amdgpu_device_get_pcie_replay_count ()
>> from /boot/modules/amdgpu.ko
>> #11 0xffffffff82a14b80 in sysctl_handle_attr () from /boot/modules/amdgpu.ko
>>
>> #12 0xffffffff80c06cc1 in sysctl_root_handler_locked (oid=0xfffffe02133ff000,
>> arg1=0xfffffe016e360980, arg2=-8724518803888, req=0xfffffe016e360980,
>> tracker=0xfffff81099af6280) at /usr/src/sys/kern/kern_sysctl.c:184
>> #13 0xffffffff80c0610c in sysctl_root (oidp=<optimized out>,
>> arg1=0xfffff810aa27e650, arg2=-2100190360, req=0xfffffe016e360980)
>> at /usr/src/sys/kern/kern_sysctl.c:2211
>>
>>
>> Since it _is_ a problem in amdgpu, I'll stop this thread and re-post on freebsd-x11.
>
> FWIW, I am using amdgpu on a Ryzen 5 3500U system on a couple days old
> -current (r368025). "sysctl sys" complains about "unknown oid 'sys'".
> I am runing hald & ntpd. I had a few amdgpu related panics initially
> but they vanished once I added
> PORTS_MODULES=graphics/drm-devel-kmod
> to /etc/src.conf to compile it along with the kernel. I am running
> GENERIC-NODEBUG. The machine gets rebooted when I install a new kernel
> (usually once a week).
>
> My guess is some weird interaction rather than something in amdgpu.
To get sysctl sys working I compiled a GENERIC kernel from today's
368108 revision and so far there are no problems.
$ sysctl sys.device.drmn0.pcie_replay_count
sys.device.drmn0.pcie_replay_count: 0
sysctl -a also works.
Last commit log on drm-devel-kmod (the last tiem may be what you're
running into):
Author: manu <manu@FreeBSD.org>
Date: Mon Nov 9 13:37:12 2020 +0000
drm-current-kmod/drm-devel-kmod: Update to latest version
- Use acpi code from base (thanks to wulf@)
- Add radeon/i386 patches (thanks to tilj@)
- Translate O_ flags for linuxulator (thanks to Greg V)
- Lot of linuxkpi cleanup
- Hack for amdgpu when the IP isn't init properly, this happens
on one of my laptop with a dGPU. We still don't support it but
we don't panic when we load amdgpu
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?916B4D57-6C8A-4510-AE29-5E289717CBCA>
