Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 04 Feb 2020 11:32:30 +0000
From:      bugzilla-noreply@freebsd.org
To:        bugs@FreeBSD.org
Subject:   [Bug 243867] random fork_trampoline() panic on Dell R540
Message-ID:  <bug-243867-227@https.bugs.freebsd.org/bugzilla/>

next in thread | raw e-mail | index | archive | help
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D243867

            Bug ID: 243867
           Summary: random fork_trampoline() panic on Dell R540
           Product: Base System
           Version: 11.3-RELEASE
          Hardware: amd64
                OS: Any
            Status: New
          Severity: Affects Only Me
          Priority: ---
         Component: kern
          Assignee: bugs@FreeBSD.org
          Reporter: oleg.cherkasov@linux.com

Created attachment 211341
  --> https://bugs.freebsd.org/bugzilla/attachment.cgi?id=3D211341&action=
=3Dedit
stack trace dump from the iDRAC virtual console

Hi,

One of our Dell R540 servers have period issue since few months ago put in
production.  Eventually it reboots every ~25 days with not minidumps or
messages in logs.  The suspicion was the motherboard and it had been replac=
ed
by Dell before Xmas so it helped to keep the server running for more then 40
days and then it happen again, and than again in less than 2 days.

Yesterday the server stopped responding and after quick glance at iDRAC vir=
tual
console it reveal the panic screen, see attached screenshot.  Unfortunately=
 I
failed to scroll the screen up because of iDRAC virtual console.

The swap is 24Gb and Dumping stalled for 10-15 so I had to cold reboot beca=
use
of no actions.

Any ideas if it a hardware or software issue?

The system has been upgraded to 11.3-RELEASE-p6 recently.  It is Dell R540 =
with
128Gb RAM, Dell BOSS NVME RAID0, H730P raid controller with 12 JBOD disks +=
 HBA
connected MD1400 via multipath 12 disks.  2 VDEVs ZFS pool, the system is on
UFS partition on M.2/BOSS flash disk.

/boot/device.hints:

hw.mfi.mrsas_enable=3D"1"


/boot/loader.conf.local:

vm.kmem_size_max=3D130000000000
vm.kmem_size=3D130000000000
vfs.zfs.arc_max=3D128000000000
vm.pmap.pti=3D0
hw.ibrs_disable=3D1
hw.spec_store_bypass_disable=3D1
geom_multipath_load=3D"YES"


The system is 100% NAS with samba 4.10, so jails or VMs or active users.

Appreciate any ideas how to debug or diagnose the issue.

--=20
You are receiving this mail because:
You are the assignee for the bug.=



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-243867-227>