Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 21 Oct 2016 20:57:08 +0200
From:      "O. Hartmann" <ohartman@zedat.fu-berlin.de>
To:        FreeBSD CURRENT <freebsd-current@freebsd.org>
Subject:   r3077XX: LOR in vfs
Message-ID:  <20161021205708.678fe3cc.ohartman@zedat.fu-berlin.de>

next in thread | raw e-mail | index | archive | help
--Sig_/sA7qdw.qFyQLeiDiZRmr+Yk
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: quoted-printable

Since r307157 I'm bugged with sporadic reboots/crashes of all boxes running=
 CURRENT. That
is different CPU types (XEON C2D, XEON Haswell, XEON IvyBridge, mobile Hasw=
ell and
desktop IvyBridge).

Below, you'll finde some messages I gathered from the console. I hope this =
is of any use,
if not: I'm very sorry.

I'm also sorry being incapable of providing you with more informations of t=
he crashes. I
do not have any clue how the crashes are related - I try to configure some =
boxes I have
direct access to with debugging options, but unfortunately they all have nv=
idia GPUs and
configured vt() for the UEFI and non UEFI systems - that means, the console=
 is wreckage
since there is an unresolved buf as long as the nvidia-mode.ko is loaded. S=
o far, the
crashing boxes didn't capture a core.

=46rom the reports I got of the other crashing servers running recent CURRENT=
, last process
in braces is in most cases "(swapper)" or "(pager)" - I got this report via=
 phone, sorry.

Anothe observation is that boxes with larger amounts of RAM last longer - a=
 workstation
with 32 GB and r3076XX lasted for ~ 32 hours befor it crashed, a server wit=
h 16 GB
takes several hours, sometimes only minutes and another machine with 8 GB g=
oes down mor
frequently.

Anyway, try to give some mor infos in case I'm able to capture those. I'm r=
eally
surprised that it seems that I'm the only one facing those problems at the =
moment. =20

[...]
Oct 21 05:09:59 <0.2> thor kernel: lock order reversal:
Oct 21 05:09:59 <0.2> thor kernel: 1st 0xfffffe01e98449f8 bufwait (bufwait)
@ /usr/src/sys/kern/vfs_bio.c:3505 Oct 21 05:09:59 <0.2> thor kernel: 2nd
0xfffff800098c2800 dirhash (dirhash) @ /usr/src/sys/ufs/ufs/ufs_dirhash.c:2=
81 Oct 21
05:09:59 <0.2> thor kernel: stack backtrace: Oct 21 05:09:59 <0.2> thor ker=
nel: #0
0xffffffff80818410 at witness_debugger+0x70 Oct 21 05:09:59 <0.2> thor kern=
el: #1
0xffffffff8081830d at witness_checkorder+0xf0d Oct 21 05:09:59 <0.2> thor k=
ernel: #2
0xffffffff807bbdc2 at _sx_xlock+0x72 Oct 21 05:09:59 <0.2> thor kernel: #3
0xffffffff80a8b507 at ufsdirhash_remove+0x37 Oct 21 05:09:59 <0.2> thor ker=
nel: #4
0xffffffff80a8e760 at ufs_dirremove+0x120 Oct 21 05:09:59 <0.2> thor kernel=
: #5
0xffffffff80a961a8 at ufs_rename+0x14a8 Oct 21 05:09:59 <0.2> thor kernel: =
#6
0xffffffff80b6ce88 at VOP_RENAME_APV+0xe8 Oct 21 05:09:59 <0.2> thor kernel=
: #7
0xffffffff80891fa8 at kern_renameat+0x4a8 Oct 21 05:09:59 <0.2> thor kernel=
: #8
0xffffffff80af6934 at amd64_syscall+0x314 Oct 21 05:09:59 <0.2> thor kernel=
: #9
0xffffffff80ad6f1b at Xfast_syscall+0xfb Oct 21 05:10:00 <3.3> thor nslcd[5=
43]: [d0722e]
<group/member=3D"root"> no available LDAP server found: Server is unavailab=
le: Resource
temporarily unavailable Oct 21 05:10:00 <3.3> thor nslcd[543]: [d0722e]
<group/member=3D"root"> no available LDAP server found: Server is unavailab=
le: Resource
temporarily unavailable Oct 21 05:10:00 <0.2> thor kernel: acquiring duplic=
ate lock of
same type: "os.lock_sx" Oct 21 05:10:00 <0.2> thor kernel: 1st os.lock_sx @
nvidia_os.c:599 Oct 21 05:10:00 <0.2> thor kernel: 2nd os.lock_sx @ nvidia_=
os.c:599 Oct
21 05:10:00 <0.2> thor kernel: stack backtrace: Oct 21 05:10:00 <0.2> thor =
kernel: #0
0xffffffff80818410 at witness_debugger+0x70 Oct 21 05:10:00 <0.2> thor kern=
el: #1
0xffffffff8081830d at witness_checkorder+0xf0d Oct 21 05:10:00 <0.2> thor k=
ernel: #2
0xffffffff807bbdc2 at _sx_xlock+0x72 Oct 21 05:10:00 <0.2> thor kernel: #3
0xffffffff81e792b2 at os_acquire_mutex+0x32 Oct 21 05:10:00 <0.2> thor kern=
el: #4
0xffffffff81e48bc8 at _nv017102rm+0x18 Oct 21 05:10:00 <0.2> thor kernel: a=
cquiring
duplicate lock of same type: "os.lock_mtx" Oct 21 05:10:00 <0.2> thor kerne=
l: 1st
os.lock_mtx @ nvidia_os.c:778 Oct 21 05:10:00 <0.2> thor kernel: 2nd os.loc=
k_mtx @
nvidia_os.c:778 Oct 21 05:10:00 <0.2> thor kernel: stack backtrace: Oct 21 =
05:10:00 <0.2>
thor kernel: #0 0xffffffff80818410 at witness_debugger+0x70 Oct 21 05:10:00=
 <0.2> thor
kernel: #1 0xffffffff8081830d at witness_checkorder+0xf0d Oct 21 05:10:00 <=
0.2> thor
kernel: #2 0xffffffff80792604 at __mtx_lock_flags+0xa4 Oct 21 05:10:00 <0.2=
> thor kernel:
#3 0xffffffff81e7962b at os_acquire_spinlock+0x1b Oct 21 05:10:00 <0.2> tho=
r kernel: #4
0xffffffff81b97705 at _nv011755rm+0x185 Oct 21 05:10:00 <0.6> thor kernel:
nvidia-modeset: Allocated GPU:0 (GPU-85fde95a-7974-9962-f1a4-d7c164413929) @
PCI:0000:01:00.0

[...]

Oct 21 05:23:19 <0.2> thor kernel: lock order reversal:
Oct 21 05:23:19 <0.2> thor kernel: 1st 0xfffff8017ee32d50 ufs (ufs)
@ /usr/src/sys/kern/vfs_lookup.c:522 Oct 21 05:23:19 <0.2> thor kernel: 2nd
0xfffffe01ea131fa8 bufwait (bufwait) @ /usr/src/sys/ufs/ffs/ffs_vnops.c:278=
 Oct 21
05:23:19 <0.2> thor kernel: 3rd 0xfffff8017eebe5f0 ufs (ufs)
@ /usr/src/sys/kern/vfs_subr.c:2592 Oct 21 05:23:19 <0.2> thor kernel: stac=
k backtrace:
Oct 21 05:23:19 <0.2> thor kernel: #0 0xffffffff80818410 at witness_debugge=
r+0x70 Oct 21
05:23:19 <0.2> thor kernel: #1 0xffffffff8081830d at witness_checkorder+0xf=
0d Oct 21
05:23:19 <0.2> thor kernel: #2 0xffffffff80788e5f at __lockmgr_args+0x4df O=
ct 21 05:23:19
<0.2> thor kernel: #3 0xffffffff80a855f6 at ffs_lock+0xa6 Oct 21 05:23:19 <=
0.2> thor
kernel: #4 0xffffffff80b6d9f0 at VOP_LOCK1_APV+0xe0 Oct 21 05:23:19 <0.2> t=
hor kernel: #5
0xffffffff808955ca at _vn_lock+0x9a Oct 21 05:23:19 <0.2> thor kernel: #6
0xffffffff80884eb4 at vget+0x64 Oct 21 05:23:19 <0.2> thor kernel: #7 0xfff=
fffff80876d7e
at vfs_hash_get+0xfe Oct 21 05:23:19 <0.2> thor kernel: #8 0xffffffff80a812=
20 at
ffs_vgetf+0x40 Oct 21 05:23:19 <0.2> thor kernel: #9 0xffffffff80a76dee at
softdep_sync_buf+0xd4e Oct 21 05:23:19 <0.2> thor kernel: #10 0xffffffff80a=
86973 at
ffs_syncvnode+0x2a3 Oct 21 05:23:19 <0.2> thor kernel: #11 0xffffffff80a59f=
d2 at
ffs_truncate+0x7d2 Oct 21 05:23:19 <0.2> thor kernel: #12 0xffffffff80a8e24=
b at
ufs_direnter+0x68b Oct 21 05:23:19 <0.2> thor kernel: #13 0xffffffff80a978b=
d at
ufs_makeinode+0x59d Oct 21 05:23:19 <0.2> thor kernel: #14 0xffffffff80a935=
c3 at
ufs_create+0x33 Oct 21 05:23:19 <0.2> thor kernel: #15 0xffffffff80b6b2da at
VOP_CREATE_APV+0xda Oct 21 05:23:19 <0.2> thor kernel: #16 0xffffffff80894d=
e3 at
vn_open_cred+0x573 Oct 21 05:23:19 <0.2> thor kernel: #17 0xffffffff8088d52=
c at
kern_openat+0x25c

[...]

Oct 21 20:16:57 <0.2> thor kernel: All buffers synced.
Oct 21 20:16:57 <0.2> thor kernel: lock order reversal:
Oct 21 20:16:57 <0.2> thor kernel: 1st 0xfffff800080419a0 ufs (ufs)
@ /usr/src/sys/kern/vfs_mount.c:1253 Oct 21 20:16:57 <0.2> thor kernel: 2nd
0xfffff80004f2a240 syncer (syncer) @ /usr/src/sys/kern/vfs_subr.c:2732 Oct =
21 20:16:57
<0.2> thor kernel: stack backtrace: Oct 21 20:16:57 <0.2> thor kernel: #0
0xffffffff80818410 at witness_debugger+0x70 Oct 21 20:16:57 <0.2> thor kern=
el: #1
0xffffffff8081830d at witness_checkorder+0xf0d Oct 21 20:16:57 <0.2> thor k=
ernel: #2
0xffffffff80788e5f at __lockmgr_args+0x4df Oct 21 20:16:57 <0.2> thor kerne=
l: #3
0xffffffff8087318c at vop_stdlock+0x3c Oct 21 20:16:57 <0.2> thor kernel: #4
0xffffffff80b6d9f0 at VOP_LOCK1_APV+0xe0 Oct 21 20:16:57 <0.2> thor kernel:=
 #5
0xffffffff808955ca at _vn_lock+0x9a Oct 21 20:16:57 <0.2> thor kernel: #6
0xffffffff80885539 at vputx+0x169 Oct 21 20:16:57 <0.2> thor kernel: #7
0xffffffff8087cfeb at dounmount+0x5eb Oct 21 20:16:57 <0.2> thor kernel: #8
0xffffffff80886ccb at vfs_unmountall+0x6b Oct 21 20:16:57 <0.2> thor kernel=
: #9
0xffffffff80863441 at bufshutdown+0x421 Oct 21 20:16:57 <0.2> thor kernel: =
#10
0xffffffff807b2fc9 at kern_reboot+0x189 Oct 21 20:16:57 <0.2> thor kernel: =
#11
0xffffffff807b2de3 at sys_reboot+0x3e3 Oct 21 20:16:57 <0.2> thor kernel: #=
12
0xffffffff80af6934 at amd64_syscall+0x314 Oct 21 20:16:57 <0.2> thor kernel=
: #13
0xffffffff80ad6f1b at Xfast_syscall+0xfb Oct 21 20:16:57 <0.2> thor kernel:=
 lock order
reversal: Oct 21 20:16:57 <0.2> thor kernel: 1st 0xfffff80008166240 devfs (=
devfs)
@ /usr/src/sys/kern/vfs_mount.c:1253 Oct 21 20:16:57 <0.2> thor kernel: 2nd
0xfffff80008165d50 syncer (syncer) @ /usr/src/sys/kern/vfs_subr.c:2732 Oct =
21 20:16:57
<0.2> thor kernel: stack backtrace: Oct 21 20:16:57 <0.2> thor kernel: #0
0xffffffff80818410 at witness_debugger+0x70 Oct 21 20:16:57 <0.2> thor kern=
el: #1
0xffffffff8081830d at witness_checkorder+0xf0d Oct 21 20:16:57 <0.2> thor k=
ernel: #2
0xffffffff80788e5f at __lockmgr_args+0x4df Oct 21 20:16:57 <0.2> thor kerne=
l: #3
0xffffffff8087318c at vop_stdlock+0x3c Oct 21 20:16:57 <0.2> thor kernel: #4
0xffffffff80b6d9f0 at VOP_LOCK1_APV+0xe0 Oct 21 20:16:57 <0.2> thor kernel:=
 #5
0xffffffff808955ca at _vn_lock+0x9a Oct 21 20:16:57 <0.2> thor kernel: #6
0xffffffff80885539 at vputx+0x169 Oct 21 20:16:57 <0.2> thor kernel: #7
0xffffffff8087cfeb at dounmount+0x5eb Oct 21 20:16:57 <0.2> thor kernel: #8
0xffffffff80886ccb at vfs_unmountall+0x6b Oct 21 20:16:57 <0.2> thor kernel=
: #9
0xffffffff80863441 at bufshutdown+0x421 Oct 21 20:16:57 <0.2> thor kernel: =
#10
0xffffffff807b2fc9 at kern_reboot+0x189 Oct 21 20:16:57 <0.2> thor kernel: =
#11
0xffffffff807b2de3 at sys_reboot+0x3e3 Oct 21 20:16:57 <0.2> thor kernel: #=
12
0xffffffff80af6934 at amd64_syscall+0x314 Oct 21 20:16:57 <0.2> thor kernel=
: #13
0xffffffff80ad6f1b at Xfast_syscall+0xfb Oct 21 20:16:57 <0.2> thor kernel:=
 lock order
reversal: Oct 21 20:16:57 <0.2> thor kernel: 1st 0xfffff800081669a0 ufs (uf=
s)
@ /usr/src/sys/kern/vfs_mount.c:1253 Oct 21 20:16:57 <0.2> thor kernel: 2nd
0xfffff80004f2b068 devfs (devfs) @ /usr/src/sys/ufs/ffs/ffs_vfsops.c:1386 O=
ct 21 20:16:57
<0.2> thor kernel: stack backtrace: Oct 21 20:16:57 <0.2> thor kernel: #0
0xffffffff80818410 at witness_debugger+0x70 Oct 21 20:16:57 <0.2> thor kern=
el: #1
0xffffffff8081830d at witness_checkorder+0xf0d Oct 21 20:16:57 <0.2> thor k=
ernel: #2
0xffffffff80788e5f at __lockmgr_args+0x4df Oct 21 20:16:57 <0.2> thor kerne=
l: #3
0xffffffff8087318c at vop_stdlock+0x3c Oct 21 20:16:57 <0.2> thor kernel: #4
0xffffffff80b6d9f0 at VOP_LOCK1_APV+0xe0 Oct 21 20:16:57 <0.2> thor kernel:=
 #5
0xffffffff808955ca at _vn_lock+0x9a Oct 21 20:16:57 <0.2> thor kernel: #6
0xffffffff80a81168 at ffs_flushfiles+0x118 Oct 21 20:16:57 <0.2> thor kerne=
l: #7
0xffffffff80a62aa9 at softdep_flushfiles+0x569 Oct 21 20:16:57 <0.2> thor k=
ernel: #8
0xffffffff80a837a2 at ffs_unmount+0x82 Oct 21 20:16:57 <0.2> thor kernel: #9
0xffffffff8087d10d at dounmount+0x70d Oct 21 20:16:57 <0.2> thor kernel: #10
0xffffffff80886ccb at vfs_unmountall+0x6b Oct 21 20:16:57 <0.2> thor kernel=
: #11
0xffffffff80863441 at bufshutdown+0x421 Oct 21 20:16:57 <0.2> thor kernel: =
#12
0xffffffff807b2fc9 at kern_reboot+0x189 Oct 21 20:16:57 <0.2> thor kernel: =
#13
0xffffffff807b2de3 at sys_reboot+0x3e3 Oct 21 20:16:57 <0.2> thor kernel: #=
14
0xffffffff80af6934 at amd64_syscall+0x314 Oct 21 20:16:57 <0.2> thor kernel=
: #15
0xffffffff80ad6f1b at Xfast_syscall+0xfb

--Sig_/sA7qdw.qFyQLeiDiZRmr+Yk
Content-Type: application/pgp-signature
Content-Description: OpenPGP digital signature

-----BEGIN PGP SIGNATURE-----

iQEcBAEBCAAGBQJYCmUEAAoJEOgBcD7A/5N85HIIAOH0Odzs3lGKoiSBaqW7eD0q
IgzcmQu2j5nqvm3h/J7oCUc+hplF82tOHZQnZAr3qI4RVOHkH+3WFhIknxO66/vb
rovvT2WA+qwrGTFHvwPCB9u004IKpqZA0Hz7zv/Z+YLVjWE9/VcrW3QmLxPzYQvV
fGvOeV1YsKM/5QkQo96+4KiqJnP+qdzjR0i4u5XSlD8dOzyHqfR6r93eUNnDraE4
iOvdGHDCvouuTXnQgsZhSfNtFv/eB69A5Kg+DoI9QVv7C5oH3XM+0UCS1bnbTyOc
179CCTjQlcg6/AHlsynSA5KUAsUgu8wftwMeppgpO3wnWQSK5DU2jfBc0KBUWpo=
=Cooa
-----END PGP SIGNATURE-----

--Sig_/sA7qdw.qFyQLeiDiZRmr+Yk--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20161021205708.678fe3cc.ohartman>