Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 30 Apr 2016 01:39:56 +0000
From:      bugzilla-noreply@freebsd.org
To:        freebsd-bugs@FreeBSD.org
Subject:   [Bug 209158] node / npm triggering zfs rename deadlock
Message-ID:  <bug-209158-8@https.bugs.freebsd.org/bugzilla/>

next in thread | raw e-mail | index | archive | help
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D209158

            Bug ID: 209158
           Summary: node / npm triggering zfs rename deadlock
           Product: Base System
           Version: 11.0-CURRENT
          Hardware: Any
                OS: Any
            Status: New
          Severity: Affects Some People
          Priority: ---
         Component: kern
          Assignee: freebsd-bugs@FreeBSD.org
          Reporter: peter@FreeBSD.org

The freebsd.org cluster package builder has been deadlocking regularly with
things like this:

root@bob.nyi:/home/peter # ps -axHl -O lwp | grep node
    0 92502 92499   0   52  0 771080 197588 kqread   I+J   1       0:07.57 =
npm
(node)       92502 101429  1  I+J       0:07.57 npm (node)
    0 92502 92499   0   22  0 771080 197588 usem     I+J   1       0:00.11 =
npm
(node)       92502 101858  1  I+J       0:00.11 npm (node)
    0 92502 92499   0   22  0 771080 197588 usem     I+J   1       0:00.11 =
npm
(node)       92502 101859  1  I+J       0:00.11 npm (node)
    0 92502 92499   0   22  0 771080 197588 usem     I+J   1       0:00.11 =
npm
(node)       92502 101860  1  I+J       0:00.11 npm (node)
    0 92502 92499   0   22  0 771080 197588 usem     I+J   1       0:00.10 =
npm
(node)       92502 101861  1  I+J       0:00.10 npm (node)
    0 92502 92499   0   28  0 771080 197588 zfs      D+J   1       0:00.89 =
npm
(node)       92502 101862  1  D+J       0:00.89 npm (node)
    0 92502 92499   0   28  0 771080 197588 zfs      D+J   1       0:00.90 =
npm
(node)       92502 101863  1  D+J       0:00.90 npm (node)
    0 92502 92499   0   28  0 771080 197588 zfs      D+J   1       0:00.89 =
npm
(node)       92502 101864  1  D+J       0:00.89 npm (node)
    0 92502 92499   0   28  0 771080 197588 zfs      D+J   1       0:00.91 =
npm
(node)       92502 101865  1  D+J       0:00.91 npm (node)
    0 84004 81177   0   20  0  14864   2532 piperd   S+    5       0:00.00 =
grep
node        84004 101732  5  S+        0:00.00 grep node
root@bob.nyi:/home/peter # kgdb


(kgdb) tid 101862
[Switching to thread 580 (Thread 101862)]#0  0xffffffff807e9f83 in sched_sw=
itch
()
(kgdb) where
#0  0xffffffff807e9f83 in sched_switch ()
#1  0xffffffff807c3f88 in mi_switch ()
#2  0xffffffff8080be0a in sleepq_wait ()
#3  0xffffffff807941d5 in sleeplk ()
#4  0xffffffff8079350f in __lockmgr_args ()
#5  0xffffffff808710bc in vop_stdlock ()
#6  0xffffffff80b9b64f in VOP_LOCK1_APV ()
#7  0xffffffff808953c3 in _vn_lock ()
#8  0xffffffff803cb542 in zfs_lookup ()
#9  0xffffffff803cb86d in zfs_freebsd_lookup ()
#10 0xffffffff80b99810 in VOP_CACHEDLOOKUP_APV ()
#11 0xffffffff8086dc56 in vfs_cache_lookup ()
#12 0xffffffff80b99710 in VOP_LOOKUP_APV ()
#13 0xffffffff80877131 in lookup ()
#14 0xffffffff80876811 in namei ()
#15 0xffffffff80891da1 in kern_renameat ()
#16 0xffffffff80af24a8 in amd64_syscall ()
#17 0xffffffff80ad69bb in Xfast_syscall ()

(kgdb) tid 101863
[Switching to thread 579 (Thread 101863)]#0  0xffffffff807e9f83 in sched_sw=
itch
()
(kgdb) where
#0  0xffffffff807e9f83 in sched_switch ()
#1  0xffffffff807c3f88 in mi_switch ()
#2  0xffffffff8080be0a in sleepq_wait ()
#3  0xffffffff807941d5 in sleeplk ()
#4  0xffffffff8079350f in __lockmgr_args ()
#5  0xffffffff808710bc in vop_stdlock ()
#6  0xffffffff80b9b64f in VOP_LOCK1_APV ()
#7  0xffffffff808953c3 in _vn_lock ()
#8  0xffffffff808839b7 in vget ()
#9  0xffffffff8086c3d8 in cache_lookup ()
#10 0xffffffff8086dc2c in vfs_cache_lookup ()
#11 0xffffffff80b99710 in VOP_LOOKUP_APV ()
#12 0xffffffff80877131 in lookup ()
#13 0xffffffff80876811 in namei ()
#14 0xffffffff80891da1 in kern_renameat ()
#15 0xffffffff80af24a8 in amd64_syscall ()
#16 0xffffffff80ad69bb in Xfast_syscall ()


(kgdb) tid 101864
[Switching to thread 578 (Thread 101864)]#0  0xffffffff807e9f83 in sched_sw=
itch
()
(kgdb) where
#0  0xffffffff807e9f83 in sched_switch ()
#1  0xffffffff807c3f88 in mi_switch ()
#2  0xffffffff8080be0a in sleepq_wait ()
#3  0xffffffff807941d5 in sleeplk ()
#4  0xffffffff80793d35 in __lockmgr_args ()
#5  0xffffffff808710bc in vop_stdlock ()
#6  0xffffffff80b9b64f in VOP_LOCK1_APV ()
#7  0xffffffff808953c3 in _vn_lock ()
#8  0xffffffff808840d8 in vputx ()
#9  0xffffffff803d2303 in zfs_freebsd_rename ()
#10 0xffffffff80b9adbf in VOP_RENAME_APV ()
#11 0xffffffff80892198 in kern_renameat ()
#12 0xffffffff80af24a8 in amd64_syscall ()
#13 0xffffffff80ad69bb in Xfast_syscall ()

(kgdb) tid 101865
[Switching to thread 577 (Thread 101865)]#0  0xffffffff807e9f83 in sched_sw=
itch
()
(kgdb) where
#0  0xffffffff807e9f83 in sched_switch ()
#1  0xffffffff807c3f88 in mi_switch ()
#2  0xffffffff8080be0a in sleepq_wait ()
#3  0xffffffff807941d5 in sleeplk ()
#4  0xffffffff8079350f in __lockmgr_args ()
#5  0xffffffff808710bc in vop_stdlock ()
#6  0xffffffff80b9b64f in VOP_LOCK1_APV ()
#7  0xffffffff808953c3 in _vn_lock ()
#8  0xffffffff808839b7 in vget ()
#9  0xffffffff8086c3d8 in cache_lookup ()
#10 0xffffffff8086dc2c in vfs_cache_lookup ()
#11 0xffffffff80b99710 in VOP_LOOKUP_APV ()
#12 0xffffffff80877131 in lookup ()
#13 0xffffffff80876811 in namei ()
#14 0xffffffff80891da1 in kern_renameat ()
#15 0xffffffff80af24a8 in amd64_syscall ()
#16 0xffffffff80ad69bb in Xfast_syscall ()
(kgdb)=20

Build info is:
FreeBSD bob.nyi.freebsd.org 11.0-CURRENT FreeBSD 11.0-CURRENT #0 r296980: T=
hu
Mar 17 07:14:40 UTC 2016=20

This is a relatively new development.  This used to build robustly for a wh=
ile
but we've been having lockups reliably for a few weeks.

--=20
You are receiving this mail because:
You are the assignee for the bug.=



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-209158-8>