Date: Thu, 18 Aug 2022 09:57:00 -0500 From: Larry Rosenman <ler@lerctr.org> To: Freebsd current <freebsd-current@freebsd.org>, Mark Johnston <markj@freebsd.org> Subject: Re: Hangs in bacula / NFS? on recent Current Message-ID: <9e63f40233fc29c9fbc3fac0cb82acd5@lerctr.org> In-Reply-To: <405b3873b709d42feb438b5b954ecdc2@lerctr.org> References: <405b3873b709d42feb438b5b954ecdc2@lerctr.org>
next in thread | previous in thread | raw e-mail | index | archive | help
On 08/18/2022 9:49 am, Larry Rosenman wrote: > I didn't get all my mail on my bacula backups today (they backup to > NFS mounted TrueNAS). > Also a df hangs. > > Here are procstat -kk's for all: > ler in 🌐 borg in ~ via C v14.0.5-clang on ☁️ (us-east-1) > ❯ ps auxxxwww|grep bacula > bacula 2067 0.0 0.0 63188 13652 - Is 11:30 > 0:17.49 /usr/local/sbin/bacula-sd -u bacula -g bacula -v -c > /usr/local/etc/bacula/bacula-sd.conf > root 2072 0.0 0.0 59280 31276 - Is 11:30 > 0:00.31 /usr/local/sbin/bacula-fd -u root -g wheel -v -c > /usr/local/etc/bacula/bacula-fd.conf > bacula 2075 0.0 0.0 86992 19352 - Is 11:30 > 0:56.95 /usr/local/sbin/bacula-dir -u bacula -g bacula -v -c > /usr/local/etc/bacula/bacula-dir.conf > postgres 50241 0.0 0.1 285764 160244 - Is 23:05 > 0:00.38 postgres: bacula bacula [local] (postgres) > postgres 50244 0.0 0.1 298784 74448 - Ds 23:05 > 0:00.67 postgres: bacula bacula [local] (postgres) > ler 66595 0.0 0.0 12888 2600 3 S+ 09:46 > 0:00.00 grep --color=auto bacula > > ler in 🌐 borg in ~ via C v14.0.5-clang on ☁️ (us-east-1) > ❯ sudo procstat -kk 2067 > PID TID COMM TDNAME KSTACK > 2067 100742 bacula-sd - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 sleepq_wait_sig+0x9 > _cv_wait_sig+0x137 kern_select+0x9fe sys_select+0x56 > amd64_syscall+0x12e fast_syscall_common+0xf8 > 2067 101036 bacula-sd - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 > sleepq_timedwait_sig+0x12 _sleep+0x27d umtxq_sleep+0x242 do_wait+0x26b > __umtx_op_wait_uint_private+0x54 sys__umtx_op+0x7e amd64_syscall+0x12e > fast_syscall_common+0xf8 > 2067 101038 bacula-sd - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 > sleepq_timedwait_sig+0x12 _sleep+0x27d umtxq_sleep+0x242 do_wait+0x26b > __umtx_op_wait_uint_private+0x54 sys__umtx_op+0x7e amd64_syscall+0x12e > fast_syscall_common+0xf8 > 2067 124485 bacula-sd - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 > sleepq_timedwait_sig+0x12 _cv_timedwait_sig_sbt+0x15c > kern_poll_kfds+0x457 kern_poll+0x9f sys_poll+0x50 amd64_syscall+0x12e > fast_syscall_common+0xf8 > > ler in 🌐 borg in ~ via C v14.0.5-clang on ☁️ (us-east-1) > ❯ sudo procstat -kk 2072 > PID TID COMM TDNAME KSTACK > 2072 100677 bacula-fd - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 sleepq_wait_sig+0x9 > _cv_wait_sig+0x137 kern_select+0x9fe sys_select+0x56 > amd64_syscall+0x12e fast_syscall_common+0xf8 > 2072 101039 bacula-fd - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 > sleepq_timedwait_sig+0x12 _sleep+0x27d umtxq_sleep+0x242 do_wait+0x26b > __umtx_op_wait_uint_private+0x54 sys__umtx_op+0x7e amd64_syscall+0x12e > fast_syscall_common+0xf8 > 2072 101040 bacula-fd - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 > sleepq_timedwait_sig+0x12 _sleep+0x27d umtxq_sleep+0x242 do_wait+0x26b > __umtx_op_wait_uint_private+0x54 sys__umtx_op+0x7e amd64_syscall+0x12e > fast_syscall_common+0xf8 > 2072 124490 bacula-fd - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 > sleepq_timedwait_sig+0x12 _cv_timedwait_sig_sbt+0x15c > kern_poll_kfds+0x457 kern_poll+0x9f sys_poll+0x50 amd64_syscall+0x12e > fast_syscall_common+0xf8 > > ler in 🌐 borg in ~ via C v14.0.5-clang on ☁️ (us-east-1) > ❯ sudo procstat -kk 2075 > PID TID COMM TDNAME KSTACK > 2075 101007 bacula-dir - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 sleepq_wait_sig+0x9 > _sleep+0x29b umtxq_sleep+0x242 do_wait+0x26b __umtx_op_wait+0x53 > sys__umtx_op+0x7e amd64_syscall+0x12e fast_syscall_common+0xf8 > 2075 101041 bacula-dir - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 > sleepq_timedwait_sig+0x12 _sleep+0x27d umtxq_sleep+0x242 do_wait+0x26b > __umtx_op_wait_uint_private+0x54 sys__umtx_op+0x7e amd64_syscall+0x12e > fast_syscall_common+0xf8 > 2075 101045 bacula-dir - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 sleepq_wait_sig+0x9 > _cv_wait_sig+0x137 kern_select+0x9fe sys_select+0x56 > amd64_syscall+0x12e fast_syscall_common+0xf8 > 2075 101046 bacula-dir - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 > sleepq_timedwait_sig+0x12 _sleep+0x27d umtxq_sleep+0x242 do_wait+0x26b > __umtx_op_wait_uint_private+0x54 sys__umtx_op+0x7e amd64_syscall+0x12e > fast_syscall_common+0xf8 > 2075 101047 bacula-dir - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 > sleepq_timedwait_sig+0x12 _sleep+0x27d kern_clock_nanosleep+0x1d1 > sys_nanosleep+0x3b amd64_syscall+0x12e fast_syscall_common+0xf8 > 2075 124479 bacula-dir - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 sleepq_wait_sig+0x9 > _cv_wait_sig+0x137 kern_poll_kfds+0x48c kern_poll+0x9f sys_poll+0x50 > amd64_syscall+0x12e fast_syscall_common+0xf8 > 2075 124480 bacula-dir - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 > sleepq_timedwait_sig+0x12 _sleep+0x27d kern_clock_nanosleep+0x1d1 > sys_nanosleep+0x3b amd64_syscall+0x12e fast_syscall_common+0xf8 > 2075 124481 bacula-dir - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 > sleepq_timedwait_sig+0x12 _sleep+0x27d kern_clock_nanosleep+0x1d1 > sys_nanosleep+0x3b amd64_syscall+0x12e fast_syscall_common+0xf8 > 2075 124489 bacula-dir - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 > sleepq_timedwait_sig+0x12 _cv_timedwait_sig_sbt+0x15c > kern_poll_kfds+0x457 kern_poll+0x9f sys_poll+0x50 amd64_syscall+0x12e > fast_syscall_common+0xf8 > 2075 124506 bacula-dir - mi_switch+0x157 > sleepq_switch+0x107 sleepq_catch_signals+0x266 > sleepq_timedwait_sig+0x12 _sleep+0x27d kern_clock_nanosleep+0x1d1 > sys_nanosleep+0x3b amd64_syscall+0x12e fast_syscall_common+0xf8 > > ler in 🌐 borg in ~ via C v14.0.5-clang on ☁️ (us-east-1) > ❯ sudo procstat -kk 66390 > PID TID COMM TDNAME KSTACK > 66390 101514 df - mi_switch+0x157 > sleepq_switch+0x107 sleepq_timedwait+0x4b _sleep+0x28e > clnt_reconnect_call+0x809 newnfs_request+0xa95 nfscl_request+0x5a > nfsrpc_statfs+0x19d nfs_statfs+0x148 vfs_statfs_sigdefer+0x2e > kern_getfsstat+0x1f1 sys_getfsstat+0x22 amd64_syscall+0x12e > fast_syscall_common+0xf8 > > ler in 🌐 borg in ~ via C v14.0.5-clang on ☁️ (us-east-1) > ❯ > > this was built yesterday: > ❯ uname -a > FreeBSD borg.lerctr.org 14.0-CURRENT FreeBSD 14.0-CURRENT #142 > ler/freebsd-main-changes-n257453-175a127a72f: Wed Aug 17 09:23:32 CDT > 2022 > root@borg.lerctr.org:/usr/obj/usr/src/amd64.amd64/sys/LER-MINIMAL > amd64 > > ler in 🌐 borg in ~ via C v14.0.5-clang on ☁️ (us-east-1) > ❯ > > What else do we need? It looks like a BUNCH of processes are hung, including poudriere, and my PostgreSQL buildfarm animal. I can keep it in this state for a bit and give access to <someone> that wants to look. -- Larry Rosenman http://www.lerctr.org/~ler Phone: +1 214-642-9640 E-Mail: ler@lerctr.org US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?9e63f40233fc29c9fbc3fac0cb82acd5>