Date: Wed, 8 Sep 2010 16:07:44 GMT From: Rich Ercolani <rercola@acm.jhu.edu> To: freebsd-gnats-submit@FreeBSD.org Subject: misc/150390: zfs deadlock when arcmsr reports drive faulted Message-ID: <201009081607.o88G7i7U062316@www.freebsd.org> Resent-Message-ID: <201009081610.o88GA2NO070900@freefall.freebsd.org>
next in thread | raw e-mail | index | archive | help
>Number: 150390 >Category: misc >Synopsis: zfs deadlock when arcmsr reports drive faulted >Confidential: no >Severity: serious >Priority: low >Responsible: freebsd-bugs >State: open >Quarter: >Keywords: >Date-Required: >Class: sw-bug >Submitter-Id: current-users >Arrival-Date: Wed Sep 08 16:10:02 UTC 2010 >Closed-Date: >Last-Modified: >Originator: Rich Ercolani >Release: 8.1 >Organization: JHU ACM >Environment: FreeBSD manticore.acm.jhu.edu 8.1-STABLE FreeBSD 8.1-STABLE #4 r211397M: Mon Aug 16 18:47:31 EDT 2010 root@manticore.acm.jhu.edu:/usr/obj/usr/local/ncvs/src/sys/DTRACE amd64 >Description: System deadlocks 100% reliably when a disk is reported FAULTED in the arcmsr card. dmesg looks like: arcmsr0:block 'read/write' command with gone raid volume Cmd= 8, TargetId=1, Lun=5 arcmsr0:block 'read/write' command with gone raid volume Cmd= 8, TargetId=1, Lun=5 arcmsr0:block 'read/write' command with gone raid volume Cmd= 8, TargetId=1, Lun=5 zpool and zfs-related commands, and all IO to the affected pool, hang forever in state D. procstat reports: [root@manticore ~]# ps aux | grep zpool stump 3287 0.0 0.0 15700 1540 0 D+ 12:03PM 0:00.00 zpool status root 3286 0.0 0.0 15700 1528 1 T+ 12:03PM 0:00.00 zpool status root 3316 0.0 0.0 9120 1164 3 S+ 12:07PM 0:00.00 grep zpool [root@manticore ~]# procstat -k 3286 PID TID COMM TDNAME KSTACK 3286 100484 zpool - mi_switch sleepq_wait _cv_wait spa_config_enter spa_config_generate spa_open_common spa_get_stats zfs_ioc_pool_stats zfsdev_ioctl devfs_ioctl_f kern_ioctl ioctl syscall Xfast_syscall [root@manticore ~]# procstat -k 3287 PID TID COMM TDNAME KSTACK 3287 100532 zpool - mi_switch sleepq_wait _cv_wait spa_config_enter spa_config_generate spa_open_common spa_get_stats zfs_ioc_pool_stats zfsdev_ioctl devfs_ioctl_f kern_ioctl ioctl syscall Xfast_syscall >How-To-Repeat: 1) Have a disk fault on an arcmsr card. 2) Hang! >Fix: >Release-Note: >Audit-Trail: >Unformatted:
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?201009081607.o88G7i7U062316>