Date: Mon, 22 Oct 2012 17:15:46 +0200 From: Attila Nagy <bra@fsn.hu> To: Dennis Glatting <freebsd@pki2.com> Cc: freebsd-fs@FreeBSD.org Subject: Re: ZFS HBAs + LSI chip sets (Was: ZFS hang (system #2)) Message-ID: <50856322.9070307@fsn.hu> In-Reply-To: <1350778257.86715.106.camel@btw.pki2.com> References: <1350698905.86715.33.camel@btw.pki2.com> <1350711509.86715.59.camel@btw.pki2.com> <50825598.3070505@FreeBSD.org> <1350744349.88577.10.camel@btw.pki2.com> <1350765093.86715.69.camel@btw.pki2.com> <508322EC.4080700@FreeBSD.org> <1350778257.86715.106.camel@btw.pki2.com>
next in thread | previous in thread | raw e-mail | index | archive | help
Hi, On 10/21/2012 02:10 AM, Dennis Glatting wrote: > I chosen the LSI2008 chip set because the code was donated by LSI, and > they therefore demonstrated interest in supporting their products under > FreeBSD, and that chip set is found in a lot of places, notably > Supermicro boards. Additionally, there were stories of success on the > lists for several boards. That said, I have received private email from > others expressing frustration with ZFS and the "hang" problems, which I > believe are also the LSI chips. > I have a Sun X4540, which shows similar symptoms. It has some (6) on-board LSI 1068E SAS controllers with 1.27.02.00-IT firmware (latest from Sun/Oracle) and 48 SATA disks. It runs stable/9@r240134. Currently the machine does a resilver on its 48 disk pool (heavy IO happens), which stops periodically. I've set up watchdogd with a command of "ls /data" (the pool is mounted there). It doesn't restart the machine when the IO freezes, because the command always succeeds (coming from cache I guess). But if something wants to touch the disks, it stucks in D state. zpool status shows: scan: resilver in progress since Sun Oct 21 15:40:50 2012 3.16T scanned out of 13.8T at 26.4M/s, 117h45m to go 133G resilvered, 22.82% done And the estimated time grows constantly. gstat shows no IO. If I issue an ls -R /data, it gets stuck: root 36217 0.0 0.0 14380 1800 3 D+ 4:45PM 0:00.00 ls -R /data/ # procstat -k 36217 PID TID COMM TDNAME KSTACK 36217 101469 ls - mi_switch sleepq_wait _cv_wait zio_wait dbuf_read dbuf_findbp dbuf_hold_impl dbuf_hold dmu_buf_hold zap_lockdir zap_cursor_retrieve zfs_freebsd_readdir kern_getdirentries sys_getdirentries amd64_syscall Xfast_syscall Also, a dd on any of the disks waits forever, without reading a single byte: root 36570 0.0 0.0 9876 1356 4 DL+ 4:46PM 0:00.00 dd if=/dev/da0 of=/dev/null # procstat -k 36570 PID TID COMM TDNAME KSTACK 36570 101489 dd - mi_switch sleepq_wait _sleep bwait physio devfs_read_f dofileread kern_readv sys_read amd64_syscall Xfast_syscall Camcontrol works: # camcontrol devlist <ATA SEAGATE ST35002N SU0F> at scbus0 target 0 lun 0 (pass0,da0) <ATA SEAGATE ST35002N SU0F> at scbus0 target 1 lun 0 (pass1,da1) <ATA SEAGATE ST35002N SU0F> at scbus0 target 2 lun 0 (pass2,da2) <ATA HITACHI HDS7250S AJ0A> at scbus0 target 3 lun 0 (pass3,da3) <ATA SEAGATE ST35002N SU0F> at scbus0 target 4 lun 0 (pass4,da4) <ATA HITACHI HUA7250S AC5A> at scbus0 target 5 lun 0 (pass5,da5) <ATA SEAGATE ST35002N SU0F> at scbus0 target 6 lun 0 (pass6,da6) <ATA ST3500320NS SN04> at scbus0 target 7 lun 0 (pass7,da7) <ATA HITACHI HDS7250S AJ0A> at scbus1 target 0 lun 0 (pass8,da8) <ATA SEAGATE ST35002N SU0F> at scbus1 target 1 lun 0 (pass9,da9) <ATA SEAGATE ST35002N SU0F> at scbus1 target 2 lun 0 (pass10,da10) <ATA SEAGATE ST35002N SU0F> at scbus1 target 3 lun 0 (pass11,da11) <ATA SEAGATE ST35002N SU0F> at scbus1 target 4 lun 0 (pass12,da12) <ATA SEAGATE ST35002N SU0F> at scbus1 target 5 lun 0 (pass13,da13) <ATA SEAGATE ST35002N SU0F> at scbus1 target 6 lun 0 (pass14,da14) <ATA SEAGATE ST35002N SU0F> at scbus1 target 7 lun 0 (pass15,da15) <ATA SEAGATE ST35002N SU0F> at scbus2 target 0 lun 0 (pass16,da16) <ATA SEAGATE ST35002N SU0F> at scbus2 target 1 lun 0 (pass17,da17) <ATA HITACHI HUA7250S AC5A> at scbus2 target 2 lun 0 (pass18,da18) <ATA SEAGATE ST35002N SU0F> at scbus2 target 3 lun 0 (pass19,da19) <ATA SEAGATE ST35002N SU0F> at scbus2 target 4 lun 0 (pass20,da20) <ATA SEAGATE ST35002N SU0F> at scbus2 target 5 lun 0 (pass21,da21) <ATA SEAGATE ST35002N SU0F> at scbus2 target 6 lun 0 (pass22,da22) <ATA SEAGATE ST35002N SU0F> at scbus2 target 7 lun 0 (pass23,da23) <ATA SEAGATE ST35002N SU0F> at scbus3 target 0 lun 0 (pass24,da24) <ATA SEAGATE ST35002N SU0F> at scbus3 target 1 lun 0 (pass25,da25) <ATA SEAGATE ST35002N SU0F> at scbus3 target 2 lun 0 (pass26,da26) <ATA SEAGATE ST35002N SU0F> at scbus3 target 3 lun 0 (pass27,da27) <ATA SEAGATE ST35002N SU0F> at scbus3 target 4 lun 0 (pass28,da28) <ATA SEAGATE ST35002N SU0F> at scbus3 target 5 lun 0 (pass29,da29) <ATA SEAGATE ST35002N SU0F> at scbus3 target 6 lun 0 (pass30,da30) <ATA SEAGATE ST35002N SU0F> at scbus3 target 7 lun 0 (pass31,da31) <ATA SEAGATE ST35002N SU0F> at scbus4 target 0 lun 0 (pass32,da32) <ATA SEAGATE ST35002N SU0F> at scbus4 target 1 lun 0 (pass33,da33) <ATA SEAGATE ST35002N SU0F> at scbus4 target 2 lun 0 (pass34,da34) <ATA SEAGATE ST35002N SU0F> at scbus4 target 3 lun 0 (pass35,da35) <ATA SEAGATE ST35002N SU0F> at scbus4 target 4 lun 0 (pass36,da36) <ATA SEAGATE ST35002N SU0F> at scbus4 target 5 lun 0 (pass37,da37) <ATA SEAGATE ST35002N SU0F> at scbus4 target 6 lun 0 (pass38,da38) <ATA SEAGATE ST35002N SU0F> at scbus4 target 7 lun 0 (pass39,da39) <ATA SEAGATE ST35002N SU0F> at scbus5 target 0 lun 0 (pass40,da40) <ATA SEAGATE ST35002N SU0F> at scbus5 target 1 lun 0 (pass41,da41) <ATA SEAGATE ST35002N SU0F> at scbus5 target 2 lun 0 (pass42,da42) <ATA SEAGATE ST35002N SU0F> at scbus5 target 3 lun 0 (pass43,da43) <ATA SEAGATE ST35002N SU0F> at scbus5 target 4 lun 0 (pass44,da44) <ATA SEAGATE ST35002N SU0F> at scbus5 target 5 lun 0 (pass45,da45) <ATA SEAGATE ST35002N SU0F> at scbus5 target 6 lun 0 (pass46,da46) <ATA SEAGATE ST35002N SU0F> at scbus5 target 7 lun 0 (pass47,da47) # camcontrol tags da0 (pass0:mpt0:0:0:0): device openings: 255 Also works (I guess it doesn't touch the disks): # zfs list NAME USED AVAIL REFER MOUNTPOINT logpool 13.1T 7.17T 507K /data logpool/jail 7.08G 7.17T 7.08G /data/jail logpool/logs 13.1T 7.17T 3.40T /data/jail/logvm/logs logpool/logs/OTHER 9.24T 7.17T 2.36T /data/jail/logvm/logs/OTHER But this doesn't: root 36686 0.0 0.0 33384 2512 5 D+ 4:49PM 0:00.00 zfs list -t snapshot # procstat -k 36686 PID TID COMM TDNAME KSTACK 36686 101593 zfs - mi_switch sleepq_wait _cv_wait zio_wait dbuf_read dmu_buf_hold zap_lockdir zap_cursor_retrieve dmu_snapshot_list_next zfs_ioc_snapshot_list_next zfsdev_ioctl devfs_ioctl_f kern_ioctl sys_ioctl amd64_syscall Xfast_syscall Entering into the debugger: KDB: enter: sysctl debug.kdb.enter [ thread pid 36959 tid 101484 ] Stopped at kdb_enter+0x3b: movq $0,0x95ab72(%rip) db> ps pid ppid pgrp uid state wmesg wchan cmd 36959 1769 36959 0 R+ CPU 0 sysctl 36691 919 919 0 S sbwait 0xfffffe009d752144 perl 36686 36677 36686 0 D+ zio->io_ 0xfffffe001ccb7d70 zfs 36677 36208 36677 0 Ss+ pause 0xfffffe009d0030a0 csh 36570 36567 36570 0 DL+ physrd 0xffffff87005a2980 dd 36567 36208 36567 0 Ss+ pause 0xfffffe00115c4540 csh 36217 36209 36217 0 D+ zio->io_ 0xfffffe001c2b2320 ls 36209 36208 36209 0 Ss+ pause 0xfffffe022c8aa0a0 csh 36208 36207 36208 0 Ss select 0xfffffe0665c92e40 screen 36207 1782 36207 0 S+ pause 0xfffffe009d0010a0 screen 32921 883 873 0 DL cbwait 0xfffffe000f7f7848 camcontrol 1782 1780 1782 0 Ss+ pause 0xfffffe009d4559e0 csh 1780 897 1780 0 Ss select 0xfffffe001d546740 sshd 1776 1774 1776 0 Ss+ ttyin 0xfffffe001c02a4a8 csh 1774 897 1774 0 Ss select 0xfffffe001cb4d0c0 sshd 1769 1767 1769 0 Ss+ pause 0xfffffe001191a540 csh 1767 897 1767 0 Ss select 0xfffffe000fd72bc0 sshd 1079 1 1079 0 Ss+ ttyin 0xfffffe000c82c4a8 getty 1078 1 1078 0 Ss+ ttyin 0xfffffe000c82c8a8 getty 1077 1 1077 0 Ss+ ttyin 0xfffffe000c82cca8 getty 1076 1 1076 0 Ss+ ttyin 0xfffffe000c82d0a8 getty 1075 1 1075 0 Ss+ ttyin 0xfffffe000c82d4a8 getty 1074 1 1074 0 Ss+ ttyin 0xfffffe000c82d8a8 getty 1073 1 1073 0 Ss+ ttyin 0xfffffe000c82dca8 getty 1072 1 1072 0 Ss+ ttyin 0xfffffe000c82f0a8 getty 919 1 919 0 Ss select 0xfffffe000f5ac940 perl 907 1 907 0 Ss nanslp 0xffffffff81244f08 cron 903 1 903 25 Ss pause 0xfffffe001125e0a0 sendmail 900 1 900 0 Ss select 0xfffffe001d549340 sendmail 897 1 897 0 Ss select 0xfffffe001d546cc0 sshd 892 884 873 0 S piperd 0xfffffe001e940888 fghack 884 878 873 0 S wait 0xfffffe000fdee000 sh 883 879 873 0 S piperd 0xfffffe022c08b000 perl 879 875 873 0 S select 0xfffffe001ca6a8c0 supervise 878 875 873 0 S select 0xfffffe000fd73d40 supervise 876 1 873 0 S piperd 0xfffffe001e9c5b60 readproctitle 875 1 873 0 S nanslp 0xffffffff81244f08 svscan 870 868 867 123 S select 0xfffffe000fd934c0 ntpd 868 867 867 123 S select 0xfffffe001ca68e40 ntpd 867 1 867 0 Ss select 0xfffffe000fddd740 ntpd 796 0 0 0 DL mdwait 0xfffffe000f52a000 [md2] 774 1 774 53 Ss (threaded) named 101524 S kqread 0xfffffe00115dd100 named 101523 S uwait 0xfffffe000fde5200 named 101522 S uwait 0xfffffe00110ce680 named 101521 S uwait 0xfffffe000fda0300 named 101520 S uwait 0xfffffe000fddd380 named 101519 S uwait 0xfffffe001198ca00 named 101518 S uwait 0xfffffe000fd58880 named 101517 S uwait 0xfffffe000fd7ab80 named 101516 S uwait 0xfffffe000f80e480 named 101515 S uwait 0xfffffe000f80f400 named 101501 S sigwait 0xfffffe00110dd000 named 751 750 751 0 Ss select 0xfffffe001d549440 syslog-ng 750 1 749 0 S wait 0xfffffe000c8144a0 syslog-ng 612 608 608 64 S bpf 0xfffffe001ca94800 pflogd 608 1 608 0 Ss sbwait 0xfffffe001eb4ae8c pflogd 605 0 0 0 DL pftm 0xffffffff817547a0 [pfpurge] 78 0 0 0 DL (threaded) [zfskern] 101459 D spa->spa 0xfffffe0011462680 [txg_thread_enter] 101458 D tx->tx_q 0xfffffe001b199230 [txg_thread_enter] 100122 D l2arc_fe 0xffffffff8173ebc0 [l2arc_feed_thread] 100121 D arc_recl 0xffffffff8172ed20 [arc_reclaim_thread] 59 0 0 0 DL mdwait 0xfffffe000f521000 [md1] 47 0 0 0 DL mdwait 0xfffffe000f523800 [md0] 24 0 0 0 DL sdflush 0xffffffff812a6158 [softdepflush] 23 0 0 0 DL syncer 0xffffffff812928c0 [syncer] 22 0 0 0 DL vlruwt 0xfffffe000c80d000 [vnlru] 21 0 0 0 DL psleep 0xffffffff81292348 [bufdaemon] 20 0 0 0 DL pgzero 0xffffffff812b019c [pagezero] 19 0 0 0 DL psleep 0xffffffff812af368 [vmdaemon] 18 0 0 0 DL psleep 0xffffffff812af32c [pagedaemon] 17 0 0 0 DL ccb_scan 0xffffffff811ff260 [xpt_thrd] 16 0 0 0 DL idle 0xffffff8001df3000 [mpt_recovery5] 9 0 0 0 DL idle 0xffffff8001dde000 [mpt_recovery4] 8 0 0 0 DL idle 0xffffff8001dc9000 [mpt_recovery3] 7 0 0 0 DL idle 0xffffff8001daa000 [mpt_recovery2] 6 0 0 0 DL idle 0xffffff8001d95000 [mpt_recovery1] 5 0 0 0 DL idle 0xffffff8001d80000 [mpt_recovery0] 15 0 0 0 DL (threaded) [usb] 100048 D - 0xffffff8001d73e18 [usbus1] 100047 D - 0xffffff8001d73dc0 [usbus1] 100046 D - 0xffffff8001d73d68 [usbus1] 100045 D - 0xffffff8001d73d10 [usbus1] 100043 D - 0xffffff8001d6b460 [usbus0] 100042 D - 0xffffff8001d6b408 [usbus0] 100041 D - 0xffffff8001d6b3b0 [usbus0] 100040 D - 0xffffff8001d6b358 [usbus0] 4 0 0 0 DL ctl_work 0xffffff8000a41000 [ctl_thrd] 14 0 0 0 DL - 0xffffffff81243ba4 [yarrow] 3 0 0 0 DL crypto_r 0xffffffff812a4ae0 [crypto returns] 2 0 0 0 DL crypto_w 0xffffffff812a4aa0 [crypto] 13 0 0 0 DL (threaded) [geom] 100023 D - 0xffffffff8123d030 [g_down] 100022 D - 0xffffffff8123d028 [g_up] 100021 D - 0xffffffff8123d018 [g_event] 12 0 0 0 RL (threaded) [intr] 100065 I [swi0: uart] 100063 I [irq293: mpt5] 100061 I [irq292: mpt4] 100059 I [irq291: mpt3] 100055 I [irq274: mpt2] 100053 I [irq273: mpt1] 100051 I [irq272: mpt0] 100044 I [irq22: ehci0] 100039 I [irq21: ohci0] 100034 I [swi2: cambio] 100031 I [swi6: task queue] 100030 I [swi6: Giant taskq] 100028 I [swi5: +] 100020 I [swi1: netisr 0] 100019 I [swi4: clock] 100018 I [swi4: clock] 100017 I [swi4: clock] 100016 I [swi4: clock] 100015 I [swi4: clock] 100014 I [swi4: clock] 100013 I [swi4: clock] 100012 RunQ [swi4: clock] 100011 I [swi3: vm] 11 0 0 0 RL (threaded) [idle] 100010 Run CPU 7 [idle: cpu7] 100009 Run CPU 6 [idle: cpu6] 100008 Run CPU 5 [idle: cpu5] 100007 Run CPU 4 [idle: cpu4] 100006 Run CPU 3 [idle: cpu3] 100005 Run CPU 2 [idle: cpu2] 100004 Run CPU 1 [idle: cpu1] 100003 CanRun [idle: cpu0] 1 0 1 0 SLs wait 0xfffffe000c068940 [init] 10 0 0 0 DL audit_wo 0xffffffff812a50d0 [audit] 0 0 0 0 DLs (threaded) [kernel] 101463 D - 0xfffffe000fddab00 [zil_clean] 101462 D - 0xfffffe000fd6a800 [zil_clean] 101461 D - 0xfffffe000fdf6180 [zil_clean] 101460 D - 0xfffffe001d546600 [zil_clean] 101457 D - 0xfffffe000f359e00 [zfs_vn_rele_taskq] 101456 D - 0xfffffe001198d080 [zio_ioctl_intr] 101455 D - 0xfffffe001cb4fa80 [zio_ioctl_issue] 101454 D - 0xfffffe000ffbf380 [zio_claim_intr] 101453 D - 0xfffffe00110cf580 [zio_claim_issue] 101452 D - 0xfffffe00110cf880 [zio_free_intr] 101451 D - 0xfffffe000ffc1b80 [zio_free_issue_99] 101450 D - 0xfffffe000ffc1b80 [zio_free_issue_98] 101449 D - 0xfffffe000ffc1b80 [zio_free_issue_97] 101448 D - 0xfffffe000ffc1b80 [zio_free_issue_96] 101447 D - 0xfffffe000ffc1b80 [zio_free_issue_95] 101446 D - 0xfffffe000ffc1b80 [zio_free_issue_94] 101445 D - 0xfffffe000ffc1b80 [zio_free_issue_93] 101444 D - 0xfffffe000ffc1b80 [zio_free_issue_92] 101443 D - 0xfffffe000ffc1b80 [zio_free_issue_91] 101442 D - 0xfffffe000ffc1b80 [zio_free_issue_90] 101441 D - 0xfffffe000ffc1b80 [zio_free_issue_89] 101440 D - 0xfffffe000ffc1b80 [zio_free_issue_88] 101439 D - 0xfffffe000ffc1b80 [zio_free_issue_87] 101438 D - 0xfffffe000ffc1b80 [zio_free_issue_86] 101437 D - 0xfffffe000ffc1b80 [zio_free_issue_85] 101436 D - 0xfffffe000ffc1b80 [zio_free_issue_84] 101435 D - 0xfffffe000ffc1b80 [zio_free_issue_83] 101434 D - 0xfffffe000ffc1b80 [zio_free_issue_82] 101433 D - 0xfffffe000ffc1b80 [zio_free_issue_81] 101432 D - 0xfffffe000ffc1b80 [zio_free_issue_80] 101431 D - 0xfffffe000ffc1b80 [zio_free_issue_79] 101430 D - 0xfffffe000ffc1b80 [zio_free_issue_78] 101429 D - 0xfffffe000ffc1b80 [zio_free_issue_77] 101428 D - 0xfffffe000ffc1b80 [zio_free_issue_76] 101427 D - 0xfffffe000ffc1b80 [zio_free_issue_75] 101426 D - 0xfffffe000ffc1b80 [zio_free_issue_74] 101425 D - 0xfffffe000ffc1b80 [zio_free_issue_73] 101424 D - 0xfffffe000ffc1b80 [zio_free_issue_72] 101423 D - 0xfffffe000ffc1b80 [zio_free_issue_71] 101422 D - 0xfffffe000ffc1b80 [zio_free_issue_70] 101421 D - 0xfffffe000ffc1b80 [zio_free_issue_69] 101420 D - 0xfffffe000ffc1b80 [zio_free_issue_68] 101419 D - 0xfffffe000ffc1b80 [zio_free_issue_67] 101418 D - 0xfffffe000ffc1b80 [zio_free_issue_66] 101417 D - 0xfffffe000ffc1b80 [zio_free_issue_65] 101416 D - 0xfffffe000ffc1b80 [zio_free_issue_64] 101415 D - 0xfffffe000ffc1b80 [zio_free_issue_63] 101414 D - 0xfffffe000ffc1b80 [zio_free_issue_62] 101413 D - 0xfffffe000ffc1b80 [zio_free_issue_61] 101412 D - 0xfffffe000ffc1b80 [zio_free_issue_60] 101411 D - 0xfffffe000ffc1b80 [zio_free_issue_59] 101410 D - 0xfffffe000ffc1b80 [zio_free_issue_58] 101409 D - 0xfffffe000ffc1b80 [zio_free_issue_57] 101408 D - 0xfffffe000ffc1b80 [zio_free_issue_56] 101407 D - 0xfffffe000ffc1b80 [zio_free_issue_55] 101406 D - 0xfffffe000ffc1b80 [zio_free_issue_54] 101405 D - 0xfffffe000ffc1b80 [zio_free_issue_53] 101404 D - 0xfffffe000ffc1b80 [zio_free_issue_52] 101403 D - 0xfffffe000ffc1b80 [zio_free_issue_51] 101402 D - 0xfffffe000ffc1b80 [zio_free_issue_50] 101401 D - 0xfffffe000ffc1b80 [zio_free_issue_49] 101400 D - 0xfffffe000ffc1b80 [zio_free_issue_48] 101399 D - 0xfffffe000ffc1b80 [zio_free_issue_47] 101398 D - 0xfffffe000ffc1b80 [zio_free_issue_46] 101397 D - 0xfffffe000ffc1b80 [zio_free_issue_45] 101396 D - 0xfffffe000ffc1b80 [zio_free_issue_44] 101395 D - 0xfffffe000ffc1b80 [zio_free_issue_43] 101394 D - 0xfffffe000ffc1b80 [zio_free_issue_42] 101393 D - 0xfffffe000ffc1b80 [zio_free_issue_41] 101392 D - 0xfffffe000ffc1b80 [zio_free_issue_40] 101391 D - 0xfffffe000ffc1b80 [zio_free_issue_39] 101390 D - 0xfffffe000ffc1b80 [zio_free_issue_38] 101389 D - 0xfffffe000ffc1b80 [zio_free_issue_37] 101388 D - 0xfffffe000ffc1b80 [zio_free_issue_36] 101387 D - 0xfffffe000ffc1b80 [zio_free_issue_35] 101386 D - 0xfffffe000ffc1b80 [zio_free_issue_34] 101385 D - 0xfffffe000ffc1b80 [zio_free_issue_33] 101384 D - 0xfffffe000ffc1b80 [zio_free_issue_32] 101383 D - 0xfffffe000ffc1b80 [zio_free_issue_31] 100569 D - 0xfffffe000ffc1b80 [zio_free_issue_30] 100567 D - 0xfffffe000ffc1b80 [zio_free_issue_29] 100565 D - 0xfffffe000ffc1b80 [zio_free_issue_28] 100560 D - 0xfffffe000ffc1b80 [zio_free_issue_27] 100554 D - 0xfffffe000ffc1b80 [zio_free_issue_26] 100553 D - 0xfffffe000ffc1b80 [zio_free_issue_25] 100547 D - 0xfffffe000ffc1b80 [zio_free_issue_24] 100545 D - 0xfffffe000ffc1b80 [zio_free_issue_23] 100542 D - 0xfffffe000ffc1b80 [zio_free_issue_22] 100539 D - 0xfffffe000ffc1b80 [zio_free_issue_21] 100536 D - 0xfffffe000ffc1b80 [zio_free_issue_20] 100530 D - 0xfffffe000ffc1b80 [zio_free_issue_19] 100487 D - 0xfffffe000ffc1b80 [zio_free_issue_18] 100415 D - 0xfffffe000ffc1b80 [zio_free_issue_17] 100413 D - 0xfffffe000ffc1b80 [zio_free_issue_16] 100407 D - 0xfffffe000ffc1b80 [zio_free_issue_15] 100403 D - 0xfffffe000ffc1b80 [zio_free_issue_14] 100400 D - 0xfffffe000ffc1b80 [zio_free_issue_13] 100393 D - 0xfffffe000ffc1b80 [zio_free_issue_12] 100391 D - 0xfffffe000ffc1b80 [zio_free_issue_11] 100387 D - 0xfffffe000ffc1b80 [zio_free_issue_10] 100386 D - 0xfffffe000ffc1b80 [zio_free_issue_9] 100385 D - 0xfffffe000ffc1b80 [zio_free_issue_8] 100384 D - 0xfffffe000ffc1b80 [zio_free_issue_7] 100383 D - 0xfffffe000ffc1b80 [zio_free_issue_6] 100379 D - 0xfffffe000ffc1b80 [zio_free_issue_5] 100372 D - 0xfffffe000ffc1b80 [zio_free_issue_4] 100367 D - 0xfffffe000ffc1b80 [zio_free_issue_3] 100366 D - 0xfffffe000ffc1b80 [zio_free_issue_2] 100361 D - 0xfffffe000ffc1b80 [zio_free_issue_1] 100360 D - 0xfffffe000ffc1b80 [zio_free_issue_0] 100359 D - 0xfffffe001ca67280 [zio_write_intr_high] 100358 D - 0xfffffe001ca67280 [zio_write_intr_high] 100357 D - 0xfffffe001ca67280 [zio_write_intr_high] 100354 D - 0xfffffe001ca67280 [zio_write_intr_high] 100353 D - 0xfffffe001ca67280 [zio_write_intr_high] 100349 D - 0xfffffe000fd72700 [zio_write_intr_7] 100348 D - 0xfffffe000fd72700 [zio_write_intr_6] 100345 D - 0xfffffe000fd72700 [zio_write_intr_5] 100343 D - 0xfffffe000fd72700 [zio_write_intr_4] 100342 D - 0xfffffe000fd72700 [zio_write_intr_3] 100341 D - 0xfffffe000fd72700 [zio_write_intr_2] 100340 D - 0xfffffe000fd72700 [zio_write_intr_1] 100339 D - 0xfffffe000fd72700 [zio_write_intr_0] 100337 D - 0xfffffe001196ce00 [zio_write_issue_hig] 100336 D - 0xfffffe001196ce00 [zio_write_issue_hig] 100334 D - 0xfffffe001196ce00 [zio_write_issue_hig] 100330 D - 0xfffffe001196ce00 [zio_write_issue_hig] 100327 D - 0xfffffe001196ce00 [zio_write_issue_hig] 100324 D - 0xfffffe00110cfb00 [zio_write_issue_7] 100322 D - 0xfffffe00110cfb00 [zio_write_issue_6] 100321 D - 0xfffffe00110cfb00 [zio_write_issue_5] 100316 D - 0xfffffe00110cfb00 [zio_write_issue_4] 100314 D - 0xfffffe00110cfb00 [zio_write_issue_3] 100312 D - 0xfffffe00110cfb00 [zio_write_issue_2] 100311 D - 0xfffffe00110cfb00 [zio_write_issue_1] 100307 D - 0xfffffe00110cfb00 [zio_write_issue_0] 100306 D - 0xfffffe000ffbfc80 [zio_read_intr_7] 100305 D - 0xfffffe000ffbfc80 [zio_read_intr_6] 100303 D - 0xfffffe000ffbfc80 [zio_read_intr_5] 100300 D - 0xfffffe000ffbfc80 [zio_read_intr_4] 100298 D - 0xfffffe000ffbfc80 [zio_read_intr_3] 100297 D - 0xfffffe000ffbfc80 [zio_read_intr_2] 100293 D - 0xfffffe000ffbfc80 [zio_read_intr_1] 100292 D - 0xfffffe000ffbfc80 [zio_read_intr_0] 100291 D - 0xfffffe00110cf000 [zio_read_issue_7] 100289 D - 0xfffffe00110cf000 [zio_read_issue_6] 100288 D - 0xfffffe00110cf000 [zio_read_issue_5] 100286 D - 0xfffffe00110cf000 [zio_read_issue_4] 100282 D - 0xfffffe00110cf000 [zio_read_issue_3] 100281 D - 0xfffffe00110cf000 [zio_read_issue_2] 100280 D - 0xfffffe00110cf000 [zio_read_issue_1] 100278 D - 0xfffffe00110cf000 [zio_read_issue_0] 100275 D - 0xfffffe001113b500 [zio_null_intr] 100273 D - 0xfffffe001196c800 [zio_null_issue] 100120 D - 0xfffffe0011370300 [system_taskq_7] 100119 D - 0xfffffe0011370300 [system_taskq_6] 100118 D - 0xfffffe0011370300 [system_taskq_5] 100117 D - 0xfffffe0011370300 [system_taskq_4] 100116 D - 0xfffffe0011370300 [system_taskq_3] 100115 D - 0xfffffe0011370300 [system_taskq_2] 100114 D - 0xfffffe0011370300 [system_taskq_1] 100113 D - 0xfffffe0011370300 [system_taskq_0] 100066 D - 0xfffffe000f239a80 [mca taskq] 100058 D - 0xfffffe000c69b900 [nfe3 taskq] 100057 D - 0xfffffe000c698480 [nfe2 taskq] 100050 D - 0xfffffe000c620400 [nfe1 taskq] 100049 D - 0xfffffe000c61b500 [nfe0 taskq] 100037 D - 0xfffffe000c24bb00 [acpi_task_2] 100036 D - 0xfffffe000c24bb00 [acpi_task_1] 100035 D - 0xfffffe000c24bb00 [acpi_task_0] 100033 D - 0xfffffe000c24be00 [kqueue taskq] 100032 D - 0xfffffe000c24c000 [ffs_trim taskq] 100029 D - 0xfffffe000c20c780 [thread taskq] 100024 D - 0xfffffe000c07fb80 [firmware taskq] 100000 D sched 0xffffffff8123d280 [swapper] 895 892 873 0 Z perl Setting this: # sysctl dev.mpt.0.debug=255 and doing a dd again from a disk on that controller prints this onto the console: SCSI IO Request @ 0xffffff80003046f0 Chain Offset 0x00 MsgFlags 0x00 MsgContext 0x000201c5 Bus: 0 TargetID 0 SenseBufferLength 32 LUN: 0x0 Control 0x02000200 READ ORDEREDQ DataLength 0x00000200 SenseBufAddr 0x0c678be0 CDB[0:6] 08 00 00 00 01 00 SE64 0xffffff87ffd33a30: Addr=0x000000070cc08400 FlagsLength=0xd3000200 64_BIT_ADDRESSING LAST_ELEMENT END_OF_BUFFER END_OF_LIST mpt0: Send Request 453 (c678a00): mpt0: 00000000 00002006 000201c5 00000000 00000000 02000200 00000008 00000001 mpt0: 00000000 00000000 00000200 0c678be0 d3000200 0cc08400 00000007 ffffffff mpt0: ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff mpt0: ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff mpt0: enter mpt_intr mpt0: Context Reply: 0x000201c5 mpt0: exit mpt_intr And dd freezes. Alltrace from a couple of stuck processes: Tracing command dd pid 36971 tid 101570 td 0xfffffe001efce000 sched_switch() at sched_switch+0x115 mi_switch() at mi_switch+0x186 sleepq_wait() at sleepq_wait+0x42 _sleep() at _sleep+0x379 bwait() at bwait+0x64 physio() at physio+0x1c8 devfs_read_f() at devfs_read_f+0x90 dofileread() at dofileread+0xa1 kern_readv() at kern_readv+0x6c sys_read() at sys_read+0x64 amd64_syscall() at amd64_syscall+0x540 Xfast_syscall() at Xfast_syscall+0xf7 --- syscall (3, FreeBSD ELF64, sys_read), rip = 0x800916c8c, rsp = 0x7fffffffd658, rbp = 0x7fffffffd6b0 --- Tracing command zfs pid 36686 tid 101593 td 0xfffffe001ecb3900 sched_switch() at sched_switch+0x115 mi_switch() at mi_switch+0x186 sleepq_wait() at sleepq_wait+0x42 _cv_wait() at _cv_wait+0x112 zio_wait() at zio_wait+0x61 dbuf_read() at dbuf_read+0x5e5 dmu_buf_hold() at dmu_buf_hold+0xe0 zap_lockdir() at zap_lockdir+0x58 zap_cursor_retrieve() at zap_cursor_retrieve+0x19b dmu_snapshot_list_next() at dmu_snapshot_list_next+0xaf zfs_ioc_snapshot_list_next() at zfs_ioc_snapshot_list_next+0x101 zfsdev_ioctl() at zfsdev_ioctl+0xe6 devfs_ioctl_f() at devfs_ioctl_f+0x7b kern_ioctl() at kern_ioctl+0x106 sys_ioctl() at sys_ioctl+0xfd amd64_syscall() at amd64_syscall+0x540 Xfast_syscall() at Xfast_syscall+0xf7 --- syscall (54, FreeBSD ELF64, sys_ioctl), rip = 0x801be2c2c, rsp = 0x7fffffff8938, rbp = 0x4000 ---
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?50856322.9070307>