Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 22 Oct 2012 17:15:46 +0200
From:      Attila Nagy <bra@fsn.hu>
To:        Dennis Glatting <freebsd@pki2.com>
Cc:        freebsd-fs@FreeBSD.org
Subject:   Re: ZFS HBAs + LSI chip sets (Was: ZFS hang (system #2))
Message-ID:  <50856322.9070307@fsn.hu>
In-Reply-To: <1350778257.86715.106.camel@btw.pki2.com>
References:  <1350698905.86715.33.camel@btw.pki2.com> <1350711509.86715.59.camel@btw.pki2.com> <50825598.3070505@FreeBSD.org> <1350744349.88577.10.camel@btw.pki2.com> <1350765093.86715.69.camel@btw.pki2.com> <508322EC.4080700@FreeBSD.org> <1350778257.86715.106.camel@btw.pki2.com>

next in thread | previous in thread | raw e-mail | index | archive | help
Hi,

On 10/21/2012 02:10 AM, Dennis Glatting wrote:
> I chosen the LSI2008 chip set because the code was donated by LSI, and
> they therefore demonstrated interest in supporting their products under
> FreeBSD, and that chip set is found in a lot of places, notably
> Supermicro boards. Additionally, there were stories of success on the
> lists for several boards. That said, I have received private email from
> others expressing frustration with ZFS and the "hang" problems, which I
> believe are also the LSI chips.
>
I have a Sun X4540, which shows similar symptoms. It has some (6) 
on-board LSI 1068E SAS controllers with 1.27.02.00-IT firmware (latest 
from Sun/Oracle) and 48 SATA disks.
It runs stable/9@r240134.

Currently the machine does a resilver on its 48 disk pool (heavy IO 
happens), which stops periodically.
I've set up watchdogd with a command of "ls /data" (the pool is mounted 
there). It doesn't restart the machine when the IO freezes, because the 
command always succeeds (coming from cache I guess).
But if something wants to touch the disks, it stucks in D state.

zpool status shows:
   scan: resilver in progress since Sun Oct 21 15:40:50 2012
         3.16T scanned out of 13.8T at 26.4M/s, 117h45m to go
         133G resilvered, 22.82% done
And the estimated time grows constantly.
gstat shows no IO.

If I issue an ls -R /data, it gets stuck:
root    36217   0.0  0.0 14380  1800  3  D+    4:45PM   0:00.00 ls -R /data/
# procstat -k 36217
   PID    TID COMM             TDNAME KSTACK
36217 101469 ls               -                mi_switch sleepq_wait 
_cv_wait zio_wait dbuf_read dbuf_findbp dbuf_hold_impl dbuf_hold 
dmu_buf_hold zap_lockdir zap_cursor_retrieve zfs_freebsd_readdir 
kern_getdirentries sys_getdirentries amd64_syscall Xfast_syscall

Also, a dd on any of the disks waits forever, without reading a single byte:
root    36570   0.0  0.0  9876  1356  4  DL+   4:46PM   0:00.00 dd 
if=/dev/da0 of=/dev/null
# procstat -k 36570
   PID    TID COMM             TDNAME KSTACK
36570 101489 dd               -                mi_switch sleepq_wait 
_sleep bwait physio devfs_read_f dofileread kern_readv sys_read 
amd64_syscall Xfast_syscall


Camcontrol works:

# camcontrol devlist
<ATA SEAGATE ST35002N SU0F>        at scbus0 target 0 lun 0 (pass0,da0)
<ATA SEAGATE ST35002N SU0F>        at scbus0 target 1 lun 0 (pass1,da1)
<ATA SEAGATE ST35002N SU0F>        at scbus0 target 2 lun 0 (pass2,da2)
<ATA HITACHI HDS7250S AJ0A>        at scbus0 target 3 lun 0 (pass3,da3)
<ATA SEAGATE ST35002N SU0F>        at scbus0 target 4 lun 0 (pass4,da4)
<ATA HITACHI HUA7250S AC5A>        at scbus0 target 5 lun 0 (pass5,da5)
<ATA SEAGATE ST35002N SU0F>        at scbus0 target 6 lun 0 (pass6,da6)
<ATA ST3500320NS SN04>             at scbus0 target 7 lun 0 (pass7,da7)
<ATA HITACHI HDS7250S AJ0A>        at scbus1 target 0 lun 0 (pass8,da8)
<ATA SEAGATE ST35002N SU0F>        at scbus1 target 1 lun 0 (pass9,da9)
<ATA SEAGATE ST35002N SU0F>        at scbus1 target 2 lun 0 (pass10,da10)
<ATA SEAGATE ST35002N SU0F>        at scbus1 target 3 lun 0 (pass11,da11)
<ATA SEAGATE ST35002N SU0F>        at scbus1 target 4 lun 0 (pass12,da12)
<ATA SEAGATE ST35002N SU0F>        at scbus1 target 5 lun 0 (pass13,da13)
<ATA SEAGATE ST35002N SU0F>        at scbus1 target 6 lun 0 (pass14,da14)
<ATA SEAGATE ST35002N SU0F>        at scbus1 target 7 lun 0 (pass15,da15)
<ATA SEAGATE ST35002N SU0F>        at scbus2 target 0 lun 0 (pass16,da16)
<ATA SEAGATE ST35002N SU0F>        at scbus2 target 1 lun 0 (pass17,da17)
<ATA HITACHI HUA7250S AC5A>        at scbus2 target 2 lun 0 (pass18,da18)
<ATA SEAGATE ST35002N SU0F>        at scbus2 target 3 lun 0 (pass19,da19)
<ATA SEAGATE ST35002N SU0F>        at scbus2 target 4 lun 0 (pass20,da20)
<ATA SEAGATE ST35002N SU0F>        at scbus2 target 5 lun 0 (pass21,da21)
<ATA SEAGATE ST35002N SU0F>        at scbus2 target 6 lun 0 (pass22,da22)
<ATA SEAGATE ST35002N SU0F>        at scbus2 target 7 lun 0 (pass23,da23)
<ATA SEAGATE ST35002N SU0F>        at scbus3 target 0 lun 0 (pass24,da24)
<ATA SEAGATE ST35002N SU0F>        at scbus3 target 1 lun 0 (pass25,da25)
<ATA SEAGATE ST35002N SU0F>        at scbus3 target 2 lun 0 (pass26,da26)
<ATA SEAGATE ST35002N SU0F>        at scbus3 target 3 lun 0 (pass27,da27)
<ATA SEAGATE ST35002N SU0F>        at scbus3 target 4 lun 0 (pass28,da28)
<ATA SEAGATE ST35002N SU0F>        at scbus3 target 5 lun 0 (pass29,da29)
<ATA SEAGATE ST35002N SU0F>        at scbus3 target 6 lun 0 (pass30,da30)
<ATA SEAGATE ST35002N SU0F>        at scbus3 target 7 lun 0 (pass31,da31)
<ATA SEAGATE ST35002N SU0F>        at scbus4 target 0 lun 0 (pass32,da32)
<ATA SEAGATE ST35002N SU0F>        at scbus4 target 1 lun 0 (pass33,da33)
<ATA SEAGATE ST35002N SU0F>        at scbus4 target 2 lun 0 (pass34,da34)
<ATA SEAGATE ST35002N SU0F>        at scbus4 target 3 lun 0 (pass35,da35)
<ATA SEAGATE ST35002N SU0F>        at scbus4 target 4 lun 0 (pass36,da36)
<ATA SEAGATE ST35002N SU0F>        at scbus4 target 5 lun 0 (pass37,da37)
<ATA SEAGATE ST35002N SU0F>        at scbus4 target 6 lun 0 (pass38,da38)
<ATA SEAGATE ST35002N SU0F>        at scbus4 target 7 lun 0 (pass39,da39)
<ATA SEAGATE ST35002N SU0F>        at scbus5 target 0 lun 0 (pass40,da40)
<ATA SEAGATE ST35002N SU0F>        at scbus5 target 1 lun 0 (pass41,da41)
<ATA SEAGATE ST35002N SU0F>        at scbus5 target 2 lun 0 (pass42,da42)
<ATA SEAGATE ST35002N SU0F>        at scbus5 target 3 lun 0 (pass43,da43)
<ATA SEAGATE ST35002N SU0F>        at scbus5 target 4 lun 0 (pass44,da44)
<ATA SEAGATE ST35002N SU0F>        at scbus5 target 5 lun 0 (pass45,da45)
<ATA SEAGATE ST35002N SU0F>        at scbus5 target 6 lun 0 (pass46,da46)
<ATA SEAGATE ST35002N SU0F>        at scbus5 target 7 lun 0 (pass47,da47)

# camcontrol tags da0
(pass0:mpt0:0:0:0): device openings: 255

Also works (I guess it doesn't touch the disks):
# zfs list
NAME                 USED  AVAIL  REFER  MOUNTPOINT
logpool             13.1T  7.17T   507K  /data
logpool/jail        7.08G  7.17T  7.08G  /data/jail
logpool/logs        13.1T  7.17T  3.40T /data/jail/logvm/logs
logpool/logs/OTHER  9.24T  7.17T  2.36T /data/jail/logvm/logs/OTHER

But this doesn't:
root    36686   0.0  0.0 33384  2512  5  D+    4:49PM   0:00.00 zfs list 
-t snapshot
# procstat -k 36686
   PID    TID COMM             TDNAME KSTACK
36686 101593 zfs              -                mi_switch sleepq_wait 
_cv_wait zio_wait dbuf_read dmu_buf_hold zap_lockdir zap_cursor_retrieve 
dmu_snapshot_list_next zfs_ioc_snapshot_list_next zfsdev_ioctl 
devfs_ioctl_f kern_ioctl sys_ioctl amd64_syscall Xfast_syscall

Entering into the debugger:
KDB: enter: sysctl debug.kdb.enter
[ thread pid 36959 tid 101484 ]
Stopped at      kdb_enter+0x3b: movq    $0,0x95ab72(%rip)
db> ps
   pid  ppid  pgrp   uid   state   wmesg wchan        cmd
36959  1769 36959     0  R+      CPU 0                       sysctl
36691   919   919     0  S       sbwait 0xfffffe009d752144 perl
36686 36677 36686     0  D+      zio->io_ 0xfffffe001ccb7d70 zfs
36677 36208 36677     0  Ss+     pause 0xfffffe009d0030a0 csh
36570 36567 36570     0  DL+     physrd 0xffffff87005a2980 dd
36567 36208 36567     0  Ss+     pause 0xfffffe00115c4540 csh
36217 36209 36217     0  D+      zio->io_ 0xfffffe001c2b2320 ls
36209 36208 36209     0  Ss+     pause 0xfffffe022c8aa0a0 csh
36208 36207 36208     0  Ss      select 0xfffffe0665c92e40 screen
36207  1782 36207     0  S+      pause 0xfffffe009d0010a0 screen
32921   883   873     0  DL      cbwait 0xfffffe000f7f7848 camcontrol
  1782  1780  1782     0  Ss+     pause 0xfffffe009d4559e0 csh
  1780   897  1780     0  Ss      select 0xfffffe001d546740 sshd
  1776  1774  1776     0  Ss+     ttyin 0xfffffe001c02a4a8 csh
  1774   897  1774     0  Ss      select 0xfffffe001cb4d0c0 sshd
  1769  1767  1769     0  Ss+     pause 0xfffffe001191a540 csh
  1767   897  1767     0  Ss      select 0xfffffe000fd72bc0 sshd
  1079     1  1079     0  Ss+     ttyin 0xfffffe000c82c4a8 getty
  1078     1  1078     0  Ss+     ttyin 0xfffffe000c82c8a8 getty
  1077     1  1077     0  Ss+     ttyin 0xfffffe000c82cca8 getty
  1076     1  1076     0  Ss+     ttyin 0xfffffe000c82d0a8 getty
  1075     1  1075     0  Ss+     ttyin 0xfffffe000c82d4a8 getty
  1074     1  1074     0  Ss+     ttyin 0xfffffe000c82d8a8 getty
  1073     1  1073     0  Ss+     ttyin 0xfffffe000c82dca8 getty
  1072     1  1072     0  Ss+     ttyin 0xfffffe000c82f0a8 getty
   919     1   919     0  Ss      select 0xfffffe000f5ac940 perl
   907     1   907     0  Ss      nanslp 0xffffffff81244f08 cron
   903     1   903    25  Ss      pause 0xfffffe001125e0a0 sendmail
   900     1   900     0  Ss      select 0xfffffe001d549340 sendmail
   897     1   897     0  Ss      select 0xfffffe001d546cc0 sshd
   892   884   873     0  S       piperd 0xfffffe001e940888 fghack
   884   878   873     0  S       wait 0xfffffe000fdee000 sh
   883   879   873     0  S       piperd 0xfffffe022c08b000 perl
   879   875   873     0  S       select 0xfffffe001ca6a8c0 supervise
   878   875   873     0  S       select 0xfffffe000fd73d40 supervise
   876     1   873     0  S       piperd 0xfffffe001e9c5b60 readproctitle
   875     1   873     0  S       nanslp 0xffffffff81244f08 svscan
   870   868   867   123  S       select 0xfffffe000fd934c0 ntpd
   868   867   867   123  S       select 0xfffffe001ca68e40 ntpd
   867     1   867     0  Ss      select 0xfffffe000fddd740 ntpd
   796     0     0     0  DL      mdwait 0xfffffe000f52a000 [md2]
   774     1   774    53  Ss (threaded)                  named
101524                   S       kqread 0xfffffe00115dd100 named
101523                   S       uwait 0xfffffe000fde5200 named
101522                   S       uwait 0xfffffe00110ce680 named
101521                   S       uwait 0xfffffe000fda0300 named
101520                   S       uwait 0xfffffe000fddd380 named
101519                   S       uwait 0xfffffe001198ca00 named
101518                   S       uwait 0xfffffe000fd58880 named
101517                   S       uwait 0xfffffe000fd7ab80 named
101516                   S       uwait 0xfffffe000f80e480 named
101515                   S       uwait 0xfffffe000f80f400 named
101501                   S       sigwait 0xfffffe00110dd000 named
   751   750   751     0  Ss      select 0xfffffe001d549440 syslog-ng
   750     1   749     0  S       wait 0xfffffe000c8144a0 syslog-ng
   612   608   608    64  S       bpf 0xfffffe001ca94800 pflogd
   608     1   608     0  Ss      sbwait 0xfffffe001eb4ae8c pflogd
   605     0     0     0  DL      pftm 0xffffffff817547a0 [pfpurge]
    78     0     0     0  DL (threaded)                  [zfskern]
101459                   D       spa->spa 0xfffffe0011462680 
[txg_thread_enter]
101458                   D       tx->tx_q 0xfffffe001b199230 
[txg_thread_enter]
100122                   D       l2arc_fe 0xffffffff8173ebc0 
[l2arc_feed_thread]
100121                   D       arc_recl 0xffffffff8172ed20 
[arc_reclaim_thread]
    59     0     0     0  DL      mdwait 0xfffffe000f521000 [md1]
    47     0     0     0  DL      mdwait 0xfffffe000f523800 [md0]
    24     0     0     0  DL      sdflush 0xffffffff812a6158 [softdepflush]
    23     0     0     0  DL      syncer 0xffffffff812928c0 [syncer]
    22     0     0     0  DL      vlruwt 0xfffffe000c80d000 [vnlru]
    21     0     0     0  DL      psleep 0xffffffff81292348 [bufdaemon]
    20     0     0     0  DL      pgzero 0xffffffff812b019c [pagezero]
    19     0     0     0  DL      psleep 0xffffffff812af368 [vmdaemon]
    18     0     0     0  DL      psleep 0xffffffff812af32c [pagedaemon]
    17     0     0     0  DL      ccb_scan 0xffffffff811ff260 [xpt_thrd]
    16     0     0     0  DL      idle 0xffffff8001df3000 [mpt_recovery5]
     9     0     0     0  DL      idle 0xffffff8001dde000 [mpt_recovery4]
     8     0     0     0  DL      idle 0xffffff8001dc9000 [mpt_recovery3]
     7     0     0     0  DL      idle 0xffffff8001daa000 [mpt_recovery2]
     6     0     0     0  DL      idle 0xffffff8001d95000 [mpt_recovery1]
     5     0     0     0  DL      idle 0xffffff8001d80000 [mpt_recovery0]
    15     0     0     0  DL (threaded)                  [usb]
100048                   D       - 0xffffff8001d73e18 [usbus1]
100047                   D       - 0xffffff8001d73dc0 [usbus1]
100046                   D       - 0xffffff8001d73d68 [usbus1]
100045                   D       - 0xffffff8001d73d10 [usbus1]
100043                   D       - 0xffffff8001d6b460 [usbus0]
100042                   D       - 0xffffff8001d6b408 [usbus0]
100041                   D       - 0xffffff8001d6b3b0 [usbus0]
100040                   D       - 0xffffff8001d6b358 [usbus0]
     4     0     0     0  DL      ctl_work 0xffffff8000a41000 [ctl_thrd]
    14     0     0     0  DL      - 0xffffffff81243ba4 [yarrow]
     3     0     0     0  DL      crypto_r 0xffffffff812a4ae0 [crypto 
returns]
     2     0     0     0  DL      crypto_w 0xffffffff812a4aa0 [crypto]
    13     0     0     0  DL (threaded)                  [geom]
100023                   D       - 0xffffffff8123d030 [g_down]
100022                   D       - 0xffffffff8123d028 [g_up]
100021                   D       - 0xffffffff8123d018 [g_event]
    12     0     0     0  RL (threaded)                  [intr]
100065 I                                   [swi0: uart]
100063 I                                   [irq293: mpt5]
100061 I                                   [irq292: mpt4]
100059 I                                   [irq291: mpt3]
100055 I                                   [irq274: mpt2]
100053 I                                   [irq273: mpt1]
100051 I                                   [irq272: mpt0]
100044 I                                   [irq22: ehci0]
100039 I                                   [irq21: ohci0]
100034 I                                   [swi2: cambio]
100031 I                                   [swi6: task queue]
100030 I                                   [swi6: Giant taskq]
100028 I                                   [swi5: +]
100020 I                                   [swi1: netisr 0]
100019 I                                   [swi4: clock]
100018 I                                   [swi4: clock]
100017 I                                   [swi4: clock]
100016 I                                   [swi4: clock]
100015 I                                   [swi4: clock]
100014 I                                   [swi4: clock]
100013 I                                   [swi4: clock]
100012 RunQ                                [swi4: clock]
100011 I                                   [swi3: vm]
    11     0     0     0  RL (threaded)                  [idle]
100010                   Run     CPU 7                       [idle: cpu7]
100009                   Run     CPU 6                       [idle: cpu6]
100008                   Run     CPU 5                       [idle: cpu5]
100007                   Run     CPU 4                       [idle: cpu4]
100006                   Run     CPU 3                       [idle: cpu3]
100005                   Run     CPU 2                       [idle: cpu2]
100004                   Run     CPU 1                       [idle: cpu1]
100003 CanRun                              [idle: cpu0]
     1     0     1     0  SLs     wait 0xfffffe000c068940 [init]
    10     0     0     0  DL      audit_wo 0xffffffff812a50d0 [audit]
     0     0     0     0  DLs (threaded)                  [kernel]
101463                   D       - 0xfffffe000fddab00 [zil_clean]
101462                   D       - 0xfffffe000fd6a800 [zil_clean]
101461                   D       - 0xfffffe000fdf6180 [zil_clean]
101460                   D       - 0xfffffe001d546600 [zil_clean]
101457                   D       - 0xfffffe000f359e00 [zfs_vn_rele_taskq]
101456                   D       - 0xfffffe001198d080 [zio_ioctl_intr]
101455                   D       - 0xfffffe001cb4fa80 [zio_ioctl_issue]
101454                   D       - 0xfffffe000ffbf380 [zio_claim_intr]
101453                   D       - 0xfffffe00110cf580 [zio_claim_issue]
101452                   D       - 0xfffffe00110cf880 [zio_free_intr]
101451                   D       - 0xfffffe000ffc1b80 [zio_free_issue_99]
101450                   D       - 0xfffffe000ffc1b80 [zio_free_issue_98]
101449                   D       - 0xfffffe000ffc1b80 [zio_free_issue_97]
101448                   D       - 0xfffffe000ffc1b80 [zio_free_issue_96]
101447                   D       - 0xfffffe000ffc1b80 [zio_free_issue_95]
101446                   D       - 0xfffffe000ffc1b80 [zio_free_issue_94]
101445                   D       - 0xfffffe000ffc1b80 [zio_free_issue_93]
101444                   D       - 0xfffffe000ffc1b80 [zio_free_issue_92]
101443                   D       - 0xfffffe000ffc1b80 [zio_free_issue_91]
101442                   D       - 0xfffffe000ffc1b80 [zio_free_issue_90]
101441                   D       - 0xfffffe000ffc1b80 [zio_free_issue_89]
101440                   D       - 0xfffffe000ffc1b80 [zio_free_issue_88]
101439                   D       - 0xfffffe000ffc1b80 [zio_free_issue_87]
101438                   D       - 0xfffffe000ffc1b80 [zio_free_issue_86]
101437                   D       - 0xfffffe000ffc1b80 [zio_free_issue_85]
101436                   D       - 0xfffffe000ffc1b80 [zio_free_issue_84]
101435                   D       - 0xfffffe000ffc1b80 [zio_free_issue_83]
101434                   D       - 0xfffffe000ffc1b80 [zio_free_issue_82]
101433                   D       - 0xfffffe000ffc1b80 [zio_free_issue_81]
101432                   D       - 0xfffffe000ffc1b80 [zio_free_issue_80]
101431                   D       - 0xfffffe000ffc1b80 [zio_free_issue_79]
101430                   D       - 0xfffffe000ffc1b80 [zio_free_issue_78]
101429                   D       - 0xfffffe000ffc1b80 [zio_free_issue_77]
101428                   D       - 0xfffffe000ffc1b80 [zio_free_issue_76]
101427                   D       - 0xfffffe000ffc1b80 [zio_free_issue_75]
101426                   D       - 0xfffffe000ffc1b80 [zio_free_issue_74]
101425                   D       - 0xfffffe000ffc1b80 [zio_free_issue_73]
101424                   D       - 0xfffffe000ffc1b80 [zio_free_issue_72]
101423                   D       - 0xfffffe000ffc1b80 [zio_free_issue_71]
101422                   D       - 0xfffffe000ffc1b80 [zio_free_issue_70]
101421                   D       - 0xfffffe000ffc1b80 [zio_free_issue_69]
101420                   D       - 0xfffffe000ffc1b80 [zio_free_issue_68]
101419                   D       - 0xfffffe000ffc1b80 [zio_free_issue_67]
101418                   D       - 0xfffffe000ffc1b80 [zio_free_issue_66]
101417                   D       - 0xfffffe000ffc1b80 [zio_free_issue_65]
101416                   D       - 0xfffffe000ffc1b80 [zio_free_issue_64]
101415                   D       - 0xfffffe000ffc1b80 [zio_free_issue_63]
101414                   D       - 0xfffffe000ffc1b80 [zio_free_issue_62]
101413                   D       - 0xfffffe000ffc1b80 [zio_free_issue_61]
101412                   D       - 0xfffffe000ffc1b80 [zio_free_issue_60]
101411                   D       - 0xfffffe000ffc1b80 [zio_free_issue_59]
101410                   D       - 0xfffffe000ffc1b80 [zio_free_issue_58]
101409                   D       - 0xfffffe000ffc1b80 [zio_free_issue_57]
101408                   D       - 0xfffffe000ffc1b80 [zio_free_issue_56]
101407                   D       - 0xfffffe000ffc1b80 [zio_free_issue_55]
101406                   D       - 0xfffffe000ffc1b80 [zio_free_issue_54]
101405                   D       - 0xfffffe000ffc1b80 [zio_free_issue_53]
101404                   D       - 0xfffffe000ffc1b80 [zio_free_issue_52]
101403                   D       - 0xfffffe000ffc1b80 [zio_free_issue_51]
101402                   D       - 0xfffffe000ffc1b80 [zio_free_issue_50]
101401                   D       - 0xfffffe000ffc1b80 [zio_free_issue_49]
101400                   D       - 0xfffffe000ffc1b80 [zio_free_issue_48]
101399                   D       - 0xfffffe000ffc1b80 [zio_free_issue_47]
101398                   D       - 0xfffffe000ffc1b80 [zio_free_issue_46]
101397                   D       - 0xfffffe000ffc1b80 [zio_free_issue_45]
101396                   D       - 0xfffffe000ffc1b80 [zio_free_issue_44]
101395                   D       - 0xfffffe000ffc1b80 [zio_free_issue_43]
101394                   D       - 0xfffffe000ffc1b80 [zio_free_issue_42]
101393                   D       - 0xfffffe000ffc1b80 [zio_free_issue_41]
101392                   D       - 0xfffffe000ffc1b80 [zio_free_issue_40]
101391                   D       - 0xfffffe000ffc1b80 [zio_free_issue_39]
101390                   D       - 0xfffffe000ffc1b80 [zio_free_issue_38]
101389                   D       - 0xfffffe000ffc1b80 [zio_free_issue_37]
101388                   D       - 0xfffffe000ffc1b80 [zio_free_issue_36]
101387                   D       - 0xfffffe000ffc1b80 [zio_free_issue_35]
101386                   D       - 0xfffffe000ffc1b80 [zio_free_issue_34]
101385                   D       - 0xfffffe000ffc1b80 [zio_free_issue_33]
101384                   D       - 0xfffffe000ffc1b80 [zio_free_issue_32]
101383                   D       - 0xfffffe000ffc1b80 [zio_free_issue_31]
100569                   D       - 0xfffffe000ffc1b80 [zio_free_issue_30]
100567                   D       - 0xfffffe000ffc1b80 [zio_free_issue_29]
100565                   D       - 0xfffffe000ffc1b80 [zio_free_issue_28]
100560                   D       - 0xfffffe000ffc1b80 [zio_free_issue_27]
100554                   D       - 0xfffffe000ffc1b80 [zio_free_issue_26]
100553                   D       - 0xfffffe000ffc1b80 [zio_free_issue_25]
100547                   D       - 0xfffffe000ffc1b80 [zio_free_issue_24]
100545                   D       - 0xfffffe000ffc1b80 [zio_free_issue_23]
100542                   D       - 0xfffffe000ffc1b80 [zio_free_issue_22]
100539                   D       - 0xfffffe000ffc1b80 [zio_free_issue_21]
100536                   D       - 0xfffffe000ffc1b80 [zio_free_issue_20]
100530                   D       - 0xfffffe000ffc1b80 [zio_free_issue_19]
100487                   D       - 0xfffffe000ffc1b80 [zio_free_issue_18]
100415                   D       - 0xfffffe000ffc1b80 [zio_free_issue_17]
100413                   D       - 0xfffffe000ffc1b80 [zio_free_issue_16]
100407                   D       - 0xfffffe000ffc1b80 [zio_free_issue_15]
100403                   D       - 0xfffffe000ffc1b80 [zio_free_issue_14]
100400                   D       - 0xfffffe000ffc1b80 [zio_free_issue_13]
100393                   D       - 0xfffffe000ffc1b80 [zio_free_issue_12]
100391                   D       - 0xfffffe000ffc1b80 [zio_free_issue_11]
100387                   D       - 0xfffffe000ffc1b80 [zio_free_issue_10]
100386                   D       - 0xfffffe000ffc1b80 [zio_free_issue_9]
100385                   D       - 0xfffffe000ffc1b80 [zio_free_issue_8]
100384                   D       - 0xfffffe000ffc1b80 [zio_free_issue_7]
100383                   D       - 0xfffffe000ffc1b80 [zio_free_issue_6]
100379                   D       - 0xfffffe000ffc1b80 [zio_free_issue_5]
100372                   D       - 0xfffffe000ffc1b80 [zio_free_issue_4]
100367                   D       - 0xfffffe000ffc1b80 [zio_free_issue_3]
100366                   D       - 0xfffffe000ffc1b80 [zio_free_issue_2]
100361                   D       - 0xfffffe000ffc1b80 [zio_free_issue_1]
100360                   D       - 0xfffffe000ffc1b80 [zio_free_issue_0]
100359                   D       - 0xfffffe001ca67280 [zio_write_intr_high]
100358                   D       - 0xfffffe001ca67280 [zio_write_intr_high]
100357                   D       - 0xfffffe001ca67280 [zio_write_intr_high]
100354                   D       - 0xfffffe001ca67280 [zio_write_intr_high]
100353                   D       - 0xfffffe001ca67280 [zio_write_intr_high]
100349                   D       - 0xfffffe000fd72700 [zio_write_intr_7]
100348                   D       - 0xfffffe000fd72700 [zio_write_intr_6]
100345                   D       - 0xfffffe000fd72700 [zio_write_intr_5]
100343                   D       - 0xfffffe000fd72700 [zio_write_intr_4]
100342                   D       - 0xfffffe000fd72700 [zio_write_intr_3]
100341                   D       - 0xfffffe000fd72700 [zio_write_intr_2]
100340                   D       - 0xfffffe000fd72700 [zio_write_intr_1]
100339                   D       - 0xfffffe000fd72700 [zio_write_intr_0]
100337                   D       - 0xfffffe001196ce00 [zio_write_issue_hig]
100336                   D       - 0xfffffe001196ce00 [zio_write_issue_hig]
100334                   D       - 0xfffffe001196ce00 [zio_write_issue_hig]
100330                   D       - 0xfffffe001196ce00 [zio_write_issue_hig]
100327                   D       - 0xfffffe001196ce00 [zio_write_issue_hig]
100324                   D       - 0xfffffe00110cfb00 [zio_write_issue_7]
100322                   D       - 0xfffffe00110cfb00 [zio_write_issue_6]
100321                   D       - 0xfffffe00110cfb00 [zio_write_issue_5]
100316                   D       - 0xfffffe00110cfb00 [zio_write_issue_4]
100314                   D       - 0xfffffe00110cfb00 [zio_write_issue_3]
100312                   D       - 0xfffffe00110cfb00 [zio_write_issue_2]
100311                   D       - 0xfffffe00110cfb00 [zio_write_issue_1]
100307                   D       - 0xfffffe00110cfb00 [zio_write_issue_0]
100306                   D       - 0xfffffe000ffbfc80 [zio_read_intr_7]
100305                   D       - 0xfffffe000ffbfc80 [zio_read_intr_6]
100303                   D       - 0xfffffe000ffbfc80 [zio_read_intr_5]
100300                   D       - 0xfffffe000ffbfc80 [zio_read_intr_4]
100298                   D       - 0xfffffe000ffbfc80 [zio_read_intr_3]
100297                   D       - 0xfffffe000ffbfc80 [zio_read_intr_2]
100293                   D       - 0xfffffe000ffbfc80 [zio_read_intr_1]
100292                   D       - 0xfffffe000ffbfc80 [zio_read_intr_0]
100291                   D       - 0xfffffe00110cf000 [zio_read_issue_7]
100289                   D       - 0xfffffe00110cf000 [zio_read_issue_6]
100288                   D       - 0xfffffe00110cf000 [zio_read_issue_5]
100286                   D       - 0xfffffe00110cf000 [zio_read_issue_4]
100282                   D       - 0xfffffe00110cf000 [zio_read_issue_3]
100281                   D       - 0xfffffe00110cf000 [zio_read_issue_2]
100280                   D       - 0xfffffe00110cf000 [zio_read_issue_1]
100278                   D       - 0xfffffe00110cf000 [zio_read_issue_0]
100275                   D       - 0xfffffe001113b500 [zio_null_intr]
100273                   D       - 0xfffffe001196c800 [zio_null_issue]
100120                   D       - 0xfffffe0011370300 [system_taskq_7]
100119                   D       - 0xfffffe0011370300 [system_taskq_6]
100118                   D       - 0xfffffe0011370300 [system_taskq_5]
100117                   D       - 0xfffffe0011370300 [system_taskq_4]
100116                   D       - 0xfffffe0011370300 [system_taskq_3]
100115                   D       - 0xfffffe0011370300 [system_taskq_2]
100114                   D       - 0xfffffe0011370300 [system_taskq_1]
100113                   D       - 0xfffffe0011370300 [system_taskq_0]
100066                   D       - 0xfffffe000f239a80 [mca taskq]
100058                   D       - 0xfffffe000c69b900 [nfe3 taskq]
100057                   D       - 0xfffffe000c698480 [nfe2 taskq]
100050                   D       - 0xfffffe000c620400 [nfe1 taskq]
100049                   D       - 0xfffffe000c61b500 [nfe0 taskq]
100037                   D       - 0xfffffe000c24bb00 [acpi_task_2]
100036                   D       - 0xfffffe000c24bb00 [acpi_task_1]
100035                   D       - 0xfffffe000c24bb00 [acpi_task_0]
100033                   D       - 0xfffffe000c24be00 [kqueue taskq]
100032                   D       - 0xfffffe000c24c000 [ffs_trim taskq]
100029                   D       - 0xfffffe000c20c780 [thread taskq]
100024                   D       - 0xfffffe000c07fb80 [firmware taskq]
100000                   D       sched 0xffffffff8123d280 [swapper]
   895   892   873     0 Z                                   perl

Setting this:
# sysctl dev.mpt.0.debug=255
and doing a dd again from a disk on that controller prints this onto the 
console:
SCSI IO Request @ 0xffffff80003046f0
     Chain Offset  0x00
     MsgFlags      0x00
     MsgContext    0x000201c5
     Bus:                0
     TargetID            0
     SenseBufferLength   32
     LUN:              0x0
     Control           0x02000200  READ  ORDEREDQ
     DataLength    0x00000200
     SenseBufAddr    0x0c678be0
     CDB[0:6]    08 00 00 00 01 00
     SE64 0xffffff87ffd33a30: Addr=0x000000070cc08400 FlagsLength=0xd3000200
       64_BIT_ADDRESSING LAST_ELEMENT END_OF_BUFFER END_OF_LIST
mpt0: Send Request 453 (c678a00):
mpt0:   00000000 00002006 000201c5 00000000 00000000 02000200 00000008 
00000001
mpt0:   00000000 00000000 00000200 0c678be0 d3000200 0cc08400 00000007 
ffffffff
mpt0:   ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff 
ffffffff
mpt0:   ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff 
ffffffff
mpt0: enter mpt_intr
mpt0: Context Reply: 0x000201c5
mpt0: exit mpt_intr

And dd freezes.

Alltrace from a couple of stuck processes:
Tracing command dd pid 36971 tid 101570 td 0xfffffe001efce000
sched_switch() at sched_switch+0x115
mi_switch() at mi_switch+0x186
sleepq_wait() at sleepq_wait+0x42
_sleep() at _sleep+0x379
bwait() at bwait+0x64
physio() at physio+0x1c8
devfs_read_f() at devfs_read_f+0x90
dofileread() at dofileread+0xa1
kern_readv() at kern_readv+0x6c
sys_read() at sys_read+0x64
amd64_syscall() at amd64_syscall+0x540
Xfast_syscall() at Xfast_syscall+0xf7
--- syscall (3, FreeBSD ELF64, sys_read), rip = 0x800916c8c, rsp = 
0x7fffffffd658, rbp = 0x7fffffffd6b0 ---

Tracing command zfs pid 36686 tid 101593 td 0xfffffe001ecb3900
sched_switch() at sched_switch+0x115
mi_switch() at mi_switch+0x186
sleepq_wait() at sleepq_wait+0x42
_cv_wait() at _cv_wait+0x112
zio_wait() at zio_wait+0x61
dbuf_read() at dbuf_read+0x5e5
dmu_buf_hold() at dmu_buf_hold+0xe0
zap_lockdir() at zap_lockdir+0x58
zap_cursor_retrieve() at zap_cursor_retrieve+0x19b
dmu_snapshot_list_next() at dmu_snapshot_list_next+0xaf
zfs_ioc_snapshot_list_next() at zfs_ioc_snapshot_list_next+0x101
zfsdev_ioctl() at zfsdev_ioctl+0xe6
devfs_ioctl_f() at devfs_ioctl_f+0x7b
kern_ioctl() at kern_ioctl+0x106
sys_ioctl() at sys_ioctl+0xfd
amd64_syscall() at amd64_syscall+0x540
Xfast_syscall() at Xfast_syscall+0xf7
--- syscall (54, FreeBSD ELF64, sys_ioctl), rip = 0x801be2c2c, rsp = 
0x7fffffff8938, rbp = 0x4000 ---




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?50856322.9070307>