Date: Sat, 10 Dec 2016 17:03:56 -0700 From: Alan Somers <asomers@freebsd.org> To: FreeBSD-scsi <freebsd-scsi@freebsd.org>, Alexander Motin <mav@freebsd.org> Subject: Fwd: frequent timeouts with mvs(4) SATA controller, GELI, and ZFS Message-ID: <CAOtMX2g7pjAVhRFcxOVN%2BucMVvMzH%2B5ZnVDo17eTPNAaPC86tA@mail.gmail.com> In-Reply-To: <CAOtMX2ghs_KwQDJQ4hyqb0mebZw_hvRVBS_48DYm=DvekVP=rw@mail.gmail.com> References: <CAOtMX2jYzMatN5WSZjBL5hi%2B_EMpa4bv9QsVxeHthMkaSR9FNw@mail.gmail.com> <CAOtMX2ghs_KwQDJQ4hyqb0mebZw_hvRVBS_48DYm=DvekVP=rw@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
I have an 11.0-RELEASE machine with a Via Nano CPU and a Marvell SATA
88SX7042 controller. I have a GELI-encrypted triple-mirror zpool with
disks on that controller. But the number doesn't matter; I have the
same problems even when only one disk is connected. Whenever I write
to this pool, after a few GB of writes I get a timeout on one of the
mvs(4) slots, followed shortly by timeouts on every disk on that
controller. From this point until I reboot, no command sent to any
disk on that controller will ever complete. CAM tries to reprobe the
disks, fails, and their ada nodes disappear. This is repeatable.
Does anybody have any ideas what's going on?
Anybody know any dirt about this SATA controller?
pciconf -lv
...
atapci0@pci0:0:15:0: class=0x01018f card=0xaa241106 chip=0x90011106 rev=0x00
hdr=0x00
vendor = 'VIA Technologies, Inc.'
device = 'VX900 Serial ATA Controller'
class = mass storage
subclass = ATA
mvs0@pci0:1:0:0: class=0x010000 card=0x11ab11ab chip=0x704211ab rev=0x02
hdr=0x00
vendor = 'Marvell Technology Group Ltd.'
device = '88SX7042 PCI-e 4-port SATA-II'
class = mass storage
subclass = SCSI
...
dmesg
...
mvsch3: Timeout on slot 7
mvsch3: iec 02000000 sstat 00000123 serr 00000000 edma_s 000000e1
dma_c 20000708 dma_s 00000008 rs 000000f2 status 40
mvsch3: ... waiting for slots 00000072
mvsch3: Timeout on slot 6
mvsch3: iec 02000000 sstat 00000123 serr 00000000 edma_s 000000e1
dma_c 20000708 dma_s 00000008 rs 000000f2 status 40
mvsch3: ... waiting for slots 00000032
mvsch3: Timeout on slot 5
mvsch3: iec 02000000 sstat 00000123 serr 00000000 edma_s 000000e1
dma_c 20000708 dma_s 00000008 rs 000000f2 status 40
mvsch3: ... waiting for slots 00000012
mvsch3: Timeout on slot 4
mvsch3: iec 02000000 sstat 00000123 serr 00000000 edma_s 000000e1
dma_c 20000708 dma_s 00000008 rs 000000f2 status 40
mvsch3: ... waiting for slots 00000002
mvsch3: Timeout on slot 1
mvsch3: iec 02000000 sstat 00000123 serr 00000000 edma_s 000000e1
dma_c 20000708 dma_s 00000008 rs 000000f2 status 40
(ada3:mvsch3:0:0:0): READ_FPDMA_QUEUED. ACB: 60 00 95 e4 11 40 4d 00 00 01 00 00
(ada3:mvsch3:0:0:0): CAM status: Command timeout
(ada3:mvsch3:0:0:0): Retrying command
(ada3:mvsch3:0:0:0): READ_FPDMA_QUEUED. ACB: 60 00 f2 5f 00 40 21 00 00 01 00 00
(ada3:mvsch3:0:0:0): CAM status: Command timeout
(ada3:mvsch3:0:0:0): Retrying command
(ada3:mvsch3:0:0:0): READ_FPDMA_QUEUED. ACB: 60 00 f2 61 00 40 21 00 00 01 00 00
(ada3:mvsch3:0:0:0): CAM status: Command timeout
(ada3:mvsch3:0:0:0): Retrying command
(ada3:mvsch3:0:0:0): READ_FPDMA_QUEUED. ACB: 60 00 f2 63 00 40 21 00 00 01 00 00
(ada3:mvsch3:0:0:0): CAM status: Command timeout
(ada3:mvsch3:0:0:0): Retrying command
(ada3:mvsch3:0:0:0): READ_FPDMA_QUEUED. ACB: 60 00 f2 67 00 40 21 00 00 01 00 00
(ada3:mvsch3:0:0:0): CAM status: Command timeout
(ada3:mvsch3:0:0:0): Retrying command
...
-Alan
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CAOtMX2g7pjAVhRFcxOVN%2BucMVvMzH%2B5ZnVDo17eTPNAaPC86tA>
