Date: Mon, 25 Apr 2016 11:36:11 -0400 From: Dan Langille <dan@langille.org> To: Scott Long <scott4long@yahoo.com> Cc: freebsd-scsi@freebsd.org Subject: Re: terminated ioc 804b scsi 0 state c xfer 0 Message-ID: <C4C58F38-4496-4C3C-B9CB-9A21E4CF6980@langille.org> In-Reply-To: <B27E370A-53B2-46B8-8663-F3AA4D389A64@yahoo.com> References: <2E8752E5-76AF-4042-86D9-8C6733658A80@langille.org> <B27E370A-53B2-46B8-8663-F3AA4D389A64@yahoo.com>
next in thread | previous in thread | raw e-mail | index | archive | help
> On Apr 25, 2016, at 11:32 AM, Scott Long <scott4long@yahoo.com> wrote: >=20 > Hi Dan, >=20 > Can you share the entire console log for the uptime? What you=E2=80=99v= e pasted is missing the initial messages of the problem. the = =E2=80=9Cterminated ioc=E2=80=9D messages are likely because the driver = has decided to reset the drive and terminate all outstanding I/O to it. = In other words, they=E2=80=99re red herrings. The reason for the driver = deciding to do the reset is likely earlier in the log. Yes, does this help anything? The 'core dumped' messages relate to Bacula regression testing. I don't = think there is anything helpful here for you: Apr 13 07:59:52 knew kernel: (sa0:sym0:0:1:0): 64512-byte tape record = bigger than supplied buffer Apr 13 12:06:14 knew kernel: pid 57706 (bacula-sd), uid 1001: exited on = signal 11 (core dumped) Apr 13 15:17:42 knew sshd[31059]: fatal: Read from socket failed: = Connection reset by peer [preauth] Apr 14 07:23:05 knew kernel: sonewconn: pcb 0xfffff8035dd21dc8: Listen = queue overflow: 8 already in queue awaiting acceptance (1 occurrences) Apr 16 12:54:07 knew kernel: (sa0:sym0:0:1:0): 64512-byte tape record = bigger than supplied buffer Apr 17 03:19:05 knew kernel: pid 38425 (bacula-sd), uid 1001: exited on = signal 11 (core dumped) Apr 17 06:43:26 knew kernel: (sa0:sym0:0:1:0): 64512-byte tape record = bigger than supplied buffer Apr 17 06:55:53 knew kernel: (sa0:sym0:0:1:0): 64512-byte tape record = bigger than supplied buffer Apr 17 09:21:16 knew kernel: (sa0:sym0:0:1:0): 64512-byte tape record = bigger than supplied buffer Apr 19 18:12:19 knew kernel: (sa1:mps0:0:0:0): 64512-byte tape record = bigger than supplied buffer Apr 20 14:03:05 knew su: BAD SU dan to root on /dev/pts/2 Apr 20 14:03:11 knew last message repeated 2 times Apr 20 14:03:15 knew su: dan to root on /dev/pts/2 Apr 20 18:52:14 knew kernel: (sa1:mps0:0:0:0): 64512-byte tape record = bigger than supplied buffer Apr 21 08:10:52 knew kernel: (sa0:sym0:0:1:0): 64512-byte tape record = bigger than supplied buffer Apr 23 03:56:28 knew kernel: pid 80961 (bacula-fd), uid 1002: exited on = signal 11 (core dumped) Apr 23 09:41:33 knew kernel: pid 51735 (bacula-sd), uid 1002: exited on = signal 11 (core dumped) Apr 24 05:14:46 knew kernel: pid 4529 (bacula-dir), uid 1002: exited on = signal 11 (core dumped) Apr 24 07:22:09 knew kernel: (sa0:sym0:0:1:0): 64512-byte tape record = bigger than supplied buffer Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 8d = 90 c6 18 00 00 10 00 length 8192 SMID 774 terminated ioc 804b scsi 0 = state c xfer 0 Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 8b = d9 97 70 00 00 20 00 length 16384 SMID 614 terminated ioc 804b scsi 0 = state c xfer 0 Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 8b = d9 97 50 00 00 20 00 length 16384 SMID 792 terminated ioc 804b scsi 0 = state c xfer 0 It includes three lines which appear below >=20 > Thanks, > Scott >=20 >> On Apr 24, 2016, at 7:35 AM, Dan Langille <dan@langille.org> wrote: >>=20 >> More of the pasted output is also at = https://gist.github.com/dlangille/1fa3135334089c6603e2ec5da946d9ae = <https://gist.github.com/dlangille/1fa3135334089c6603e2ec5da946d9ae> and = added smartctl output. >>=20 >> I have a FreeBSD 10.2-RELEASE-p14 box in which there is an LSI = SAS2008 card. It's running a zfs root system. >>=20 >> This morning the system was unresponsive via ssh. Attempts to log in = at the console did not yield a password prompt. >>=20 >> A power cycle brought the system online. Inspecting = /var/log/messages, I found about 63,000 entries similar to those which = appear below. >>=20 >> zpool status of all are OK. A scrub is in progress for one pool = (since before this issue arose). da7 is in that pool. >>=20 >>=20 >> Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = 8d 90 c6 18 00 00 10 00 length 8192 SMID 774 terminated ioc 804b scsi 0 = state c xfer 0 >> Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = 8b d9 97 70 00 00 20 00 length 16384 SMID 614 terminated ioc 804b scsi 0 = state c xfer 0 >> Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = 8b d9 97 50 00 00 20 00 length 16384 SMID 792 terminated ioc 804b scsi 0 = state c xfer 0 >> Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = 8b d9 97 08 00 00 20 00 length 16384 SMID 974 terminated ioc 804b scsi 0 = state c xfer 0 >> Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = 8b 6f ef 50 00 00 08 00 length 4096 SMID 674 terminated ioc 804b scsi 0 = state c xfer 0 >> Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): WRITE(10). CDB: 2a 00 = 8b 0f a2 48 00 00 18 00 length 12288 SMID 177 terminated ioc 804b scsi 0 = state c xfer 12288 >> Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = ab 8f a1 38 00 00 08 00 length 4096 SMID 908 terminated ioc 804b scsi 0 = state c xfer 0 >> Apr 24 11:25:56 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = 8b d9 97 70 00 00 20 00 length 16384 SMID 376 terminated ioc 804b scsi 0 = state c xfer 0 >> Apr 24 11:25:56 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = 8b d9 97 50 00 00 20 00 length 16384 SMID 172 terminated ioc 804b scsi 0 = state c xfer 0 >>=20 >> Is this a cabling issue? The drive is a SATA device (smartctl output = in the URL above). Anyone familiar with these errors? >>=20 >> --=20 >> Dan Langille - BSDCan / PGCon >> dan@langille.org >>=20 >>=20 >>=20 >>=20 >> _______________________________________________ >> freebsd-scsi@freebsd.org mailing list >> https://lists.freebsd.org/mailman/listinfo/freebsd-scsi >> To unsubscribe, send any mail to = "freebsd-scsi-unsubscribe@freebsd.org" >=20
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?C4C58F38-4496-4C3C-B9CB-9A21E4CF6980>