Date: Wed, 10 Aug 2005 21:14:53 +0200 From: Volker <volker@vwsoft.com> To: freebsd-scsi@freebsd.org Subject: DDS trouble - device hanging Message-ID: <42FA522D.2050508@vwsoft.com>
next in thread | raw e-mail | index | archive | help
Hi guys, on a system with 5.4-STABLE I'm experiencing trouble with a DAT drive. I've used a Sony SDT-7000 (DDS-2) attached to an Adaptec 3985 for the last years without trouble (narrow SCSI). A year ago I've changed the controller to an Adaptec AAA-133B. When trying to get a backup some time later, the backup was hanging in the middle of the backup and I thought the SDT-7000 would be faulty. Now I've changed the tape drive to a Sony SDT-11000 (DDS-4) and the same thing happens, it stops _in the middle_ of the backup (whether using tar or bacula doesn't make a difference). The cabling has been changed when changing the tape drive. SCSI bus termination is ok, tape drive firmware and jumper settings have been double checked. When the error occours, the tape drive doesn't respond to any commands being sent (camcontrol) and will not eject media (emergency eject). In the meantime (a year ago) I've changed the server from RELENG_4 to RELENG_5 but while the backup has been disabled over months I can't say for sure if the fBSD version update or the hardware change was causing the fault. The following error messages are taken from the console after the backup has been aborted. Please note a manual `camcontrol reset 1:6:0' at the end (which did _not_ solve the frozen device). How do I debug, what can be read from the card dump state or the debug messages? Is the controller at fault? Any hints? I'm near of pulling out the controller and going single channel (that would be possible in my setup without causing additional trouble). Thanks, Volker #uname -v FreeBSD 5.4-STABLE #10: Fri May 13 16:12:28 CEST 2005 ahc2: Recovery Initiated >>>>>>>>>>>>>>>>>> Dump Card State Begins >>>>>>>>>>>>>>>>>> <<<<<<<<<<<<<<<<< ahc2: Dumping Card State while idle, at SEQADDR 0x7 Card was paused ACCUM = 0xd9, SINDEX = 0x67, DINDEX = 0x27, ARG_2 = 0x3 HCNT = 0x0 SCBPTR = 0x0 SCSISIGI[0x0] ERROR[0x0] SCSIBUSL[0x0] LASTPHASE[0x1]:(P_BUSFREE) SCSISEQ[0x12]:(ENAUTOATNP|ENRSELI) SBLKCTL[0x2]:(SELWIDE) SCSIRATE[0x0] SEQCTL[0x10]:(FASTMODE) SEQ_FLAGS[0xc0]:(NO_CDB_SENT|NOT_IDENTIFIED) SSTAT0[0x5]:(DMADONE|SDONE) SSTAT1[0xa]:(PHASECHG|BUSFREE) SSTAT2[0x0] SSTAT3[0x0] SIMODE0[0x0] SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO) SXFRCTL0[0x80]:(DFON) DFCNTRL[0x0] DFSTATUS[0x2d]:(FIFOEMP|DFTHRESH|HDONE|FIFOQWDEMP) STACK: 0xcc 0x151 0x192 0x3 SCB count = 20 Kernel NEXTQSCB = 17 Card NEXTQSCB = 17 QINFIFO entries: Waiting Queue entries: Disconnected Queue entries: 0:14 QOUTFIFO entries: Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Sequencer SCB Info: 0 SCB_CONTROL[0x4c]:(DISCONNECTED|ULTRAENB|DISCENB) SCB_SCSIID[0x67] SCB_LUN[0x0] SCB_TAG[0xe] 1 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 2 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 3 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 4 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 5 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 6 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 7 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 8 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 9 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 10 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 11 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 12 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 13 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 14 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 15 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] Pending list: 14 SCB_CONTROL[0x48]:(ULTRAENB|DISCENB) SCB_SCSIID[0x67] SCB_LUN[0x0] Kernel Free SCB list: 18 9 8 6 5 3 2 0 19 16 15 1 7 4 13 12 11 10 Untagged Q(6): 14 <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> (sa0:ahc2:0:6:0): SCB 0xe - timed out sg[0] - Addr 0x219d028 : Length 4056 sg[1] - Addr 0x96de000 : Length 4096 sg[2] - Addr 0x15d3f000 : Length 4096 sg[3] - Addr 0x3c00000 : Length 4096 sg[4] - Addr 0xbe61000 : Length 4096 sg[5] - Addr 0xb9e2000 : Length 4096 sg[6] - Addr 0xeca3000 : Length 4096 sg[7] - Addr 0xc064000 : Length 4096 sg[8] - Addr 0x7485000 : Length 4096 sg[9] - Addr 0x41e6000 : Length 4096 sg[10] - Addr 0xdde7000 : Length 4096 sg[11] - Addr 0xb488000 : Length 4096 sg[12] - Addr 0x7689000 : Length 4096 sg[13] - Addr 0xb7ca000 : Length 4096 sg[14] - Addr 0xceb000 : Length 4096 sg[15] - Addr 0x3a4c000 : Length 3112 (sa0:ahc2:0:6:0): Queuing a BDR SCB (sa0:ahc2:0:6:0): Bus Device Reset Message Sent ahc2: Timedout SCBs already complete. Interrupts may not be functioning. (sa0:ahc2:0:6:0): no longer in timeout, status = 24b ahc2: Bus Device Reset on A:6. 1 SCBs aborted (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): Sense Error Code 0x0 (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): Sense Error Code 0x75 (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): NO SENSE ILI (length mismatch): -2048 asc:0,0 (sa0:ahc2:0:6:0): No additional sense information (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): NO SENSE ILI (length mismatch): -2048 asc:0,0 (sa0:ahc2:0:6:0): No additional sense information (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): NO SENSE ILI (length mismatch): -2048 asc:0,0 (sa0:ahc2:0:6:0): No additional sense information (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): Sense Error Code 0x0 (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): Sense Error Code 0x0 (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): Sense Error Code 0x75 (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): Sense Error Code 0x75 (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): Sense Error Code 0x75 (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): Sense Error Code 0x75 (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): Sense Error Code 0x75 (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): Sense Error Code 0x75 (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): Sense Error Code 0x75 (sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc2:0:6:0): Sense Error Code 0x75 (pass3:ahc2:0:6:0): Bus Device Reset Message Sent ahc2: Bus Device Reset on A:6. 1 SCBs aborted -- GPG/PGP fingerprint: FF93 13A1 2477 B631 E953 06DF 4C49 ADD9 E4BF 79B1
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?42FA522D.2050508>
