Date: Tue, 22 Feb 2005 19:17:31 +0800 From: Rong-En Fan <rafan@csie.org> To: scsi@freebsd.org Subject: strange SCSI problem on 5.3, 4.10 ok Message-ID: <20050222111731.GA10825@svm.csie.ntu.edu.tw>
next in thread | raw e-mail | index | archive | help
[please CC'ed, thanks] Hi, Recently I upgraded a 4.10 box to 5.3 and encounter some strange SCSI problem and causes panic. My configuration is: Adaptec 29160, ahc(4) SliverStar GT1008 Hardware (manfactured by Infortrend), 160MB/s, da1 Infortrend IFT7200 RAID, 160MB/s, da2, da3 dmesg: http://rafan.infor.org/tmp/scsi/dmesg.boot While I'm doing rsync all da2's data to da1 (about 20MB/s) after few mintures, (da1:ahc1:0:0:0): Unexpected busfree in Command phase SEQADDR == 0x16c (da1:ahc1:0:0:0): lost device (da1:ahc1:0:0:0): Invalidating pack (da1:ahc1:0:0:0): Invalidating pack (da1:ahc1:0:0:0): Invalidating pack (da1:ahc1:0:0:0): Invalidating pack (da1:ahc1:0:0:0): Invalidating pack (da1:ahc1:0:0:0): WRITE(10). CDB: 2a 0 0 33 6d 3f 0 0 80 0 (da1:ahc1:0:0:0): CAM Status: SCSI Status Error (da1:ahc1:0:0:0): SCSI Status: Check Condition (da1:ahc1:0:0:0): UNIT ATTENTION asc:29,0 (da1:ahc1:0:0:0): Power on, reset, or bus device reset occurred (da1:ahc1:0:0:0): Retrying Command (per Sense Data) panic: softdep_move_dependencies: need merge code cpuid = 1 boot() called on cpu#1 Uptime: 23h1m3s (da1:ahc1:0:0:0): SYNCHRONIZE CACHE. CDB: 35 0 0 0 0 0 0 0 0 0 (da1:ahc1:0:0:0): Sense Error Code 0x80 at block no. -1051665751 (decimal) Cannot dump. No dump device defined. Shutting down ACPI Then I tried again and again: login: (da1:ahc1:0:0:0): Unexpected busfree in Command phase SEQADDR == 0x16c (da1:ahc1:0:0:0): lost device (da1:ahc1:0:0:0): Invalidating pack (da1:ahc1:0:0:0): Invalidating pack (da1:ahc1:0:0:0): Invalidating pack (da1:ahc1:0:0:0): Invalidating pack (da1:ahc1:0:0:0): WRITE(10). CDB: 2a 0 d 12 a8 ff 0 0 80 0 (da1:ahc1:0:0:0): CAM Status: SCSI Status Error (da1:ahc1:0:0:0): SCSI Status: Check Condition (da1:ahc1:0:0:0): UNIT ATTENTION asc:29,0 (da1:ahc1:0:0:0): Power on, reset, or bus device reset occurred (da1:ahc1:0:0:0): Retries Exhausted (da1:ahc1:0:0:0): Invalidating pack panic: softdep_move_dependencies: need merge code cpuid = 0 boot() called on cpu#0 Uptime: 8h56m7s (da1:ahc1:0:0:0): SYNCHRONIZE CACHE. CDB: 35 0 0 0 0 0 0 0 0 0 (da1:ahc1:0:0:0): Sense Error Code 0x50 another one: (da1:ahc1:0:0:0): Unexpected busfree in Command phase SEQADDR == 0x16c (da1:ahc1:0:0:0): lost device (da1:ahc1:0:0:0): Invalidating pack initiate_write_filepage: already started ... (da1:ahc1:0:0:0): Invalidating pack initiate_write_filepage: already started ... panic: initiate_write_inodeblock_ufs2: already started cpuid = 0 boot() called on cpu#0 Uptime: 22m28s The former two, tag depth is 32 (as the same as the RAID), last time tag depth is 8. Before upgrading to 5.3, this box works pretty well under this kind of IO. I saw some old posts from Xin Li last Oct or Nov have some problems on 5.3 and OK on RELENG_4. I'm upgrading to RELENG_5 and will try current later, see if they are OK or not. Since hardware (even firmware) is identical as before, I suspect that there are some fix in RELENG_4 and not go back to current or RELENG_5, or this is a new problem in RELENG_5? Regards, Rong-En Fan
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20050222111731.GA10825>