Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 10 Dec 2003 18:05:43 +1030
From:      Jon.Soong@imvs.sa.gov.au
To:        aic7xxx@freebsd.org
Subject:   Please help me redhat 9 aic7xxx
Message-ID:  <1071041742.3fd6cccef4207@ablett.imvs.sa.gov.au>

next in thread | raw e-mail | index | archive | help
Hi there

I hope someone can help me, i'm having trouble with aic7xxx connected to a
Promise UltraTrak RM4000 external RAID 5 array on my redhat 9 box.

I have tried both the stock redhat9.0 drivers as well as the 6.3.3 drivers found
at http://people.freebsd.org/~gibbs/linux/RPM/.

In both cases the module is loaded fine during boot up (i can see the module in
lsmod, i can access the scsi disk using 'fdisk /dev/sda')

I then use fdisk to create a partition on /dev/sda - this is fine too.

When i try to format the partition (mkfs.ext3 /dev/sda1), the machine hangs on
'Writing inodes'.

After a while i get a whole bunch of SCSI errors. They are like: 
 <<<<<<<<<<<Dump Card State Ends >>>>>>>>>
 (scsi0:A:0:0): SCB 0xf - timed out
   sg[0] - Addr 0x57ea000 : Length 4096
  Recovery SCB completes
  scsi0: Issued Channel A Bus Reset. 32 SCBs aborted

-- SEE BELOW FOR BETTER PRINTOUT

Things i have tried:
1. RedHat 9 default aic7xxx drivers and 6.3.3 drivers (installed from RPM)
2. Updating FirmWare on Promise UltraTrak RM4000
3. Fedora (the 6.3.3 RPMs did not work)
4. Resetting the Adaptec to defaults (going thru the bios 'ctrl-a')
5. Resetting the machines bios defaults
6. 2 or 3 different SCSI cables 
7. 2 or 3 different SCSI terminators 

I would be most most grateful if anyone could shed some light on this
situation.I have been at this for a couple of days now and its really starting
to hurt! :)

Kind Regards

Jon

OUTPUT OF DMESG:

 SCSI subsystem driver Revision: 1.00
 PCI: Found IRQ 10 for device 05:09.0
 PCI: Sharing IRQ 10 with 00:1f.1
 scsi0 : Adaptec AIC7XXX EISA/VLB/PCI SCSI HBA DRIVER, Rev 6.3.3
        <Adaptec 29160 Ultra160 SCSI adapter>
        aic7892: Ultra160 Wide Channel A, SCSI Id=7, 32/253 SCBs

BELOW IS WHAT COMES UP ON THE SCREEN ONCE I'VE STARTED mkfs.ext3 /dev/sda1 and
it has paused during 'Writing inodes' (i've shorted some of it where the same
message was being printed out):

Dec 10 17:48:58 ablettjnr kernel: scsi0: Recovery Initiated
Dec 10 17:48:58 ablettjnr kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begins
<<<<<<<<<<<<<<<<<
Dec 10 17:48:58 ablettjnr kernel: scsi0: Dumping Card State while idle, at
SEQADDR 0x9
Dec 10 17:48:58 ablettjnr kernel: Card was paused
Dec 10 17:48:58 ablettjnr kernel: ACCUM = 0x4, SINDEX = 0x64, DINDEX = 0x65,
ARG_2 = 0x4
Dec 10 17:48:58 ablettjnr kernel: HCNT = 0x0 SCBPTR = 0x19
Dec 10 17:48:58 ablettjnr kernel: SCSIPHASE[0x0] SCSISIGI[0x0] ERROR[0x0]
SCSIBUSL[0x0]
Dec 10 17:48:58 ablettjnr kernel: LASTPHASE[0x1]:(P_BUSFREE)
SCSISEQ[0x12]:(ENAUTOATNP|ENRSELI)
Dec 10 17:48:58 ablettjnr kernel: SBLKCTL[0x6]:(SELWIDE|ENAB20) SCSIRATE[0x0]
SEQCTL[0x10]:(FASTMODE)
Dec 10 17:48:58 ablettjnr kernel: SEQ_FLAGS[0xc0]:(NO_CDB_SENT|NOT_IDENTIFIED)
SSTAT0[0x0]
Dec 10 17:48:58 ablettjnr kernel: SSTAT1[0x8]:(BUSFREE) SSTAT2[0x0] SSTAT3[0x0]
SIMODE0[0x8]:(ENSWRAP)
Dec 10 17:48:58 ablettjnr kernel: SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO)
SXFRCTL0[0x80]:(DFON)
Dec 10 17:48:58 ablettjnr kernel: DFCNTRL[0x0]
DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
Dec 10 17:48:58 ablettjnr kernel: STACK: 0x0 0x164 0x179 0x3
Dec 10 17:48:58 ablettjnr kernel: SCB count = 35
Dec 10 17:48:58 ablettjnr kernel: Kernel NEXTQSCB = 16
Dec 10 17:48:58 ablettjnr kernel: Card NEXTQSCB = 16
Dec 10 17:48:58 ablettjnr kernel: QINFIFO entries:
Dec 10 17:48:58 ablettjnr kernel: Waiting Queue entries:
Dec 10 17:48:58 ablettjnr kernel: Disconnected Queue entries: 25:17 24:18 23:19
22:10 21:11 20:12 19:13 18:14 17:5 16:6 15:7 14:8 13:9 12:0 11:4 9:2 10:1 8:3
7:32 6:33 5:34 4:25 3:26 1:27 2:28 0:29 31:20 30:21 29:22 28:23 27:24 26:15
Dec 10 17:48:58 ablettjnr kernel: QOUTFIFO entries:
Dec 10 17:48:58 ablettjnr kernel: Sequencer Free SCB List:
Dec 10 17:48:58 ablettjnr kernel: Sequencer SCB Info:
Dec 10 17:48:58 ablettjnr kernel:   0
SCB_CONTROL[0x64]:(DISCONNECTED|TAG_ENB|DISCENB) SCB_SCSIID[0x7]
Dec 10 17:48:58 ablettjnr kernel: SCB_LUN[0x0] SCB_TAG[0x1d]
Dec 10 17:48:58 ablettjnr kernel:  15 SCB_CONTROL[0x60]:(TAG_ENB|DISCENB)
SCB_SCSIID[0x7] SCB_LUN[0x0]
                                                                                
                                           
**********NOTE:: the above lines repeated 100's times...
                                                                                
                                           
Dec 10 17:48:58 ablettjnr kernel: Kernel Free SCB list: 31 30
Dec 10 17:48:58 ablettjnr kernel: scsi0: Host Status: Failed(0)
Dec 10 17:48:58 ablettjnr kernel: DevQ(0:0:0): 0 waiting
Dec 10 17:48:58 ablettjnr kernel: DevQ(0:1:0): 0 waiting
Dec 10 17:48:58 ablettjnr kernel:
Dec 10 17:48:58 ablettjnr kernel: <<<<<<<<<<<<<<<<< Dump Card State Ends
>>>>>>>>>>>>>>>>>>
Dec 10 17:48:58 ablettjnr kernel: (scsi0:A:0:0): SCB 0xf - timed out
Dec 10 17:48:58 ablettjnr kernel: sg[0] - Addr 0x57ea000 : Length 4096
                                                                                
                                           
**********NOTE:: the above lines repeated 100's times... only the number and
Addr change
                                                                                
                                           
Dec 10 17:48:58 ablettjnr kernel: sg[101] - Addr 0x5778000 : Length 4096
Dec 10 17:48:58 ablettjnr kernel: (scsi0:A:0:0): Queuing a BDR SCB
Dec 10 17:48:58 ablettjnr kernel: (scsi0:A:0:0): Bus Device Reset Message Sent
Dec 10 17:48:58 ablettjnr kernel: Recovery SCB completes
Dec 10 17:48:58 ablettjnr kernel: scsi0: Bus Device Reset on A:0. 32 SCBs
aborted
Dec 10 17:49:58 ablettjnr kernel: 503b000 : Length 4096
Dec 10 17:49:58 ablettjnr kernel: sg[12] - Addr 0x503a000 : Length 4096
                                                                                
                                           
**********NOTE:: the above lines repeated 100's times... only the number and
Addr change
                                                                                
                                           
Dec 10 17:49:58 ablettjnr kernel: sg[98] - Addr 0x4fd8000 : Length 4096
Dec 10 17:49:58 ablettjnr kernel: (scsi0:A:0:0): Other SCB Timeout
Dec 10 17:49:58 ablettjnr kernel: (scsi0:A:0:0): SCB 0x1d - timed out




-------------------------------------------------
This mail sent through IMP: http://horde.org/imp/



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?1071041742.3fd6cccef4207>