Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 23 Feb 1999 14:37:34 +0000 (GMT)
From:      russell@lls.lls.com (Russell Brown)
To:        AIC7xxx@FreeBSD.ORG
Subject:   Timeout Errors when using DAT
Message-ID:  <m10FIxq-000BnJC@lls.lls.com>

next in thread | raw e-mail | index | archive | help

I'm using aix7xxx version:  5.1.11/3.2.4 with Linux 2.0.36 and seeing
intermittent write errors when writing to a DAT drive.  This is
happening on three identical boxes so I'm ruling out any particular
individual hardware problem.

The Hardware configuration is ASUS P2B-DS motherboards with SMP enabled
(and two processors installed).  This Motherboard has 68pin Wide, 68pin
Ultra and 50pin FAST SCSI connectors.

The SCSI chain has two seagate ST39102LW drives on the 68pin Wide
connector (I've also had them on the Ultra2 connector with no
difference), an Iomega ZIP and a Sony SDT-9000 on the 50pin cable.

The two disks are in a RAID 1 configuration using software RAID.

There is a terminator (Amp) on the end of the 68pin cable.  The ZIP
drive (the last device on the 50pin cable) is terminated.

The BIOS (LVD and SE) and Adaptec BIOS termination is enabled.

The system(s) work fine until one starts writing to the DAT drives
(using tar).  At some point, although not every time, tar terminates
with a write error and the kernel log shows loads of timeout entries for
the disks.

Can anyone suggest what's going wrong?  I've tried playing with the
termination on the controller to no avail. Might it be the AIC driver,
the software RAID driver or the Motherboard setup?

What can I do to track the problem down?

Here's the kernel log for a failing case (the system was freshly booted):-

Feb 23 12:50:54 newbox kernel: VFS: Mounted root (ext2 filesystem) readonly.
Feb 23 12:50:54 newbox kernel: Adding Swap: 130748k swap-space (priority 2144)
Feb 23 12:50:54 newbox last message repeated 3 times
Feb 23 12:50:54 newbox kernel: Unable to load NLS charset cp437(nls_cp437)
Feb 23 12:50:58 newbox kernel: iBCS: socksys registered on character major 30
Feb 23 12:51:35 newbox kernel: (scsi0:0:4:0) Synchronous at 10.0 Mbyte/sec, offset 15.
Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797919, scsi0, channel 0, id 1, lun 0 Write (6) 01 f8 d8 02 00 
Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797920, scsi0, channel 0, id 1, lun 0 Write (6) 01 f8 e2 02 00 
Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797921, scsi0, channel 0, id 1, lun 0 Write (6) 0c b8 d0 02 00 
Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797922, scsi0, channel 0, id 1, lun 0 Write (6) 0e 38 de 30 00 
Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797923, scsi0, channel 0, id 1, lun 0 Write (10) 00 00 36 f8 d0 00 00 02 00 
Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797924, scsi0, channel 0, id 1, lun 0 Write (10) 00 00 36 f8 d4 00 00 1c 00 
Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797925, scsi0, channel 0, id 1, lun 0 Write (10) 00 00 37 38 d0 00 00 1e 00 
Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797926, scsi0, channel 0, id 1, lun 0 Write (10) 00 00 37 78 d0 00 00 10 00 
Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797911, scsi0, channel 0, id 0, lun 0 Write (6) 01 f8 d8 02 00 
Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797912, scsi0, channel 0, id 0, lun 0 Write (6) 01 f8 e2 02 00 
Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797913, scsi0, channel 0, id 0, lun 0 Write (6) 0c b8 d0 02 00 
Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797914, scsi0, channel 0, id 0, lun 0 Write (6) 0e 38 de 30 00 

This repeats in a similar vein for a while.

FWIW, here's the AIC detection log (I've enabled VERBOSE_PROBE2 so we can
see the termination settings):-

<snip....>
Feb 23 12:50:53 newbox kernel: md driver 0.90.0 MAX_MD_DEVS=256, MAX_REAL=12
Feb 23 12:50:53 newbox kernel: linear personality registered
Feb 23 12:50:53 newbox kernel: raid1 personality registered
Feb 23 12:50:53 newbox kernel: aic7xxx: <Adaptec AIC-7890/1 Ultra2 SCSI host adapter> at PCI 6/0
Feb 23 12:50:53 newbox kernel: aic7xxx: Initial PCI_COMMAND value was 0x17
Feb 23 12:50:53 newbox kernel: aic7xxx: Initial DEVCONFIG value was 0x540
Feb 23 12:50:53 newbox kernel: aic7xxx: Loading serial EEPROM...done
Feb 23 12:50:53 newbox kernel: (scsi0) <Adaptec AIC-7890/1 Ultra2 SCSI host adapter> found at PCI 6/0
Feb 23 12:50:53 newbox kernel: (scsi0) Wide Channel, SCSI ID=7, 32/255 SCBs
Feb 23 12:50:53 newbox kernel: (scsi0) BIOS enabled, IO Port 0xd000, IRQ 14
Feb 23 12:50:53 newbox kernel: (scsi0) IO Memory at 0xe3000000, MMAP Memory at 0x10807000
Feb 23 12:50:53 newbox kernel: (scsi0) EEPROM is present.
Feb 23 12:50:53 newbox kernel: (scsi0) SE High byte termination Enabled
Feb 23 12:50:53 newbox kernel: (scsi0) SE Low byte termination Enabled
Feb 23 12:50:53 newbox kernel: (scsi0) LVD Low byte termination Enabled
Feb 23 12:50:53 newbox kernel: (scsi0) LVD High byte termination Enabled
Feb 23 12:50:53 newbox kernel: (scsi0) Downloading sequencer code... 407 instructions downloaded
Feb 23 12:50:53 newbox kernel: (scsi0) Resetting channel
Feb 23 12:50:53 newbox kernel: scsi0 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 5.1.11/3.2.4
Feb 23 12:50:53 newbox kernel:        <Adaptec AIC-7890/1 Ultra2 SCSI host adapter>
Feb 23 12:50:54 newbox kernel: scsi : 1 host.
Feb 23 12:50:54 newbox kernel:   Vendor: SEAGATE   Model: ST39102LW         Rev: 0005
Feb 23 12:50:54 newbox kernel:   Type:   Direct-Access                      ANSI SCSI revision: 02
Feb 23 12:50:54 newbox kernel: Detected scsi disk sda at scsi0, channel 0, id 0, lun 0
Feb 23 12:50:54 newbox kernel:   Vendor: SEAGATE   Model: ST39102LW         Rev: 0005
Feb 23 12:50:54 newbox kernel:   Type:   Direct-Access                      ANSI SCSI revision: 02
Feb 23 12:50:54 newbox kernel: Detected scsi disk sdb at scsi0, channel 0, id 1, lun 0
Feb 23 12:50:54 newbox kernel:   Vendor: SONY      Model: SDT-9000          Rev: 0400
Feb 23 12:50:54 newbox kernel:   Type:   Sequential-Access                  ANSI SCSI revision: 02
Feb 23 12:50:54 newbox kernel: Detected scsi tape st0 at scsi0, channel 0, id 4, lun 0
Feb 23 12:50:54 newbox kernel:   Vendor: IOMEGA    Model: ZIP 100           Rev: J.03
Feb 23 12:50:54 newbox kernel:   Type:   Direct-Access                      ANSI SCSI revision: 02
Feb 23 12:50:54 newbox kernel: Detected scsi removable disk sdc at scsi0, channel 0, id 5, lun 0
Feb 23 12:50:54 newbox kernel: scsi : detected 1 SCSI tape 3 SCSI disks total.
Feb 23 12:50:54 newbox kernel: (scsi0:0:0:0) Synchronous at 40.0 Mbyte/sec, offset 15.
Feb 23 12:50:54 newbox kernel: SCSI device sda: hdwr sector= 512 bytes. Sectors= 17783240 [8683 MB] [8.7 GB]
Feb 23 12:50:54 newbox kernel: (scsi0:0:1:0) Synchronous at 40.0 Mbyte/sec, offset 15.
Feb 23 12:50:54 newbox kernel: SCSI device sdb: hdwr sector= 512 bytes. Sectors= 17783240 [8683 MB] [8.7 GB]
Feb 23 12:50:54 newbox kernel: SCSI device sdc: hdwr sector= 512 bytes. Sectors= 196608 [96 MB] [0.1 GB]
Feb 23 12:50:54 newbox kernel: sdc: Write Protect is off
Feb 23 12:50:54 newbox kernel: SLIP: version 0.8.4-NET3.019-NEWTTY (dynamic channels, max=256).

-- 
 Regards,
     Russell
 --------------------------------------------------------------------------
| Russell Brown          | MAIL: russell@lls.com PHONE: 01780 471800       |
| Lady Lodge Systems     | WWW Work: http://www.kings.demon.co.uk/         |
| Peterborough, England  | WWW Play: http://www.kings.demon.co.uk/russell/ |
 --------------------------------------------------------------------------


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe aic7xxx" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?m10FIxq-000BnJC>