Date: Tue, 23 Feb 1999 14:37:34 +0000 (GMT) From: russell@lls.lls.com (Russell Brown) To: AIC7xxx@FreeBSD.ORG Subject: Timeout Errors when using DAT Message-ID: <m10FIxq-000BnJC@lls.lls.com>
next in thread | raw e-mail | index | archive | help
I'm using aix7xxx version: 5.1.11/3.2.4 with Linux 2.0.36 and seeing intermittent write errors when writing to a DAT drive. This is happening on three identical boxes so I'm ruling out any particular individual hardware problem. The Hardware configuration is ASUS P2B-DS motherboards with SMP enabled (and two processors installed). This Motherboard has 68pin Wide, 68pin Ultra and 50pin FAST SCSI connectors. The SCSI chain has two seagate ST39102LW drives on the 68pin Wide connector (I've also had them on the Ultra2 connector with no difference), an Iomega ZIP and a Sony SDT-9000 on the 50pin cable. The two disks are in a RAID 1 configuration using software RAID. There is a terminator (Amp) on the end of the 68pin cable. The ZIP drive (the last device on the 50pin cable) is terminated. The BIOS (LVD and SE) and Adaptec BIOS termination is enabled. The system(s) work fine until one starts writing to the DAT drives (using tar). At some point, although not every time, tar terminates with a write error and the kernel log shows loads of timeout entries for the disks. Can anyone suggest what's going wrong? I've tried playing with the termination on the controller to no avail. Might it be the AIC driver, the software RAID driver or the Motherboard setup? What can I do to track the problem down? Here's the kernel log for a failing case (the system was freshly booted):- Feb 23 12:50:54 newbox kernel: VFS: Mounted root (ext2 filesystem) readonly. Feb 23 12:50:54 newbox kernel: Adding Swap: 130748k swap-space (priority 2144) Feb 23 12:50:54 newbox last message repeated 3 times Feb 23 12:50:54 newbox kernel: Unable to load NLS charset cp437(nls_cp437) Feb 23 12:50:58 newbox kernel: iBCS: socksys registered on character major 30 Feb 23 12:51:35 newbox kernel: (scsi0:0:4:0) Synchronous at 10.0 Mbyte/sec, offset 15. Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797919, scsi0, channel 0, id 1, lun 0 Write (6) 01 f8 d8 02 00 Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797920, scsi0, channel 0, id 1, lun 0 Write (6) 01 f8 e2 02 00 Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797921, scsi0, channel 0, id 1, lun 0 Write (6) 0c b8 d0 02 00 Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797922, scsi0, channel 0, id 1, lun 0 Write (6) 0e 38 de 30 00 Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797923, scsi0, channel 0, id 1, lun 0 Write (10) 00 00 36 f8 d0 00 00 02 00 Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797924, scsi0, channel 0, id 1, lun 0 Write (10) 00 00 36 f8 d4 00 00 1c 00 Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797925, scsi0, channel 0, id 1, lun 0 Write (10) 00 00 37 38 d0 00 00 1e 00 Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797926, scsi0, channel 0, id 1, lun 0 Write (10) 00 00 37 78 d0 00 00 10 00 Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797911, scsi0, channel 0, id 0, lun 0 Write (6) 01 f8 d8 02 00 Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797912, scsi0, channel 0, id 0, lun 0 Write (6) 01 f8 e2 02 00 Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797913, scsi0, channel 0, id 0, lun 0 Write (6) 0c b8 d0 02 00 Feb 23 13:55:07 newbox kernel: scsi : aborting command due to timeout : pid 797914, scsi0, channel 0, id 0, lun 0 Write (6) 0e 38 de 30 00 This repeats in a similar vein for a while. FWIW, here's the AIC detection log (I've enabled VERBOSE_PROBE2 so we can see the termination settings):- <snip....> Feb 23 12:50:53 newbox kernel: md driver 0.90.0 MAX_MD_DEVS=256, MAX_REAL=12 Feb 23 12:50:53 newbox kernel: linear personality registered Feb 23 12:50:53 newbox kernel: raid1 personality registered Feb 23 12:50:53 newbox kernel: aic7xxx: <Adaptec AIC-7890/1 Ultra2 SCSI host adapter> at PCI 6/0 Feb 23 12:50:53 newbox kernel: aic7xxx: Initial PCI_COMMAND value was 0x17 Feb 23 12:50:53 newbox kernel: aic7xxx: Initial DEVCONFIG value was 0x540 Feb 23 12:50:53 newbox kernel: aic7xxx: Loading serial EEPROM...done Feb 23 12:50:53 newbox kernel: (scsi0) <Adaptec AIC-7890/1 Ultra2 SCSI host adapter> found at PCI 6/0 Feb 23 12:50:53 newbox kernel: (scsi0) Wide Channel, SCSI ID=7, 32/255 SCBs Feb 23 12:50:53 newbox kernel: (scsi0) BIOS enabled, IO Port 0xd000, IRQ 14 Feb 23 12:50:53 newbox kernel: (scsi0) IO Memory at 0xe3000000, MMAP Memory at 0x10807000 Feb 23 12:50:53 newbox kernel: (scsi0) EEPROM is present. Feb 23 12:50:53 newbox kernel: (scsi0) SE High byte termination Enabled Feb 23 12:50:53 newbox kernel: (scsi0) SE Low byte termination Enabled Feb 23 12:50:53 newbox kernel: (scsi0) LVD Low byte termination Enabled Feb 23 12:50:53 newbox kernel: (scsi0) LVD High byte termination Enabled Feb 23 12:50:53 newbox kernel: (scsi0) Downloading sequencer code... 407 instructions downloaded Feb 23 12:50:53 newbox kernel: (scsi0) Resetting channel Feb 23 12:50:53 newbox kernel: scsi0 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 5.1.11/3.2.4 Feb 23 12:50:53 newbox kernel: <Adaptec AIC-7890/1 Ultra2 SCSI host adapter> Feb 23 12:50:54 newbox kernel: scsi : 1 host. Feb 23 12:50:54 newbox kernel: Vendor: SEAGATE Model: ST39102LW Rev: 0005 Feb 23 12:50:54 newbox kernel: Type: Direct-Access ANSI SCSI revision: 02 Feb 23 12:50:54 newbox kernel: Detected scsi disk sda at scsi0, channel 0, id 0, lun 0 Feb 23 12:50:54 newbox kernel: Vendor: SEAGATE Model: ST39102LW Rev: 0005 Feb 23 12:50:54 newbox kernel: Type: Direct-Access ANSI SCSI revision: 02 Feb 23 12:50:54 newbox kernel: Detected scsi disk sdb at scsi0, channel 0, id 1, lun 0 Feb 23 12:50:54 newbox kernel: Vendor: SONY Model: SDT-9000 Rev: 0400 Feb 23 12:50:54 newbox kernel: Type: Sequential-Access ANSI SCSI revision: 02 Feb 23 12:50:54 newbox kernel: Detected scsi tape st0 at scsi0, channel 0, id 4, lun 0 Feb 23 12:50:54 newbox kernel: Vendor: IOMEGA Model: ZIP 100 Rev: J.03 Feb 23 12:50:54 newbox kernel: Type: Direct-Access ANSI SCSI revision: 02 Feb 23 12:50:54 newbox kernel: Detected scsi removable disk sdc at scsi0, channel 0, id 5, lun 0 Feb 23 12:50:54 newbox kernel: scsi : detected 1 SCSI tape 3 SCSI disks total. Feb 23 12:50:54 newbox kernel: (scsi0:0:0:0) Synchronous at 40.0 Mbyte/sec, offset 15. Feb 23 12:50:54 newbox kernel: SCSI device sda: hdwr sector= 512 bytes. Sectors= 17783240 [8683 MB] [8.7 GB] Feb 23 12:50:54 newbox kernel: (scsi0:0:1:0) Synchronous at 40.0 Mbyte/sec, offset 15. Feb 23 12:50:54 newbox kernel: SCSI device sdb: hdwr sector= 512 bytes. Sectors= 17783240 [8683 MB] [8.7 GB] Feb 23 12:50:54 newbox kernel: SCSI device sdc: hdwr sector= 512 bytes. Sectors= 196608 [96 MB] [0.1 GB] Feb 23 12:50:54 newbox kernel: sdc: Write Protect is off Feb 23 12:50:54 newbox kernel: SLIP: version 0.8.4-NET3.019-NEWTTY (dynamic channels, max=256). -- Regards, Russell -------------------------------------------------------------------------- | Russell Brown | MAIL: russell@lls.com PHONE: 01780 471800 | | Lady Lodge Systems | WWW Work: http://www.kings.demon.co.uk/ | | Peterborough, England | WWW Play: http://www.kings.demon.co.uk/russell/ | -------------------------------------------------------------------------- To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe aic7xxx" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?m10FIxq-000BnJC>