Date: Mon, 3 Jan 2005 10:56:47 -0700 From: hal <hal@cc.usu.edu> To: freebsd-questions@FreeBSD.ORG Subject: FreeBSD vs. SCSI tape drive Message-ID: <D6578D60-5DB0-11D9-A9D2-000A959670A0@cc.usu.edu>
next in thread | raw e-mail | index | archive | help
I have a backup server: OS freeBSD 4.7 P25 SuperMicro X5DP8-G2 mother board Symbios 875 SCSI controller with 1 Exabyte VXA-1 tape drive on channel 0 Adaptec 3960D SCSI controller with 2 Seagate ST39173LW disk drives on Channel 0 with 1 Dell Ultrium 2 tape drive on channel 1 2 3ware raid controllers with 2 mirror sets each The problem: About 50% of the time dump crashes writing to the Ultrium tape drive. See the output of dmesg and /var/log/messages below. A look at the tape drive's onboard error log shows nothing. The tape drive diagnostics show no problems. Can anyone offer a solution/insight/sympathy? If you need more info please ask. hal ############ output of dmseg ############################################### Copyright (c) 1992-2002 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.7-RELEASE-p25 #1: Fri Dec 10 13:55:55 MST 2004 root@jack.ss.usu.edu:/usr/src/sys/compile/JACK Timecounter "i8254" frequency 1193182 Hz CPU: Pentium 4 (2799.22-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0xf29 Stepping = 9 Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE ,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,<b28>,ACC,<b31 >> real memory = 2146959360 (2096640K bytes) config> q avail memory = 2088710144 (2039756K bytes) Programming 24 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 Programming 24 pins in IOAPIC #1 Programming 24 pins in IOAPIC #2 Programming 24 pins in IOAPIC #3 Programming 24 pins in IOAPIC #4 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 0, version: 0x00050014, at 0xfee00000 cpu1 (AP): apic id: 6, version: 0x00050014, at 0xfee00000 cpu2 (AP): apic id: 1, version: 0x00050014, at 0xfee00000 cpu3 (AP): apic id: 7, version: 0x00050014, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00178020, at 0xfec00000 io1 (APIC): apic id: 3, version: 0x00178020, at 0xfec80000 io2 (APIC): apic id: 4, version: 0x00178020, at 0xfec80400 io3 (APIC): apic id: 5, version: 0x00178020, at 0xfec81000 io4 (APIC): apic id: 8, version: 0x00178020, at 0xfec81400 Preloaded elf kernel "kernel" at 0xc02f8000. Preloaded userconfig_script "/boot/kernel.conf" at 0xc02f809c. Pentium Pro MTRR support enabled Using $PIR table, 29 entries at 0xc00fddf0 npx0: <math processor> on motherboard npx0: INT 16 interface pcib0: <Host to PCI bridge> on motherboard IOAPIC #0 intpin 16 -> irq 2 IOAPIC #0 intpin 19 -> irq 10 IOAPIC #0 intpin 18 -> irq 11 pci0: <PCI bus> on pcib0 pci0: <unknown card> (vendor=0x8086, dev=0x2541) at 0.1 pcib1: <PCI to PCI bridge (vendor=8086 device=2543)> at device 2.0 on pci0 pci1: <PCI bus> on pcib1 pci1: <unknown card> (vendor=0x8086, dev=0x1461) at 28.0 pcib2: <PCI to PCI bridge (vendor=8086 device=1460)> at device 29.0 on pci1 IOAPIC #2 intpin 0 -> irq 16 IOAPIC #2 intpin 1 -> irq 17 pci2: <PCI bus> on pcib2 ahc0: <Adaptec 3960D Ultra160 SCSI adapter> port 0x3000-0x30ff mem 0xfb200000-0xfb200fff irq 16 at device 1.0 on pci2 aic7899: Ultra160 Wide Channel A, SCSI Id=7, 32/253 SCBs ahc1: <Adaptec 3960D Ultra160 SCSI adapter> port 0x3400-0x34ff mem 0xfb201000-0xfb201fff irq 17 at device 1.1 on pci2 aic7899: Ultra160 Wide Channel B, SCSI Id=7, 32/253 SCBs pci1: <unknown card> (vendor=0x8086, dev=0x1461) at 30.0 pcib3: <PCI to PCI bridge (vendor=8086 device=1460)> at device 31.0 on pci1 IOAPIC #1 intpin 0 -> irq 18 IOAPIC #1 intpin 1 -> irq 19 IOAPIC #1 intpin 4 -> irq 20 IOAPIC #1 intpin 5 -> irq 21 pci3: <PCI bus> on pcib3 sym0: <875> port 0x4000-0x40ff mem 0xfb340000-0xfb340fff,0xfb342000-0xfb3420ff irq 18 at device 1.0 on pci3 sym0: Symbios NVRAM, ID 7, Fast-20, SE, parity checking sym0: open drain IRQ line driver, using on-chip SRAM sym0: using LOAD/STORE-based firmware. sym1: <875> port 0x4400-0x44ff mem 0xfb341000-0xfb341fff,0xfb342400-0xfb3424ff irq 19 at device 1.1 on pci3 sym1: Symbios NVRAM, ID 7, Fast-20, SE, parity checking sym1: open drain IRQ line driver, using on-chip SRAM sym1: using LOAD/STORE-based firmware. em0: <Intel(R) PRO/1000 Network Connection, Version - 1.3.14> port 0x4800-0x483f mem 0xfb300000-0xfb31ffff irq 20 at device 2.0 on pci3 em0: Speed:100 Mbps Duplex:Full em1: <Intel(R) PRO/1000 Network Connection, Version - 1.3.14> port 0x4840-0x487f mem 0xfb320000-0xfb33ffff irq 21 at device 2.1 on pci3 em1: Speed:N/A Duplex:N/A pcib4: <PCI to PCI bridge (vendor=8086 device=2545)> at device 3.0 on pci0 pci4: <PCI bus> on pcib4 pci4: <unknown card> (vendor=0x8086, dev=0x1461) at 28.0 pcib5: <PCI to PCI bridge (vendor=8086 device=1460)> at device 29.0 on pci4 IOAPIC #4 intpin 4 -> irq 22 pci5: <PCI bus> on pcib5 twe0: <3ware Storage Controller> port 0x5000-0x500f mem 0xfb800000-0xfbffffff,0xfb500000-0xfb50000f irq 22 at device 2.0 on pci5 twe0: 8 ports, Firmware FE7X 1.05.00.065, BIOS BE7X 1.08.00.048 pci4: <unknown card> (vendor=0x8086, dev=0x1461) at 30.0 pcib6: <PCI to PCI bridge (vendor=8086 device=1460)> at device 31.0 on pci4 IOAPIC #3 intpin 0 -> irq 23 pci6: <PCI bus> on pcib6 twe1: <3ware Storage Controller> port 0x6000-0x600f mem 0xfc000000-0xfc7fffff,0xfc800000-0xfc80000f irq 23 at device 1.0 on pci6 twe1: 4 ports, Firmware FE7X 1.05.00.023, BIOS BE7X 1.08.00.036 pci0: <UHCI USB controller> at 29.0 irq 2 pci0: <UHCI USB controller> at 29.1 irq 10 pci0: <UHCI USB controller> at 29.2 irq 11 pcib7: <Intel 82801BA/BAM (ICH2) Hub to PCI bridge> at device 30.0 on pci0 pci7: <PCI bus> on pcib7 pci7: <ATI Mach64-GR graphics accelerator> at 1.0 irq 2 isab0: <PCI to ISA bridge (vendor=8086 device=2480)> at device 31.0 on pci0 isa0: <ISA bus> on isab0 atapci0: <Intel ICH3 ATA100 controller> port 0x2060-0x206f,0-0x3,0-0x7,0x3f4-0x3f7,0x1f0-0x1f7 irq 0 at device 31.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 pci0: <unknown card> (vendor=0x8086, dev=0x2483) at 31.3 irq 0 orm0: <Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xc8fff,0xce800-0xcefff,0xcf000 -0xcffff,0xd0800-0xd17ff,0xdc000-0xdffff,0xe0000-0xe3fff on isa0 fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0 kbd0 at atkbd0 psm0: <PS/2 Mouse> irq 12 on atkbdc0 psm0: model IntelliMouse Explorer, device ID 4 vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A ppc0: parallel port not found. APIC_IO: Testing 8254 interrupt delivery APIC_IO: routing 8254 via IOAPIC #0 intpin 2 IP packet filtering initialized, divert disabled, rule-based forwarding enabled, default to deny, logging disabled SMP: AP CPU #2 Launched! SMP: AP CPU #1 Launched! acd0: CDROM <CDU5211> at ata0-master PIO4 Waiting 8 seconds for SCSI devices to settle (noperiph:sym0:0:-1:-1): SCSI BUS reset delivered. (noperiph:sym1:0:-1:-1): SCSI BUS reset delivered. twed0: <TwinStor, Normal> on twe0 twed0: 76318MB (156299440 sectors) twed1: <TwinStor, Normal> on twe0 twed1: 76318MB (156299440 sectors) twe0: command interrupt twed2: <TwinStor, Normal> on twe1 twed2: 286102MB (585938272 sectors) twed3: <TwinStor, Normal> on twe1 twed3: 190733MB (390622952 sectors) twe1: command interrupt SMP: AP CPU #3 Launched! sa0 at ahc1 bus 0 target 6 lun 0 sa0: <IBM ULTRIUM-TD2 3AYC> Removable Sequential Access SCSI-3 device sa0: 160.000MB/s transfers (80.000MHz, offset 31, 16bit) sa1 at sym0 bus 0 target 5 lun 0 sa1: <ECRIX VXA-1 V2161618 x001> Removable Sequential Access SCSI-2 device sa1: 10.000MB/s transfers (10.000MHz, offset 16) Mounting root from ufs:/dev/da0s1a da0 at ahc0 bus 0 target 0 lun 0 da0: <SEAGATE ST39173LW 6246> Fixed Direct Access SCSI-2 device da0: 80.000MB/s transfers (40.000MHz, offset 15, 16bit), Tagged Queueing Enabled da0: 8683MB (17783240 512 byte sectors: 255H 63S/T 1106C) da1 at ahc0 bus 0 target 1 lun 0 da1: <SEAGATE ST39173LW 6246> Fixed Direct Access SCSI-2 device da1: 80.000MB/s transfers (40.000MHz, offset 15, 16bit), Tagged Queueing Enabled da1: 8683MB (17783240 512 byte sectors: 255H 63S/T 1106C) ################ a snippet of /var/log/messages ########################## Jan 3 08:44:30 jack /kernel: (sa0:ahc1:0:6:0): SCB 0xe - timed out Jan 3 08:44:30 jack /kernel: ahc1: Dumping Card State while idle, at SEQADDR 0x9 Jan 3 08:44:30 jack /kernel: ACCUM = 0x4, SINDEX = 0x67, DINDEX = 0x27, ARG_2 = 0x3 Jan 3 08:44:30 jack /kernel: HCNT = 0x0 SCBPTR = 0x0 Jan 3 08:44:30 jack /kernel: SCSISEQ = 0x12, SBLKCTL = 0xa Jan 3 08:44:30 jack /kernel: DFCNTRL = 0x0, DFSTATUS = 0x89 Jan 3 08:44:30 jack /kernel: LASTPHASE = 0x1, SCSISIGI = 0x0, SXFRCTL0 = 0x80 Jan 3 08:44:30 jack /kernel: SSTAT0 = 0x0, SSTAT1 = 0x8 Jan 3 08:44:30 jack /kernel: SCSIPHASE = 0x0 Jan 3 08:44:30 jack /kernel: STACK == 0x3, 0x175, 0x160, 0xe7 Jan 3 08:44:30 jack /kernel: SCB count = 20 Jan 3 08:44:30 jack /kernel: Kernel NEXTQSCB = 3 Jan 3 08:44:30 jack /kernel: Card NEXTQSCB = 3 Jan 3 08:44:30 jack /kernel: QINFIFO entries: Jan 3 08:44:30 jack /kernel: Waiting Queue entries: Jan 3 08:44:30 jack /kernel: Disconnected Queue entries: 0:14 Jan 3 08:44:30 jack /kernel: QOUTFIFO entries: Jan 3 08:44:30 jack /kernel: Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Jan 3 08:44:30 jack /kernel: Sequencer SCB Info: 0(c 0x44, s 0x67, l 0, t 0xe) 1(c 0x0, s 0xff, l 255, t 0xff) 2(c 0x0, s 0xff, l 255, t 0xff) 3(c 0x0, s 0xff, l 255, t 0xff) 4(c 0x0, s 0xff, l 255, t 0xff) 5(c 0x0, s 0xff, l 255, t 0xff) 6(c 0x0, s 0xff, l 255, t 0xff) 7(c 0x0, s 0xff, l 255, t 0xff) 8(c 0x0, s 0xff, l 255, t 0xff) 9(c 0x0, s 0xff, l 255, t 0xff) 10(c 0x0, s 0xff, l 255, t 0xff) 11(c 0x0, s 0xff, l 255, t 0xff) 12(c 0x0, s 0xff, l 255, t 0xff) 13(c 0x0, s 0xff, l 255, t 0xff) 14(c 0x0, s 0xff, l 255, t 0xff) 15(c 0x0, s 0xff, l 255, t 0xff) 16(c 0x0, s 0xff, l 255, t 0xff) 17(c 0x0, s 0xff, l 255, t 0xff) 18(c 0x0, s 0xff, l 255, t 0xff) 19(c 0x0, s 0xff, l 255, t 0xff) 20(c 0x0, s 0xff, l 255, t 0xff) 21(c 0x0, s 0xff, l 255, t 0xff) 22(c 0x0, s 0xff, l 255, t 0xff) 23(c 0x0, s 0xff, l 255, t 0xff) 24(c 0x0, s 0xff, l 255, t 0xff) 25(c 0x0, s 0xff, l 255, t 0xff) 26(c 0x0, s 0xff, l 255, t 0xff) 27(c 0x0, s 0xff, l 255, t 0xff) 28(c 0x0, s 0xff, l 255, t 0xff) 29(c 0x0, s 0xff, l 255, t 0xff) 30(c 0x0, s 0xff, Jan 3 08:44:30 jack /kernel: t 0xff) 31(c 0x0, s 0xff, l 255, t 0xff) Jan 3 08:44:30 jack /kernel: Pending list: 14(c 0x40, s 0x67, l 0) Jan 3 08:44:30 jack /kernel: Kernel Free SCB list: 15 16 17 18 19 0 1 2 4 5 6 7 8 9 13 12 11 10 Jan 3 08:44:30 jack /kernel: Untagged Q(6): 14 Jan 3 08:44:30 jack /kernel: sg[0] - Addr 0x1d919000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[1] - Addr 0x2cc7a000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[2] - Addr 0x19fea000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[3] - Addr 0x4944a000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[4] - Addr 0x204bd000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[5] - Addr 0x105be000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[6] - Addr 0x4411f000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[7] - Addr 0x23560000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[8] - Addr 0x43fe1000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[9] - Addr 0x420d4000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[10] - Addr 0x5fd94000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[11] - Addr 0x67ed4000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[12] - Addr 0x66da5000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[13] - Addr 0x76946000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[14] - Addr 0x53527000 : Length 4096 Jan 3 08:44:30 jack /kernel: sg[15] - Addr 0x13168000 : Length 4096 Jan 3 08:44:31 jack /kernel: (sa0:ahc1:0:6:0): Queuing a BDR SCB Jan 3 08:44:31 jack /kernel: (sa0:ahc1:0:6:0): Bus Device Reset Message Sent Jan 3 08:44:31 jack /kernel: (sa0:ahc1:0:6:0): no longer in timeout, status = 34b Jan 3 08:44:31 jack /kernel: ahc1: Bus Device Reset on A:6. 1 SCBs aborted Jan 3 09:25:38 jack /kernel: (sa0:ahc1:0:6:0): WRITE FILEMARKS. CDB: 10 0 0 0 2 0 Jan 3 09:25:38 jack /kernel: (sa0:ahc1:0:6:0): UNIT ATTENTION asc:29,0 Jan 3 09:25:38 jack /kernel: (sa0:ahc1:0:6:0): Power on, reset, or bus device reset occurred field replaceable unit: 30 Jan 3 09:25:38 jack /kernel: (sa0:ahc1:0:6:0): failed to write terminating filemark(s) Jan 3 09:25:38 jack /kernel: (sa0:ahc1:0:6:0): tape is now frozen- use an OFFLINE, REWIND or MTEOM command to clear this state.
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?D6578D60-5DB0-11D9-A9D2-000A959670A0>