From owner-aic7xxx@FreeBSD.ORG Tue Nov 18 07:58:04 2003 Return-Path: Delivered-To: aic7xxx@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 50E4716A4CE for ; Tue, 18 Nov 2003 07:58:04 -0800 (PST) Received: from helix.icgeb.org (helix.icgeb.trieste.it [140.105.9.2]) by mx1.FreeBSD.org (Postfix) with ESMTP id 17C4843FFB for ; Tue, 18 Nov 2003 07:58:00 -0800 (PST) (envelope-from kristian@icgeb.org) Received: from localhost (localhost.icgeb.trieste.it [127.0.0.1]) by helix.icgeb.org (Postfix) with ESMTP id 93E0E196784; Tue, 18 Nov 2003 16:57:58 +0100 (CET) Received: from helix.icgeb.org ([127.0.0.1]) by localhost (helix.icgeb.org [127.0.0.1]) (amavisd-new) with ESMTP id 06259-09; Tue, 18 Nov 2003 16:57:57 -0000 (CET) Received: by helix.icgeb.org (Postfix, from userid 48) id C8BD3196736; Tue, 18 Nov 2003 16:57:57 +0100 (CET) Received: from 140.105.9.78 (SquirrelMail authenticated user kristian) by www.icgeb.org with HTTP; Tue, 18 Nov 2003 16:57:57 +0100 (CET) Message-ID: <1348.140.105.9.78.1069171077.squirrel@www.icgeb.org> Date: Tue, 18 Nov 2003 16:57:57 +0100 (CET) From: "Kristian Vlahovicek" To: X-Priority: 3 Importance: Normal X-Mailer: SquirrelMail (version 1.2.11) MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_20031118165757_12505" X-Virus-Scanned: by amavisd-new amavisd-new-20020630 X-Razor-id: 166a9b21bbe2499891fd715464e8787987792f5c X-Content-Filtered-By: Mailman/MimeDel 2.1.1 cc: kajan@icgeb.org Subject: aic7xxx problems X-BeenThere: aic7xxx@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Adaptec Device Drivers in FreeBSD and Linux List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 18 Nov 2003 15:58:04 -0000 ------=_20031118165757_12505 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit Hi, We have a linux machine with Adaptec 29160 Ultra160 SCSI adapter (mounted in PCI-X slot) on oa TYAN Tiger MPX S2466 Motherboard with two Athlon MP processors. As of recently (after the RH7.3 -> 9.0 migration) we started having problems with our SCSI subsystem. We would appreciate any help from more experienced guys then we are to try to understand (and hopefully fix) the problem. SCSI behaves erraticaly, we have 5 QUANTUM 36GB disks and a HP tape mounted on the adaptec card: Vendor: QUANTUM Model: ATLAS10K3_36_WLS Rev: 020W Type: Direct-Access ANSI SCSI revision: 03 Vendor: QUANTUM Model: ATLAS10K3_36_WLS Rev: 020W Type: Direct-Access ANSI SCSI revision: 03 Vendor: QUANTUM Model: ATLAS10K3_36_WLS Rev: 020W Type: Direct-Access ANSI SCSI revision: 03 Vendor: HP Model: C5683A Rev: C005 Type: Sequential-Access ANSI SCSI revision: 02 Vendor: QUANTUM Model: ATLAS10K3_36_WLS Rev: 020W Type: Direct-Access ANSI SCSI revision: 03 Vendor: QUANTUM Model: ATLAS10K3_36_WLS Rev: 020W Type: Direct-Access ANSI SCSI revision: 03 the disks are all on the same channel, first we had them as 1 standalone (ID0) and 4 in RAID0 and one day the machine just froze, reporting errors in the aic7xxx driver (unfortunately no log of that remained). The disks were surface tested with the adaptec on-card SCSI utility and came out without errors. We did the OS upgrade, and installed the 2.4.20-8smp kernel (RH9 vanilla), the machine boots with aic7xxx dumping the whole lot of messages (see below). In parallel, we reconfigured RAID to RAID5 (sdb1, sdc1, sdd1, sde1) just in case it was RAID that misbehaved. An attempt to upgrade kernel to newer version (still the RH9 updated 2.4.20-20.9smp) results in boot process stopping just before the SCSI loading. We tried to update teh aic7xxx driver to v6.3.0 from Justin's website and that one again resulted in complete unability to boot (same stop point as above). Note that both kernels 2.4.20-20.9 and the 2.4.20-8 boot in single processor version with both the original aic7xxx drivers and the v6.3.0 DO boot. What we see currently are RAID messages about kicking disks from array due to IO erors (they look like hardware errors even though surface tests do not give anything!): ----------- Nov 18 11:50:33 hydra kernel: SCSI disk error : host 0 channel 0 id 1 lun 0 return code = 8000002 Nov 18 11:50:33 hydra kernel: Info fld=0x4005, Current sd08:11: sense key Hardware Error Nov 18 11:50:34 hydra kernel: I/O error: dev 08:11, sector 26908288 Nov 18 11:50:34 hydra kernel: raid5: Disk failure on sdb1, disabling device. Operation continuing on 2 devices ---------- Nov 18 15:38:45 hydra kernel: scsi0: ERROR on channel 0, id 8, lun 0, CDB: Read (10) 00 03 27 23 0f 00 00 f8 00 Nov 18 15:38:45 hydra kernel: Info fld=0x327231e, Current sd08:41: sense key Medium Error Nov 18 15:38:45 hydra kernel: I/O error: dev 08:41, sector 52896472 Nov 18 15:38:45 hydra kernel: raid5: Disk failure on sde1, disabling device. Operation continuing on 3 devices Nov 18 15:38:45 hydra kernel: raid5: parity resync was not fully finished, restarting next time. I'm attaching aldo the most recent dmesg, lspci, and a (longish) messages log of RAID5 fighting with disk failures. We reverted back to 2.4.18 kernel but the system seems to be unstable still... ANY info on where to go from here would be most appreciated! Thanks, Kristian ********************* dmesg ********************* Linux version 2.4.18-3smp (bhcompile@porky.devel.redhat.com) (gcc version 2.96 20000731 (Red Hat Linux 7.3 2.96-110)) #1 SMP Thu Apr 18 06:59:55 EDT 2002 BIOS-provided physical RAM map: BIOS-e820: 0000000000000000 - 000000000009f400 (usable) BIOS-e820: 000000000009f400 - 00000000000a0000 (reserved) BIOS-e820: 00000000000d0000 - 0000000000100000 (reserved) BIOS-e820: 0000000000100000 - 000000001ff80000 (usable) BIOS-e820: 000000001ff80000 - 0000000020000000 (reserved) BIOS-e820: 00000000fec00000 - 00000000fec04000 (reserved) BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved) BIOS-e820: 00000000fff80000 - 0000000100000000 (reserved) found SMP MP-table at 000f71e0 hm, page 000f7000 reserved twice. hm, page 000f8000 reserved twice. hm, page 0009f000 reserved twice. hm, page 000a0000 reserved twice. On node 0 totalpages: 130944 zone(0): 4096 pages. zone(1): 126848 pages. zone(2): 0 pages. Intel MultiProcessor Specification v1.4 Virtual Wire compatibility mode. OEM ID: TYAN Product ID: PAULANER APIC at: 0xFEE00000 Processor #1 Pentium(tm) Pro APIC version 16 Processor #0 Pentium(tm) Pro APIC version 16 I/O APIC #2 Version 17 at 0xFEC00000. Processors: 2 Kernel command line: ro root=LABEL=/ Initializing CPU#0 Detected 1600.067 MHz processor. Console: colour VGA+ 80x25 Calibrating delay loop... 3191.60 BogoMIPS Memory: 512876k/523776k available (1224k kernel code, 10512k reserved, 839k data, 304k init, 0k highmem) Dentry cache hash table entries: 65536 (order: 7, 524288 bytes) Inode cache hash table entries: 32768 (order: 6, 262144 bytes) Mount-cache hash table entries: 8192 (order: 4, 65536 bytes) Buffer cache hash table entries: 32768 (order: 5, 131072 bytes) Page-cache hash table entries: 131072 (order: 7, 524288 bytes) CPU: Before vendor init, caps: 0383fbff c1cbfbff 00000000, vendor = 2 CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line) CPU: L2 Cache: 256K (64 bytes/line) CPU: After vendor init, caps: 0383fbff c1cbfbff 00000000 00000000 Intel machine check architecture supported. Intel machine check reporting enabled on CPU#0. CPU: After generic, caps: 0383fbff c1cbfbff 00000000 00000000 CPU: Common caps: 0383fbff c1cbfbff 00000000 00000000 Enabling fast FPU save and restore... done. Enabling unmasked SIMD FPU exception support... done. Checking 'hlt' instruction... OK. POSIX conformance testing by UNIFIX mtrr: v1.40 (20010327) Richard Gooch (rgooch@atnf.csiro.au) mtrr: detected mtrr type: Intel CPU: Before vendor init, caps: 0383fbff c1cbfbff 00000000, vendor = 2 CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line) CPU: L2 Cache: 256K (64 bytes/line) CPU: After vendor init, caps: 0383fbff c1cbfbff 00000000 00000000 Intel machine check reporting enabled on CPU#0. CPU: After generic, caps: 0383fbff c1cbfbff 00000000 00000000 CPU: Common caps: 0383fbff c1cbfbff 00000000 00000000 CPU0: AMD Athlon(tm) MP 1900+ stepping 02 per-CPU timeslice cutoff: 731.19 usecs. task migration cache decay timeout: 10 msecs. masked ExtINT on CPU#0 ESR value before enabling vector: 00000000 ESR value after enabling vector: 00000000 Booting processor 1/0 eip 2000 Initializing CPU#1 masked ExtINT on CPU#1 ESR value before enabling vector: 00000000 ESR value after enabling vector: 00000000 Calibrating delay loop... 3198.15 BogoMIPS CPU: Before vendor init, caps: 0383fbff c1cbfbff 00000000, vendor = 2 CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line) CPU: L2 Cache: 256K (64 bytes/line) CPU: After vendor init, caps: 0383fbff c1cbfbff 00000000 00000000 Intel machine check reporting enabled on CPU#1. CPU: After generic, caps: 0383fbff c1cbfbff 00000000 00000000 CPU: Common caps: 0383fbff c1cbfbff 00000000 00000000 CPU1: AMD Athlon(tm) Processor stepping 02 Total of 2 processors activated (6389.76 BogoMIPS). ENABLING IO-APIC IRQs Setting 2 in the phys_id_present_map ....changing IO-APIC physical APIC ID to 2 ... ok. init IO_APIC IRQs IO-APIC (apicid-pin) 2-0, 2-22, 2-23 not connected. ...TIMER: vector=0x31 pin1=2 pin2=0 number of MP IRQ sources: 22. number of IO-APIC #2 registers: 24. testing the IO APIC....................... IO APIC #2...... ..... register #00: 02000000 ........ : physical APIC id: 02 ..... register #01: 00170011 ........ : max redirection entries: 0017 ........ : PRQ implemented: 0 ........ : IO APIC version: 0011 ..... register #02: 00000000 ........ : arbitration: 00 ..... IRQ redirection table: NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect: 00 000 00 1 0 0 0 0 0 0 00 01 003 03 0 0 0 0 0 1 1 39 02 003 03 0 0 0 0 0 1 1 31 03 003 03 0 0 0 0 0 1 1 41 04 003 03 0 0 0 0 0 1 1 49 05 003 03 1 1 0 1 0 1 1 51 06 003 03 0 0 0 0 0 1 1 59 07 003 03 0 0 0 0 0 1 1 61 08 003 03 0 0 0 0 0 1 1 69 09 003 03 1 1 0 1 0 1 1 71 0a 003 03 1 1 0 1 0 1 1 79 0b 003 03 1 1 0 1 0 1 1 81 0c 003 03 0 0 0 0 0 1 1 89 0d 003 03 0 0 0 0 0 1 1 91 0e 003 03 0 0 0 0 0 1 1 99 0f 003 03 0 0 0 0 0 1 1 A1 10 003 03 0 0 0 0 0 1 1 A9 11 003 03 0 0 0 0 0 1 1 B1 12 003 03 0 0 0 0 0 1 1 B9 13 003 03 0 0 0 0 0 1 1 C1 14 003 03 0 0 0 0 0 1 1 C9 15 003 03 1 1 0 1 0 1 1 D1 16 000 00 1 0 0 0 0 0 0 00 17 000 00 1 0 0 0 0 0 0 00 IRQ to pin mappings: IRQ0 -> 0:2 IRQ1 -> 0:1 IRQ3 -> 0:3 IRQ4 -> 0:4 IRQ5 -> 0:5 IRQ6 -> 0:6 IRQ7 -> 0:7 IRQ8 -> 0:8 IRQ9 -> 0:9 IRQ10 -> 0:10 IRQ11 -> 0:11 IRQ12 -> 0:12 IRQ13 -> 0:13 IRQ14 -> 0:14 IRQ15 -> 0:15 IRQ16 -> 0:16 IRQ17 -> 0:17 IRQ18 -> 0:18 IRQ19 -> 0:19 IRQ20 -> 0:20 IRQ21 -> 0:21 ..................................... done. Using local APIC timer interrupts. calibrating APIC timer ... ...... CPU clock speed is 1599.9606 MHz. ...... host bus clock speed is 266.6601 MHz. cpu: 0, clocks: 2666601, slice: 888867 CPU0 cpu: 1, clocks: 2666601, slice: 888867 CPU1 checking TSC synchronization across CPUs: passed. PCI: PCI BIOS revision 2.10 entry at 0xfd7e0, last bus=2 PCI: Using configuration type 1 PCI: Probing PCI hardware Unknown bridge resource 0: assuming transparent Unknown bridge resource 2: assuming transparent PCI: Using IRQ router default [1022/7443] at 00:07.3 BIOS failed to enable PCI standards compliance, fixing this error. isapnp: Scanning for PnP cards... isapnp: No Plug & Play device found Linux NET4.0 for Linux 2.4 Based upon Swansea University Computer Society NET3.039 Initializing RT netlink socket apm: BIOS version 1.2 Flags 0x03 (Driver version 1.16) apm: disabled - APM is not SMP safe. Starting kswapd VFS: Diskquotas version dquot_6.5.0 initialized pty: 2048 Unix98 ptys configured Serial driver version 5.05c (2001-07-08) with MANY_PORTS MULTIPORT SHARE_IRQ SERIAL_PCI ISAPNP enabled ttyS00 at 0x03f8 (irq = 4) is a 16550A ttyS01 at 0x02f8 (irq = 3) is a 16550A Real Time Clock Driver v1.10e block: 992 slots per queue, batch=248 Uniform Multi-Platform E-IDE driver Revision: 6.31 ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx AMD7441: IDE controller on PCI bus 00 dev 39 AMD7441: chipset revision 4 AMD7441: not 100% native mode: will probe irqs later AMD7441: disabling single-word DMA support (revision < C4) ide0: BM-DMA at 0xf000-0xf007, BIOS settings: hda:pio, hdb:DMA ide1: BM-DMA at 0xf008-0xf00f, BIOS settings: hdc:DMA, hdd:pio hdb: C/H/S=52969/225/117 from BIOS ignored hda: LG CD-ROM CRD-8522B, ATAPI CD/DVD-ROM drive hdb: Maxtor 6Y120L0, ATA DISK drive hdc: Maxtor 6Y120L0, ATA DISK drive ide0 at 0x1f0-0x1f7,0x3f6 on irq 14 ide1 at 0x170-0x177,0x376 on irq 15 blk: queue c03cd410, I/O limit 4095Mb (mask 0xffffffff) hdb: 240121728 sectors (122942 MB) w/2048KiB Cache, CHS=238216/16/63, UDMA(100) blk: queue c03cd628, I/O limit 4095Mb (mask 0xffffffff) hdc: 240121728 sectors (122942 MB) w/2048KiB Cache, CHS=238216/16/63, UDMA(100) ide-floppy driver 0.99.newide Partition check: hdb: hdb1 hdc: hdc1 Floppy drive(s): fd0 is 1.44M FDC 0 is a post-1991 82077 RAMDISK driver initialized: 16 RAM disks of 4096K size 1024 blocksize ide-floppy driver 0.99.newide md: md driver 0.90.0 MAX_MD_DEVS=256, MD_SB_DISKS=27 md: Autodetecting RAID arrays. md: autorun ... md: ... autorun DONE. NET4: Linux TCP/IP 1.0 for NET4.0 IP Protocols: ICMP, UDP, TCP, IGMP IP: routing cache hash table of 4096 buckets, 32Kbytes TCP: Hash tables configured (established 32768 bind 32768) Linux IP multicast router 0.06 plus PIM-SM NET4: Unix domain sockets 1.0/SMP for Linux NET4.0. RAMDISK: Compressed image found at block 0 Freeing initrd memory: 281k freed VFS: Mounted root (ext2 filesystem). SCSI subsystem driver Revision: 1.00 kmod: failed to exec /sbin/modprobe -s -k scsi_hostadapter, errno = 2 scsi0 : Adaptec AIC7XXX EISA/VLB/PCI SCSI HBA DRIVER, Rev 6.2.5 aic7892: Ultra160 Wide Channel A, SCSI Id=7, 32/253 SCBs scsi0:0:0:0: Attempting to queue an ABORT message scsi0: Dumping Card State while idle, at SEQADDR 0x8 ACCUM = 0x0, SINDEX = 0x3, DINDEX = 0xe4, ARG_2 = 0x0 HCNT = 0x0 SCBPTR = 0x0 SCSISEQ = 0x12, SBLKCTL = 0xa DFCNTRL = 0x0, DFSTATUS = 0x89 LASTPHASE = 0x1, SCSISIGI = 0x0, SXFRCTL0 = 0x80 SSTAT0 = 0x0, SSTAT1 = 0x8 SCSIPHASE = 0x0 STACK == 0x3, 0x108, 0x160, 0x0 SCB count = 4 Kernel NEXTQSCB = 2 Card NEXTQSCB = 2 QINFIFO entries: Waiting Queue entries: Disconnected Queue entries: QOUTFIFO entries: Sequencer Free SCB List: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Sequencer SCB Info: 0(c 0x40, s 0x7, l 0, t 0xff) 1(c 0x0, s 0x7, l 0, t 0xff) 2(c 0x0, s 0x7, l 0, t 0xff) 3(c 0x0, s 0x7, l 0, t 0xff) 4(c 0x0, s 0x0, l 0, t 0xff) 5(c 0x0, s 0x7, l 0, t 0xff) 6(c 0x0, s 0x7, l 0, t 0xff) 7(c 0x0, s 0x7, l 0, t 0xff) 8(c 0x0, s 0x7, l 0, t 0xff) 9(c 0x0, s 0x7, l 0, t 0xff) 10(c 0x0, s 0x27, l 0, t 0xff) 11(c 0x0, s 0x7, l 0, t 0xff) 12(c 0x0, s 0x7, l 0, t 0xff) 13(c 0x0, s 0x7, l 0, t 0xff) 14(c 0x0, s 0x7, l 0, t 0xff) 15(c 0x0, s 0x7, l 0, t 0xff) 16(c 0x0, s 0x7, l 0, t 0xff) 17(c 0x0, s 0x7, l 0, t 0xff) 18(c 0x0, s 0x7, l 0, t 0xff) 19(c 0x0, s 0x7, l 0, t 0xff) 20(c 0x0, s 0x7, l 0, t 0xff) 21(c 0x0, s 0x7, l 0, t 0xff) 22(c 0x0, s 0x7, l 0, t 0xff) 23(c 0x0, s 0x7, l 0, t 0xff) 24(c 0x0, s 0x7, l 0, t 0xff) 25(c 0x0, s 0x7, l 0, t 0xff) 26(c 0x0, s 0x7, l 0, t 0xff) 27(c 0x0, s 0x7, l 0, t 0xff) 28(c 0x0, s 0x7, l 0, t 0xff) 29(c 0x0, s 0x7, l 0, t 0xff) 30(c 0x0, s 0x7, l 0, t 0xff) 31(c 0x0, s 0x7, l 0, t 0xff) Pending list: Kernel Free SCB list: 3 1 0 DevQ(0:0:0): 0 waiting scsi0:0:0:0: Command already completed aic7xxx_abort returns 0x2002 scsi0:0:0:0: Attempting to queue an ABORT message scsi0: Dumping Card State in Message-out phase, at SEQADDR 0x168 ACCUM = 0xa0, SINDEX = 0x61, DINDEX = 0xe4, ARG_2 = 0x0 HCNT = 0x0 SCBPTR = 0x0 SCSISEQ = 0x12, SBLKCTL = 0xa DFCNTRL = 0x0, DFSTATUS = 0x89 LASTPHASE = 0xa0, SCSISIGI = 0xb6, SXFRCTL0 = 0x88 SSTAT0 = 0x2, SSTAT1 = 0x1 SCSIPHASE = 0x4 STACK == 0x175, 0x160, 0x0, 0xe7 SCB count = 4 Kernel NEXTQSCB = 3 Card NEXTQSCB = 3 QINFIFO entries: Waiting Queue entries: Disconnected Queue entries: QOUTFIFO entries: Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Sequencer SCB Info: 0(c 0x40, s 0x7, l 0, t 0x2) 1(c 0x0, s 0x7, l 0, t 0xff) 2(c 0x0, s 0x7, l 0, t 0xff) 3(c 0x0, s 0x7, l 0, t 0xff) 4(c 0x0, s 0x0, l 0, t 0xff) 5(c 0x0, s 0x7, l 0, t 0xff) 6(c 0x0, s 0x7, l 0, t 0xff) 7(c 0x0, s 0x7, l 0, t 0xff) 8(c 0x0, s 0x7, l 0, t 0xff) 9(c 0x0, s 0x7, l 0, t 0xff) 10(c 0x0, s 0x27, l 0, t 0xff) 11(c 0x0, s 0x7, l 0, t 0xff) 12(c 0x0, s 0x7, l 0, t 0xff) 13(c 0x0, s 0x7, l 0, t 0xff) 14(c 0x0, s 0x7, l 0, t 0xff) 15(c 0x0, s 0x7, l 0, t 0xff) 16(c 0x0, s 0x7, l 0, t 0xff) 17(c 0x0, s 0x7, l 0, t 0xff) 18(c 0x0, s 0x7, l 0, t 0xff) 19(c 0x0, s 0x7, l 0, t 0xff) 20(c 0x0, s 0x7, l 0, t 0xff) 21(c 0x0, s 0x7, l 0, t 0xff) 22(c 0x0, s 0x7, l 0, t 0xff) 23(c 0x0, s 0x7, l 0, t 0xff) 24(c 0x0, s 0x7, l 0, t 0xff) 25(c 0x0, s 0x7, l 0, t 0xff) 26(c 0x0, s 0x7, l 0, t 0xff) 27(c 0x0, s 0x7, l 0, t 0xff) 28(c 0x0, s 0x7, l 0, t 0xff) 29(c 0x0, s 0x7, l 0, t 0xff) 30(c 0x0, s 0x7, l 0, t 0xff) 31(c 0x0, s 0x7, l 0, t 0xff) Pending list: 2(c 0x40, s 0x7, l 0) Kernel Free SCB list: 1 0 Untagged Q(0): 2 DevQ(0:0:0): 0 waiting scsi0:0:0:0: Device is active, asserting ATN Recovery code sleeping (scsi0:A:0): 160.000MB/s transfers (80.000MHz DT, offset 127, 16bit) (scsi0:A:0:0): Abort Message Sent (scsi0:A:0:0): SCB 2 - Abort Completed. Recovery SCB completes Recovery code awake aic7xxx_abort returns 0x2002 scsi0:0:0:0: Attempting to queue a TARGET RESET message scsi0:0:0:0: Command not found aic7xxx_dev_reset returns 0x2002 Vendor: QUANTUM Model: ATLAS10K3_36_WLS Rev: 020W Type: Direct-Access ANSI SCSI revision: 03 Vendor: QUANTUM Model: ATLAS10K3_36_WLS Rev: 020W Type: Direct-Access ANSI SCSI revision: 03 Vendor: QUANTUM Model: ATLAS10K3_36_WLS Rev: 020W Type: Direct-Access ANSI SCSI revision: 03 Vendor: HP Model: C5683A Rev: C005 Type: Sequential-Access ANSI SCSI revision: 02 Vendor: QUANTUM Model: ATLAS10K3_36_WLS Rev: 020W Type: Direct-Access ANSI SCSI revision: 03 Vendor: QUANTUM Model: ATLAS10K3_36_WLS Rev: 020W Type: Direct-Access ANSI SCSI revision: 03 scsi0:A:0:0: Tagged Queuing enabled. Depth 253 scsi0:A:1:0: Tagged Queuing enabled. Depth 253 scsi0:A:2:0: Tagged Queuing enabled. Depth 253 scsi0:A:4:0: Tagged Queuing enabled. Depth 253 scsi0:A:8:0: Tagged Queuing enabled. Depth 253 Attached scsi disk sda at scsi0, channel 0, id 0, lun 0 Attached scsi disk sdb at scsi0, channel 0, id 1, lun 0 Attached scsi disk sdc at scsi0, channel 0, id 2, lun 0 Attached scsi disk sdd at scsi0, channel 0, id 4, lun 0 Attached scsi disk sde at scsi0, channel 0, id 8, lun 0 SCSI device sda: 71833096 512-byte hdwr sectors (36779 MB) sda: sda1 sda2 sda3 sda4 < sda5 > (scsi0:A:1): 160.000MB/s transfers (80.000MHz DT, offset 127, 16bit) SCSI device sdb: 71833096 512-byte hdwr sectors (36779 MB) sdb: sdb1 (scsi0:A:2): 160.000MB/s transfers (80.000MHz DT, offset 127, 16bit) SCSI device sdc: 71833096 512-byte hdwr sectors (36779 MB) sdc: sdc1 (scsi0:A:4): 160.000MB/s transfers (80.000MHz DT, offset 127, 16bit) SCSI device sdd: 71833096 512-byte hdwr sectors (36779 MB) sdd: sdd1 (scsi0:A:8): 160.000MB/s transfers (80.000MHz DT, offset 127, 16bit) SCSI device sde: 71833096 512-byte hdwr sectors (36779 MB) sde: sde1 raid5: measuring checksumming speed 8regs : 2449.600 MB/sec 32regs : 1552.800 MB/sec pII_mmx : 3759.200 MB/sec p5_mmx : 4813.600 MB/sec raid5: using function: p5_mmx (4813.600 MB/sec) md: raid5 personality registered as nr 4 Journalled Block Device driver loaded md: Autodetecting RAID arrays. [events: 0000000b] [events: 00000010] [events: 00000010] [events: 00000000] md: invalid raid superblock magic on sde1 md: sde1 has invalid sb, not importing! md: could not import sde1! md: autorun ... md: considering sdd1 ... md: adding sdd1 ... md: adding sdc1 ... md: adding sdb1 ... md: created md0 md: bind md: bind md: bind md: running: md: sdd1's event counter: 00000010 md: sdc1's event counter: 00000010 md: sdb1's event counter: 0000000b md: superblock update time inconsistency -- using the most recent one md: freshest: sdd1 md: kicking non-fresh sdb1 from array! md: unbind md: export_rdev(sdb1) md0: max total readahead window set to 512k md0: 2 data-disks, max readahead per data-disk: 256k raid5: device sdd1 operational as raid disk 2 raid5: device sdc1 operational as raid disk 1 raid5: md0, not all disks are operational -- trying to recover array raid5: allocated 3291kB for md0 raid5: raid level 5 set md0 active with 2 out of 3 devices, algorithm 2 RAID5 conf printout: --- rd:3 wd:2 fd:1 disk 0, s:0, o:0, n:0 rd:0 us:1 dev:[dev 00:00] disk 1, s:0, o:1, n:1 rd:1 us:1 dev:sdc1 disk 2, s:0, o:1, n:2 rd:2 us:1 dev:sdd1 RAID5 conf printout: --- rd:3 wd:2 fd:1 disk 0, s:0, o:0, n:0 rd:0 us:1 dev:[dev 00:00] disk 1, s:0, o:1, n:1 rd:1 us:1 dev:sdc1 disk 2, s:0, o:1, n:2 rd:2 us:1 dev:sdd1 md: updating md0 RAID superblock on device md: sdd1 [events: 00000011]<6>(write) sdd1's sb offset: 35913152 md: recovery thread got woken up ... md0: no spare disk to reconstruct array! -- continuing in degraded mode md: recovery thread finished ... md: sdc1 [events: 00000011]<6>(write) sdc1's sb offset: 35913152 md: ... autorun DONE. raid5: switching cache buffer size, 4096 --> 1024 kjournald starting. Commit interval 5 seconds EXT3-fs: mounted filesystem with ordered data mode. Freeing unused kernel memory: 304k freed usb.c: registered new driver usbdevfs usb.c: registered new driver hub usb-ohci.c: USB OHCI at membase 0xe0898000, IRQ 10 usb-ohci.c: usb-02:00.0, Advanced Micro Devices [AMD] AMD-768 [??] USB usb.c: new USB bus registered, assigned bus number 1 hub.c: USB hub found hub.c: 4 ports detected usb.c: registered new driver hiddev usb.c: registered new driver hid hid-core.c: v1.8.1 Andreas Gal, Vojtech Pavlik hid-core.c: USB HID support drivers mice: PS/2 mouse device common for all mice EXT3 FS 2.4-0.9.17, 10 Jan 2002 on sd(8,5), internal journal Adding Swap: 1172736k swap-space (priority -1) Adding Swap: 1204864k swap-space (priority -2) kjournald starting. Commit interval 5 seconds EXT3 FS 2.4-0.9.17, 10 Jan 2002 on sd(8,1), internal journal EXT3-fs: mounted filesystem with ordered data mode. kjournald starting. Commit interval 5 seconds EXT3 FS 2.4-0.9.17, 10 Jan 2002 on ide1(22,1), internal journal EXT3-fs: mounted filesystem with ordered data mode. kjournald starting. Commit interval 5 seconds EXT3 FS 2.4-0.9.17, 10 Jan 2002 on ide0(3,65), internal journal EXT3-fs: mounted filesystem with ordered data mode. kjournald starting. Commit interval 5 seconds EXT3 FS 2.4-0.9.17, 10 Jan 2002 on sd(8,65), internal journal EXT3-fs: mounted filesystem with ordered data mode. st: Version 20020205, bufsize 32768, wrt 30720, max init. bufs 4, s/g segs 16 Attached scsi tape st0 at scsi0, channel 0, id 3, lun 0 ********************* lspci ********************* [root@hydra log]# lspci -vv 00:00.0 Host bridge: Advanced Micro Devices [AMD] AMD-760 MP [IGD4-2P] System Controller (rev 11) Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap+ 66Mhz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- SERR- 00:01.0 PCI bridge: Advanced Micro Devices [AMD] AMD-760 MP [IGD4-2P] AGP Bridge (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66Mhz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- SERR- Reset- FastB2B- 00:07.0 ISA bridge: Advanced Micro Devices [AMD] AMD-768 [Opus] ISA (rev 05) Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66Mhz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- SERR- TAbort- SERR- TAbort- SERR- TAbort- SERR- [disabled] [size=128K] Capabilities: [dc] Power Management version 2 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- 00:09.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5701 Gigabit Ethernet (rev 15) Subsystem: 3Com Corporation 3C996B-T 1000Base-T Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap+ 66Mhz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- SERR- TAbort- SERR- Reset- FastB2B- 01:05.0 VGA compatible controller: Matrox Graphics, Inc. MGA G550 AGP (rev 01) (prog-if 00 [VGA]) Subsystem: Matrox Graphics, Inc. Millennium G550 Dual Head DDR 32Mb Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap+ 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- SERR- [disabled] [size=128K] Capabilities: [dc] Power Management version 2 Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Capabilities: [f0] AGP version 2.0 Status: RQ=31 SBA+ 64bit- FW- Rate=x1,x2,x4 Command: RQ=31 SBA+ AGP+ 64bit- FW- Rate=x1 02:00.0 USB Controller: Advanced Micro Devices [AMD] AMD-768 [Opus] USB (rev 07) (prog-if 10 [OHCI]) Subsystem: Advanced Micro Devices [AMD] AMD-768 [Opus] USB Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- SERR- TAbort- SERR- [disabled] [size=128K] Capabilities: [dc] Power Management version 2 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=2 PME- ************** messages ************** attached in a file ------=_20031118165757_12505--