Date: Tue, 14 Mar 2000 10:55:21 +0000 From: Karl Pielorz <kpielorz@tdx.co.uk> To: scsi@freebsd.org Subject: timed out CCB already completed - DPT 3.4-STABLE (14/3/00) & Softupdates Message-ID: <38CE1A99.E8DAB07D@tdx.co.uk>
next in thread | raw e-mail | index | archive | help
Hi, We've got a number of HP Netservers. These run with FreeBSD 2.2.7 (waiting to be upgraded), with 2 x 9Gb Disks (Seagates), a DPT SmartRAID IV, w/64Mb of RAM on board, running as 1 x 9Gb RAID1 drive. We recently installed FreeBSD 3.4-STABLE onto one of these machines, running an identical config (down to amount / type of RAM, drives, firmware on DPT and machine etc.) - and using Sofupdates. The system is not very happy :( Being as they're HP 'hot-swap' drives, using the HP cabling loom - we're extreemly sure the cabling / termination is not at fault, as is confirmed by the way we've swapped things around (we have 3 'absolutely' identical machines). The 3.4-STABLE box died on it's first night, around 3am. The console indicated the swapper had timed out trying to get a block off the disk, and that the system had raised a number of CCB errors. We've increased the RAM in all 3 boxes to 192Mb's now, and set the "DPT_LOST_IRQ" option in the kernel on the 3.4 box (after reading the mailing list archives). The 3.4 box now 'lives', but it's still not happy... We get a lot (and I mean I lot) of: (da0:dpt:0:0:0): CCB 0xXXXXXXXX timed out CCB already complated (da0:dpt:0:0:0): CCB 0xXXXXXXXX timed out CCB (da0:dpt:0:0:0): CCB 0xXXXXXXXX timed out CCB already complated (da0:dpt:0:0:0): CCB 0xYYYYYYYY timed out CCB already complated (da0:dpt:0:0:0): CCB 0xYYYYYYYY timed out CCB (da0:dpt:0:0:0): CCB 0xYYYYYYYY timed out CCB already complated Each error is repeated with the same CCB number 3 times (twice as 'already completed' and once as 'timed out'). The machine 'lurches' around under disk access, and is generally not too well... Has anyone got any suggestions at all? We've tried moving the DPT around to different IRQ's / slots, but everything we've done so far (apart from put 2.2.7 back on it) has had no effect. I don't want to post the whole kernel config to the list unless really needed, we've taken a stock GENERIC kernel and make sure 'dpt' is enabled, as well as now 'DPT_LOST_IRQ'. The problem occurs in both SMP and UMP mode, I'd really like to get this fixed - as the 2.2.7 boxes (even when switched over to the same 'problem' 3.4-STABLE's hardware) doesn't exhibit this problem at all... dmesg output is enclosed (this is from an SMP boot, but the problem doesn't change under single CPU kernel either, also the Symbios and DPT appear to be on the same IRQ, this changes when we move the DPT around to different slots [obviously], but doesnt' affect the problem. The Symbios is not used at the moment, and has just an empty / terminated [by the HP's loom / termination units] bus on it. I've tried removing it from the Kernel config, and it doesn't make any difference (even if it's on a different Slot/IRQ). Regards, Karl --- Copyright (c) 1992-1999 FreeBSD Inc. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 3.4-STABLE #2: Mon Mar 13 23:03:35 GMT 2000 root@:/usr/src/sys/compile/VIPER-SMP Timecounter "i8254" frequency 1193182 Hz CPU: Pentium II/Xeon/Celeron (686-class CPU) Origin = "GenuineIntel" Id = 0x652 Stepping = 2 Features=0x183fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR> real memory = 201326592 (196608K bytes) avail memory = 192987136 (188464K bytes) Programming 24 pins in IOAPIC #0 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 1, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 0, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Preloaded elf kernel "kernel" at 0xc029c000. Preloaded userconfig_script "/boot/kernel.conf" at 0xc029c09c. Pentium Pro MTRR support enabled Probing for devices on PCI bus 0: chip0: <Intel 82443BX host to PCI bridge (AGP disabled)> rev 0x02 on pci0.0.0 chip1: <Intel 82371AB PCI to ISA bridge> rev 0x02 on pci0.4.0 ide_pci0: <Intel PIIX4 Bus-master IDE controller> rev 0x01 on pci0.4.1 chip2: <Intel 82371AB Power management controller> rev 0x02 on pci0.4.3 chip3: <PCI to PCI bridge (vendor=1011 device=0024)> rev 0x03 on pci0.7.0 vga0: <Cirrus Logic GD5446 SVGA controller> rev 0x45 on pci0.13.0 Probing for devices on PCI bus 1: dpt0: <DPT Caching SCSI RAID Controller> rev 0x02 int a irq 18 on pci1.2.0 dpt0: DPT PM2044UW FW Rev. 07M1, 1 channel, 64 CCBs fxp0: <Intel EtherExpress Pro 10/100B Ethernet> rev 0x02 int a irq 19 on pci1.3.0 fxp0: Ethernet address 00:60:b0:67:89:6f ncr0: <ncr 53c895 fast40 wide scsi> rev 0x01 int a irq 18 on pci1.4.0 Probing for devices on the ISA bus: sc0 on isa sc0: VGA color <16 virtual consoles, flags=0x0> atkbdc0 at 0x60-0x6f on motherboard atkbd0 irq 1 on isa psm0 not found sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa sio0: type 16550A sio1 at 0x2f8-0x2ff irq 3 on isa sio1: type 16550A fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa fdc0: FIFO enabled, 8 bytes threshold fd0: 1.44MB 3.5in wdc0 at 0x1f0-0x1f7 irq 14 flags 0xa0ffa0ff on isa wdc0: unit 0 (atapi): <CD-ROM CDR-U240/1.10>, removable, accel, dma, iordis acd0: drive speed 1722 - 4037KB/sec, 256KB cache acd0: supported read types: CD-DA acd0: Audio: play, 256 volume levels acd0: Mechanism: ejectable tray acd0: Medium: no/blank disc inside, unlocked ppc0 at 0x378 irq 7 flags 0x40 on isa ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode lpt0: <generic printer> on ppbus 0 lpt0: Interrupt-driven port ppi0: <generic parallel i/o> on ppbus 0 plip0: <PLIP network interface> on ppbus 0 vga0 at 0x3b0-0x3df maddr 0xa0000 msize 131072 on isa npx0 on motherboard npx0: INT 16 interface APIC_IO: Testing 8254 interrupt delivery APIC_IO: Broken MP table detected: 8254 is not connected to IO APIC int pin 2 APIC_IO: routing 8254 via 8259 on pin 0 Waiting 2 seconds for SCSI devices to settle SMP: AP CPU #1 Launched! changing root device to da0s1a da0 at dpt0 bus 0 target 0 lun 0 da0: <DPT RAID-1 07M1> Fixed Direct Access SCSI-2 device da0: 8678MB (17773012 512 byte sectors: 255H 63S/T 1106C) To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-scsi" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?38CE1A99.E8DAB07D>