Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 14 Mar 2000 10:55:21 +0000
From:      Karl Pielorz <kpielorz@tdx.co.uk>
To:        scsi@freebsd.org
Subject:   timed out CCB already completed - DPT 3.4-STABLE (14/3/00) & Softupdates
Message-ID:  <38CE1A99.E8DAB07D@tdx.co.uk>

next in thread | raw e-mail | index | archive | help
Hi,

We've got a number of HP Netservers. These run with FreeBSD 2.2.7 (waiting to
be upgraded), with 2 x 9Gb Disks (Seagates), a DPT SmartRAID IV, w/64Mb of RAM
on board, running as 1 x 9Gb RAID1 drive.

We recently installed FreeBSD 3.4-STABLE onto one of these machines, running
an identical config (down to amount / type of RAM, drives, firmware on DPT and
machine etc.) - and using Sofupdates. The system is not very happy :(

Being as they're HP 'hot-swap' drives, using the HP cabling loom - we're
extreemly sure the cabling / termination is not at fault, as is confirmed by
the way we've swapped things around (we have 3 'absolutely' identical
machines).

The 3.4-STABLE box died on it's first night, around 3am. The console indicated
the swapper had timed out trying to get a block off the disk, and that the
system had raised a number of CCB errors. 

We've increased the RAM in all 3 boxes to 192Mb's now, and set the
"DPT_LOST_IRQ" option in the kernel on the 3.4 box (after reading the mailing
list archives). The 3.4 box now 'lives', but it's still not happy...

We get a lot (and I mean I lot) of:

(da0:dpt:0:0:0): CCB 0xXXXXXXXX timed out CCB already complated
(da0:dpt:0:0:0): CCB 0xXXXXXXXX timed out CCB
(da0:dpt:0:0:0): CCB 0xXXXXXXXX timed out CCB already complated

(da0:dpt:0:0:0): CCB 0xYYYYYYYY timed out CCB already complated
(da0:dpt:0:0:0): CCB 0xYYYYYYYY timed out CCB
(da0:dpt:0:0:0): CCB 0xYYYYYYYY timed out CCB already complated

Each error is repeated with the same CCB number 3 times (twice as 'already
completed' and once as 'timed out').

The machine 'lurches' around under disk access, and is generally not too
well...

Has anyone got any suggestions at all? We've tried moving the DPT around to
different IRQ's / slots, but everything we've done so far (apart from put
2.2.7 back on it) has had no effect.

I don't want to post the whole kernel config to the list unless really needed,
we've taken a stock GENERIC kernel and make sure 'dpt' is enabled, as well as
now 'DPT_LOST_IRQ'.

The problem occurs in both SMP and UMP mode, I'd really like to get this fixed
- as the 2.2.7 boxes (even when switched over to the same 'problem'
3.4-STABLE's hardware) doesn't exhibit this problem at all...

dmesg output is enclosed (this is from an SMP boot, but the problem doesn't
change under single CPU kernel either, also the Symbios and DPT appear to be
on the same IRQ, this changes when we move the DPT around to different slots
[obviously], but doesnt' affect the problem. The Symbios is not used at the
moment, and has just an empty / terminated [by the HP's loom / termination
units] bus on it. I've tried removing it from the Kernel config, and it
doesn't make any difference (even if it's on a different Slot/IRQ).

Regards,

Karl

---

Copyright (c) 1992-1999 FreeBSD Inc.
Copyright (c) 1982, 1986, 1989, 1991, 1993
        The Regents of the University of California. All rights reserved.
FreeBSD 3.4-STABLE #2: Mon Mar 13 23:03:35 GMT 2000
    root@:/usr/src/sys/compile/VIPER-SMP
Timecounter "i8254"  frequency 1193182 Hz
CPU: Pentium II/Xeon/Celeron (686-class CPU)
  Origin = "GenuineIntel"  Id = 0x652  Stepping = 2
 
Features=0x183fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR>
real memory  = 201326592 (196608K bytes)
avail memory = 192987136 (188464K bytes)
Programming 24 pins in IOAPIC #0
FreeBSD/SMP: Multiprocessor motherboard
 cpu0 (BSP): apic id:  1, version: 0x00040011, at 0xfee00000
 cpu1 (AP):  apic id:  0, version: 0x00040011, at 0xfee00000
 io0 (APIC): apic id:  2, version: 0x00170011, at 0xfec00000
Preloaded elf kernel "kernel" at 0xc029c000.
Preloaded userconfig_script "/boot/kernel.conf" at 0xc029c09c.
Pentium Pro MTRR support enabled
Probing for devices on PCI bus 0:
chip0: <Intel 82443BX host to PCI bridge (AGP disabled)> rev 0x02 on pci0.0.0
chip1: <Intel 82371AB PCI to ISA bridge> rev 0x02 on pci0.4.0
ide_pci0: <Intel PIIX4 Bus-master IDE controller> rev 0x01 on pci0.4.1
chip2: <Intel 82371AB Power management controller> rev 0x02 on pci0.4.3
chip3: <PCI to PCI bridge (vendor=1011 device=0024)> rev 0x03 on pci0.7.0
vga0: <Cirrus Logic GD5446 SVGA controller> rev 0x45 on pci0.13.0
Probing for devices on PCI bus 1:
dpt0: <DPT Caching SCSI RAID Controller> rev 0x02 int a irq 18 on pci1.2.0
dpt0: DPT PM2044UW FW Rev. 07M1, 1 channel, 64 CCBs
fxp0: <Intel EtherExpress Pro 10/100B Ethernet> rev 0x02 int a irq 19 on
pci1.3.0
fxp0: Ethernet address 00:60:b0:67:89:6f
ncr0: <ncr 53c895 fast40 wide scsi> rev 0x01 int a irq 18 on pci1.4.0
Probing for devices on the ISA bus:
sc0 on isa
sc0: VGA color <16 virtual consoles, flags=0x0>
atkbdc0 at 0x60-0x6f on motherboard
atkbd0 irq 1 on isa
psm0 not found
sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa
sio0: type 16550A
sio1 at 0x2f8-0x2ff irq 3 on isa
sio1: type 16550A
fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa
fdc0: FIFO enabled, 8 bytes threshold
fd0: 1.44MB 3.5in
wdc0 at 0x1f0-0x1f7 irq 14 flags 0xa0ffa0ff on isa
wdc0: unit 0 (atapi): <CD-ROM  CDR-U240/1.10>, removable, accel, dma, iordis
acd0: drive speed 1722 - 4037KB/sec, 256KB cache
acd0: supported read types: CD-DA
acd0: Audio: play, 256 volume levels
acd0: Mechanism: ejectable tray
acd0: Medium: no/blank disc inside, unlocked
ppc0 at 0x378 irq 7 flags 0x40 on isa
ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode
lpt0: <generic printer> on ppbus 0
lpt0: Interrupt-driven port
ppi0: <generic parallel i/o> on ppbus 0
plip0: <PLIP network interface> on ppbus 0
vga0 at 0x3b0-0x3df maddr 0xa0000 msize 131072 on isa
npx0 on motherboard
npx0: INT 16 interface
APIC_IO: Testing 8254 interrupt delivery
APIC_IO: Broken MP table detected: 8254 is not connected to IO APIC int pin 2
APIC_IO: routing 8254 via 8259 on pin 0
Waiting 2 seconds for SCSI devices to settle
SMP: AP CPU #1 Launched!
changing root device to da0s1a
da0 at dpt0 bus 0 target 0 lun 0
da0: <DPT RAID-1 07M1> Fixed Direct Access SCSI-2 device
da0: 8678MB (17773012 512 byte sectors: 255H 63S/T 1106C)


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-scsi" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?38CE1A99.E8DAB07D>