Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 18 Dec 2001 13:42:39 -0800 (PST)
From:      Matthew Dillon <dillon@apollo.backplane.com>
To:        "Justin T. Gibbs" <gibbs@scsiguy.com>
Cc:        freebsd-current@freebsd.org
Subject:   Hmm.... Adaptec SCB timeouts on -current, can reproduce at will
Message-ID:  <200112182142.fBILgdc84928@apollo.backplane.com>

next in thread | raw e-mail | index | archive | help
    I can reproduce this at will using the following test.  Note that the
    machine does seem to recover from the error eventually.

	(This is with -current patches at: http://apollo.backplane.com/FreeBSD/
	However, I do not believe the patches are related to the SCSI bus
	problems)

	Create a low-memory situation by artificially reducing the amount
	of memory in the machine, typically by adding this to
	/boot/loader.conf (and reboot):

	    hw.physmem="64m"

	Then run things which do a lot of I/O and eat memory.  NEWS machines
	are best, but baring that try running a whole lot of postmark
	processes (/usr/ports/benchmarks/postmark).  Run this on a local
	softupdates-enabled filesystem.  With restricted memory the machine 
	will slow to a crawl, by the way.

	#!/bin/tcsh
	#
	# runtest directory
	mkdir $argv
	chdir $argv
	postmark << EOF
	set size 262144 16000000
	set number 64 
	set seed 1
	set transactions 1000000000
	set subdirectories 8
	run
	EOF

	# END OF SCRIPT

	foreach i ( t1 t2 t3 t4 t5 )
	    ./runtest $i &
	end

					-Matt
					Matthew Dillon 
					<dillon@backplane.com>


Dec 18 13:32:05 test3 kernel: swap_pager: indefinite wait buffer: device: da0s1b, blkno: 11264, size: 12288
Dec 18 13:32:11 test3 kernel: swap_pager: indefinite wait buffer: device: da0s1b, blkno: 8728, size: 4096
Dec 18 13:32:13 test3 kernel: swap_pager: indefinite wait buffer: device: da0s1b, blkno: 11264, size: 12288
Dec 18 13:32:15 test3 kernel: swap_pager: indefinite wait buffer: device: da0s1b, blkno: 8728, size: 4096
Dec 18 13:32:16 test3 kernel: (da0:ahc0:0:0:0): SCB 0xe - timed out
Dec 18 13:32:18 test3 kernel: ahc0: Dumping Card State while idle, at SEQADDR 0x8
Dec 18 13:32:19 test3 kernel: ACCUM = 0x4, SINDEX = 0x64, DINDEX = 0x65, ARG_2 = 0x16
Dec 18 13:32:20 test3 kernel: HCNT = 0x0
Dec 18 13:32:20 test3 kernel: SCSISEQ = 0x12, SBLKCTL = 0xa
Dec 18 13:32:21 test3 kernel: DFCNTRL = 0x0, DFSTATUS = 0x89
Dec 18 13:32:21 test3 kernel: LASTPHASE = 0x1, SCSISIGI = 0x0, SXFRCTL0 = 0x80
Dec 18 13:32:21 test3 kernel: SSTAT0 = 0x0, SSTAT1 = 0x8
Dec 18 13:32:21 test3 kernel: SCSIPHASE = 0x0
Dec 18 13:32:22 test3 kernel: STACK == 0x3, 0x175, 0x160, 0x0
Dec 18 13:32:22 test3 kernel: SCB count = 140
Dec 18 13:32:22 test3 kernel: Kernel NEXTQSCB = 118
Dec 18 13:32:22 test3 kernel: Card NEXTQSCB = 118
Dec 18 13:32:22 test3 kernel: QINFIFO entries: 
Dec 18 13:32:22 test3 kernel: Waiting Queue entries: 
Dec 18 13:32:22 test3 kernel: Disconnected Queue entries: 21:88 31:127 17:50 30:19 16:33 13:38 29:49 14:97 22:36 4:25 25:115 0:108 9:52 1:26 5:8 19:9 12:99 8:58 7:21 24:78 23:30 18:119 11:80 27:86 26:109 28:48 6:40 3:110 10:45 15:12 2:17 20:14 
Dec 18 13:32:22 test3 kernel: QOUTFIFO entries: 
Dec 18 13:32:22 test3 kernel: Sequencer Free SCB List: 
Dec 18 13:32:22 test3 kernel: Pending list: 88, 16, 113, 51, 24, 127, 92, 104, 4, 18, 67, 106, 129, 7, 61, 56, 0, 102, 76, 95, 77, 55, 81, 31, 37, 28, 63, 72, 139, 103, 68, 3, 87, 5, 46, 126, 123, 84, 82, 100, 111, 75, 50, 19, 33, 38, 49, 97, 36, 25, 115, 108, 52, 26, 8, 9, 99, 58, 21, 78, 30, 119, 80, 86, 109, 48, 40, 110, 45, 23, 15, 11, 116, 91, 12, 17, 14
Dec 18 13:32:22 test3 kernel: Kernel Free SCB list: 79 41 96 93 53 6 65 121 114 42 32 89 105 62 2 20 107 90 66 73 125 122 27 85 13 10 94 34 39 83 64 44 74 54 60 71 69 1 35 124 70 43 112 22 59 101 117 98 128 57 47 29 120 138 137 136 135 134 133 132 131 130 
Dec 18 13:32:22 test3 kernel: sg[0] - Addr 0x3121000 : Length 4096
Dec 18 13:32:22 test3 kernel: sg[1] - Addr 0x2902000 : Length 2048
Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): Queuing a BDR SCB
Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): Bus Device Reset Message Sent
Dec 18 13:32:22 test3 kernel: swap_pager: indefinite wait buffer: device: da0s1b, blkno: 11264, size: 12288
Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): no longer in timeout, status = 34b
Dec 18 13:32:22 test3 kernel: ahc0: Bus Device Reset on A:0. 77 SCBs aborted
Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): WRITE(10). CDB: 2a 0 0 a8 0 6f 0 0 c 0 
Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): CAM Status: SCSI Status Error
Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): SCSI Status: Check Condition
Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): UNIT ATTENTION asc:29,3
Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): Bus device reset function occurred
Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): Retrying Command
Dec 18 13:35:51 test3 su: dillon to root on /dev/ttyp1

pciconf -l

chip0@pci0:0:0: class=0x060000 card=0x80441043 chip=0x25308086 rev=0x02 hdr=0x00
pcib1@pci0:1:0: class=0x060400 card=0x00000000 chip=0x25328086 rev=0x02 hdr=0x01
pcib2@pci0:30:0:        class=0x060400 card=0x00000000 chip=0x244e8086 rev=0x02 hdr=0x01
isab0@pci0:31:0:        class=0x060100 card=0x00000000 chip=0x24408086 rev=0x02 hdr=0x00
atapci0@pci0:31:1:      class=0x010180 card=0x00000000 chip=0x244b8086 rev=0x02 hdr=0x00
none0@pci0:31:2:        class=0x0c0300 card=0x00000000 chip=0x24428086 rev=0x02 hdr=0x00
none1@pci0:31:3:        class=0x0c0500 card=0x00000000 chip=0x24438086 rev=0x02 hdr=0x00
none2@pci0:31:4:        class=0x0c0300 card=0x00000000 chip=0x24448086 rev=0x02 hdr=0x00
none3@pci0:31:5:        class=0x040100 card=0x80e4104d chip=0x24458086 rev=0x02 hdr=0x00
none4@pci1:0:0: class=0x030000 card=0x42111043 chip=0x011010de rev=0xa1 hdr=0x00
rl0@pci2:1:0:   class=0x020000 card=0x80da104d chip=0x813910ec rev=0x10 hdr=0x00
none5@pci2:2:0: class=0x0c0010 card=0x80d2104d chip=0x8020104c rev=0x00 hdr=0x00
ahc0@pci2:10:0: class=0x010000 card=0xa1809005 chip=0x00109005 rev=0x00 hdr=0x00
none6@pci2:11:0:        class=0x078000 card=0x040513e0 chip=0x044e11c1 rev=0x00 hdr=0x00

dmesg.boot

Copyright (c) 1992-2001 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
	The Regents of the University of California. All rights reserved.
FreeBSD 5.0-CURRENT #15: Tue Dec 18 12:33:04 PST 2001
    dillon@apollo.backplane.com:/usr/obj/FreeBSD/FreeBSD-current/src/sys/DELL2550
Preloaded elf kernel "/boot/kernel/kernel" at 0xc03c5000.
Preloaded elf module "/boot/kernel/acpi.ko" at 0xc03c50a8.
Timecounter "i8254"  frequency 1193182 Hz
CPU: Pentium III/Pentium III Xeon/Celeron (1130.46-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x6b1  Stepping = 1
  Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE>
real memory  = 67108864 (65536K bytes)
avail memory = 61403136 (59964K bytes)
Changing APIC ID for IO APIC #0 from 0 to 2 on chip
Changing APIC ID for IO APIC #1 from 0 to 3 on chip
Programming 16 pins in IOAPIC #0
IOAPIC #0 intpin 2 -> irq 0
Programming 16 pins in IOAPIC #1
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 cpu0 (BSP): apic id:  1, version: 0x00040011, at 0xfee00000
 cpu1 (AP):  apic id:  0, version: 0x00040011, at 0xfee00000
 io0 (APIC): apic id:  2, version: 0x000f0011, at 0xfec00000
 io1 (APIC): apic id:  3, version: 0x000f0011, at 0xfec01000
Pentium Pro MTRR support enabled
Using $PIR table, 8 entries at 0xc00fc270
npx0: <math processor> on motherboard
npx0: INT 16 interface
acpi0: <DELL   PE2550  > on motherboard
acpi0: power button is handled as a fixed feature programming model.
Timecounter "ACPI"  frequency 3579545 Hz
acpi_timer0: <32-bit timer at 3.579545MHz> port 0x808-0x80b on acpi0
acpi_cpu0: <CPU> on acpi0
acpi_cpu: CLK_VAL field overflows P_CNT register
acpi_cpu: CLK_VAL field overlaps THT_EN bit
acpi_cpu1: <CPU> on acpi0
acpi_pcib0: <Host-PCI bridge> on acpi0
pci0: <PCI bus> on acpi_pcib0
pci0: <display, VGA> at device 14.0 (no driver attached)
isab0: <PCI-ISA bridge> port 0x8a0-0x8af at device 15.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <ServerWorks ROSB4 ATA33 controller> port 0x8b0-0x8bf at device 15.1 on pci0
ata0: at 0x1f0 irq 14 on atapci0
ata1: at 0x170 irq 15 on atapci0
acpi_pcib1: <Host-PCI bridge> on acpi0
IOAPIC #1 intpin 1 -> irq 2
pci1: <PCI bus> on acpi_pcib1
pci1: <network, ethernet> at device 8.0 (no driver attached)
acpi_pcib2: <Host-PCI bridge> on acpi0
IOAPIC #1 intpin 0 -> irq 5
pci2: <PCI bus> on acpi_pcib2
pcib3: <PCI-PCI bridge> at device 2.0 on pci2
IOAPIC #1 intpin 15 -> irq 10
IOAPIC #1 intpin 14 -> irq 11
pci3: <PCI bus> on pcib3
ahc0: <Adaptec aic7899 Ultra160 SCSI adapter> port 0xcc00-0xccff mem 0xfe7ff000-0xfe7fffff irq 10 at device 4.0 on pci3
aic7899: Ultra160 Wide Channel A, SCSI Id=7, 32/255 SCBs
ahc1: <Adaptec aic7899 Ultra160 SCSI adapter> port 0xc800-0xc8ff mem 0xfe7fe000-0xfe7fefff irq 11 at device 4.1 on pci3
aic7899: Ultra160 Wide Channel B, SCSI Id=7, 32/255 SCBs
fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xbcc0-0xbcff mem 0xfe500000-0xfe5fffff,0xfe900000-0xfe900fff irq 5 at device 4.0 on pci2
fxp0: Ethernet address 00:06:5b:19:be:6f
inphy0: <i82555 10/100 media interface> on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
fdc0: <NEC 72065B or clone> port 0x3f7,0x3f0-0x3f5 irq 6 on acpi0
fdc0: FIFO enabled, 8 bytes threshold
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
atkbdc0: <Keyboard controller (i8042)> port 0x64,0x60 irq 1 on acpi0
atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
sio0 port 0x3f8-0x3ff irq 4 on acpi0
sio0: type 16550A, console
sio1 port 0x2f8-0x2ff irq 3 on acpi0
sio1: type 16550A
fdc-: fdc0 already exists, skipping it
ata-: ata0 already exists, skipping it
ata-: ata1 already exists, skipping it
atkbdc-: atkbdc0 already exists, skipping it
sio-: sio0 already exists, skipping it
sio-: sio1 already exists, skipping it
sc-: sc0 already exists, skipping it
vga-: vga0 already exists, skipping it
orm0: <Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xcdfff,0xec000-0xeffff on isa0
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x100>
APIC_IO: Testing 8254 interrupt delivery
APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 intpin 2
APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0
acd0: CDROM <SAMSUNG CD-ROM SN-124> at ata0-master PIO4
Waiting 5 seconds for SCSI devices to settle
pass1 at ahc0 bus 0 target 6 lun 0
pass1: <DELL 1x4 U2W SCSI BP 1.27> Fixed Processor SCSI-2 device 
pass1: 3.300MB/s transfers
da0 at ahc0 bus 0 target 0 lun 0
da0: <FUJITSU MAJ3091MC 5509> Fixed Direct Access SCSI-3 device 
da0: 160.000MB/s transfers (80.000MHz, offset 127, 16bit), Tagged Queueing Enabled
da0: 8683MB (17783239 512 byte sectors: 255H 63S/T 1106C)
Mounting root from ufs:/dev/da0s1a
SMP: AP CPU #1 Launched!

To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-current" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200112182142.fBILgdc84928>