Date: Tue, 18 Dec 2001 13:42:39 -0800 (PST) From: Matthew Dillon <dillon@apollo.backplane.com> To: "Justin T. Gibbs" <gibbs@scsiguy.com> Cc: freebsd-current@freebsd.org Subject: Hmm.... Adaptec SCB timeouts on -current, can reproduce at will Message-ID: <200112182142.fBILgdc84928@apollo.backplane.com>
next in thread | raw e-mail | index | archive | help
I can reproduce this at will using the following test. Note that the machine does seem to recover from the error eventually. (This is with -current patches at: http://apollo.backplane.com/FreeBSD/ However, I do not believe the patches are related to the SCSI bus problems) Create a low-memory situation by artificially reducing the amount of memory in the machine, typically by adding this to /boot/loader.conf (and reboot): hw.physmem="64m" Then run things which do a lot of I/O and eat memory. NEWS machines are best, but baring that try running a whole lot of postmark processes (/usr/ports/benchmarks/postmark). Run this on a local softupdates-enabled filesystem. With restricted memory the machine will slow to a crawl, by the way. #!/bin/tcsh # # runtest directory mkdir $argv chdir $argv postmark << EOF set size 262144 16000000 set number 64 set seed 1 set transactions 1000000000 set subdirectories 8 run EOF # END OF SCRIPT foreach i ( t1 t2 t3 t4 t5 ) ./runtest $i & end -Matt Matthew Dillon <dillon@backplane.com> Dec 18 13:32:05 test3 kernel: swap_pager: indefinite wait buffer: device: da0s1b, blkno: 11264, size: 12288 Dec 18 13:32:11 test3 kernel: swap_pager: indefinite wait buffer: device: da0s1b, blkno: 8728, size: 4096 Dec 18 13:32:13 test3 kernel: swap_pager: indefinite wait buffer: device: da0s1b, blkno: 11264, size: 12288 Dec 18 13:32:15 test3 kernel: swap_pager: indefinite wait buffer: device: da0s1b, blkno: 8728, size: 4096 Dec 18 13:32:16 test3 kernel: (da0:ahc0:0:0:0): SCB 0xe - timed out Dec 18 13:32:18 test3 kernel: ahc0: Dumping Card State while idle, at SEQADDR 0x8 Dec 18 13:32:19 test3 kernel: ACCUM = 0x4, SINDEX = 0x64, DINDEX = 0x65, ARG_2 = 0x16 Dec 18 13:32:20 test3 kernel: HCNT = 0x0 Dec 18 13:32:20 test3 kernel: SCSISEQ = 0x12, SBLKCTL = 0xa Dec 18 13:32:21 test3 kernel: DFCNTRL = 0x0, DFSTATUS = 0x89 Dec 18 13:32:21 test3 kernel: LASTPHASE = 0x1, SCSISIGI = 0x0, SXFRCTL0 = 0x80 Dec 18 13:32:21 test3 kernel: SSTAT0 = 0x0, SSTAT1 = 0x8 Dec 18 13:32:21 test3 kernel: SCSIPHASE = 0x0 Dec 18 13:32:22 test3 kernel: STACK == 0x3, 0x175, 0x160, 0x0 Dec 18 13:32:22 test3 kernel: SCB count = 140 Dec 18 13:32:22 test3 kernel: Kernel NEXTQSCB = 118 Dec 18 13:32:22 test3 kernel: Card NEXTQSCB = 118 Dec 18 13:32:22 test3 kernel: QINFIFO entries: Dec 18 13:32:22 test3 kernel: Waiting Queue entries: Dec 18 13:32:22 test3 kernel: Disconnected Queue entries: 21:88 31:127 17:50 30:19 16:33 13:38 29:49 14:97 22:36 4:25 25:115 0:108 9:52 1:26 5:8 19:9 12:99 8:58 7:21 24:78 23:30 18:119 11:80 27:86 26:109 28:48 6:40 3:110 10:45 15:12 2:17 20:14 Dec 18 13:32:22 test3 kernel: QOUTFIFO entries: Dec 18 13:32:22 test3 kernel: Sequencer Free SCB List: Dec 18 13:32:22 test3 kernel: Pending list: 88, 16, 113, 51, 24, 127, 92, 104, 4, 18, 67, 106, 129, 7, 61, 56, 0, 102, 76, 95, 77, 55, 81, 31, 37, 28, 63, 72, 139, 103, 68, 3, 87, 5, 46, 126, 123, 84, 82, 100, 111, 75, 50, 19, 33, 38, 49, 97, 36, 25, 115, 108, 52, 26, 8, 9, 99, 58, 21, 78, 30, 119, 80, 86, 109, 48, 40, 110, 45, 23, 15, 11, 116, 91, 12, 17, 14 Dec 18 13:32:22 test3 kernel: Kernel Free SCB list: 79 41 96 93 53 6 65 121 114 42 32 89 105 62 2 20 107 90 66 73 125 122 27 85 13 10 94 34 39 83 64 44 74 54 60 71 69 1 35 124 70 43 112 22 59 101 117 98 128 57 47 29 120 138 137 136 135 134 133 132 131 130 Dec 18 13:32:22 test3 kernel: sg[0] - Addr 0x3121000 : Length 4096 Dec 18 13:32:22 test3 kernel: sg[1] - Addr 0x2902000 : Length 2048 Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): Queuing a BDR SCB Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): Bus Device Reset Message Sent Dec 18 13:32:22 test3 kernel: swap_pager: indefinite wait buffer: device: da0s1b, blkno: 11264, size: 12288 Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): no longer in timeout, status = 34b Dec 18 13:32:22 test3 kernel: ahc0: Bus Device Reset on A:0. 77 SCBs aborted Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): WRITE(10). CDB: 2a 0 0 a8 0 6f 0 0 c 0 Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): CAM Status: SCSI Status Error Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): SCSI Status: Check Condition Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): UNIT ATTENTION asc:29,3 Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): Bus device reset function occurred Dec 18 13:32:22 test3 kernel: (da0:ahc0:0:0:0): Retrying Command Dec 18 13:35:51 test3 su: dillon to root on /dev/ttyp1 pciconf -l chip0@pci0:0:0: class=0x060000 card=0x80441043 chip=0x25308086 rev=0x02 hdr=0x00 pcib1@pci0:1:0: class=0x060400 card=0x00000000 chip=0x25328086 rev=0x02 hdr=0x01 pcib2@pci0:30:0: class=0x060400 card=0x00000000 chip=0x244e8086 rev=0x02 hdr=0x01 isab0@pci0:31:0: class=0x060100 card=0x00000000 chip=0x24408086 rev=0x02 hdr=0x00 atapci0@pci0:31:1: class=0x010180 card=0x00000000 chip=0x244b8086 rev=0x02 hdr=0x00 none0@pci0:31:2: class=0x0c0300 card=0x00000000 chip=0x24428086 rev=0x02 hdr=0x00 none1@pci0:31:3: class=0x0c0500 card=0x00000000 chip=0x24438086 rev=0x02 hdr=0x00 none2@pci0:31:4: class=0x0c0300 card=0x00000000 chip=0x24448086 rev=0x02 hdr=0x00 none3@pci0:31:5: class=0x040100 card=0x80e4104d chip=0x24458086 rev=0x02 hdr=0x00 none4@pci1:0:0: class=0x030000 card=0x42111043 chip=0x011010de rev=0xa1 hdr=0x00 rl0@pci2:1:0: class=0x020000 card=0x80da104d chip=0x813910ec rev=0x10 hdr=0x00 none5@pci2:2:0: class=0x0c0010 card=0x80d2104d chip=0x8020104c rev=0x00 hdr=0x00 ahc0@pci2:10:0: class=0x010000 card=0xa1809005 chip=0x00109005 rev=0x00 hdr=0x00 none6@pci2:11:0: class=0x078000 card=0x040513e0 chip=0x044e11c1 rev=0x00 hdr=0x00 dmesg.boot Copyright (c) 1992-2001 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.0-CURRENT #15: Tue Dec 18 12:33:04 PST 2001 dillon@apollo.backplane.com:/usr/obj/FreeBSD/FreeBSD-current/src/sys/DELL2550 Preloaded elf kernel "/boot/kernel/kernel" at 0xc03c5000. Preloaded elf module "/boot/kernel/acpi.ko" at 0xc03c50a8. Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (1130.46-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x6b1 Stepping = 1 Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE> real memory = 67108864 (65536K bytes) avail memory = 61403136 (59964K bytes) Changing APIC ID for IO APIC #0 from 0 to 2 on chip Changing APIC ID for IO APIC #1 from 0 to 3 on chip Programming 16 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 Programming 16 pins in IOAPIC #1 FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): apic id: 1, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 0, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x000f0011, at 0xfec00000 io1 (APIC): apic id: 3, version: 0x000f0011, at 0xfec01000 Pentium Pro MTRR support enabled Using $PIR table, 8 entries at 0xc00fc270 npx0: <math processor> on motherboard npx0: INT 16 interface acpi0: <DELL PE2550 > on motherboard acpi0: power button is handled as a fixed feature programming model. Timecounter "ACPI" frequency 3579545 Hz acpi_timer0: <32-bit timer at 3.579545MHz> port 0x808-0x80b on acpi0 acpi_cpu0: <CPU> on acpi0 acpi_cpu: CLK_VAL field overflows P_CNT register acpi_cpu: CLK_VAL field overlaps THT_EN bit acpi_cpu1: <CPU> on acpi0 acpi_pcib0: <Host-PCI bridge> on acpi0 pci0: <PCI bus> on acpi_pcib0 pci0: <display, VGA> at device 14.0 (no driver attached) isab0: <PCI-ISA bridge> port 0x8a0-0x8af at device 15.0 on pci0 isa0: <ISA bus> on isab0 atapci0: <ServerWorks ROSB4 ATA33 controller> port 0x8b0-0x8bf at device 15.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 acpi_pcib1: <Host-PCI bridge> on acpi0 IOAPIC #1 intpin 1 -> irq 2 pci1: <PCI bus> on acpi_pcib1 pci1: <network, ethernet> at device 8.0 (no driver attached) acpi_pcib2: <Host-PCI bridge> on acpi0 IOAPIC #1 intpin 0 -> irq 5 pci2: <PCI bus> on acpi_pcib2 pcib3: <PCI-PCI bridge> at device 2.0 on pci2 IOAPIC #1 intpin 15 -> irq 10 IOAPIC #1 intpin 14 -> irq 11 pci3: <PCI bus> on pcib3 ahc0: <Adaptec aic7899 Ultra160 SCSI adapter> port 0xcc00-0xccff mem 0xfe7ff000-0xfe7fffff irq 10 at device 4.0 on pci3 aic7899: Ultra160 Wide Channel A, SCSI Id=7, 32/255 SCBs ahc1: <Adaptec aic7899 Ultra160 SCSI adapter> port 0xc800-0xc8ff mem 0xfe7fe000-0xfe7fefff irq 11 at device 4.1 on pci3 aic7899: Ultra160 Wide Channel B, SCSI Id=7, 32/255 SCBs fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xbcc0-0xbcff mem 0xfe500000-0xfe5fffff,0xfe900000-0xfe900fff irq 5 at device 4.0 on pci2 fxp0: Ethernet address 00:06:5b:19:be:6f inphy0: <i82555 10/100 media interface> on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fdc0: <NEC 72065B or clone> port 0x3f7,0x3f0-0x3f5 irq 6 on acpi0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: <Keyboard controller (i8042)> port 0x64,0x60 irq 1 on acpi0 atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0 sio0 port 0x3f8-0x3ff irq 4 on acpi0 sio0: type 16550A, console sio1 port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A fdc-: fdc0 already exists, skipping it ata-: ata0 already exists, skipping it ata-: ata1 already exists, skipping it atkbdc-: atkbdc0 already exists, skipping it sio-: sio0 already exists, skipping it sio-: sio1 already exists, skipping it sc-: sc0 already exists, skipping it vga-: vga0 already exists, skipping it orm0: <Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xcdfff,0xec000-0xeffff on isa0 vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x100> APIC_IO: Testing 8254 interrupt delivery APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 intpin 2 APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0 acd0: CDROM <SAMSUNG CD-ROM SN-124> at ata0-master PIO4 Waiting 5 seconds for SCSI devices to settle pass1 at ahc0 bus 0 target 6 lun 0 pass1: <DELL 1x4 U2W SCSI BP 1.27> Fixed Processor SCSI-2 device pass1: 3.300MB/s transfers da0 at ahc0 bus 0 target 0 lun 0 da0: <FUJITSU MAJ3091MC 5509> Fixed Direct Access SCSI-3 device da0: 160.000MB/s transfers (80.000MHz, offset 127, 16bit), Tagged Queueing Enabled da0: 8683MB (17783239 512 byte sectors: 255H 63S/T 1106C) Mounting root from ufs:/dev/da0s1a SMP: AP CPU #1 Launched! To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200112182142.fBILgdc84928>