From owner-freebsd-scsi Fri Oct 20 0:37: 6 2000 Delivered-To: freebsd-scsi@freebsd.org Received: from zibbi.mikom.csir.co.za (zibbi.mikom.csir.co.za [146.64.24.58]) by hub.freebsd.org (Postfix) with ESMTP id 4D5A837B479 for ; Fri, 20 Oct 2000 00:36:49 -0700 (PDT) Received: (from jhay@localhost) by zibbi.mikom.csir.co.za (8.11.0/8.11.0) id e9K7aiQ58036 for freebsd-scsi@freebsd.org; Fri, 20 Oct 2000 09:36:44 +0200 (SAT) (envelope-from jhay) From: John Hay Message-Id: <200010200736.e9K7aiQ58036@zibbi.mikom.csir.co.za> Subject: ahc still broken? To: freebsd-scsi@freebsd.org Date: Fri, 20 Oct 2000 09:36:44 +0200 (SAT) X-Mailer: ELM [version 2.4ME+ PL54 (25)] MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: owner-freebsd-scsi@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org Hi, Is the ahc driver still broken in -stable? I have a Dell machine that have been building -stable snapshosts every night for months without a problem (using a kernel and user level of April 14). Two days ago we had a prower failure and I thought it might be a good time to upgrade the machine to the latest -stable, but now it gives scsi errors and panic when building the -stable snapshots. Previously I haven't seen a single scsi error and the machine had been up for months at a time. Here is the console output of the boot, scsi errors and panic I have captured through the serial port. John -- John Hay -- John.Hay@icomtek.csir.co.za Console: serial port BIOS drive A: is disk0 BIOS drive C: is disk1 BIOS 640kB/130040kB available memory FreeBSD/i386 bootstrap loader, Revision 0.8 (jhay@dolphin.mikom.csir.co.za, Wed Oct 18 13:30:42 SAST 2000) Loading /boot/defaults/loader.conf /kernel text=0x1824a1 data=0x427e0+0x1bc18 syms=[0x4+0x29420+0x4+0x2d924] Hit [Enter] to boot immediately, or any other key for command prompt. Copyright (c) 1992-2000 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.1.1-STABLE #0: Wed Oct 18 14:15:22 SAST 2000 jhay@dolphin.mikom.csir.co.za:/usr/src/sys/compile/DOLPHIN Timecounter "i8254" frequency 1193146 Hz Timecounter "TSC" frequency 348916883 Hz CPU: Pentium II/Pentium II Xeon/Celeron (348.92-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x652 Stepping = 2 Features=0x183f9ff real memory = 134209536 (131064K bytes) avail memory = 127447040 (124460K bytes) Preloaded elf kernel "kernel" at 0xc033a000. Pentium Pro MTRR support enabled npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard pci0: on pcib0 pcib1: at device 1.0 on pci0 pci1: on pcib1 pci1: at 0.0 irq 11 isab0: at device 7.0 on pci0 isa0: on isab0 atapci0: port 0xffa0-0xffaf at device 7.1 on pci0 ata1: at 0x170 irq 15 on atapci0 pci0: at 7.2 irq 11 intpm0: port 0x850-0x85f irq 9 at device 7.3 on pci0 intpm0: I/O mapped 850 intpm0: intr IRQ 9 enabled revision 0 smbus0: on intsmb0 smb0: on smbus0 intpm0: PM I/O mapped 800 pcib2: at device 15.0 on pci0 pci2: on pcib2 ahc0: port 0xdc00-0xdcff mem 0xfafff000-0xfaffffff irq 11 at device 10.0 on pci2 aic7895: Wide Channel A, SCSI Id=7, 32/255 SCBs ahc1: port 0xd800-0xd8ff mem 0xfaffe000-0xfaffefff irq 11 at device 10.1 on pci2 aic7895: Single Channel B, SCSI Id=7, 32/255 SCBs xl0: <3Com 3c905B-TX Fast Etherlink XL> port 0xcc00-0xcc7f mem 0xff000000-0xff00007f irq 11 at device 17.0 on pci0 xl0: Ethernet address: 00:c0:4f:71:b3:ab miibus0: on xl0 xlphy0: <3Com internal media interface> on miibus0 xlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fdc0: at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: at port 0x60,0x64 on isa0 atkbd0: flags 0x1 irq 1 on atkbdc0 kbd0 at atkbd0 psm0: irq 12 on atkbdc0 psm0: model Generic PS/2 mouse, device ID 0 vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x100> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A, console sio1 at port 0x2f8-0x2ff irq 3 on isa0 sio1: type 16550A ppc0: at port 0x378-0x37f irq 7 on isa0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/8 bytes threshold lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 plip0: on ppbus0 pcm0: at port 0x534-0x537,0x388-0x38b,0x220-0x22f irq 5 drq 1,0 on isa0 acd0: CDROM at ata1-master using WDMA2 Waiting 2 seconds for SCSI devices to settle Mounting root from ufs:/dev/da0s1a da0 at ahc0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-2 device da0: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled da0: 8709MB (17836668 512 byte sectors: 255H 63S/T 1110C) WARNING: / was not properly dismounted swapon: adding /dev/da0s1b as swap device Automatic boot in progress... /dev/da0s1a: 1443 files, 38661 used, 60522 free (522 frags, 7500 blocks, 0.5% fragmentation) /dev/da0s1f: 101065 files, 1289928 used, 742695 free (41839 frags, 87607 blocks, 2.1% fragmentation) /dev/da0s1g: UNREF FILE I=79410 OWNER=root MODE=100555 /dev/da0s1g: SIZE=445920 MTIME=Oct 19 06:10 2000 (CLEARED) /dev/da0s1g: UNREF FILE I=517315 OWNER=root MODE=100644 /dev/da0s1g: SIZE=9311 MTIME=Oct 19 10:02 2000 (CLEARED) /dev/da0s1g: FREE BLK COUNT(S) WRONG IN SUPERBLK (SALVAGED) /dev/da0s1g: SUMMARY INFORMATION BAD (SALVAGED) /dev/da0s1g: BLK(S) MISSING IN BIT MAPS (SALVAGED) /dev/da0s1g: 103481 files, 1421325 used, 1627617 free (26329 frags, 200161 blocks, 0.9% fragmentation) /dev/da0s1h: 142048 files, 1809221 used, 1057687 free (5335 frags, 131544 blocks, 0.2% fragmentation) /dev/da0s1e: 900 files, 29997 used, 69186 free (1282 frags, 8488 blocks, 1.3% fragmentation) Doing initial network setup: hostname. xl0: flags=8843 mtu 1500 inet 146.64.28.14 netmask 0xffffff00 broadcast 146.64.28.255 inet6 fe80::2c0:4fff:fe71:b3ab%xl0 prefixlen 64 tentative scopeid 0x1 ether 00:c0:4f:71:b3:ab media: autoselect (100baseTX ) status: active supported media: autoselect 100baseTX 100baseTX 10baseT/UTP 10baseT/UTP 100baseTX lo0: flags=8049 mtu 16384 inet6 fe80::1%lo0 prefixlen 64 scopeid 0x9 inet6 ::1 prefixlen 128 inet 127.0.0.1 netmask 0xff000000 add net default: gateway 146.64.28.30 Additional routing options: tcp extensions=NO TCP keepalive=YES. routing daemons:. Doing IPv6 network setup:add net ::ffff:0.0.0.0: gateway ::1 add net ::0.0.0.0: gateway ::1 net.inet6.ip6.forwarding: 0 -> 0 net.inet6.ip6.accept_rtadv: 0 -> 1 add net fe80::: gateway fe80::2c0:4fff:fe71:b3ab%xl0 add net ff02::: gateway fe80::2c0:4fff:fe71:b3ab%xl0 IPv4 mapped IPv6 address support=YES. clearing /tmp additional daemons: syslogd. checking for core dump...savecore: no core dump Doing additional network setup: ntpd. Starting final network daemons:. setting ELF ldconfig path: /usr/lib /usr/lib/compat /usr/X11R6/lib /usr/local/lib setting a.out ldconfig path: /usr/lib/aout /usr/lib/compat/aout /usr/X11R6/lib/aout starting standard daemons: cron printer sendmail sshd. Initial rc.i386 initialization:. rc.i386 configuring syscons: blank_time moused. additional ABI support: linux. starting local daemons:starting local daemons:. . Local package initialization:. Additional TCP options:. Thu Oct 19 10:20:05 SAST 2000 Oct 19 10:58:31 dolphin /kernel: /mnt: optimization changed from SPACE to TIME Oct 19 10:59:40 dolphin /kernel: /mnt: optimization changed from SPACE to TIME Oct 19 11:00:39 dolphin /kernel: /mnt: optimization changed from SPACE to TIME Oct 19 11:01:46 dolphin /kernel: /mnt: optimization changed from SPACE to TIME Oct 19 15:20:41 dolphin /kernel: /mnt: optimization changed from SPACE to TIME Oct 19 15:28:08 dolphin /kernel: /mnt: optimization changed from SPACE to TIME Oct 19 15:29:24 dolphin last message repeated 2 times (da0:ahc0:0:0:0): data overrun detected in Data-out phase. Tag == 0x32. (da0:ahc0:0:0:0): Have seen Data Phase. Length = 65536. NumSGs = 16. sg[0] - Addr 0x328e000 : Length 4096 sg[1] - Addr 0x440f000 : Length 4096 sg[2] - Addr 0x2f70000 : Length 4096 sg[3] - Addr 0xd91000 : Length 4096 sg[4] - Addr 0x3872000 : Length 4096 sg[5] - Addr 0x9f3000 : Length 4096 sg[6] - Addr 0x4394000 : Length 4096 sg[7] - Addr 0x3b55000 : Length 4096 sg[8] - Addr 0xcb6000 : Length 4096 sg[9] - Addr 0x33b7000 : Length 4096 sg[10] - Addr 0x40f8000 : Length 4096 sg[11] - Addr 0x2879000 : Length 4096 sg[12] - Addr 0x6d3a000 : Length 4096 sg[13] - Addr 0x56db000 : Length 4096 sg[14] - Addr 0x3d7c000 : Length 4096 sg[15] - Addr 0x735d000 : Length 4096 (da0:ahc0:0:0:0): data overrun detected in Data-out phase. Tag == 0x32. (da0:ahc0:0:0:0): Have seen Data Phase. Length = 65536. NumSGs = 16. sg[0] - Addr 0x328e000 : Length 4096 sg[1] - Addr 0x440f000 : Length 4096 sg[2] - Addr 0x2f70000 : Length 4096 sg[3] - Addr 0xd91000 : Length 4096 sg[4] - Addr 0x3872000 : Length 4096 sg[5] - Addr 0x9f3000 : Length 4096 sg[6] - Addr 0x4394000 : Length 4096 sg[7] - Addr 0x3b55000 : Length 4096 sg[8] - Addr 0xcb6000 : Length 4096 sg[9] - Addr 0x33b7000 : Length 4096 sg[10] - Addr 0x40f8000 : Length 4096 sg[11] - Addr 0x2879000 : Length 4096 sg[12] - Addr 0x6d3a000 : Length 4096 sg[13] - Addr 0x56db000 : Length 4096 sg[14] - Addr 0x3d7c000 : Length 4096 sg[15] - Addr 0x735d000 : Length 4096 (da0:ahc0:0:0:0): data overrun detected in Data-out phase. Tag == 0x4. (da0:ahc0:0:0:0): Have seen Data Phase. Length = 65536. NumSGs = 16. sg[0] - Addr 0x12fe000 : Length 4096 sg[1] - Addr 0x61bf000 : Length 4096 sg[2] - Addr 0x6a80000 : Length 4096 sg[3] - Addr 0x67a1000 : Length 4096 sg[4] - Addr 0x2f22000 : Length 4096 sg[5] - Addr 0x6e63000 : Length 4096 sg[6] - Addr 0x3d04000 : Length 4096 sg[7] - Addr 0x40c5000 : Length 4096 sg[8] - Addr 0x45c6000 : Length 4096 sg[9] - Addr 0x61a7000 : Length 4096 sg[10] - Addr 0x7b48000 : Length 4096 sg[11] - Addr 0x1009000 : Length 4096 sg[12] - Addr 0x44ea000 : Length 4096 sg[13] - Addr 0x49eb000 : Length 4096 sg[14] - Addr 0x16cc000 : Length 4096 sg[15] - Addr 0xded000 : Length 4096 (da0:ahc0:0:0:0): data overrun detected in Data-out phase. Tag == 0x30. (da0:ahc0:0:0:0): Have seen Data Phase. Length = 65536. NumSGs = 16. sg[0] - Addr 0x352e000 : Length 4096 sg[1] - Addr 0x1aef000 : Length 4096 sg[2] - Addr 0x3930000 : Length 4096 sg[3] - Addr 0x3ff1000 : Length 4096 sg[4] - Addr 0x71b2000 : Length 4096 sg[5] - Addr 0x63b3000 : Length 4096 sg[6] - Addr 0x2b14000 : Length 4096 sg[7] - Addr 0x56b5000 : Length 4096 sg[8] - Addr 0x77f6000 : Length 4096 sg[9] - Addr 0x69f7000 : Length 4096 sg[10] - Addr 0x6538000 : Length 4096 sg[11] - Addr 0x54d9000 : Length 4096 sg[12] - Addr 0x4c5a000 : Length 4096 sg[13] - Addr 0x141b000 : Length 4096 sg[14] - Addr 0x765c000 : Length 4096 sg[15] - Addr 0x4f1d000 : Length 4096 (da0:ahc0:0:0:0): data overrun detected in Data-out phase. Tag == 0x3a. (da0:ahc0:0:0:0): Have seen Data Phase. Length = 65536. NumSGs = 16. sg[0] - Addr 0x793e000 : Length 4096 sg[1] - Addr 0x1c3f000 : Length 4096 sg[2] - Addr 0xc40000 : Length 4096 sg[3] - Addr 0x2c61000 : Length 4096 sg[4] - Addr 0x7662000 : Length 4096 sg[5] - Addr 0x4163000 : Length 4096 sg[6] - Addr 0x78c4000 : Length 4096 sg[7] - Addr 0x1685000 : Length 4096 sg[8] - Addr 0x4906000 : Length 4096 sg[9] - Addr 0x2927000 : Length 4096 sg[10] - Addr 0x14c8000 : Length 4096 sg[11] - Addr 0x6e69000 : Length 4096 sg[12] - Addr 0x788a000 : Length 4096 sg[13] - Addr 0x1f6b000 : Length 4096 sg[14] - Addr 0x6fcc000 : Length 4096 sg[15] - Addr 0x366d000 : Length 4096 SCB 0x27 - timed out in Message-in phase, SEQADDR == 0x15b sg[0] - Addr 0x322e000 : Length 4096 sg[1] - Addr 0x730f000 : Length 4096 sg[2] - Addr 0x35d0000 : Length 4096 sg[3] - Addr 0x6b71000 : Length 4096 sg[4] - Addr 0x5432000 : Length 4096 sg[5] - Addr 0x4453000 : Length 4096 sg[6] - Addr 0x3db4000 : Length 4096 sg[7] - Addr 0x6615000 : Length 4096 sg[8] - Addr 0x4556000 : Length 4096 sg[9] - Addr 0x5c77000 : Length 4096 sg[10] - Addr 0x98000 : Length 4096 sg[11] - Addr 0x2099000 : Length 4096 sg[12] - Addr 0x45da000 : Length 4096 sg[13] - Addr 0x263b000 : Length 4096 sg[14] - Addr 0x205c000 : Length 4096 sg[15] - Addr 0x52bd000 : Length 4096 (da0:ahc0:0:0:0): BDR message in message buffer SCB 0x30 - timed out in Message-in phase, SEQADDR == 0x159 sg[0] - Addr 0x4d6b000 : Length 3072 panic: Disconnected List inconsistency. SCB index == 255, yet numscbs == 100. syncing disks... Fatal trap 12: page fault while in kernel mode fault virtual address = 0x30 fault code = supervisor read, page not present instruction pointer = 0x8:0xc01e4500 stack pointer = 0x10:0xc0285224 frame pointer = 0x10:0xc0285228 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = Idle interrupt mask = bio cam trap number = 12 panic: page fault Uptime: 19h12m3s (da0:ahc0:0:0:0): Synchronize cache failed, status == 0xb, scsi status == 0x0 dumping to dev #da/0x20001, offset 761872 dump Aborting dump due to I/O error. status == 0xb, scsi status == 0x0 failed, reason: i/o error Automatic reboot in 15 seconds - press a key on the console to abort Rebooting... To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-scsi" in the body of the message