From owner-freebsd-scsi Mon Jun 2 08:36:58 1997 Return-Path: Received: (from root@localhost) by hub.freebsd.org (8.8.5/8.8.5) id IAA24408 for freebsd-scsi-outgoing; Mon, 2 Jun 1997 08:36:58 -0700 (PDT) Received: from rdsw.com (rdsw.com [198.211.39.70]) by hub.freebsd.org (8.8.5/8.8.5) with ESMTP id IAA24402 for ; Mon, 2 Jun 1997 08:36:44 -0700 (PDT) Received: (from bob@localhost) by rdsw.com (8.7.5/8.7.3) id KAA28015 for scsi@freebsd.org; Mon, 2 Jun 1997 10:36:41 -0500 (CDT) From: Bob Dunaway Message-Id: <199706021536.KAA28015@rdsw.com> Subject: SCSI crashes with ahc on 2.2-STABLE To: scsi@freebsd.org Date: Mon, 2 Jun 1997 10:36:41 -0500 (CDT) X-Mailer: ELM [version 2.4 PL24] Content-Type: text Sender: owner-freebsd-scsi@freebsd.org X-Loop: FreeBSD.org Precedence: bulk Here is an example of SCSI AHC 2842 system crash after disk errors. I am running 2.2-STABLE as of 04:00 CST May 5. I added a Quantum 2gb Capello disk drive as target 2 several days ago. After putting about 400 meg of data on the drive, I ran a dump to a DAT tape. I got 11 read errors and then the system crashed. Both AWRE and ARRE are enabled on the drive. I then tried a tar to /dev/null to determine the files with errors. I had several system crashes after read errors. Shown below, is the error log from 1 test run. There were 2 bad files. The first showed unrecovered errors and the second only showed the abort messages. After I removed all of the files with bad sectors, I reran the dump to tape and had no problems. Following the error messages are the verbose boot messages and then the kernel configuration file. Thanks, Bob Dunaway bob@rdsw.com * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * May 29 13:28:25 rds1 /kernel: sd2(ahc0:2:0): MEDIUM ERROR info:2c297d asc:11,0 Unrecovered read error May 29 13:28:26 rds1 /kernel: , retries:4 May 29 13:28:28 rds1 /kernel: sd2(ahc0:2:0): MEDIUM ERROR info:2c297d asc:11,0 Unrecovered read error May 29 13:28:28 rds1 /kernel: , retries:3 May 29 13:28:31 rds1 /kernel: sd2(ahc0:2:0): MEDIUM ERROR info:2c297d asc:11,0 Unrecovered read error May 29 13:28:31 rds1 /kernel: , retries:2 May 29 13:28:33 rds1 /kernel: sd2(ahc0:2:0): MEDIUM ERROR info:2c297d asc:11,0 Unrecovered read error May 29 13:28:33 rds1 /kernel: , retries:1 May 29 13:28:36 rds1 /kernel: sd2(ahc0:2:0): MEDIUM ERROR info:2c297d asc:11,0 Unrecovered read error May 29 13:28:36 rds1 /kernel: , FAILURE May 29 13:36:31 rds1 /kernel: sd2(ahc0:2:0): SCB 0x0 - timed out while idle, LASTPHASE == 0x1, SCSISIGI == 0x0 May 29 13:36:31 rds1 /kernel: SEQADDR = 0x4 SCSISEQ = 0x12 SSTAT0 = 0x0 SSTAT1 = 0xa May 29 13:36:31 rds1 /kernel: sd2(ahc0:2:0): Queueing an Abort SCB May 29 13:36:31 rds1 /kernel: sd2(ahc0:2:0): Abort Message Sent May 29 13:36:31 rds1 /kernel: sd2(ahc0:2:0): SCB 0 - Abort Completed. May 29 13:36:31 rds1 /kernel: sd2(ahc0:2:0): no longer in timeout * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * May 29 11:46:28 rds1 /kernel: Copyright (c) 1992-1997 FreeBSD Inc. May 29 11:46:28 rds1 /kernel: Copyright (c) 1982, 1986, 1989, 1991, 1993 May 29 11:46:28 rds1 /kernel: The Regents of the University of California. All rights reserved. May 29 11:46:28 rds1 /kernel: May 29 11:46:28 rds1 /kernel: FreeBSD 2.2-STABLE #0: Thu May 22 14:37:21 CDT 1997 May 29 11:46:28 rds1 /kernel: root@rds1.rdsw.com:/usr/src-r2_2.2/sys/compile/HTI May 29 11:46:28 rds1 /kernel: Calibrating clock(s) ... i8254 clock: 1193461 Hz May 29 11:46:28 rds1 /kernel: CLK_USE_I8254_CALIBRATION not specified - using default frequency May 29 11:46:28 rds1 /kernel: CPU: i486 DX2 (486-class CPU) May 29 11:46:28 rds1 /kernel: Origin = "GenuineIntel" Id = 0x435 Stepping=5 May 29 11:46:28 rds1 /kernel: Features=0x3 May 29 11:46:28 rds1 /kernel: real memory = 16777216 (16384K bytes) May 29 11:46:29 rds1 /kernel: avail memory = 14671872 (14328K bytes) May 29 11:46:29 rds1 /kernel: ahc0: at 0x1c00-0x1cff irq 11 on eisa0 slot 1 May 29 11:46:29 rds1 /kernel: ahc0: Using Edge Triggered Interrupts May 29 11:46:29 rds1 /kernel: ahc0: aic7770 >= Rev E, Single Channel, SCSI Id=7, 4 SCBs May 29 11:46:29 rds1 /kernel: ahc0: Resetting Channel A May 29 11:46:29 rds1 /kernel: ahc0: Downloading Sequencer Program...ahc0: 369 instructions downloaded May 29 11:46:29 rds1 /kernel: Done May 29 11:46:29 rds1 /kernel: ahc0: Probing channel A May 29 11:46:29 rds1 /kernel: Choosing drivers for scbus configured at 0 May 29 11:46:29 rds1 /kernel: ahc0 waiting for scsi devices to settle May 29 11:46:29 rds1 /kernel: ahc0: target 0 synchronous at 10.0MHz, offset = 0xf May 29 11:46:29 rds1 /kernel: (ahc0:0:0): "MICROP 2217-15MZ1001905 HQ30" type 0 fixed SCSI 2 May 29 11:46:29 rds1 /kernel: sd0(ahc0:0:0): Direct-Access 1685MB (3450902 512 byte sectors) May 29 11:46:29 rds1 /kernel: sd0(ahc0:0:0): with 2372 cyls, 15 heads, and an average 96 sectors/track May 29 11:46:29 rds1 /kernel: ahc0: target 1 synchronous at 10.0MHz, offset = 0xf May 29 11:46:29 rds1 /kernel: (ahc0:1:0): "QUANTUM EMPIRE_1080S 1220" type 0 fixed SCSI 2 May 29 11:46:29 rds1 /kernel: sd1(ahc0:1:0): Direct-Access 1029MB (2109376 512 byte sectors) May 29 11:46:29 rds1 /kernel: sd1(ahc0:1:0): with 2874 cyls, 8 heads, and an average 91 sectors/track May 29 11:46:30 rds1 /kernel: ahc0: target 2 synchronous at 10.0MHz, offset = 0xf May 29 11:46:30 rds1 /kernel: (ahc0:2:0): "QUANTUM VP32210 81H8" type 0 fixed SCSI 2 May 29 11:46:30 rds1 /kernel: sd2(ahc0:2:0): Direct-Access 2103MB (4308352 512 byte sectors) May 29 11:46:30 rds1 /kernel: sd2(ahc0:2:0): with 4243 cyls, 8 heads, and an average 126 sectors/track May 29 11:46:30 rds1 /kernel: ahc0: target 5 synchronous at 5.0MHz, offset = 0xf May 29 11:46:30 rds1 /kernel: (ahc0:5:0): "ARCHIVE Python 00367-XXX 5.23" type 1 removable SCSI 2 May 29 11:46:30 rds1 /kernel: st is configured at 0 May 29 11:46:30 rds1 /kernel: st0(ahc0:5:0): Sequential-Access density code 0x13, drive empty May 29 11:46:30 rds1 /kernel: ahc0: target 6 synchronous at 4.0MHz, offset = 0xf May 29 11:46:30 rds1 /kernel: (ahc0:6:0): "TOSHIBA CD-ROM XM-3401TA 1094" type 5 removable SCSI 2 May 29 11:46:30 rds1 /kernel: cd0(ahc0:6:0): CD-ROM can't get the size May 29 11:46:30 rds1 /kernel: pcibus_setup(1): mode 1 addr port (0x0cf8) is 0xffffffff May 29 11:46:30 rds1 /kernel: pcibus_setup(2): mode 2 enable port (0x0cf8) is 0xff May 29 11:46:30 rds1 /kernel: Probing for devices on the ISA bus: May 29 11:46:30 rds1 /kernel: sc0: the current keyboard controller command byte 0045 May 29 11:46:30 rds1 /kernel: kbdio: RESET_KBD return code:00fa May 29 11:46:30 rds1 /kernel: kbdio: RESET_KBD status:00aa May 29 11:46:30 rds1 /kernel: sc0 at 0x60-0x6f irq 1 on motherboard May 29 11:46:30 rds1 /kernel: sc0: BIOS video mode:3 May 29 11:46:31 rds1 /kernel: sc0: VGA registers upon power-up May 29 11:46:31 rds1 /kernel: 50 18 10 00 10 00 03 00 02 67 5f 4f 50 82 55 81 May 29 11:46:31 rds1 /kernel: bf 1f 00 4f 0e 0f 00 00 ff ff 9c 8e 8f 28 1f 96 May 29 11:46:31 rds1 /kernel: b9 a3 ff 00 01 02 03 04 05 14 07 38 39 3a 3b 3c May 29 11:46:31 rds1 /kernel: 3d 3e 3f 0c 00 0f 08 00 00 00 00 00 10 0e 00 ff May 29 11:46:31 rds1 /kernel: sc0: video mode:24 May 29 11:46:31 rds1 /kernel: sc0: VGA color <16 virtual consoles, flags=0x0> May 29 11:46:31 rds1 /kernel: sio0 at 0x3f8-0x3ff irq 4 on isa May 29 11:46:31 rds1 /kernel: sio0: type 16450 May 29 11:46:31 rds1 /kernel: sio1 not found at 0x2f8 May 29 11:46:31 rds1 /kernel: sio2: disabled, not probed. May 29 11:46:31 rds1 /kernel: sio3: disabled, not probed. May 29 11:46:31 rds1 /kernel: lpt0 at 0x378-0x37f irq 7 on isa May 29 11:46:31 rds1 /kernel: lpt0: Interrupt-driven port May 29 11:46:31 rds1 /kernel: lp0: TCP/IP capable interface May 29 11:46:31 rds1 /kernel: lpt1 not found at 0xffffffff May 29 11:46:31 rds1 /kernel: mse0: wrong signature ff May 29 11:46:31 rds1 /kernel: mse0 not found at 0x23c May 29 11:46:32 rds1 /kernel: psm0: disabled, not probed. May 29 11:46:32 rds1 /kernel: fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa May 29 11:46:32 rds1 /kernel: fdc0: NEC 72065B May 29 11:46:32 rds1 /kernel: fd0: 1.44MB 3.5in May 29 11:46:32 rds1 /kernel: wdc0 not found at 0x1f0 May 29 11:46:32 rds1 /kernel: wdc1 not found at 0x170 May 29 11:46:32 rds1 /kernel: 1 3C5x9 board(s) on ISA found at 0x300 May 29 11:46:32 rds1 /kernel: ep0 at 0x300-0x30f irq 10 on isa May 29 11:46:32 rds1 /kernel: ep0: aui/utp/bnc[*UTP*] address 00:20:af:52:37:4c May 29 11:46:32 rds1 /kernel: npx0 on motherboard May 29 11:46:32 rds1 /kernel: npx0: INT 16 interface May 29 11:46:32 rds1 /kernel: apm0: disabled, not probed. May 29 11:46:32 rds1 /kernel: imasks: bio c0000840, tty c0030492, net c0030492 May 29 11:46:32 rds1 /kernel: BIOS Geometries: May 29 11:46:32 rds1 /kernel: 0:03fe3f20 0..1022=1023 cylinders, 0..63=64 heads, 1..32=32 sectors May 29 11:46:32 rds1 /kernel: 1:03fe3f20 0..1022=1023 cylinders, 0..63=64 heads, 1..32=32 sectors May 29 11:46:32 rds1 /kernel: 0 accounted for May 29 11:46:32 rds1 /kernel: Device configuration finished. May 29 11:46:33 rds1 /kernel: Considering FFS root f/s. May 29 11:46:33 rds1 /kernel: changing root device to sd0a May 29 11:46:33 rds1 /kernel: configure() finished. May 29 11:46:33 rds1 /kernel: sd0s1: type 0x6, start 32, end = 204799, size 204768 : OK May 29 11:46:33 rds1 /kernel: sd0s2: type 0xa5, start 204800, end = 3450879, size 3246080 : OK May 29 11:46:33 rds1 /kernel: WARNING: / was not properly dismounted. May 29 11:46:33 rds1 /kernel: sd1s1: type 0xa5, start 0, end = 2109375, size 2109376 : OK May 29 11:46:33 rds1 /kernel: sd2s1: type 0xa5, start 63, end = 4305419, size 4305357 : OK May 29 11:46:33 rds1 /kernel: sd1s1: type 0xa5, start 0, end = 2109375, size 2109376 : OK May 29 11:46:33 rds1 /kernel: sd2s1: type 0xa5, start 63, end = 4305419, size 4305357 : OK May 29 11:46:36 rds1 lpd[106]: restarted * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * # # GENERIC -- Generic machine with WD/AHx/NCR/BTx family disks # # For more information read the handbook part System Administration -> # Configuring the FreeBSD Kernel -> The Configuration File. # The handbook is available in /usr/share/doc/handbook or online as # latest version from the FreeBSD World Wide Web server # # # An exhaustive list of options and more detailed explanations of the # device lines is present in the ./LINT configuration file. If you are # in doubt as to the purpose or necessity of a line, check first in LINT. # # $Id: GENERIC,v 1.77.2.2 1997/02/03 22:53:24 gibbs Exp $ machine "i386" #cpu "I386_CPU" cpu "I486_CPU" cpu "I586_CPU" cpu "I686_CPU" ident RDS maxusers 30 options MATH_EMULATE #Support for x87 emulation options INET #InterNETworking options FFS #Berkeley Fast Filesystem options NFS #Network Filesystem options MSDOSFS #MSDOS Filesystem options "CD9660" #ISO 9660 Filesystem options PROCFS #Process filesystem options "COMPAT_43" #Compatible with BSD 4.3 [KEEP THIS!] options SCSI_DELAY=15 #Be pessimistic about Joe SCSI device options BOUNCE_BUFFERS #include support for DMA bounce buffers options UCONSOLE #Allow users to grab the console options FAILSAFE #Be conservative options USERCONFIG #boot -c editor options VISUAL_USERCONFIG #visual boot -c editor options CHILD_MAX=512 options OPEN_MAN=512 options SYSVSHM options SYSVSEM options SYSVMSG config kernel root on wd0 controller isa0 controller eisa0 controller pci0 controller fdc0 at isa? port "IO_FD1" bio irq 6 drq 2 vector fdintr disk fd0 at fdc0 drive 0 disk fd1 at fdc0 drive 1 tape ft0 at fdc0 drive 2 controller wdc0 at isa? port "IO_WD1" bio irq 14 vector wdintr disk wd0 at wdc0 drive 0 disk wd1 at wdc0 drive 1 controller wdc1 at isa? port "IO_WD2" bio irq 15 vector wdintr disk wd2 at wdc1 drive 0 disk wd3 at wdc1 drive 1 #options ATAPI #Enable ATAPI support for IDE bus #options ATAPI_STATIC #Don't do it as an LKM #device wcd0 #IDE CD-ROM # A single entry for any of these controllers (ncr, ahb, ahc, amd) is # sufficient for any number of installed devices. #controller ncr0 #controller amd0 #controller ahb0 controller ahc0 #controller bt0 at isa? port "IO_BT0" bio irq ? vector bt_isa_intr #controller uha0 at isa? port "IO_UHA0" bio irq ? drq 5 vector uhaintr #controller aha0 at isa? port "IO_AHA0" bio irq ? drq 5 vector ahaintr #controller aic0 at isa? port 0x340 bio irq 11 vector aicintr #controller nca0 at isa? port 0x1f88 bio irq 10 vector ncaintr #controller nca1 at isa? port 0x350 bio irq 5 vector ncaintr #controller sea0 at isa? bio irq 5 iomem 0xc8000 iosiz 0x2000 vector seaintr controller scbus0 at ahc0 tape st0 at scbus0 target 5 tape st1 at scbus0 target 4 controller scbus0 device sd0 device od0 #See LINT for possible `od' options. device st0 device cd0 #Only need one of these, the code dynamically grows #device wt0 at isa? port 0x300 bio irq 5 drq 1 vector wtintr #device mcd0 at isa? port 0x300 bio irq 10 vector mcdintr #controller matcd0 at isa? port 0x230 bio #device scd0 at isa? port 0x230 bio # syscons is the default console driver, resembling an SCO console device sc0 at isa? port "IO_KBD" tty irq 1 vector scintr # Enable this and PCVT_FREEBSD for pcvt vt220 compatible console driver #device vt0 at isa? port "IO_KBD" tty irq 1 vector pcrint #options PCVT_FREEBSD=210 # pcvt running on FreeBSD >= 2.0.5 #options XSERVER # include code for XFree86 #options FAT_CURSOR # start with block cursor # If you have a ThinkPAD, uncomment this along with the rest of the PCVT lines #options PCVT_SCANSET=2 # IBM keyboards are non-std # Mandatory, don't remove device npx0 at isa? port "IO_NPX" irq 13 vector npxintr # # Laptop support (see LINT for more options) # device apm0 at isa? disable # Advanced Power Management options APM_BROKEN_STATCLOCK # Workaround some buggy APM BIOS # PCCARD (PCMCIA) support #controller crd0 #device pcic0 at crd? #device pcic1 at crd? device sio0 at isa? port "IO_COM1" tty irq 4 vector siointr device sio1 at isa? port "IO_COM2" tty irq 3 vector siointr device sio2 at isa? disable port "IO_COM3" tty irq 5 vector siointr device sio3 at isa? disable port "IO_COM4" tty irq 9 vector siointr device lpt0 at isa? port? tty irq 7 vector lptintr device lpt1 at isa? port? tty device mse0 at isa? port 0x23c tty irq 5 vector mseintr device psm0 at isa? disable port "IO_KBD" conflicts tty irq 12 vector psmintr # Order is important here due to intrusive probes, do *not* alphabetize # this list of network interfaces until the probes have been fixed. # Right now it appears that the ie0 must be probed before ep0. See # revision 1.20 of this file. #device de0 #device fxp0 #device vx0 #device ed0 at isa? port 0x280 net irq 5 iomem 0xd8000 vector edintr #device ed1 at isa? port 0x300 net irq 5 iomem 0xd8000 vector edintr #device ie0 at isa? port 0x360 net irq 7 iomem 0xd0000 vector ieintr device ep0 at isa? port 0x300 net irq 10 vector epintr #device fe0 at isa? port 0x300 net irq ? vector feintr #device ix0 at isa? port 0x300 net irq 10 iomem 0xd0000 iosiz 32768 vector ixintr #device le0 at isa? port 0x300 net irq 5 iomem 0xd0000 vector le_intr #device lnc0 at isa? port 0x280 net irq 10 drq 0 vector lncintr #device ze0 at isa? port 0x300 net irq 5 iomem 0xd8000 vector zeintr #device zp0 at isa? port 0x300 net irq 10 iomem 0xd8000 vector zpintr pseudo-device loop pseudo-device ether pseudo-device log pseudo-device sl 1 # ijppp uses tun instead of ppp device #pseudo-device ppp 1 pseudo-device tun 1 pseudo-device pty 64 pseudo-device gzip # Exec gzipped a.out's # KTRACE enables the system-call tracing facility ktrace(2). # This adds 4 KB bloat to your kernel, and slightly increases # the costs of each syscall. options KTRACE #kernel tracing