Date: Sun, 17 Feb 2008 16:02:00 +0000 From: "Niki Denev" <nike_d@cytexbg.com> To: "Justin T. Gibbs" <gibbs@scsiguy.com> Cc: freebsd-current@freebsd.org Subject: Re: Adaptec AHD U320 operating as only U160 Message-ID: <2e77fc10802170802w16fa5d6oe1e03b4660a7e85@mail.gmail.com> In-Reply-To: <47B8582A.3020402@scsiguy.com> References: <200704182239.59842.gelsemap@superhero.nl> <20080211184453.GA5605@dragon.NUXI.org> <2e77fc10802111330p738d2c93i9eb1189b307d732d@mail.gmail.com> <47B0C34E.7040804@samsco.org> <20080212010836.GA14441@dragon.NUXI.org> <2e77fc10802161506x5d790149lb688d221b0a96222@mail.gmail.com> <47B7D534.7000204@scsiguy.com> <2e77fc10802170142r51719253r682365fb7d274116@mail.gmail.com> <2e77fc10802170623k181c807dh52b13b01c121cd9f@mail.gmail.com> <47B8582A.3020402@scsiguy.com>
next in thread | previous in thread | raw e-mail | index | archive | help
On Feb 17, 2008 3:52 PM, Justin T. Gibbs <gibbs@scsiguy.com> wrote: > Niki Denev wrote: > > After some bonnie64 runs the first machine (8.0-CURRENT Dtrace > > snapshot with aic79xx_osm.c v1.30) spat out this : > > > > ahd1: Recovery Initiated - Card was not paused > >>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< > > ahd1: Dumping Card State at program address 0x38 Mode 0x11 > > ... > > > SAVED_MODE[0x11] DFFSTAT[0x31]:(CURRFIFO_1|FIFO0FREE|FIFO1FREE) > > SCSISIGI[0x0]:(P_DATAOUT) SCSIPHASE[0x0] SCSIBUS[0x0] > > LASTPHASE[0x1]:(P_DATAOUT|P_BUSFREE) SCSISEQ0[0x0] > > According to the controller, the bus is idle. However... > > > Pending list: > > 325 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x17] > > 403 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x17] > > 439 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x17] > > 380 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x17] > > 495 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x17] > > 353 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x17] > > 478 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x17] > > Total 7 > > we have 7 commands still pending on the drive at ID 1, and the drive hasn't > provided status within the allowed timeout period. There were lots of > problems early on with Seagate drives locking up like this when hit with > more than 31 (I think that was the magic number) concurrent commands with > WCE (write cache enable) set. It was supposedly fixed in later firmware > releases. Do you have the latest firmware for your drives? No, they are with the default factory set firmware, probably years old, i will try to upgrade them. > You might be able to confirm this hypothesis by lowering the queued depth > via camcontrol and/or disabling WCE. I will definitely try this if I'm able to reproduce the error, because I'm seeng probably Dtrace related panic on this machine, that I'm trying to investigate. > -- > Justin > --Niki
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?2e77fc10802170802w16fa5d6oe1e03b4660a7e85>