From owner-freebsd-current@FreeBSD.ORG Thu Jan 13 08:42:54 2011 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 443E21065670 for ; Thu, 13 Jan 2011 08:42:54 +0000 (UTC) (envelope-from barbara.xxx1975@libero.it) Received: from cp-out4.libero.it (cp-out4.libero.it [212.52.84.104]) by mx1.freebsd.org (Postfix) with ESMTP id AE34E8FC1C for ; Thu, 13 Jan 2011 08:42:53 +0000 (UTC) X-CTCH-Spam: Unknown X-CTCH-RefID: str=0001.0A0B0201.4D2EBB0B.01A9,ss=1,re=0.000,fgs=0 X-libjamoibt: 1419 Received: from wmail45 (172.31.0.235) by cp-out4.libero.it (8.5.133) (authenticated as barbara.xxx1975@libero.it) id 4D10BF990151486A; Thu, 13 Jan 2011 09:42:51 +0100 Message-ID: <31888702.2008831294908171515.JavaMail.defaultUser@defaultHost> Date: Thu, 13 Jan 2011 09:42:51 +0100 (CET) From: Barbara To: , MIME-Version: 1.0 Content-Type: text/plain;charset="UTF-8" Content-Transfer-Encoding: 7bit X-SenderIP: 79.3.217.162 Cc: freebsd-current@freebsd.org Subject: R: ahci timeout X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: Barbara List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 13 Jan 2011 08:42:54 -0000 > >>From : ndenev@gmail.com >> >>On 27 Dec, 2010, at 21:20 , Barbara wrote: >> >>> >>> As my old PATA hard disk was failing, I had to replace it with a new SATA >>> drive where I moved my FreeBSDs installations, as PATA drives are not easy >to >>> find these days. >>> So I had to move one of my data drive from a VIA8237A SATA controller to >the >>> last free SATA slot on a Marvell 88SX6121 to make room for the new hd. >>> The hd I moved was working perfectly when connected to the VIA controller. >>> Now, with the Marvell I'm getting messages like the following twos while >using >>> the disk: >>> ahcich0: Timeout on slot 10 >>> ahcich0: is 00000000 cs 3ffff800 ss 3ffffc00 rs 3ffffc00 tfd 50010040 >serr >>> 00000000 >>> >>> ahcich0: Timeout on slot 5 >>> ahcich0: is 00000000 cs 00000180 ss 000001e0 rs 000001e0 tfd 50040040 >serr >>> 00000000 >>> >>> This doesn't happen regularly. For example downloading from a slow website >on >>> it, so few kb/s, is ok. >>> But if I copy files from the disk attacked to the Marvell controller to >>> another another disk, or for example run md5 on some files, it's very >likely to >>> happen. >>> The process accessing the disk can not be killed even with -9, ^C does >>> nothing, and umount doesn't exit. >>> If I'm copying files on it from another disk it can't be unmounted too as >the >>> unkillable process has it in use. >>> On shutdown many disk doesn't get unmounted, so there are a lot of fsck on >>> boot, and on CURRENT (last built yesterday), FreeBSD enter debugger as it >fail >>> flushing disk caches. >>> >>> Relevant part from dmesg: >>> >>> atapci0: port 0xdc00-0xdc07,0xd880- >>> 0xd883,0xd800-0xd807,0xd480-0xd483,0xd400-0xd40f mem 0xfbdffc00- 0xfbdfffff >irq >>> 28 at device 0.0 on pci6 >>> ahci0: on atapci0 >>> ahci0: AHCI v1.00 with 2 3Gbps ports, Port Multiplier supported >>> ahcich0: at channel 0 on ahci0 >>> ahcich1: at channel 1 on ahci0 >>> ata2: on atapci0 >>> atapci1: port 0xbc00-0xbc07,0xb880-0xb883, >>> 0xb800-0xb807,0xb480-0xb483,0xb400-0xb40f,0xb000-0xb0ff irq 21 at device >15.0 >>> on pci0 >>> ata3: on atapci1 >>> ata4: on atapci1 >>> atapci2: port 0x1f0-0x1f7,0x3f6,0x170- 0x177, >>> 0x376,0xfc00-0xfc0f at device 15.1 on pci0 >>> ata0: on atapci2 >>> ata1: on atapci2 >>> >>> ada0 at ahcich0 bus 0 scbus0 target 0 lun 0 >>> ada0: ATA-8 SATA 2.x device >>> ada0: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes) >>> ada0: Command Queueing enabled >>> ada0: 953869MB (1953525168 512 byte sectors: 16H 63S/T 16383C) >>> ada1 at ata3 bus 0 scbus3 target 0 lun 0 >>> ada1: ATA-7 SATA 2.x device >>> ada1: 150.000MB/s transfers (SATA 1.x, UDMA5, PIO 8192bytes) >>> ada1: 238475MB (488397168 512 byte sectors: 16H 63S/T 16383C) >>> ada2 at ata4 bus 0 scbus4 target 0 lun 0 >>> ada2: ATA-8 SATA 1.x device >>> ada2: 150.000MB/s transfers (SATA 1.x, UDMA5, PIO 8192bytes) >>> ada2: 476940MB (976773168 512 byte sectors: 16H 63S/T 16383C) >>> ada3 at ata0 bus 0 scbus5 target 0 lun 0 >>> ada3: ATA-7 device >>> ada3: 100.000MB/s transfers (UDMA5, PIO 8192bytes) >>> ada3: 152627MB (312581808 512 byte sectors: 16H 63S/T 16383C) >>> >>> _______________________________________________ >>> freebsd-current@freebsd.org mailing list >>> http://lists.freebsd.org/mailman/listinfo/freebsd-current >>> To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org" >> >>Just to add a "me too". >> >>I'm running -STABLE but have the same problems with Marvell 88SX6121 giving >"ahci timeout" messages. >> >>Regards, >>Nikolay >> > >Nikolay, thanks for the feedback, even if unfortunately it's negative... >I see that in both 8-STABLE (8.2-PRERELEASE now) and 9.0-CURRENT, both rebuilt >no more than a week ago. >I've also tried rebuilding the kernel with: > options CAMDEBUG > options CAM_DEBUG_BUS=0 > options CAM_DEBUG_TARGET=0 > options CAM_DEBUG_LUN=0 > options CAM_DEBUG_FLAGS=CAM_DEBUG_TRACE >(0 is the bus/target/lun of the marvell controller/attached hd) >and run a test computing md5 for about 60GB of files. >Obviously, as the debug options where active, it run successfully without any >problems :) >Is there any other info I should provide or any other test that I can do? > >Thanks >Barbara > Maybe the attached disk has some problems which aren't handled if it's attached to a 88SE6121 controller? >From what I can see, connecting it to the other internal sata controller, which is a VIA 8237A, or to an external PCIe Sil3132 controller using the siis driver, those timeouts aren't happening. Anyway I see something which I don't understand. I tried reading some files (~1 gb) from the same slice (1st one, ~200gb) while looking at gstat. Some files are being read at >100mb/s some others at about 4 mb/s. It seems that a "zone" of the disk is very slow. The disk is a Seagate 7200.12 and neither smartmontools nor SeaTools (Seagate diagnostic) report problems with it. Could this explain the timeouts on the Marvell controller? And BTW, does anyone knows which could be the problem with the disk? Thanks Barbara