From owner-freebsd-scsi@freebsd.org Tue Dec 8 16:05:24 2015 Return-Path: Delivered-To: freebsd-scsi@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 6F86B9D3A07; Tue, 8 Dec 2015 16:05:24 +0000 (UTC) (envelope-from prateekrootkey@gmail.com) Received: from mail-wm0-x22f.google.com (mail-wm0-x22f.google.com [IPv6:2a00:1450:400c:c09::22f]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 0C6591E28; Tue, 8 Dec 2015 16:05:24 +0000 (UTC) (envelope-from prateekrootkey@gmail.com) Received: by wmww144 with SMTP id w144so35642876wmw.0; Tue, 08 Dec 2015 08:05:22 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=59VcFUEhMq8erY1qLzmkDEISAlSxFnCexh4CfRvveB4=; b=0Vc3wo8myPr9paNliD46eLgvZnLykoLbLkrNd4wUKfXOl4Ay2m59NCSGngCKeLN65g umJp55YbqBpNgniyr2f5O61J2dy9r5e7PRix+jo7tUiJCUI5w5DyieKMeU/qacE2OvnR 0fi6zQ/JYoU/x7g1eeJCpidyFOR5L/nfO3L0asLbBGJHWBo6rcICpSdtyLGW3SUhkf0J JQJWCrfnrBZtWRXioeBdIa7w6EY/21QC/6zsuLW4Ia1tgYxn5fQujZKySjchXEtSBaoI uWx58bslfZ/wVTUfDsAWrGd9SpGikAXSvLLTTmQoE/XHOXRAHM30IE+kh6x5QuLHahC5 bl6Q== MIME-Version: 1.0 X-Received: by 10.28.45.216 with SMTP id t207mr5661237wmt.89.1449590722048; Tue, 08 Dec 2015 08:05:22 -0800 (PST) Received: by 10.27.16.7 with HTTP; Tue, 8 Dec 2015 08:05:21 -0800 (PST) In-Reply-To: References: <6A7832F8-53EB-4641-8EF6-E0E6175EB52D@yahoo.com> Date: Tue, 8 Dec 2015 21:35:21 +0530 Message-ID: Subject: Re: bad disk discovery From: prateek sethi To: Michael Jung Cc: Scott Long , freebsd-scsi@freebsd.org, owner-freebsd-scsi@freebsd.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Content-Filtered-By: Mailman/MimeDel 2.1.20 X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 08 Dec 2015 16:05:24 -0000 Yes, I have tried that one but issue is still there. Other disk are working fine with the same configuration that means firmware should not be a problem. On Tue, Dec 8, 2015 at 6:56 PM, Michael Jung wrote: > On 2015-12-08 08:00, prateek sethi wrote: > >> Hi Scott, >> Thanks for the your quick response. >> >> I have different set of hardware . So that's why I want to know how I ca= n >> debug it myself . Is there anyway or procedure using that I can findout >> about the situation or the reason for CDB errors or disk command failure= ? >> >> Right now I am giving detail about the setup where I am getting this >> issue . >> >> I am using LSI SAS2008 controller and connected with supermicro Enclosur= e >> with freebsd 9.3. 16 different disks are there but only one disk is havi= ng >> problem. That means contoller and cable are fine. >> >> Faulty disk info are like:-. >> >> *smartctl output is:-* >> >> smartctl -x /dev/da23 >> >> =3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D >> Vendor: SEAGATE >> Product: ST3600057SS >> Revision: 000B >> Rotation Rate: 15000 rpm >> Form Factor: 3.5 inches >> Logical Unit id: 0x5000c5007725173f >> Serial number: 6SL8YLPC0000N5030DY7 >> Device type: disk >> Transport protocol: SAS >> Local Time is: Tue Dec 8 18:20:45 2015 IST >> *device is NOT READY (e.g. spun down, busy)* >> >> *Logs:-* >> >> Dec 8 14:12:01 N1 kernel: da23 at mps0 bus 0 scbus0 target 148 lun 0 >> Dec 8 14:12:01 N1 kernel: da23: Fixed Direct >> Access SCSI-5 device >> Dec 8 14:12:01 N1 kernel: da23: Serial Number 6SL8YLPC0000N5030DY7 >> Dec 8 14:12:01 N1 kernel: da23: 600.000MB/s transfers >> Dec 8 14:12:01 N1 kernel: da23: Command Queueing enabled >> Dec 8 14:12:01 N1 kernel: da23: *Attempt to query device size failed: N= OT >> READY, Logical unit not ready, cause n* >> Dec 8 14:12:01 N1 kernel: ses1: da23,pass26: Element descriptor: 'Slot >> 24' >> Dec 8 14:12:01 N1 kernel: ses1: da23,pass26: SAS Device Slot Element: 1 >> Phys at Slot 23 >> >> *driver versions:-* >> >> >> dev.mps.0.firmware_version: 15.00.00.00 >> dev.mps.0.driver_version: 16.00.00.00-fbsd >> >> >> >> >> >> >> On Tue, Dec 8, 2015 at 3:15 AM, Scott Long wrote: >> >> Hi, >>> >>> If your situation is accurate and the disk is not responding properly t= o >>> regular >>> commands then it=E2=80=99s unlikely that it will respond to SMART comma= nds >>> either. >>> Sometimes these situations are caused by a bad cable, bad controller, o= r >>> buggy software/firmware, and only rarely will the standard statistics i= n >>> SMART >>> pick up these kinds of errors. SMART is better at tracking wear rates >>> and >>> error rates on the physical media, both HDD and SSD, but even then it= =E2=80=99s >>> hard >>> for it to be accurately predictive or even accurately diagnostic. For >>> your case, >>> I recommend that you describe your hardware and software configuration = in >>> more detail, and look for physical abnormalities in the cabling and >>> connections. >>> Once that is ruled and and the rest of us know what kind of hardware >>> you=E2=80=99re >>> dealing with, we might be able to make better commendations. >>> >>> Scott >>> >>> > On Dec 7, 2015, at 11:07 AM, prateek sethi >>> wrote: >>> > >>> > Hi , >>> > >>> > Is there any way or tool to find out that a disk which is not >>> responding >>> > properly is really bad or not? Sometimes I have seen that there is lo= t >>> of >>> > CDB error for a drive and system reboot makes every thing fine. What >>> can >>> be >>> > reasons for such kind of scenarios? >>> > >>> > I know smartctl is the one which can help. I have some couple of >>> question >>> > regarding this . >>> > >>> > 1. What if disk does not support smartctl? >>> > 2. How I can do smartest use of smartctl command like which parameter= s >>> can >>> > tell that the disk is actually bad? >>> > 3. What other test I can perform to make it sure that disk has >>> completely >>> > gone? >>> > >>> > >>> > Please tell me correct place to ask this question if I am asking at >>> wrong >>> > place. >>> > _______________________________________________ >>> > freebsd-scsi@freebsd.org mailing list >>> > https://lists.freebsd.org/mailman/listinfo/freebsd-scsi >>> > To unsubscribe, send any mail to "freebsd-scsi-unsubscribe@freebsd.or= g >>> " >>> >>> >>> _______________________________________________ >> freebsd-scsi@freebsd.org mailing list >> https://lists.freebsd.org/mailman/listinfo/freebsd-scsi >> To unsubscribe, send any mail to "freebsd-scsi-unsubscribe@freebsd.org" >> > > > Have you simply moved the drive to another slot - does the problem follow > the drive? > Unlikely but it could be a backplane issue. > > I don't know about version 15 firmware, I have always used version 16 > firmware > with 9.x to match the driver version. > >