From owner-freebsd-questions@FreeBSD.ORG  Sun Mar 16 07:00:52 2014
Return-Path: <owner-freebsd-questions@FreeBSD.ORG>
Delivered-To: freebsd-questions@freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org
 [IPv6:2001:1900:2254:206a::19:1])
 (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits))
 (No client certificate requested)
 by hub.freebsd.org (Postfix) with ESMTPS id 55B0FCA7
 for <freebsd-questions@freebsd.org>; Sun, 16 Mar 2014 07:00:52 +0000 (UTC)
Received: from mail-pd0-x22e.google.com (mail-pd0-x22e.google.com
 [IPv6:2607:f8b0:400e:c02::22e])
 (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits))
 (No client certificate requested)
 by mx1.freebsd.org (Postfix) with ESMTPS id 2B577865
 for <freebsd-questions@freebsd.org>; Sun, 16 Mar 2014 07:00:52 +0000 (UTC)
Received: by mail-pd0-f174.google.com with SMTP id y13so4264953pdi.19
 for <freebsd-questions@freebsd.org>; Sun, 16 Mar 2014 00:00:51 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113;
 h=mime-version:in-reply-to:references:date:message-id:subject:from:to
 :cc:content-type;
 bh=Q+AmVGeRv2tbn6Od6jEpENvSKFfQN+YsJfhE3yK0iXE=;
 b=Akd23uJFu02c3Zj/VJfvWhcgePwTvg72gn6kYNS6G8HOkP0t1jkKUyhL8+2duT4GzZ
 20H77TBVdMPOg98K86YFb929Sy+wUFK+9tlPaBeYYU8QlWOnWPFPi48Ys8ERy2bb2km/
 HO3DGzADYm+LnxZwaTDsy65V6igtk/MpxV6M9uAzBt73tOcOB7+Zs51yThJk1IumP5qF
 eahZr0jHlQG7TYLpLIn5sJFmvui9mMX6IwHnMVH2FzdYtXZzzcjCAdJzW8DcUjPqRtq0
 zNLF3p8aX+t1vYYUs5XtlN36YxPF6dtQoOwoWhPF4UI9ALzGODX9ui4BGKvO2skXie57
 ubvg==
MIME-Version: 1.0
X-Received: by 10.68.197.8 with SMTP id iq8mr19107098pbc.124.1394953251788;
 Sun, 16 Mar 2014 00:00:51 -0700 (PDT)
Received: by 10.68.157.73 with HTTP; Sun, 16 Mar 2014 00:00:51 -0700 (PDT)
In-Reply-To: <20140316142213.459009dc@X220.alogt.com>
References: <CAPYfQ9z-YUzKDAh3=V3_m1wmDtds4NzcewTq0wLUD9LWt3VaGA@mail.gmail.com>
 <20140316130936.3f2d18e0@X220.alogt.com>
 <CAPYfQ9ycxEr+-qPBC6qY6tvLrTMqT3guU+8q+bK2_RAj=WH1tw@mail.gmail.com>
 <20140316134309.2edc258a@X220.alogt.com>
 <CAPYfQ9ztmzYWSRoNLJk2Z-mTAdDti48ZOJrKT0LEEpuWf5SqHg@mail.gmail.com>
 <20140316142213.459009dc@X220.alogt.com>
Date: Sun, 16 Mar 2014 02:00:51 -0500
Message-ID: <CAPYfQ9yUOXG7uHh120vuERZLggo3QQSguck9RcJn62h8-yugyw@mail.gmail.com>
Subject: Re: Another case of the vanishing disk
From: cruxpot <cruxpot@gmail.com>
To: Erich Dollansky <erich@alogt.com>
Content-Type: text/plain; charset=ISO-8859-1
Cc: freebsd-questions@freebsd.org
X-BeenThere: freebsd-questions@freebsd.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: User questions <freebsd-questions.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/options/freebsd-questions>, 
 <mailto:freebsd-questions-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-questions/>
List-Post: <mailto:freebsd-questions@freebsd.org>
List-Help: <mailto:freebsd-questions-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-questions>, 
 <mailto:freebsd-questions-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Sun, 16 Mar 2014 07:00:52 -0000

Seek_Error_Rate, Hardware_ECC_Recovered, Raw_Read_Error_Rate are all
increasing steadily for all four disks. Does this have something to do
with the recent resilver of the disk or the ongoing scrub (16.5%
completed)?

On Sun, Mar 16, 2014 at 1:22 AM, Erich Dollansky <erich@alogt.com> wrote:
> Hi,
>
> On Sun, 16 Mar 2014 01:04:05 -0500
> cruxpot <cruxpot@gmail.com> wrote:
>
>> Back in December, it was the power supply. That was a cheap Rosewill
>> 300W PSU. The new is a Corsair CX500 (500W). The system basically just
>> has an old SCSI card and 4 Green Barracuda 2TB disks and a low end
>> pci-e video card and pci-e gigabit NIC. How can the PSU be the problem
>> since I replaced it and it's more than adequate?
>
> the power supply has to regulate the supplied voltages withing a given
> range. If this does not work, drives tend to have problems. Your
> problem will be that you do not have the tools to check for this.
>
> The problem is that it is a rare thing. It is as rare that four drives
> go together.
>
> Can you run the machine with another power supply to test? Store the
> SMART values of each disk when you start the test and compare after
> some time.
>
> Erich
>>
>> On Sun, Mar 16, 2014 at 12:43 AM, Erich Dollansky
>> <erichsfreebsdlist@alogt.com> wrote:
>> > Hi,
>> >
>> > On Sun, 16 Mar 2014 00:28:31 -0500
>> > cruxpot <cruxpot@gmail.com> wrote:
>> >
>> >> All four disks have similar smartctl stats as far as those alarms
>> >> go. Are you trying to tell me that all four of my disks are about
>> >> to die? The sudden crashes have already been happening.
>> >
>> > it also could a problem with the motherboard or power supply. It is
>> > only hard to believe that a problem from the motherboard affects raw
>> > error rate. It is a bit more likely that your power supply is just
>> > on its limits and small drops in the 5/12V supply lines cause the
>> > problem.
>> >
>> > Erich
>> >>
>> >> On Sun, Mar 16, 2014 at 12:09 AM, Erich Dollansky
>> >> <erichsfreebsdlist@alogt.com> wrote:
>> >> > Hi,
>> >> >
>> >> > get a new disk as fast as possible.
>> >> >
>> >> > On Sat, 15 Mar 2014 23:48:58 -0500
>> >> > cruxpot <cruxpot@gmail.com> wrote:
>> >> >
>> >> >> messages:Mar 13 03:03:11 bsdbox kernel: ata4: port is not ready
>> >> >> (timeout 15000ms) tfd = 0000ffff
>> >> >
>> >> > First alarm bell is on.
>> >> >
>> >> >> UPDATED  WHEN_FAILED RAW_VALUE
>> >> >>   1 Raw_Read_Error_Rate     0x000f   100   099   006    Pre-fail
>> >> >> Always       -       1476032
>> >> >
>> >> > Second alarm bell.
>> >> >
>> >> >>   7 Seek_Error_Rate         0x000f   078   060   030    Pre-fail
>> >> >> Always       -       64570250
>> >> >
>> >> > Third alarm bell.
>> >> >
>> >> >>   9 Power_On_Hours          0x0032   077   077   000    Old_age
>> >> >> Always       -       20524
>> >> >
>> >> > Warranty should be still on then.
>> >> >
>> >> >> 188 Command_Timeout         0x0032   100   097   000    Old_age
>> >> >> Always       -       50
>> >> >
>> >> > Fourth alarm bell.
>> >> >
>> >> >> 195 Hardware_ECC_Recovered  0x001a   037   004   000    Old_age
>> >> >> Always       -       1476032
>> >> >
>> >> > I think I cannot count that far.
>> >> >
>> >> > A disk with raw errors is not dead yet but it is a clear sign
>> >> > that something is wrong. Be prepared for a sudden crash.
>> >> >
>> >> > Erich
>> >> _______________________________________________
>> >> freebsd-questions@freebsd.org mailing list
>> >> http://lists.freebsd.org/mailman/listinfo/freebsd-questions
>> >> To unsubscribe, send any mail to
>> >> "freebsd-questions-unsubscribe@freebsd.org"
>> >
>> _______________________________________________
>> freebsd-questions@freebsd.org mailing list
>> http://lists.freebsd.org/mailman/listinfo/freebsd-questions
>> To unsubscribe, send any mail to
>> "freebsd-questions-unsubscribe@freebsd.org"
>