From owner-freebsd-questions@FreeBSD.ORG Sun Mar 16 07:00:52 2014 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 55B0FCA7 for ; Sun, 16 Mar 2014 07:00:52 +0000 (UTC) Received: from mail-pd0-x22e.google.com (mail-pd0-x22e.google.com [IPv6:2607:f8b0:400e:c02::22e]) (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id 2B577865 for ; Sun, 16 Mar 2014 07:00:52 +0000 (UTC) Received: by mail-pd0-f174.google.com with SMTP id y13so4264953pdi.19 for ; Sun, 16 Mar 2014 00:00:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=Q+AmVGeRv2tbn6Od6jEpENvSKFfQN+YsJfhE3yK0iXE=; b=Akd23uJFu02c3Zj/VJfvWhcgePwTvg72gn6kYNS6G8HOkP0t1jkKUyhL8+2duT4GzZ 20H77TBVdMPOg98K86YFb929Sy+wUFK+9tlPaBeYYU8QlWOnWPFPi48Ys8ERy2bb2km/ HO3DGzADYm+LnxZwaTDsy65V6igtk/MpxV6M9uAzBt73tOcOB7+Zs51yThJk1IumP5qF eahZr0jHlQG7TYLpLIn5sJFmvui9mMX6IwHnMVH2FzdYtXZzzcjCAdJzW8DcUjPqRtq0 zNLF3p8aX+t1vYYUs5XtlN36YxPF6dtQoOwoWhPF4UI9ALzGODX9ui4BGKvO2skXie57 ubvg== MIME-Version: 1.0 X-Received: by 10.68.197.8 with SMTP id iq8mr19107098pbc.124.1394953251788; Sun, 16 Mar 2014 00:00:51 -0700 (PDT) Received: by 10.68.157.73 with HTTP; Sun, 16 Mar 2014 00:00:51 -0700 (PDT) In-Reply-To: <20140316142213.459009dc@X220.alogt.com> References: <20140316130936.3f2d18e0@X220.alogt.com> <20140316134309.2edc258a@X220.alogt.com> <20140316142213.459009dc@X220.alogt.com> Date: Sun, 16 Mar 2014 02:00:51 -0500 Message-ID: Subject: Re: Another case of the vanishing disk From: cruxpot To: Erich Dollansky Content-Type: text/plain; charset=ISO-8859-1 Cc: freebsd-questions@freebsd.org X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.17 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 16 Mar 2014 07:00:52 -0000 Seek_Error_Rate, Hardware_ECC_Recovered, Raw_Read_Error_Rate are all increasing steadily for all four disks. Does this have something to do with the recent resilver of the disk or the ongoing scrub (16.5% completed)? On Sun, Mar 16, 2014 at 1:22 AM, Erich Dollansky wrote: > Hi, > > On Sun, 16 Mar 2014 01:04:05 -0500 > cruxpot wrote: > >> Back in December, it was the power supply. That was a cheap Rosewill >> 300W PSU. The new is a Corsair CX500 (500W). The system basically just >> has an old SCSI card and 4 Green Barracuda 2TB disks and a low end >> pci-e video card and pci-e gigabit NIC. How can the PSU be the problem >> since I replaced it and it's more than adequate? > > the power supply has to regulate the supplied voltages withing a given > range. If this does not work, drives tend to have problems. Your > problem will be that you do not have the tools to check for this. > > The problem is that it is a rare thing. It is as rare that four drives > go together. > > Can you run the machine with another power supply to test? Store the > SMART values of each disk when you start the test and compare after > some time. > > Erich >> >> On Sun, Mar 16, 2014 at 12:43 AM, Erich Dollansky >> wrote: >> > Hi, >> > >> > On Sun, 16 Mar 2014 00:28:31 -0500 >> > cruxpot wrote: >> > >> >> All four disks have similar smartctl stats as far as those alarms >> >> go. Are you trying to tell me that all four of my disks are about >> >> to die? The sudden crashes have already been happening. >> > >> > it also could a problem with the motherboard or power supply. It is >> > only hard to believe that a problem from the motherboard affects raw >> > error rate. It is a bit more likely that your power supply is just >> > on its limits and small drops in the 5/12V supply lines cause the >> > problem. >> > >> > Erich >> >> >> >> On Sun, Mar 16, 2014 at 12:09 AM, Erich Dollansky >> >> wrote: >> >> > Hi, >> >> > >> >> > get a new disk as fast as possible. >> >> > >> >> > On Sat, 15 Mar 2014 23:48:58 -0500 >> >> > cruxpot wrote: >> >> > >> >> >> messages:Mar 13 03:03:11 bsdbox kernel: ata4: port is not ready >> >> >> (timeout 15000ms) tfd = 0000ffff >> >> > >> >> > First alarm bell is on. >> >> > >> >> >> UPDATED WHEN_FAILED RAW_VALUE >> >> >> 1 Raw_Read_Error_Rate 0x000f 100 099 006 Pre-fail >> >> >> Always - 1476032 >> >> > >> >> > Second alarm bell. >> >> > >> >> >> 7 Seek_Error_Rate 0x000f 078 060 030 Pre-fail >> >> >> Always - 64570250 >> >> > >> >> > Third alarm bell. >> >> > >> >> >> 9 Power_On_Hours 0x0032 077 077 000 Old_age >> >> >> Always - 20524 >> >> > >> >> > Warranty should be still on then. >> >> > >> >> >> 188 Command_Timeout 0x0032 100 097 000 Old_age >> >> >> Always - 50 >> >> > >> >> > Fourth alarm bell. >> >> > >> >> >> 195 Hardware_ECC_Recovered 0x001a 037 004 000 Old_age >> >> >> Always - 1476032 >> >> > >> >> > I think I cannot count that far. >> >> > >> >> > A disk with raw errors is not dead yet but it is a clear sign >> >> > that something is wrong. Be prepared for a sudden crash. >> >> > >> >> > Erich >> >> _______________________________________________ >> >> freebsd-questions@freebsd.org mailing list >> >> http://lists.freebsd.org/mailman/listinfo/freebsd-questions >> >> To unsubscribe, send any mail to >> >> "freebsd-questions-unsubscribe@freebsd.org" >> > >> _______________________________________________ >> freebsd-questions@freebsd.org mailing list >> http://lists.freebsd.org/mailman/listinfo/freebsd-questions >> To unsubscribe, send any mail to >> "freebsd-questions-unsubscribe@freebsd.org" >