From owner-freebsd-questions@FreeBSD.ORG Sat Mar 19 09:38:15 2005 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 0D38F16A4CE for ; Sat, 19 Mar 2005 09:38:15 +0000 (GMT) Received: from smtp11.wanadoo.fr (smtp11.wanadoo.fr [193.252.22.31]) by mx1.FreeBSD.org (Postfix) with ESMTP id 6899043D1D for ; Sat, 19 Mar 2005 09:38:14 +0000 (GMT) (envelope-from atkielski.anthony@wanadoo.fr) Received: from me-wanadoo.net (unknown [127.0.0.1]) by mwinf1102.wanadoo.fr (SMTP Server) with ESMTP id A301C1C00095 for ; Sat, 19 Mar 2005 10:38:13 +0100 (CET) Received: from pix.atkielski.com (ASt-Lambert-111-2-1-3.w81-50.abo.wanadoo.fr [81.50.80.3]) by mwinf1102.wanadoo.fr (SMTP Server) with ESMTP id 7D5DE1C0008F for ; Sat, 19 Mar 2005 10:38:13 +0100 (CET) X-ME-UUID: 20050319093813513.7D5DE1C0008F@mwinf1102.wanadoo.fr Date: Sat, 19 Mar 2005 10:38:13 +0100 From: Anthony Atkielski X-Priority: 3 (Normal) Message-ID: <583197724.20050319103813@wanadoo.fr> To: freebsd-questions@freebsd.org MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Subject: Serious issue with SATA disks again X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list Reply-To: freebsd-questions@freebsd.org List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 19 Mar 2005 09:38:15 -0000 I'm still getting errors like this: ad10: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=5601695 ad10: FAILURE - WRITE_DMA timed out ad10: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=4848803 ad10: FAILURE - WRITE_DMA timed out ad10: WARNING - READ_DMA UDMA ICRC error (retrying request) LBA=5618815 ad10: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=4848959 ad10: FAILURE - WRITE_DMA timed out ad10: WARNING - WRITE_DMA UDMA ICRC error (retrying request) LBA=4472607 ad10: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=4860959 ad10: FAILURE - WRITE_DMA timed out ad10: WARNING - WRITE_DMA UDMA ICRC error (retrying request) LBA=4861087 ad10: WARNING - WRITE_DMA UDMA ICRC error (retrying request) LBA=4861695 Yesterday, for the first time, the system crashed (ungracefully) after some of these errors occurred, and I had to reset the system manually; fsck had to correct errors after boot. I need to know what is causing these problems. They have been reported for a year by various people on various configurations (different motherboards and chipsets). I've seen lots of complaints and reports, but no solutions. It's not hardware, so don't bother suggesting that unless you can _prove_ that the OS is eliminated from consideration. Doesn't anyone actually know how FreeBSD works? Someone wrote the code that prints the above cryptic messages. What do they mean, _exactly_? These errors occur most often while I'm running a Perl program (awstats) to analyse web logs. That may explain why the LBAs seem to be in the same region. ad10 contains /tmp and /var; ad12 (which doesn't seem to show the error messages) contains /usr. The root and swap file are on a different drive entirely. I'm beginning to get the impression that support for disks is rather weak in FreeBSD 5.x. I have mysterious SCSI errors on one machine that nobody seems to have any clue about, and mysterious SATA errors on another machine that nobody seems to have any clue about. I can't really brag about the reliability or uptime of the OS if it crashes once a week due to unresolved bugs in disk-handling code. -- Anthony