From owner-freebsd-stable@FreeBSD.ORG Wed Sep 13 12:09:26 2006 Return-Path: X-Original-To: freebsd-stable@FreeBSD.ORG Delivered-To: freebsd-stable@FreeBSD.ORG Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id B928416A40F for ; Wed, 13 Sep 2006 12:09:26 +0000 (UTC) (envelope-from spamd@kc8onw.net) Received: from smtp3.fuse.net (mail-out3.fuse.net [216.68.8.177]) by mx1.FreeBSD.org (Postfix) with ESMTP id 615BC43D77 for ; Wed, 13 Sep 2006 12:09:16 +0000 (GMT) (envelope-from spamd@kc8onw.net) Received: from gx6.fuse.net ([72.49.10.62]) by smtp3.fuse.net (InterMail vM.6.01.04.04 201-2131-118-104-20050224) with ESMTP id <20060913120911.JSZX20597.smtp3.fuse.net@gx6.fuse.net> for ; Wed, 13 Sep 2006 08:09:11 -0400 Received: from kb8fcl.kc8onw.net ([72.49.10.62]) by gx6.fuse.net (InterMail vG.1.02.00.02 201-2136-104-102-20041210) with ESMTP id <20060913120911.DFHU10743.gx6.fuse.net@kb8fcl.kc8onw.net> for ; Wed, 13 Sep 2006 08:09:11 -0400 Received: from mail.kc8onw.net (unknown [204.117.152.87]) by kb8fcl.kc8onw.net (Postfix) with ESMTP id 04D85A173F for ; Wed, 13 Sep 2006 08:09:08 -0400 (EDT) Received: by mail.kc8onw.net (Postfix, from userid 58) id 69BB2285EE; Wed, 13 Sep 2006 08:09:07 -0400 (EDT) X-Spam-Checker-Version: SpamAssassin 3.1.5 (2006-08-29) on server.kc8onw.net X-Spam-Level: X-Spam-Status: No, score=-1.4 required=5.0 tests=ALL_TRUSTED,AWL autolearn=ham version=3.1.5 Received: from [10.70.3.254] (unknown [10.70.3.254]) by mail.kc8onw.net (Postfix) with ESMTP id 09E5028568; Wed, 13 Sep 2006 08:09:05 -0400 (EDT) Message-ID: <4507F4DF.3000605@kc8onw.net> Date: Wed, 13 Sep 2006 08:09:03 -0400 From: Jonathan Stewart User-Agent: Thunderbird 1.5.0.5 (Windows/20060719) MIME-Version: 1.0 To: freebsd-stable@FreeBSD.ORG, jonathan@kc8onw.net References: <200609130912.k8D9CaUP063256@lurza.secnetix.de> In-Reply-To: <200609130912.k8D9CaUP063256@lurza.secnetix.de> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: Subject: Re: Anyone??? (was Reproducible data corruption on 6.1-Stable) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 Sep 2006 12:09:26 -0000 Oliver Fromme wrote: > Jonathan Stewart wrote: > > I set up a new server recently and transferred all the information from > > my old server over. I tried to use unison to synchronize the backup of > > pictures I have taken and noticed that a large number of pictures where > > marked as changed on the server. After checking the pictures by hand I > > confirmed that many of the pictures on the server were corrupted. I > > attempted to use unison to update the files on the server with the > > correct local copies but it would fail on almost all the files with the > > message "destination updated during synchronization." > > > > It appears the corruption happens during the read process because when I > > recompare the files in a graphical diff tool between cache flushes the > > differences move around!?!?!? The differences also appear to be very > > small for the most part, single bytes scattered throughout the file. I > > really have no idea what is causing the problem and would like to pin it > > down so I can either replace hardware if it's bad or fix whatever the > > bug is. > > That very much sounds like bad RAM, or overclocked CPU > or bus. I assume you do not overclock, so I recommend > you replace your RAM modules and check if the symptoms > are gone. > > Also check your BIOS settings for the RAM timings. > Setting the timings to more conservative values might > already solve the problem. Thanks for the suggestions but I have tried lowering the clock rate on the processor and and the RAM speed with no luck whatsoever. I appear to have forgotten to mention that the problem appears no matter how I read the file, unison, md5, etc. 1 out of maybe 100 times it will read correctly. I have another drive that I use for the OS and I have done many buildworlds/kernels without problems on that drive as well as compiling some very large software packages. I'm wondering if a possible cause is the controller ignoring read errors from the hard drive but I would think more than the occasional single byte would be changed. I'm thinking about maybe trying to dd the file from the raw device in an attempt to see if the problem is occurring in the filesystem code or is lower level yet. Any suggestions on how to locate the file on the disk or how to isolate the problem better are welcome. I don't mind doing the work I just have no idea where to look/what to try next. Thanks, Jonathan