From owner-freebsd-stable@FreeBSD.ORG Fri Sep 15 02:27:26 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 8336D16A407 for ; Fri, 15 Sep 2006 02:27:26 +0000 (UTC) (envelope-from spamd@kc8onw.net) Received: from smtp2.fuse.net (mail-out2.fuse.net [216.68.8.175]) by mx1.FreeBSD.org (Postfix) with ESMTP id D97E243D46 for ; Fri, 15 Sep 2006 02:27:25 +0000 (GMT) (envelope-from spamd@kc8onw.net) Received: from gx5.fuse.net ([72.49.10.62]) by smtp2.fuse.net (InterMail vM.6.01.04.04 201-2131-118-104-20050224) with ESMTP id <20060915022724.SUFH2008.smtp2.fuse.net@gx5.fuse.net> for ; Thu, 14 Sep 2006 22:27:24 -0400 Received: from kb8fcl.kc8onw.net ([72.49.10.62]) by gx5.fuse.net (InterMail vG.1.02.00.02 201-2136-104-102-20041210) with ESMTP id <20060915022724.EBXA5373.gx5.fuse.net@kb8fcl.kc8onw.net> for ; Thu, 14 Sep 2006 22:27:24 -0400 Received: from mail.kc8onw.net (unknown [204.117.152.87]) by kb8fcl.kc8onw.net (Postfix) with ESMTP id C187EA1737 for ; Thu, 14 Sep 2006 22:27:19 -0400 (EDT) Received: by mail.kc8onw.net (Postfix, from userid 58) id 89BEA288AB; Thu, 14 Sep 2006 22:27:17 -0400 (EDT) X-Spam-Checker-Version: SpamAssassin 3.1.5 (2006-08-29) on server.kc8onw.net X-Spam-Level: X-Spam-Status: No, score=-1.3 required=5.0 tests=ALL_TRUSTED,AWL, MISSING_HEADERS autolearn=ham version=3.1.5 Received: from [10.70.3.254] (unknown [10.70.3.254]) by mail.kc8onw.net (Postfix) with ESMTP id 85904285BD for ; Thu, 14 Sep 2006 22:27:13 -0400 (EDT) Message-ID: <450A0F7E.7020600@kc8onw.net> Date: Thu, 14 Sep 2006 22:27:10 -0400 From: Jonathan Stewart User-Agent: Thunderbird 1.5.0.5 (Windows/20060719) MIME-Version: 1.0 CC: freebsd-stable@freebsd.org References: <450752F6.4050109@kc8onw.net> <492332980.20060914225258@rulez.sk> In-Reply-To: <492332980.20060914225258@rulez.sk> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Subject: Re: Anyone??? (was Reproducible data corruption on 6.1-Stable) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 Sep 2006 02:27:26 -0000 Daniel Gerzo wrote: > Hello Jonathan, > > Wednesday, September 13, 2006, 2:38:14 AM, you wrote: > >> I set up a new server recently and transferred all the information from >> my old server over. I tried to use unison to synchronize the backup of >> pictures I have taken and noticed that a large number of pictures where >> marked as changed on the server. After checking the pictures by hand I >> confirmed that many of the pictures on the server were corrupted. > >> It appears the corruption happens during the read process because when I >> recompare the files in a graphical diff tool between cache flushes the >> differences move around!?!?!? The differences also appear to be very >> small for the most part, single bytes scattered throughout the file. I >> really have no idea what is causing the problem and would like to pin it >> down so I can either replace hardware if it's bad or fix whatever the >> bug is. > >> CPU: AMD Athlon(tm) XP 3200+ (2090.16-MHz 686-class CPU) >> Origin = "AuthenticAMD" Id = 0x6a0 Stepping = 0 > > I saw very similar simptons on p4 3.2ghz. I was able to build world > without any problems and the overall stability of the machine was > completely good, but when I tried to install some ports, the md5 > sums didn't match the source and I was sure that they were all right. > > The following simple test demonstrates the problem I was hitting: > > root@[bigbang ~]# sha256 /usr/ports/distfiles/ruby/ruby-1.8.4.tar.gz > SHA256 (/usr/ports/distfiles/ruby/ruby-1.8.4.tar.gz) = b95ddf27bc0ffa379c9aa881ca39e92a7d79e0d08999b4dff6d7d9547ee2a72d > root@[bigbang ~]# sha256 /usr/ports/distfiles/ruby/ruby-1.8.4.tar.gz > SHA256 (/usr/ports/distfiles/ruby/ruby-1.8.4.tar.gz) = 71432841b3965b7ab2d83f0dc7c3049195ea4e9267a8dc2d825a8a0466982930 > root@[bigbang ~]# sha256 /usr/ports/distfiles/ruby/ruby-1.8.4.tar.gz > SHA256 (/usr/ports/distfiles/ruby/ruby-1.8.4.tar.gz) = 83e44f5301b3270e821850164c74d275f6721bed5d126480cf518a9fe5ca0d6c > root@[bigbang ~]# md5 < /usr/ports/distfiles/ruby/ruby-1.8.4.tar.gz > bd8c2e593e1fa4b01fd98eaf016329bb > root@[bigbang ~]# md5 < /usr/ports/distfiles/ruby/ruby-1.8.4.tar.gz > bd8c2e593e1fa4b01fd98eaf016329bb > root@[bigbang ~]# md5 < /usr/ports/distfiles/ruby/ruby-1.8.4.tar.gz > b9342bb213393238dd37322d4e2ee3fe > root@[bigbang ~]# md5 < /usr/ports/distfiles/ruby/ruby-1.8.4.tar.gz > 88efa7977fd3febaa8d260e3d5f21917 > > The memtest didn't show any problems with RAM and we were unable to > clarify what is really going on. Then we managed to get the machine > replaced with the complete new hardware and the problem was gone. > Later, I was told that it is some kind of known bug in older p4's > bioses (and advised to update the bios which should have been fixed > in the meantime) but we were unable to find out any information about > the problem. Fortunately the colo company replaced the hardware with > no problems. So long so good and the box is running flawlessly. > I don't think it's quite the same as my problem as I have to use dd on a large file to flush the cache and force freebsd to go back to the disk before the checksum changes. At this point I think I need to further narrow down where the error is occurring but I don't know what to try next. I am 99.999% sure memory and cpu are not the problem but after that point I'm getting into driver and filesystem code testing which is a little overwhelming to just dive into. Jonathan