From owner-freebsd-stable@FreeBSD.ORG Sat Sep 9 20:45:51 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 987E016A40F for ; Sat, 9 Sep 2006 20:45:51 +0000 (UTC) (envelope-from spamd@kc8onw.net) Received: from smtp3.fuse.net (mail-out3.fuse.net [216.68.8.177]) by mx1.FreeBSD.org (Postfix) with ESMTP id 0563F43D49 for ; Sat, 9 Sep 2006 20:45:50 +0000 (GMT) (envelope-from spamd@kc8onw.net) Received: from gx6.fuse.net ([72.49.10.62]) by smtp3.fuse.net (InterMail vM.6.01.04.04 201-2131-118-104-20050224) with ESMTP id <20060909204550.HCK20597.smtp3.fuse.net@gx6.fuse.net> for ; Sat, 9 Sep 2006 16:45:50 -0400 Received: from kb8fcl.kc8onw.net ([72.49.10.62]) by gx6.fuse.net (InterMail vG.1.02.00.02 201-2136-104-102-20041210) with ESMTP id <20060909204545.LKJG10743.gx6.fuse.net@kb8fcl.kc8onw.net> for ; Sat, 9 Sep 2006 16:45:45 -0400 Received: from mail.kc8onw.net (unknown [204.117.152.87]) by kb8fcl.kc8onw.net (Postfix) with ESMTP id 8E0FBA1827 for ; Sat, 9 Sep 2006 16:45:32 -0400 (EDT) Received: by mail.kc8onw.net (Postfix, from userid 58) id 0903E28735; Sat, 9 Sep 2006 16:45:31 -0400 (EDT) X-Spam-Checker-Version: SpamAssassin 3.1.5 (2006-08-29) on server.kc8onw.net X-Spam-Level: X-Spam-Status: No, score=-1.4 required=5.0 tests=ALL_TRUSTED,AWL autolearn=ham version=3.1.5 Received: from [10.70.3.254] (unknown [10.70.3.254]) by mail.kc8onw.net (Postfix) with ESMTP id 56C5D285A7 for ; Sat, 9 Sep 2006 16:45:22 -0400 (EDT) Message-ID: <450327DC.5060202@kc8onw.net> Date: Sat, 09 Sep 2006 16:45:16 -0400 From: Jonathan Stewart User-Agent: Thunderbird 1.5.0.5 (Windows/20060719) MIME-Version: 1.0 To: freebsd-stable@freebsd.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Subject: Reproducible data corruption on 6.1-Stable X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 09 Sep 2006 20:45:51 -0000 I set up a new server recently and transferred all the information from my old server over. At the time it appeared there where no problems. I just tried to use unison to synchronize the backup of pictures I have taken and noticed that a shockingly high number of pictures where marked as changed on the server. After checking the pictures by hand I confirmed that many of the pictures on the server where corrupted. I attempted to use unison to update the files on the server with the correct local copies but it would fail on almost all the files with the message "destination updated during synchronization." I have since tried copying the files over using both samba and rsync and both exhibit corruption. I do know the files are transferred correctly over the network because a diff initially shows everything as identical until I read enough to flush the cache at which point when it hits the disk I start seeing the corruption. Not every file gets corrupted but it seems like >10% do each time and it's not always the same files. The larger files seem to be corrupted more often so it seem to be related more to the amount of data written than the number of files. I cvsuped and rebuilt world and kernel last night in hope that it had been fixed recently but with no luck. I have not seen any error messages on the console at all either. I have a pair of 320GB SATA hard drives setup as RAID0 on a HighPoint RocketRaid 1520 card. This being a data corruption issue I can afford any amount of downtime needed for trouble shooting as it's not very useful to have the server up if everything is going to get corrupted. Thank you, Jonathan uname -a: FreeBSD XXXXX 6.1-STABLE FreeBSD 6.1-STABLE #1: Fri Sep 8 23:53:36 EDT 2006 root@XXXXXX:/usr/obj/usr/src/sys/GENERIC i386 dmesg: Copyright (c) 1992-2006 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 6.1-STABLE #1: Fri Sep 8 23:53:36 EDT 2006 root@XXXXX:/usr/obj/usr/src/sys/GENERIC mptable_probe: MP Config Table has bad signature: 4\^C\^_ Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Athlon(tm) XP 3200+ (2090.17-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0x6a0 Stepping = 0 Features=0x383fbff AMD Features=0xc0400800 real memory = 1073676288 (1023 MB) avail memory = 1041698816 (993 MB) kbd1 at kbdmux0 ath_hal: 0.9.17.2 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413) acpi0: on motherboard acpi0: Power Button (fixed) Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x4008-0x400b on acpi0 cpu0: on acpi0 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 Correcting nForce2 C1 CPU disconnect hangs agp0: mem 0xd8000000-0xdbffffff at device 0.0 on pci0 pci0: at device 0.1 (no driver attached) pci0: at device 0.2 (no driver attached) pci0: at device 0.3 (no driver attached) pci0: at device 0.4 (no driver attached) pci0: at device 0.5 (no driver attached) isab0: at device 1.0 on pci0 isa0: on isab0 pci0: at device 1.1 (no driver attached) ohci0: mem 0xe1085000-0xe1085fff irq 5 at device 2.0 on pci0 ohci0: [GIANT-LOCKED] usb0: OHCI version 1.0, legacy support usb0: on ohci0 usb0: USB revision 1.0 uhub0: nVidia OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 3 ports with 3 removable, self powered ohci1: mem 0xe1082000-0xe1082fff irq 5 at device 2.1 on pci0 ohci1: [GIANT-LOCKED] usb1: OHCI version 1.0, legacy support usb1: on ohci1 usb1: USB revision 1.0 uhub1: nVidia OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 3 ports with 3 removable, self powered ehci0: mem 0xe1083000-0xe10830ff irq 12 at device 2.2 on pci0 ehci0: [GIANT-LOCKED] usb2: EHCI version 1.0 usb2: companion controllers, 4 ports each: usb0 usb1 usb2: on ehci0 usb2: USB revision 2.0 uhub2: nVidia EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 uhub2: 6 ports with 6 removable, self powered nve0: port 0xe400-0xe407 mem 0xe1084000-0xe1084fff irq 12 at device 4.0 on pci0 nve0: Ethernet address 00:0c:6e:7d:e0:79 miibus0: on nve0 rlphy0: on miibus0 rlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto nve0: Ethernet address: 00:0c:6e:7d:e0:79 pci0: at device 5.0 (no driver attached) pci0: at device 6.0 (no driver attached) pcib1: at device 8.0 on pci0 pci1: on pcib1 atapci0: port 0xa000-0xa007,0xa400-0xa403,0xa800-0xa807,0xac00-0xac03,0xb000-0xb0ff irq 11 at device 6.0 on pci1 ata2: on atapci0 ata3: on atapci0 pci1: at device 9.0 (no driver attached) pci1: at device 9.1 (no driver attached) atapci1: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xf000-0xf00f at device 9.0 on pci0 ata0: on atapci1 ata1: on atapci1 pcib2: at device 12.0 on pci0 pci2: on pcib2 xl0: <3Com 3c920B-EMB Integrated Fast Etherlink XL> port 0xc000-0xc07f mem 0xdd000000-0xdd00007f irq 5 at device 1.0 on pci2 miibus1: on xl0 acphy0: on miibus1 acphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto xl0: Ethernet address: 00:26:54:10:8c:0f pcib3: at device 30.0 on pci0 pci3: on pcib3 pci3: at device 0.0 (no driver attached) fdc0: port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: does not respond device_attach: fdc0 attach returned 6 sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A ppc0: port 0x378-0x37f,0x778-0x77b irq 7 drq 3 on acpi0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/16 bytes threshold ppbus0: on ppc0 plip0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] fdc0: port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: does not respond device_attach: fdc0 attach returned 6 pmtimer0 on isa0 orm0: at iomem 0xd0000-0xd17ff,0xd6000-0xd67ff on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounter "TSC" frequency 2090170106 Hz quality 800 Timecounters tick every 1.000 msec ad0: 194481MB at ata0-master UDMA133 acd0: DVDROM at ata0-slave UDMA33 ad4: 305245MB at ata2-master UDMA133 ad6: 305245MB at ata3-master UDMA133 ar0: 610490MB status: READY ar0: disk0 READY using ad4 at ata2-master ar0: disk1 READY using ad6 at ata3-master Trying to mount root from ufs:/dev/ad0s1a