From owner-freebsd-stable@FreeBSD.ORG Tue Apr 22 15:01:44 2008 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 741AF106566C for ; Tue, 22 Apr 2008 15:01:44 +0000 (UTC) (envelope-from mike@sentex.net) Received: from smarthost1.sentex.ca (smarthost1.sentex.ca [64.7.153.18]) by mx1.freebsd.org (Postfix) with ESMTP id 331AB8FC15 for ; Tue, 22 Apr 2008 15:01:44 +0000 (UTC) (envelope-from mike@sentex.net) Received: from lava.sentex.ca (pyroxene.sentex.ca [199.212.134.18]) by smarthost1.sentex.ca (8.14.2/8.14.2) with ESMTP id m3MF1hQk006497; Tue, 22 Apr 2008 11:01:43 -0400 (EDT) (envelope-from mike@sentex.net) Received: from mdt-xp.sentex.net (simeon.sentex.ca [192.168.43.27]) by lava.sentex.ca (8.13.8/8.13.3) with ESMTP id m3MF1guW092221 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Tue, 22 Apr 2008 11:01:42 -0400 (EDT) (envelope-from mike@sentex.net) Message-Id: <200804221501.m3MF1guW092221@lava.sentex.ca> X-Mailer: QUALCOMM Windows Eudora Version 7.1.0.9 Date: Tue, 22 Apr 2008 11:01:29 -0400 To: "Arno J. Klaassen" From: Mike Tancsa In-Reply-To: References: <20080421094718.GY25623@hub.freebsd.org> <200804211537.m3LFbaZA086977@lava.sentex.ca> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed Cc: stable@freebsd.org Subject: Re: nfs-server silent data corruption X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 22 Apr 2008 15:01:44 -0000 At 05:57 PM 4/21/2008, Arno J. Klaassen wrote: > > Hi, > > How long does it take for the problem to show up ? > > >Less than an hour in general (running the same client script >simultanuously on a 100Mbps linux box and 1Gbps bds6-x86) I am running my nic at gig speeds only... I recompiled the kernel this morning to include cpufreq as well as made sure the cool&quiet was enabled in the BIOS. >for info, I test with args '38 999' (38M, try 999 times) on linux >(slightly adapted script BTW) and '138 999' on bsd. The best 'score' I >got was 'still 871 iterations to go' So far I have done 150 loops with an 80MB file and no issues and 200 loopswith a 160MB file. My nfe nic does not support MSI and has its own interrupt # vmstat -i interrupt total rate irq1: atkbd0 5 0 irq4: sio0 3049 1 irq16: twe0 327046 164 irq19: bge0 385147 194 irq21: atapci1 976355 492 irq23: nfe0 11876726 5986 cpu0: timer 3966420 1999 cpu1: timer 3964392 1998 I have powerd started up with powerd_enable="YES" powerd_flags="-a adaptive -b adaptive -n adaptive" FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 ioapic0: Changing APIC ID to 2 ioapic0 irqs 0-23 on motherboard kbd1 at kbdmux0 acpi0: on motherboard acpi0: [ITHREAD] acpi0: Power Button (fixed) acpi0: reservation of 0, a0000 (3) failed acpi0: reservation of 100000, dfde0000 (3) failed Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x1008-0x100b on acpi0 cpu0: on acpi0 powernow0: on cpu0 cpu1: on acpi0 powernow1: on cpu1 acpi_button0: on acpi0 . . nfe0: port 0xb400-0xb407 mem 0xfebf9000-0xfebf9fff irq 23 at device 10.0 on pci0 miibus0: on nfe0 e1000phy0: PHY 1 on miibus0 e1000phy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX-FDX, auto nfe0: Ethernet address: 00:e0:81:58:91:6a nfe0: [FILTER] With the "sleep" in my test script, powerd does seem to be fiddling with frequencies as well during the inactivity. # sysctl dev. | grep -i fre dev.cpu.0.freq: 1800 dev.cpu.0.freq_levels: 2200/110000 2000/105600 1800/89100 1000/49000 dev.powernow.0.freq_settings: 2200/110000 2000/105600 1800/89100 1000/49000 dev.powernow.1.freq_settings: 2200/110000 2000/105600 1800/89100 1000/49000 dev.cpufreq.0.%driver: cpufreq dev.cpufreq.0.%parent: cpu0 dev.cpufreq.1.%driver: cpufreq dev.cpufreq.1.%parent: cpu1 # sysctl dev. | grep -i fre dev.cpu.0.freq: 2200 dev.cpu.0.freq_levels: 2200/110000 2000/105600 1800/89100 1000/49000 dev.powernow.0.freq_settings: 2200/110000 2000/105600 1800/89100 1000/49000 dev.powernow.1.freq_settings: 2200/110000 2000/105600 1800/89100 1000/49000 dev.cpufreq.0.%driver: cpufreq dev.cpufreq.0.%parent: cpu0 dev.cpufreq.1.%driver: cpufreq dev.cpufreq.1.%parent: cpu1 ---Mike