From owner-freebsd-net@FreeBSD.ORG Fri Jul 6 15:24:40 2007 Return-Path: X-Original-To: freebsd-net@FreeBSD.org Delivered-To: freebsd-net@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 3BE2A16A4E1 for ; Fri, 6 Jul 2007 15:24:40 +0000 (UTC) (envelope-from admin@lissyara.su) Received: from smtp.qwerty.ru (smtp.qwerty.ru [87.240.2.134]) by mx1.freebsd.org (Postfix) with ESMTP id 5287213C45E for ; Fri, 6 Jul 2007 15:24:38 +0000 (UTC) (envelope-from admin@lissyara.su) Received: from ussr.lissyara.int.otradno.ru (unknown [10.21.64.215]) by smtp.qwerty.ru (Spam Firewall) with ESMTP id 3AA801991B65 for ; Fri, 6 Jul 2007 19:07:00 +0400 (MSD) Message-ID: <468E5A94.3030509@lissyara.su> Date: Fri, 06 Jul 2007 19:07:00 +0400 From: Alex Keda User-Agent: Thunderbird 2.0.0.4 (X11/20070630) MIME-Version: 1.0 To: freebsd-net@FreeBSD.org Content-Type: text/plain; charset=KOI8-R; format=flowed Content-Transfer-Encoding: 7bit Cc: Subject: Fatal double fault while copy to NFS filesystems X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 06 Jul 2007 15:24:40 -0000 When I copy files to NFS on another host kernel crash: Fatal double fault: eip = 0xc07e9e29 esp = 0xe31a3000 ebp = 0xe31a3000 cpuid = 1; apic id = 01 panic: double fault cpuid = 1 ======================= before this, I see on /var/log/messages nve0: device timeout ======================= how repeat problem: ussr# df -h Filesystem Size Used Avail Capacity Mounted on /dev/ad0s1a 72G 6.1G 60G 9% / devfs 1.0K 1.0K 0B 100% /dev ussr# dd if=/dev/zero of=file_20mb bs=1m count=20 ussr# mount 192.168.254.254:/shares /mnt/ ussr# df -h Filesystem Size Used Avail Capacity Mounted on /dev/ad0s1a 72G 6.1G 60G 9% / devfs 1.0K 1.0K 0B 100% /dev 192.168.254.254:/shares 271G 179G 89G 67% /mnt ussr# cp file_20mb /mnt/ then, after 3-5 second I see "device timeout", and later, after 5-7 seconds - system crash ===================== another information - this problem appearance after I upgrade remote machine (6.2-RELEASE-p5), I change CPU from Celeron 466 to PIII 800. interface on remote machine - 3com509b if I slow copy to remote machine (~100kb/s - 10% interface usage) - all good. System not crash... if I copy from remote machine - all good - system not crash... on logs on remote machine - all clean. ===================== 3 days ago I upgrade my system to 6.2-RELEASE-p5, but - problem exists... ussr# ifconfig fwe0: flags=108802 mtu 1500 options=8 ether 02:11:d8:ee:cc:8c ch 1 dma -1 nve0: flags=8843 mtu 1500 inet 192.168.254.193 netmask 0xffffff00 broadcast 192.168.254.255 ether 00:18:f3:97:3e:1b media: Ethernet autoselect (10baseT/UTP) status: active plip0: flags=108810 mtu 1500 lo0: flags=8049 mtu 16384 inet6 fe80::1%lo0 prefixlen 64 scopeid 0x4 inet6 ::1 prefixlen 128 inet 127.0.0.1 netmask 0xff000000 ussr# =========================== dmesg: Copyright (c) 1992-2007 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 6.2-RELEASE #0: Fri Jan 12 11:05:30 UTC 2007 root@dessler.cse.buffalo.edu:/usr/obj/usr/src/sys/SMP Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Athlon(tm) 64 X2 Dual Core Processor 3800+ (2004.19-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0x40fb2 Stepping = 2 Features=0x178bfbff Features2=0x2001 AMD Features=0xea500800 AMD Features2=0x1f,,CR8> Cores per package: 2 real memory = 1039007744 (990 MB) avail memory = 1007513600 (960 MB) ACPI APIC Table: FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 ioapic0: Changing APIC ID to 2 ioapic0 irqs 0-23 on motherboard kbd1 at kbdmux0 ath_hal: 0.9.17.2 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413) acpi0: on motherboard acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi0: Power Button (fixed) acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi0: reservation of 3dee0000, 20000 (3) failed Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x4008-0x400b on acpi0 cpu0: on acpi0 cpu1: on acpi0 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 pci0: at device 0.0 (no driver attached) pci0: at device 0.1 (no driver attached) pci0: at device 0.2 (no driver attached) pci0: at device 0.3 (no driver attached) pci0: at device 0.4 (no driver attached) pci0: at device 0.5 (no driver attached) pci0: at device 0.6 (no driver attached) pci0: at device 0.7 (no driver attached) pcib1: at device 2.0 on pci0 pci1: on pcib1 pcib2: at device 3.0 on pci0 pci2: on pcib2 pcib3: at device 4.0 on pci0 pci3: on pcib3 pci0: at device 5.0 (no driver attached) pci0: at device 9.0 (no driver attached) isab0: at device 10.0 on pci0 isa0: on isab0 pci0: at device 10.1 (no driver attached) pci0: at device 10.2 (no driver attached) ohci0: mem 0xfe02f000-0xfe02ffff irq 21 at device 11.0 on pci0 ohci0: [GIANT-LOCKED] usb0: OHCI version 1.0, legacy support usb0: SMM does not respond, resetting usb0: on ohci0 usb0: USB revision 1.0 uhub0: nVidia OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 8 ports with 8 removable, self powered ehci0: mem 0xfe02e000-0xfe02e0ff irq 22 at device 11.1 on pci0 ehci0: [GIANT-LOCKED] usb1: EHCI version 1.0 usb1: companion controller, 8 ports each: usb0 usb1: on ehci0 usb1: USB revision 2.0 uhub1: nVidia EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 uhub1: 8 ports with 8 removable, self powered atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xf400-0xf40f at device 13.0 on pci0 ata0: on atapci0 ata1: on atapci0 atapci1: port 0x9f0-0x9f7,0xbf0-0xbf3,0x970-0x977,0xb70-0xb73,0xe000-0xe00f mem 0xfe02d000-0xfe02dfff irq 23 at device 14.0 on pci0 ata2: on atapci1 ata3: on atapci1 atapci2: port 0x9e0-0x9e7,0xbe0-0xbe3,0x960-0x967,0xb60-0xb63,0xcc00-0xcc0f mem 0xfe02c000-0xfe02cfff irq 20 at device 15.0 on pci0 ata4: on atapci2 ata5: on atapci2 pcib4: at device 16.0 on pci0 pci4: on pcib4 fwohci0: mem 0xfdbff000-0xfdbff7ff,0xfdbf8000-0xfdbfbfff irq 19 at device 5.0 on pci4 fwohci0: OHCI version 1.10 (ROM=1) fwohci0: No. of Isochronous channels is 4. fwohci0: EUI64 00:11:d8:00:00:ee:cc:8c fwohci0: Phy 1394a available S400, 2 ports. fwohci0: Link S400, max_rec 2048 bytes. firewire0: on fwohci0 fwe0: on firewire0 if_fwe0: Fake Ethernet address: 02:11:d8:ee:cc:8c fwe0: Ethernet address: 02:11:d8:ee:cc:8c fwe0: if_start running deferred for Giant sbp0: on firewire0 fwohci0: Initiate bus reset fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode firewire0: 1 nodes, maxhop <= 0, cable IRM = 0 (me) firewire0: bus manager 0 (me) pci0: at device 16.1 (no driver attached) nve0: port 0xc800-0xc807 mem 0xfe02b000-0xfe02bfff irq 22 at device 20.0 on pci0 nve0: Ethernet address 00:18:f3:97:3e:1b miibus0: on nve0 ukphy0: on miibus0 ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto nve0: Ethernet address: 00:18:f3:97:3e:1b acpi_tz0: on acpi0 fdc0: port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FAST] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A ppc0: port 0x378-0x37f,0x778-0x77b irq 7 drq 3 on acpi0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/16 bytes threshold ppbus0: on ppc0 plip0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] pmtimer0 on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 ums0: A4Tech RF USB Mouse, rev 1.10/0.01, addr 2, iclass 3/1 ums0: 8 buttons and Z dir. Timecounters tick every 1.000 msec ad0: 76318MB at ata0-master UDMA100 acd0: DVDR at ata1-master UDMA33 SMP: AP CPU #1 Launched! Trying to mount root from ufs:/dev/ad0s1a WARNING: / was not properly dismounted ==================== uname -a FreeBSD ussr.lissyara.int.otradno.ru 6.2-RELEASE FreeBSD 6.2-RELEASE #0: Fri Jan 12 11:05:30 UTC 2007 root@dessler.cse.buffalo.edu:/usr/obj/usr/src/sys/SMP i386 ==================== kldstat Id Refs Address Size Name 1 3 0xc0400000 70794c kernel 2 1 0xc0b08000 59f20 acpi.ko