Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 31 Jul 1999 11:13:19 -0400
From:      tcobb@staff.circle.net
To:        blapp@attic.ch, freebsd-current@freebsd.org
Cc:        freebsd-smp@freebsd.org
Subject:   RE: network got stuck during high nfs-load
Message-ID:  <A0CFA284C004D211B7EE0060082F32A41F2830@freya.circle.net>

next in thread | raw e-mail | index | archive | help
I'm now getting a similar thing with the latest -STABLE build.
NFS isn't involved, but this is on a heavily used webserver.
NIC is a 3COM 3C905b.

Ifconfig up and down works for about 5 packets, that's it.  netstat
was showing either errors inbound, or absolutely nothing.

This happens about once per day now, on this machine.  I've
had to switch back to a UP kernel to avoid this (and another
bug about processes getting wedged on a processor).

I think this is an SMP problem, so I've included freebsd-smp
in the reply.


-Troy Cobb
 Circle Net, Inc.
 http://www.circle.net

>   -----Original Message-----
>   From: Martin Blapp [mailto:blapp@attic.ch]
>   Sent: Saturday, July 31, 1999 9:57 AM
>   To: freebsd-current@freebsd.org
>   Subject: network got stuck during high nfs-load
>   
>   
>   
>   Hi driver gurus,
>   
>   Today I had heavy NFS-load on my CURRENT-SMP machine while compiling
>   world. Suddenly all NFS-Clients stopped working. I was't 
>   able to ping 
>   the server from inside my  /29 subnet, same with ping the 
>   others from
>   the server itself. I changed cables, plugged them  in and 
>   out, did some
>   ifconfig up down, deleted routes and added them again ... 
>   nothing.  But
>   the card was still sending from time to time (once a 
>   second) with 100%
>   load many 0 bytes - like some waves: 
>   
>   (tcp-dump from same subnet, MAC Adress 0:a0:cc:66:32:97 is 
>   the NFS-Server)
>   ------------------------------------------------------------
>   --------------
>   
>   13:53:41.673733 0:a0:cc:66:32:97 > 0:a0:cc:66:32:97 null 
>   test/C len=43
>                            0000 0000 0000 0000 0000 0000 0000 0000
>                            0000 0000 0000 0000 0000 0000 0000 0000
>                            0000 0000 0000 0000 0000 00
>   
>   13:53:41.773706 0:a0:cc:66:32:97 > 0:a0:cc:66:32:97 null 
>   test/C len=43
>                            0000 0000 0000 0000 0000 0000 0000 0000
>                            0000 0000 0000 0000 0000 0000 0000 0000
>                            0000 0000 0000 0000 0000 00    
>   
>   It looks like a endless loop somewhere in the de0-driver ...
>   
>   I was able to ping the machine from itself via loopback and 
>   via the local
>   ip. No stucked processes at all, the running make 
>   buildworld was still
>   running. Also the load was normal, I couldn't see some 
>   special behaviour ...
>   
>   I paniced the server and made a crashdump. If anybody is 
>   interested in
>   this strange behaviour, just write me back ... After a 
>   reboot, all worked
>   again, but that's not a good solution ;)
>   
>   Martin
>   
>   PS: cvsup is from yesterday (30.7.1999) at 22:00 CET and I included
>   a backtrace and a dmesg from my server ...
>   
>   backtrace from debug.kernel
>   ---------------------------
>   
>   #0  boot (howto=260) at ../../kern/kern_shutdown.c:291
>   #1  0xc01538c9 in panic (fmt=0xc0277c14 "from debugger")
>       at ../../kern/kern_shutdown.c:505
>   #2  0xc012df19 in db_panic (addr=-1071398025, have_addr=0, count=-1,
>       modif=0xff80dddc "") at ../../ddb/db_command.c:434
>   #3  0xc012deb7 in db_command (last_cmdp=0xc02a6f30, 
>   cmd_table=0xc02a6d90,
>       aux_cmd_tablep=0xc02c572c) at ../../ddb/db_command.c:334
>   #4  0xc012df7e in db_command_loop () at ../../ddb/db_command.c:456
>   #5  0xc012ffbb in db_trap (type=3, code=0) at ../../ddb/db_trap.c:71
>   #6  0xc023c0c2 in kdb_trap (type=3, code=0, regs=0xff80ded0)
>       at ../../i386/i386/db_interface.c:157
>   #7  0xc02524a0 in trap (frame={tf_fs = 24, tf_es = 16, 
>   tf_ds = 16, tf_edi
>   = 0,
>         tf_esi = -1070741376, tf_ebp = -8331496, tf_isp = -8331524,
>         tf_ebx = -1070741376, tf_edx = -1070689184, tf_ecx = 16777217,
>         tf_eax = 38, tf_trapno = 3, tf_err = 0, tf_eip = 
>   -1071398025, tf_cs
>   = 8,
>         tf_eflags = 582, tf_esp = -1071030749, tf_ss = -1071045718})
>       at ../../i386/i386/trap.c:534
>   #8  0xc023c377 in Debugger (msg=0xc02923aa "manual escape 
>   to debugger")
>       at machine/cpufunc.h:64
>   #9  0xc0236d7c in scgetc (sc=0xc02c3ca0, flags=2)
>       at ../../dev/syscons/syscons.c:3813
>   #10 0xc0232529 in sckbdevent (thiskbd=0xc02db0a0, event=0, 
>   arg=0xc02c3ca0)
>       at ../../dev/syscons/syscons.c:688
>   #11 0xc022bd77 in atkbd_intr (kbd=0xc02db0a0, arg=0x0)
>       at ../../dev/kbd/atkbd.c:535
>   #12 0xc02692d7 in atkbd_isa_intr (arg=0xc05a2100) at
>   ../../isa/atkbd_isa.c:125
>   
>   dmesg-output
>   ------------
>   
>   Copyright (c) 1992-1999 The FreeBSD Project.
>   Copyright (c) 1982, 1986, 1989, 1991, 1993
>           The Regents of the University of California. All 
>   rights reserved.
>   FreeBSD 4.0-CURRENT #1: Fri Jul 30 21:31:06 CEST 1999
>       root@fuchur.lan.attic.ch:/usr/src/sys/compile/FUCHUR 
>   Timecounter "i8254"  frequency 1193182 Hz
>   CPU: Pentium/P55C (586-class CPU)
>     Origin = "GenuineIntel"  Id = 0x543  Stepping = 3
>     Features=0x8003bf<FPU,VME,DE,PSE,TSC,MSR,MCE,CX8,APIC,MMX>
>   real memory  = 67108864 (65536K bytes)
>   avail memory = 61652992 (60208K bytes)
>   Programming 24 pins in IOAPIC #0
>   FreeBSD/SMP: Multiprocessor motherboard
>    cpu0 (BSP): apic id:  0, version: 0x00030010, at 0xfee00000
>    cpu1 (AP):  apic id:  1, version: 0x00030010, at 0xfee00000
>    io0 (APIC): apic id:  2, version: 0x00170011, at 0xfec00000
>   Preloaded elf kernel "kernel" at 0xc0340000.
>   Intel Pentium detected, installing workaround for F00F bug
>   Probing for PnP devices:
>   npx0: <math processor> on motherboard
>   npx0: INT 16 interface
>   pcib0: <Host to PCI bridge> on motherboard
>   pci0: <PCI bus> on pcib0
>   isab0: <Intel 82371SB PCI to ISA bridge> at device 7.0 on pci0
>   ide_pci0: <Intel PIIX3 Bus-master IDE controller> at device 
>   7.1 on pci0
>   de0: <Digital 21140A Fast Ethernet> irq 17 at device 9.0 on pci0
>   de0: 21140A [10-100Mb/s] pass 2.0
>   de0: address 00:a0:cc:66:32:97
>   vga-pci0: <Matrox MGA 1024SG/1064SG/1164SG graphics 
>   accelerator> irq 18 at
>   device 10.0 on pci0 
>   ahc0: <Adaptec aic7880 Ultra SCSI adapter> irq 19 at device 
>   12.0 on pci0
>   ahc0: Using left over BIOS settings
>   ahc0: aic7880 Wide Channel A, SCSI Id=7, 16/255 SCBs
>   isa0: <ISA bus> on motherboard
>   fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f7 irq 6 drq 2 on isa0
>   fdc0: FIFO enabled, 8 bytes threshold
>   fd0: <1440-KB 3.5" drive> on fdc0 drive 0
>   wdc0 at port 0x1f0-0x1f7 irq 14 on isa0
>   wdc0: unit 0 (atapi): <TOSHIBA CD-ROM XM-6202B/1110>, 
>   removable, accel,
>   ovlap, dma, iordis
>   wcd0: drive speed 5512KB/sec, 256KB cache
>   wcd0: supported read types: CD-R, CD-RW, CD-DA
>   wcd0: Audio: play, 255 volume levels
>   wcd0: Mechanism: ejectable tray
>   wcd0: Medium: CD-ROM 120mm data disc loaded, unlocked
>   wdc0: unit 1 (atapi): <IOMEGA  ZIP 100       ATAPI/23.D>, 
>   removable, intr,
>   iordis
>   wfd0: medium type unknown (no disk)
>   wfd0: buggy Zip drive, 64-block transfer limit set
>   atkbdc0: <keyboard controller (i8042)> at port 0x60-0x6f on isa0
>   atkbd0: <AT Keyboard> irq 1 on atkbdc0
>   vga0: <Generic ISA VGA> at port 0x3b0-0x3df iomem 
>   0xa0000-0xbffff on isa0  
>   sc0: VGA <16 virtual consoles, flags=0x200>
>   sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
>   sio0: type 16550A
>   sio1 at port 0x2f8-0x2ff irq 3 on isa0
>   sio1: type 16550A
>   ppc0 at port 0x378-0x37f irq 7 flags 0x40 on isa0
>   ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode
>   ppc0: FIFO with 16/16/16 bytes threshold
>   ppb0: IEEE1284 device found /NIBBLE
>   Probing for PnP devices on ppbus0:
>   ppbus0: <Hewlett-Packard HP LaserJet 5MP> HP ENHANCED 
>   PCL5,PJL,POSTSCRIPT
>   plip0: <PLIP network interface> on ppbus 0
>   lpt0: <generic printer> on ppbus 0
>   lpt0: Interrupt-driven port
>   ppi0: <generic parallel i/o> on ppbus 0
>   APIC_IO: Testing 8254 interrupt delivery
>   APIC_IO: routing 8254 via pin 2
>   IP packet filtering initialized, divert enabled, rule-based 
>   forwarding
>   enabled, default to accept, logging limited to 100 packets/entry
>   DUMMYNET initialized (990504)
>   BRIDGE 981214, have 6 interfaces
>   -- index 1 de0 type 6 phy 0 addrl 6 addr 00.a0.cc.66.32.97
>   IP Filter: initialized.  Default = pass all, Logging = enabled
>   Waiting 15 seconds for SCSI devices to settle 
>   SMP: AP CPU #1 Launched!
>   de0: enabling 10baseT port   
>   
>   
>   
>   To Unsubscribe: send mail to majordomo@FreeBSD.org
>   with "unsubscribe freebsd-current" in the body of the message
>   


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-current" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?A0CFA284C004D211B7EE0060082F32A41F2830>