From owner-freebsd-current@FreeBSD.ORG Mon Oct 17 09:47:59 2005 Return-Path: X-Original-To: freebsd-current@freebsd.org Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id D73F216A420 for ; Mon, 17 Oct 2005 09:47:59 +0000 (GMT) (envelope-from dan@mail.neosystem.cz) Received: from gem.neosystem.cz (gem.neosystem.cz [81.95.103.8]) by mx1.FreeBSD.org (Postfix) with ESMTP id C65C043D49 for ; Mon, 17 Oct 2005 09:47:58 +0000 (GMT) (envelope-from dan@mail.neosystem.cz) Received: from localhost (localhost [127.0.0.1]) by gem.neosystem.cz (Postfix) with ESMTP id 69EDB38AF92 for ; Mon, 17 Oct 2005 11:47:59 +0200 (CEST) Received: from gem.neosystem.cz ([127.0.0.1]) by localhost (gem [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 09624-05 for ; Mon, 17 Oct 2005 11:47:54 +0200 (CEST) Received: from moon.kancelar.seznam.cz (proxy.seznam.cz [212.80.76.5]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by gem.neosystem.cz (Postfix) with ESMTP id 8D36638AF8D for ; Mon, 17 Oct 2005 11:47:54 +0200 (CEST) Date: Mon, 17 Oct 2005 11:46:55 +0200 From: Dan Bilik To: freebsd-current@freebsd.org Message-Id: <20051017114655.636eff01.dan@mail.neosystem.cz> Organization: neosystem.cz X-Mailer: Sylpheed version 2.0.2 (GTK+ 2.6.10; i386-portbld-freebsd6.0) Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="Multipart=_Mon__17_Oct_2005_11_46_55_+0200_ET8Kc55/Ev3NeaZs" X-Virus-Scanned: amavisd-new at neosystem.cz Subject: Possible fxp(4) problem in -CURRENT X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 17 Oct 2005 09:48:00 -0000 This is a multi-part message in MIME format. --Multipart=_Mon__17_Oct_2005_11_46_55_+0200_ET8Kc55/Ev3NeaZs Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Hi. Situation: Single-purpose machines only serving http requests for static content, running 6.0-BETA5 and now 6.0-RC1. Hardware is IBM eServer x330 (dual P3, 1GB RAM, 2 * SCSI disk, 2 * Intel fxp(4) network interface). In addition to base system there is running boa, cdpd, qmail a net-snmp. Kernel is custom, config and dmesg are attached. Machines are under quite heavy network load. Problem: After some time of serving requests the ethernet interface in the machine stops communicating on the wire. It does not respond to any packets (ping, http, nfs, ssh) and vmstat(8) shows stopped interrupt counter for that interface. Machine is running just fine, one can log in localy and even configure second interface to communicate on network through it. It seems to be the same problem as described in PR amd64/82425 with one exception - there is no 'device timeout' in logs. In fact there is nothing suspicious in logs when the system reaches such state. This leads me to that it also may be close to PR kern/87418 - interface just stops receiving (and/or sending) data. The period in which this happens varies a lot - some machines run for two weeks with no problem, some stop that way twice a day. It may be related to network traffic volume. We have tried to disable acpi(4) and turn on polling(4) but with no effect. Questions: Is this known problem (extensive googling returned nothing similar)? Is there any way I can help debug the problem? When I encounter the same problem again what data would be valuable for debugging? Dan --Multipart=_Mon__17_Oct_2005_11_46_55_+0200_ET8Kc55/Ev3NeaZs Content-Type: text/plain; name="IBM33x" Content-Disposition: attachment; filename="IBM33x" Content-Transfer-Encoding: 7bit machine i386 cpu I686_CPU ident IBM33x makeoptions MODULES_OVERRIDE="cd9660 cd9660_iconv udf udf_iconv msdosfs msdosfs_iconv procfs linprocfs pseudofs libmchain libiconv smbfs nwfs nfs4client nfsclient nfsserver ext2fs rl sis txp vr xl ti nge lge sk vx mii if_gif if_gre if_tun if_faith if_tap if_stf if_sl if_ppp snp plip ppi lpt vpo acpi apm linux i2c uart ugen uhid ukbd ulpt umass ums udbp ucom umodem pf ipfw nullfs unionfs netgraph crypto cryptodev sio rc4 md syscons isp ispfw fdc" maxusers 512 options NMBCLUSTERS=16384 options MAXFILES=65535 options TCP_DROP_SYNFIN options SCHED_4BSD options INET # InterNETworking options FFS # Berkeley Fast Filesystem options SOFTUPDATES # Enable FFS soft updates support options UFS_DIRHASH # Improve performance on big directories options GEOM_BSD # BSD disklabels options GEOM_GPT # GUID Partition Tables options GEOM_CONCAT # Disk concatenation. options GEOM_STRIPE # Disk striping. options GEOM_MIRROR # Disk mirroring. options COMPAT_43 # Compatible with BSD 4.3 [KEEP THIS!] options COMPAT_FREEBSD4 # Compatible with FreeBSD4 options SCSI_DELAY=5000 # Delay (in ms) before probing SCSI options KTRACE # ktrace(1) support options SYSVSHM # SYSV-style shared memory options SYSVMSG # SYSV-style message queues options SYSVSEM # SYSV-style semaphores options _KPOSIX_PRIORITY_SCHEDULING options KBD_INSTALL_CDEV # install a CDEV entry in /dev options ADAPTIVE_GIANT # Giant mutex is adaptive options AHC_REG_PRETTY_PRINT # Print register bitfields in debug # output. Adds ~128k to driver. options AHD_REG_PRETTY_PRINT # Print register bitfields in debug # output. Adds ~215k to driver. options CPU_FASTER_5X86_FPU options NO_F00F_HACK options SMP # Symmetric MultiProcessor Kernel options DEVICE_POLLING options HZ=1000 options IPFIREWALL_DEFAULT_TO_ACCEPT option IPSTEALTH # support for stealth forwarding options ALTQ options ALTQ_CBQ # Class Bases Queueing options ALTQ_RED # Random Early Drop options ALTQ_RIO # RED In/Out options ALTQ_HFSC # Hierarchical Packet Scheduler options ALTQ_CDNR # Traffic conditioner options ALTQ_PRIQ # Prioirity Queueing options ACCEPT_FILTER_HTTP options ACCEPT_FILTER_DATA # Mandatory: device apic # I/O apic device isa device pci # ATA and ATAPI devices device ata device atadisk # ATA disk drives device atapicd # ATAPI CDROM drives options ATA_STATIC_ID # Static device numbering # SCSI Controllers device ahc # AHA2940 and onboard AIC7xxx devices device ahd # AHA39320/29320 and onboard AIC79xx devices device mpt # LSI-Logic MPT/Fusion device sym # Symbios/Logic 53C8XX family of PCI-SCSI I/O processors # SCSI peripherals device scbus # SCSI bus (required) device da # Direct Access (disks) device cd # CD device pass # Passthrough device (direct SCSI access) # atkbdc0 controls both the keyboard and the PS/2 mouse device atkbdc # AT keyboard controller device atkbd # AT keyboard device vga # VGA video card driver device splash # Splash screen and screen saver support # syscons is the default console driver, resembling an SCO console device sc # Floating point support - do not disable. device npx # Parallel port device ppc device ppbus # Parallel port bus (required) # PCI Ethernet NICs that use the common MII bus controller code. # NOTE: Be sure to keep the 'device miibus' line in order to use these NICs! device miibus # MII bus support device fxp # Intel EtherExpress PRO/100B (82557, 82558) device em # Intel EtherExpress PRO/1000 device bge # Broadcom BCM570x (``Tigon III'') # Pseudo devices - the number indicates how many units to allocate. device random # Entropy device device loop # Network loopback device mem # Memory and kernel memory devices device io # I/O device device ether # Ethernet support device pty # Pseudo-ttys (telnet etc) # The `bpf'device enables the Berkeley Packet Filter. # Be aware of the administrative consequences of enabling this! device bpf # Berkeley packet filter # USB support device uhci # UHCI PCI->USB interface device ehci # EHCI PCI->USB interface device usb # USB Bus (required) --Multipart=_Mon__17_Oct_2005_11_46_55_+0200_ET8Kc55/Ev3NeaZs Content-Type: text/plain; name="dmesg.txt" Content-Disposition: attachment; filename="dmesg.txt" Content-Transfer-Encoding: 7bit Copyright (c) 1992-2005 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 6.0-RC1 #0: Wed Oct 12 18:23:43 CEST 2005 root@machine.test:/usr/obj/usr/src/sys/IBM33x Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel(R) Pentium(R) III CPU family 1266MHz (1261.31-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x6b1 Stepping = 1 Features=0x383fbff real memory = 1073659904 (1023 MB) avail memory = 1039339520 (991 MB) MPTable: FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 3 cpu1 (AP): APIC ID: 0 ioapic0: Assuming intbase of 0 ioapic1: Assuming intbase of 16 ioapic1 irqs 16-31 on motherboard ioapic0 irqs 0-15 on motherboard npx0: [FAST] npx0: on motherboard npx0: INT 16 interface cpu0 on motherboard cpu1 on motherboard pcib0: pcibus 0 on motherboard pci0: on pcib0 pci0: at device 1.0 (no driver attached) fxp0: port 0x2200-0x223f mem 0xfeb7f000-0xfeb7ffff,0xfea00000-0xfeafffff irq 27 at device 2.0 on pci0 fxp0: Disabling dynamic standby mode in EEPROM fxp0: New EEPROM ID: 0x48a0 fxp0: EEPROM checksum @ 0x3f: 0xf0f7 -> 0xf0f7 miibus0: on fxp0 inphy0: on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:02:55:c6:e8:25 fxp1: port 0x2240-0x227f mem 0xfeb7e000-0xfeb7efff,0xfe900000-0xfe9fffff irq 25 at device 10.0 on pci0 fxp1: Disabling dynamic standby mode in EEPROM fxp1: New EEPROM ID: 0x48a0 fxp1: EEPROM checksum @ 0x3f: 0x5ffa -> 0x5ffa miibus1: on fxp1 inphy1: on miibus1 inphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp1: Ethernet address: 00:02:55:c6:e8:26 isab0: port 0x440-0x44f at device 15.0 on pci0 isa0: on isab0 atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x700-0x70f at device 15.1 on pci0 ata0: on atapci0 ata1: on atapci0 pci0: at device 15.2 (no driver attached) pcib1: pcibus 1 on motherboard pci1: on pcib1 ahc0: port 0x2300-0x23ff mem 0xeffff000-0xefffffff irq 28 at device 3.0 on pci1 ahc0: [GIANT-LOCKED] aic7892: Ultra160 Wide Channel A, SCSI Id=7, 32/253 SCBs orm0: at iomem 0xc0000-0xcafff,0xcb000-0xcc7ff,0xcc800-0xd0fff on isa0 atkbdc0: at port 0x60,0x64 on isa0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] ppc0: parallel port not found. sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 ppc1: parallel port not found. unknown: can't assign resources (memory) unknown: can't assign resources (port) Timecounters tick every 1.000 msec acd0: CDROM at ata0-master UDMA33 Waiting 5 seconds for SCSI devices to settle pass2 at ahc0 bus 0 target 8 lun 0 pass2: Fixed Processor SCSI-2 device pass2: 3.300MB/s transfers da0 at ahc0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-3 device da0: 160.000MB/s transfers (80.000MHz, offset 63, 16bit), Tagged Queueing Enabled da0: 34715MB (71096640 512 byte sectors: 255H 63S/T 4425C) da1 at ahc0 bus 0 target 1 lun 0 da1: Fixed Direct Access SCSI-3 device da1: 160.000MB/s transfers (80.000MHz, offset 63, 16bit), Tagged Queueing Enabled da1: 34715MB (71096640 512 byte sectors: 255H 63S/T 4425C) SMP: AP CPU #1 Launched! GEOM_STRIPE: Device st0 created (id=3594278667). GEOM_STRIPE: Disk da0s1g attached to st0. GEOM_STRIPE: Disk da1s1g attached to st0. GEOM_STRIPE: Device st0 activated. Trying to mount root from ufs:/dev/da0s1a --Multipart=_Mon__17_Oct_2005_11_46_55_+0200_ET8Kc55/Ev3NeaZs--