From owner-freebsd-net@FreeBSD.ORG Tue Aug 24 22:11:32 2010 Return-Path: Delivered-To: freebsd-net@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 50F4C10656AA for ; Tue, 24 Aug 2010 22:11:32 +0000 (UTC) (envelope-from universite@ukr.net) Received: from otrada.od.ua (otrada.od.ua [89.209.81.54]) by mx1.freebsd.org (Postfix) with ESMTP id B319B8FC14 for ; Tue, 24 Aug 2010 22:11:31 +0000 (UTC) Received: from [10.0.0.10] (phenom.otrada.od.ua [10.0.0.10]) (authenticated bits=0) by otrada.od.ua (8.14.3/8.14.3) with ESMTP id o7OLvnOZ058241 for ; Wed, 25 Aug 2010 00:57:49 +0300 (EEST) (envelope-from universite@ukr.net) X-Authentication-Warning: otrada.od.ua: Host phenom.otrada.od.ua [10.0.0.10] claimed to be [10.0.0.10] Message-ID: <4C744053.6010403@ukr.net> Date: Wed, 25 Aug 2010 00:57:39 +0300 From: "Vladislav V. Prodan" User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; ru; rv:1.9.2.8) Gecko/20100802 Thunderbird/3.1.2 MIME-Version: 1.0 To: freebsd-net@freebsd.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-0.5 required=5.0 tests=ALL_TRUSTED,AWL autolearn=failed version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on mary-teresa.otrada.od.ua X-Virus-Scanned: clamav-milter 0.95.3 at mary-teresa.otrada.od.ua X-Virus-Status: Clean Subject: Error: em0: Watchdog timeout -- resetting X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 24 Aug 2010 22:11:32 -0000 The server is sometimes off the network card. It helps just to restart via KVM-IPMI. MotherBoard: X8SIL/X8SIL-F BIOS Version: 1.0c Build Date: 02/05/10 OS: FreeBSD 8.1-RELEASE, FreeBSD 8.1-STABLE, FreeBSD 9.0-CURRENT What would you recommend to address the problem? # uname -a FreeBSD solo.XXX.biz 9.0-CURRENT FreeBSD 9.0-CURRENT #0: Tue Aug 24 15:52:21 EEST 2010 root@solo.XXX.biz:/usr/obj/usr/src/sys/solo.2 amd64 #pciconf -lv ... em0@pci0:2:0:0: class=0x020000 card=0x060515d9 chip=0x10d38086 rev=0x00 hdr=0x00 vendor = 'Intel Corporation' device = 'Intel 82574L Gigabit Ethernet Controller (82574L)' class = network subclass = ethernet em1@pci0:3:0:0: class=0x020000 card=0x060515d9 chip=0x10d38086 rev=0x00 hdr=0x00 vendor = 'Intel Corporation' device = 'Intel 82574L Gigabit Ethernet Controller (82574L)' class = network subclass = ethernet ... In /usr/src/sys/dev/e1000/if_em.c: static void em_local_timer(void *arg) { struct adapter *adapter = arg; struct ifnet *ifp = adapter->ifp; struct tx_ring *txr = adapter->tx_rings; EM_CORE_LOCK_ASSERT(adapter); em_update_link_status(adapter); em_update_stats_counters(adapter); /* Reset LAA into RAR[0] on 82571 */ if (e1000_get_laa_state_82571(&adapter->hw) == TRUE) e1000_rar_set(&adapter->hw, adapter->hw.mac.addr, 0); /* ** Check for time since any descriptor was cleaned */ for (int i = 0; i < adapter->num_queues; i++, txr++) { EM_TX_LOCK(txr); if (txr->watchdog_check == FALSE) { EM_TX_UNLOCK(txr); continue; } if ((ticks - txr->watchdog_time) > EM_WATCHDOG) goto hung; EM_TX_UNLOCK(txr); } callout_reset(&adapter->timer, hz, em_local_timer, adapter); return; hung: device_printf(adapter->dev, "Watchdog timeout -- resetting\n"); ifp->if_drv_flags &= ~IFF_DRV_RUNNING; adapter->watchdog_events++; EM_TX_UNLOCK(txr); em_init_locked(adapter); } Someone will finish the piece for debugging, to further diagnose the error?