From owner-freebsd-net@FreeBSD.ORG Fri Aug 10 19:20:48 2012 Return-Path: Delivered-To: freebsd-net@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id F090A1065676 for ; Fri, 10 Aug 2012 19:20:48 +0000 (UTC) (envelope-from Dennis.P.Glatting@boeing.com) Received: from stl-mbsout-02.boeing.com (stl-mbsout-02.boeing.com [130.76.96.170]) by mx1.freebsd.org (Postfix) with ESMTP id ACAAB8FC20 for ; Fri, 10 Aug 2012 19:20:48 +0000 (UTC) Received: from stl-mbsout-02.boeing.com (localhost.localdomain [127.0.0.1]) by stl-mbsout-02.boeing.com (8.14.4/8.14.4/DOWNSTREAM_MBSOUT) with ESMTP id q7AJIvcL026330 for ; Fri, 10 Aug 2012 14:18:57 -0500 Received: from blv-av-01.boeing.com (blv-av-01.boeing.com [130.247.16.37]) by stl-mbsout-02.boeing.com (8.14.4/8.14.4/UPSTREAM_MBSOUT) with ESMTP id q7AJIt3G026320 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=FAIL); Fri, 10 Aug 2012 14:18:56 -0500 Received: from blv-av-01.boeing.com (localhost.localdomain [127.0.0.1]) by blv-av-01.boeing.com (8.14.4/8.14.4/DOWNSTREAM_RELAY) with ESMTP id q7AJItBK012566; Fri, 10 Aug 2012 12:18:55 -0700 Received: from XCH-NWHT-11.nw.nos.boeing.com (xch-nwht-11.nw.nos.boeing.com [130.247.25.114]) by blv-av-01.boeing.com (8.14.4/8.14.4/UPSTREAM_RELAY) with ESMTP id q7AJItmH012560 (version=TLSv1/SSLv3 cipher=RC4-MD5 bits=128 verify=OK); Fri, 10 Aug 2012 12:18:55 -0700 Received: from XCH-NW-21V.nw.nos.boeing.com ([130.247.25.209]) by XCH-NWHT-11.nw.nos.boeing.com ([130.247.25.114]) with mapi; Fri, 10 Aug 2012 12:18:55 -0700 From: "EXT-Glatting, Dennis P" To: Karl Pielorz , "freebsd-net@freebsd.org" Date: Fri, 10 Aug 2012 12:18:54 -0700 Thread-Topic: FreeBSD 9.0-R em0 issues? Thread-Index: Ac13KX8d/A2Ns6P0S7+rffYFFr3+YgAAnwEg Message-ID: <0457799DC5A3A5479336B9A1F2E7FC6450E4FF2F5F@XCH-NW-21V.nw.nos.boeing.com> References: <26ABB168B4C8E32E7D127EF9@Octca64MkIV.tdx.co.uk> In-Reply-To: <26ABB168B4C8E32E7D127EF9@Octca64MkIV.tdx.co.uk> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-TM-AS-MML: No Cc: Subject: RE: FreeBSD 9.0-R em0 issues? X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 10 Aug 2012 19:20:49 -0000 I've seen this too on a H8DG6-F but changed to the igb interfaces. I am als= o seeing a problem under ESXi 5.0 on an unpatched system but haven't had th= e opportunity to patch (ESXi) it. There I will see these messages under a F= reeBSD instance: nms# uname -a FreeBSD nms 9.1-PRERELEASE FreeBSD 9.1-PRERELEASE #9: Mon Aug 6 22:02:46 P= DT 2012 root@nms:/sys/amd64/compile/SMUNI amd64 nms# grep em0 /var/log/messages Jul 9 15:50:39 nms kernel: em0: port 0x2000-0x203f mem 0xd1020000-0xd103ffff,0xd1000000-0xd100fff= f irq 18 at device 0.0 on pci2 Jul 9 15:50:39 nms kernel: em0: Memory Access and/or Bus Master bits were = not set! Jul 9 15:50:39 nms kernel: em0: Ethernet address: 00:0c:29:a2:36:c2 Jul 10 09:28:49 nms kernel: em0: Watchdog timeout -- resetting Jul 18 16:53:30 nms kernel: em0: Watchdog timeout -- resetting Jul 26 22:37:16 nms kernel: em0: Watchdog timeout -- resetting Aug 6 20:55:01 nms kernel: em0: port 0x2000-0x203f mem 0xd1020000-0xd103ffff,0xd1000000-0xd100fff= f irq 18 at device 0.0 on pci2 Aug 6 20:55:01 nms kernel: em0: Memory Access and/or Bus Master bits were = not set! Aug 6 20:55:01 nms kernel: em0: Ethernet address: 00:0c:29:a2:36:c2 Aug 6 22:16:34 nms kernel: em0: port 0x2000-0x203f mem 0xd1020000-0xd103ffff,0xd1000000-0xd100fff= f irq 18 at device 0.0 on pci2 Aug 6 22:16:34 nms kernel: em0: Memory Access and/or Bus Master bits were = not set! Aug 6 22:16:34 nms kernel: em0: Ethernet address: 00:0c:29:a2:36:c2 Aug 6 22:58:35 nms kernel: em0: port 0x2000-0x203f mem 0xd1020000-0xd103ffff,0xd1000000-0xd100fff= f irq 18 at device 0.0 on pci2 Aug 6 22:58:35 nms kernel: em0: Memory Access and/or Bus Master bits were = not set! Aug 6 22:58:35 nms kernel: em0: Ethernet address: 00:0c:29:a2:36:c2 Aug 8 12:51:50 nms kernel: em0: Watchdog timeout -- resetting Aug 8 22:49:58 nms kernel: em0: port 0x2000-0x203f mem 0xd1020000-0xd103ffff,0xd1000000-0xd100fff= f irq 18 at device 0.0 on pci2 Aug 8 22:49:58 nms kernel: em0: Memory Access and/or Bus Master bits were = not set! Aug 8 22:49:58 nms kernel: em0: Ethernet address: 00:0c:29:a2:36:c2 -----Original Message----- From: owner-freebsd-net@freebsd.org [mailto:owner-freebsd-net@freebsd.org] = On Behalf Of Karl Pielorz Sent: Friday, August 10, 2012 11:55 AM To: freebsd-net@freebsd.org Subject: FreeBSD 9.0-R em0 issues? Hi, Apologies for posting to -net as well - I originally posted this to=20 -hackers, but was advised to re-post it here... A FreeBSD 9.0-R amd64 box - based on a SuperMicro X8DTL-IF Rev. 2.01=20 w/Intel L5630 & 6Gb of RAM seems to have issues with it's onboard NIC (em= =20 driver based - i.e. em0). The machine runs fine - but then suddenly loses all network connectivity.=20 Nothing is logged on the console, or /var/log/messages. Doing an 'infconfig em0 down' then up, doesn't fix it. Rebooting the box=20 does fix it "for a while". Having dug around Google - I've now set=20 "hw.em.enable_msix=3D0" - the box ran the whole of the day with that set,=20 before again - having em0 wedge up. When it does this 'netstat -n -i' returns "silly" figures - i.e. if I catch= =20 it even moments after it's done it - it'll claim to have suffered billions= =20 of input/output and collision errors (huge amounts more than the amount of= =20 traffic that machine would have handled) - e.g. " Name Mtu Network Address Ipkts Ierrs Idrop Opkts=20 Oerrs Coll em0 1500 00:25:90:31:82:46 355482 10612864185945 0=20 291109 3032246910270 1516123455135 " Before it locks up, all ierr,oerr,coll are zero (i.e. only ipkts/opkts have= =20 incremented). Running sysctl dev.em.0.debug=3D1 spits out on the console: Interface is RUNNING and INACTIVE em0: hw tdh =3D -1, hw tdt =3D -1 em0: hw rdh =3D -1, hw rdt =3D -1 em0: Tx Queue Status =3D 1 em0: Tx descriptors avail =3D 986 em0: Tx Descriptors avail failure =3D 0 em0: RX discarded packets =3D 0 em0: RX Next to Check =3D 844 em0: RX Next to Refresh =3D 843 (I don't like the look of the -1's but I don't know enough about what that= =20 sysctl dumps out to know if that's "bad"). The HP switch it's connected to doesn't seem to log any errors for the port= . Any suggestions on how I can debug this further, or any ideas to try and=20 fix it? You can find the dmesg output here: (ifconfig -a is tacked on the end) Regards, -Karl _______________________________________________ freebsd-net@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-net To unsubscribe, send any mail to "freebsd-net-unsubscribe@freebsd.org"