Date: Sat, 17 Aug 2013 20:19:04 -0700 From: aurfalien <aurfalien@gmail.com> To: iamatt <iamatt@gmail.com> Cc: FreeBSD Mailing List <freebsd-questions@freebsd.org> Subject: Re: Myrinet 10Gb odd behavior - SOLVED Message-ID: <D8FB1DBC-9B68-4556-A366-868AC5065097@gmail.com> In-Reply-To: <CAEeRwNU%2BhuENqY26z1uRpJ139hwWAOo=Og96Vrk-_6Zp_7L7Vg@mail.gmail.com> References: <297A8244-3756-4126-9F23-B772B81127C7@gmail.com> <52F83CC6-2D87-43E7-9F9C-9D16ED637064@gmail.com> <EF8734BC-1E7B-402B-B62A-EFE07A414EFB@gmail.com> <CAEeRwNU%2BhuENqY26z1uRpJ139hwWAOo=Og96Vrk-_6Zp_7L7Vg@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
Spoke to soon. Fine for a while (doing a 5 day rsync of 38TB) but = getting those errors every 7 min. And I'm only getting 1.24Gb/s over a = 10Gb jumbo link. Definitely causing connection issues. Using it for ethernet. Gonna go in tomorrow and give my Solarflare another shot as it was = giving me issues but the rel notes say to try this, so I will; - The driver uses mbufs to store packet data which come from a set of = pools of limted size. See man 7 tuning for more details. The following = command can display the number of used and free mbufs within the pools the = Solarflare driver uses # vmstat -z | head -n 1; vmstat -z | grep mbuf ITEM SIZE LIMIT USED FREE REQUESTS = FAILURES mbuf_cluster: 2048, 25600, 1408, 658, 31604, = 0 mbuf_jumbo_page: 4096, 12800, 0, 76, 2063, = 0 mbuf_jumbo_9k: 9216, 6400, 0, 0, 0, = 0 mbuf_jumbo_16k: 16384, 3200, 0, 0, 0, = 0 If a pool is exhausted (i.e. the failure count in the right hand = column is non-zero, networking applications may hang or received packets may be = dropped. Hence you may need to increase these limits using the following = sysctls: kern.ipc.nmbclusters (for mbuf_cluster) kern.ipc.nmbjumbop (for mbuf_jumbo_page) kern.ipc.nmbjumbo9 (for mbuf_jumbo_9k) kern.ipc.nmbjumbo16 (for mbuf_jumbo_16k) - aurf On Aug 17, 2013, at 8:14 PM, iamatt wrote: > Wow myricom still around... used to use the lanai stuff never on bsd = though. All FDR Infiniband these days. Are you using the myrinet = protocol or ethernet, just curious. Glad you got it working! >=20 > On Aug 16, 2013 8:12 PM, "aurfalien" <aurfalien@gmail.com> wrote: >=20 > On Aug 16, 2013, at 8:47 AM, aurfalien wrote: >=20 > > Forgot to mention my loader.conf; > > > > if_mxge_load=3D"YES" > > mxge_ethp_z8e_load=3D"YES" > > mxge_eth_z8e_load=3D"YES" > > mxge_rss_ethp_z8e_load=3D"YES" > > mxge_rss_eth_z8e_load=3D"YES" > > > > > > I blindly added these w/o thinking what they do. > > > > Should I simply only load the first line? > > > > - aurf > > > > > > On Aug 16, 2013, at 8:18 AM, aurfalien wrote: > > > >> Hi, > >> > >> I've been suspecting my NIC is not up to par and notice this in the = logs every few minutes; > >> > >> Aug 16 08:05:06 prometheus kernel: mxge0: slice 0 struck? ring = state: > >> Aug 16 08:05:06 prometheus kernel: mxge0: tx.req=3D1914503981 = tx.done=3D1914503810, tx.queue_active=3D0 > >> Aug 16 08:05:06 prometheus kernel: mxge0: tx.activate=3D0 = tx.deactivate=3D0 > >> Aug 16 08:05:06 prometheus kernel: mxge0: pkt_done=3D1824019832 = fw=3D1824019931 > >> Aug 16 08:05:06 prometheus kernel: mxge0: Watchdog reset! > >> Aug 16 08:05:06 prometheus kernel: mxge0: NIC did not reboot, not = resetting > >> > >> Could tis be effecting throughput? > >> > >> My card is a Myri-10G-PCIE-8A > >> > >> I did install the Myrinet dev tools for FreeBSD and ran myri_info = which yields; > >> > >> pci-dev at 05:00.0 vendor:product(rev)=3D14c1:0008(00) > >> behind bridge root-port: 00:03.0 8086:3c08 (x8.1/x16.3) > >> Myri-10G-PCIE-8A -- Link x8 > >> EEPROM String-spec: > >> MAC=3D00:60:dd:45:73:23 > >> SN=3D413665 > >> PWR=3D100 > >> PC=3D10G-PCIE-8A-R > >> PN=3D09-03852 > >> XFI=3DAEL1010 > >> TAG=3Dze_tools-1_4_45 > >> > >> EEPROM MCP, PRESENT, length =3D 103384, crc=3D0x119daf46 > >> ETHZ::1.4.45 2009/08/22 18:57:06 self extracting firmware > >> Bundle: exec_len=3D72144, PCI-ROM-len =3D 31232 > >> Running MCP: > >> ETH ::1.4.55 -P- 2012/04/21 01:48:34 myri10ge firmware > >> > >> Any insights are appreciated. > >> > >> - aurf >=20 >=20 > Did the ole RTFM and re programmed the firmware, all good now. >=20 > - aurf > _______________________________________________ > freebsd-questions@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-questions > To unsubscribe, send any mail to = "freebsd-questions-unsubscribe@freebsd.org"
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?D8FB1DBC-9B68-4556-A366-868AC5065097>