Date: Sat, 17 Aug 2013 20:19:04 -0700 From: aurfalien <aurfalien@gmail.com> To: iamatt <iamatt@gmail.com> Cc: FreeBSD Mailing List <freebsd-questions@freebsd.org> Subject: Re: Myrinet 10Gb odd behavior - SOLVED Message-ID: <D8FB1DBC-9B68-4556-A366-868AC5065097@gmail.com> In-Reply-To: <CAEeRwNU%2BhuENqY26z1uRpJ139hwWAOo=Og96Vrk-_6Zp_7L7Vg@mail.gmail.com> References: <297A8244-3756-4126-9F23-B772B81127C7@gmail.com> <52F83CC6-2D87-43E7-9F9C-9D16ED637064@gmail.com> <EF8734BC-1E7B-402B-B62A-EFE07A414EFB@gmail.com> <CAEeRwNU%2BhuENqY26z1uRpJ139hwWAOo=Og96Vrk-_6Zp_7L7Vg@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
Spoke to soon. Fine for a while (doing a 5 day rsync of 38TB) but =
getting those errors every 7 min. And I'm only getting 1.24Gb/s over a =
10Gb jumbo link.
Definitely causing connection issues.
Using it for ethernet.
Gonna go in tomorrow and give my Solarflare another shot as it was =
giving me issues but the rel notes say to try this, so I will;
- The driver uses mbufs to store packet data which come from a set of =
pools
of limted size. See man 7 tuning for more details. The following =
command
can display the number of used and free mbufs within the pools the =
Solarflare
driver uses
# vmstat -z | head -n 1; vmstat -z | grep mbuf
ITEM SIZE LIMIT USED FREE REQUESTS =
FAILURES
mbuf_cluster: 2048, 25600, 1408, 658, 31604, =
0
mbuf_jumbo_page: 4096, 12800, 0, 76, 2063, =
0
mbuf_jumbo_9k: 9216, 6400, 0, 0, 0, =
0
mbuf_jumbo_16k: 16384, 3200, 0, 0, 0, =
0
If a pool is exhausted (i.e. the failure count in the right hand =
column is
non-zero, networking applications may hang or received packets may be =
dropped.
Hence you may need to increase these limits using the following =
sysctls:
kern.ipc.nmbclusters (for mbuf_cluster)
kern.ipc.nmbjumbop (for mbuf_jumbo_page)
kern.ipc.nmbjumbo9 (for mbuf_jumbo_9k)
kern.ipc.nmbjumbo16 (for mbuf_jumbo_16k)
- aurf
On Aug 17, 2013, at 8:14 PM, iamatt wrote:
> Wow myricom still around... used to use the lanai stuff never on bsd =
though. All FDR Infiniband these days. Are you using the myrinet =
protocol or ethernet, just curious. Glad you got it working!
>=20
> On Aug 16, 2013 8:12 PM, "aurfalien" <aurfalien@gmail.com> wrote:
>=20
> On Aug 16, 2013, at 8:47 AM, aurfalien wrote:
>=20
> > Forgot to mention my loader.conf;
> >
> > if_mxge_load=3D"YES"
> > mxge_ethp_z8e_load=3D"YES"
> > mxge_eth_z8e_load=3D"YES"
> > mxge_rss_ethp_z8e_load=3D"YES"
> > mxge_rss_eth_z8e_load=3D"YES"
> >
> >
> > I blindly added these w/o thinking what they do.
> >
> > Should I simply only load the first line?
> >
> > - aurf
> >
> >
> > On Aug 16, 2013, at 8:18 AM, aurfalien wrote:
> >
> >> Hi,
> >>
> >> I've been suspecting my NIC is not up to par and notice this in the =
logs every few minutes;
> >>
> >> Aug 16 08:05:06 prometheus kernel: mxge0: slice 0 struck? ring =
state:
> >> Aug 16 08:05:06 prometheus kernel: mxge0: tx.req=3D1914503981 =
tx.done=3D1914503810, tx.queue_active=3D0
> >> Aug 16 08:05:06 prometheus kernel: mxge0: tx.activate=3D0 =
tx.deactivate=3D0
> >> Aug 16 08:05:06 prometheus kernel: mxge0: pkt_done=3D1824019832 =
fw=3D1824019931
> >> Aug 16 08:05:06 prometheus kernel: mxge0: Watchdog reset!
> >> Aug 16 08:05:06 prometheus kernel: mxge0: NIC did not reboot, not =
resetting
> >>
> >> Could tis be effecting throughput?
> >>
> >> My card is a Myri-10G-PCIE-8A
> >>
> >> I did install the Myrinet dev tools for FreeBSD and ran myri_info =
which yields;
> >>
> >> pci-dev at 05:00.0 vendor:product(rev)=3D14c1:0008(00)
> >> behind bridge root-port: 00:03.0 8086:3c08 (x8.1/x16.3)
> >> Myri-10G-PCIE-8A -- Link x8
> >> EEPROM String-spec:
> >> MAC=3D00:60:dd:45:73:23
> >> SN=3D413665
> >> PWR=3D100
> >> PC=3D10G-PCIE-8A-R
> >> PN=3D09-03852
> >> XFI=3DAEL1010
> >> TAG=3Dze_tools-1_4_45
> >>
> >> EEPROM MCP, PRESENT, length =3D 103384, crc=3D0x119daf46
> >> ETHZ::1.4.45 2009/08/22 18:57:06 self extracting firmware
> >> Bundle: exec_len=3D72144, PCI-ROM-len =3D 31232
> >> Running MCP:
> >> ETH ::1.4.55 -P- 2012/04/21 01:48:34 myri10ge firmware
> >>
> >> Any insights are appreciated.
> >>
> >> - aurf
>=20
>=20
> Did the ole RTFM and re programmed the firmware, all good now.
>=20
> - aurf
> _______________________________________________
> freebsd-questions@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-questions
> To unsubscribe, send any mail to =
"freebsd-questions-unsubscribe@freebsd.org"
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?D8FB1DBC-9B68-4556-A366-868AC5065097>
