FreeBSD Mail Archives

Date:      Sat, 17 Aug 2013 20:19:04 -0700
From:      aurfalien <aurfalien@gmail.com>
To:        iamatt <iamatt@gmail.com>
Cc:        FreeBSD Mailing List <freebsd-questions@freebsd.org>
Subject:   Re: Myrinet 10Gb odd behavior - SOLVED
Message-ID:  <D8FB1DBC-9B68-4556-A366-868AC5065097@gmail.com>
In-Reply-To: <CAEeRwNU%2BhuENqY26z1uRpJ139hwWAOo=Og96Vrk-_6Zp_7L7Vg@mail.gmail.com>
References:  <297A8244-3756-4126-9F23-B772B81127C7@gmail.com> <52F83CC6-2D87-43E7-9F9C-9D16ED637064@gmail.com> <EF8734BC-1E7B-402B-B62A-EFE07A414EFB@gmail.com> <CAEeRwNU%2BhuENqY26z1uRpJ139hwWAOo=Og96Vrk-_6Zp_7L7Vg@mail.gmail.com>

Spoke to soon. Fine for a while (doing a 5 day rsync of 38TB) but =
getting those errors every 7 min.  And I'm only getting 1.24Gb/s over a =
10Gb jumbo link.

Definitely causing connection issues.

Using it for ethernet.

Gonna go in tomorrow and give my Solarflare another shot as it was =
giving me issues but the rel notes say to try this, so I will;

 - The driver uses mbufs to store packet data which come from a set of =
pools
   of limted size. See man 7 tuning for more details. The following =
command
   can display the number of used and free mbufs within the pools the =
Solarflare
   driver uses

# vmstat -z | head -n 1; vmstat -z | grep mbuf
ITEM                     SIZE     LIMIT      USED      FREE  REQUESTS  =
FAILURES
mbuf_cluster:            2048,    25600,     1408,      658,    31604,   =
     0
mbuf_jumbo_page:         4096,    12800,        0,       76,     2063,   =
     0
mbuf_jumbo_9k:           9216,     6400,        0,        0,        0,   =
     0
mbuf_jumbo_16k:         16384,     3200,        0,        0,        0,   =
     0

  If a pool is exhausted (i.e. the failure count in the right hand =
column is
  non-zero, networking applications may hang or received packets may be =
dropped.
  Hence you may need to increase these limits using the following =
sysctls:
     kern.ipc.nmbclusters (for mbuf_cluster)
     kern.ipc.nmbjumbop   (for mbuf_jumbo_page)
     kern.ipc.nmbjumbo9   (for mbuf_jumbo_9k)
     kern.ipc.nmbjumbo16  (for mbuf_jumbo_16k)


- aurf



On Aug 17, 2013, at 8:14 PM, iamatt wrote:

> Wow myricom still around...  used to use the lanai stuff never on bsd =
though.  All FDR Infiniband these days.  Are you using the myrinet =
protocol or ethernet,  just curious.  Glad you got it working!
>=20
> On Aug 16, 2013 8:12 PM, "aurfalien" <aurfalien@gmail.com> wrote:
>=20
> On Aug 16, 2013, at 8:47 AM, aurfalien wrote:
>=20
> > Forgot to mention my loader.conf;
> >
> > if_mxge_load=3D"YES"
> > mxge_ethp_z8e_load=3D"YES"
> > mxge_eth_z8e_load=3D"YES"
> > mxge_rss_ethp_z8e_load=3D"YES"
> > mxge_rss_eth_z8e_load=3D"YES"
> >
> >
> > I blindly added these w/o thinking what they do.
> >
> > Should I simply only load the first line?
> >
> > - aurf
> >
> >
> > On Aug 16, 2013, at 8:18 AM, aurfalien wrote:
> >
> >> Hi,
> >>
> >> I've been suspecting my NIC is not up to par and notice this in the =
logs every few minutes;
> >>
> >> Aug 16 08:05:06 prometheus kernel: mxge0: slice 0 struck? ring =
state:
> >> Aug 16 08:05:06 prometheus kernel: mxge0: tx.req=3D1914503981 =
tx.done=3D1914503810, tx.queue_active=3D0
> >> Aug 16 08:05:06 prometheus kernel: mxge0: tx.activate=3D0 =
tx.deactivate=3D0
> >> Aug 16 08:05:06 prometheus kernel: mxge0: pkt_done=3D1824019832 =
fw=3D1824019931
> >> Aug 16 08:05:06 prometheus kernel: mxge0: Watchdog reset!
> >> Aug 16 08:05:06 prometheus kernel: mxge0: NIC did not reboot, not =
resetting
> >>
> >> Could tis be effecting throughput?
> >>
> >> My card is a Myri-10G-PCIE-8A
> >>
> >> I did install the Myrinet dev tools for FreeBSD and ran myri_info =
which yields;
> >>
> >> pci-dev at 05:00.0 vendor:product(rev)=3D14c1:0008(00)
> >>      behind bridge root-port: 00:03.0 8086:3c08 (x8.1/x16.3)
> >> Myri-10G-PCIE-8A -- Link x8
> >>  EEPROM String-spec:
> >>      MAC=3D00:60:dd:45:73:23
> >>      SN=3D413665
> >>      PWR=3D100
> >>      PC=3D10G-PCIE-8A-R
> >>      PN=3D09-03852
> >>      XFI=3DAEL1010
> >>      TAG=3Dze_tools-1_4_45
> >>
> >>  EEPROM MCP, PRESENT, length =3D 103384, crc=3D0x119daf46
> >>      ETHZ::1.4.45 2009/08/22 18:57:06 self extracting firmware
> >>      Bundle: exec_len=3D72144, PCI-ROM-len =3D 31232
> >>  Running MCP:
> >>      ETH ::1.4.55 -P- 2012/04/21 01:48:34 myri10ge firmware
> >>
> >> Any insights are appreciated.
> >>
> >> - aurf
>=20
>=20
> Did the ole RTFM and re programmed the firmware, all good now.
>=20
> - aurf
> _______________________________________________
> freebsd-questions@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-questions
> To unsubscribe, send any mail to =
"freebsd-questions-unsubscribe@freebsd.org"

Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?D8FB1DBC-9B68-4556-A366-868AC5065097>

Header And Logo

Peripheral Links

Site Navigation

Header And Logo

Peripheral Links

Search

Site Navigation