Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 22 Sep 2016 08:09:40 -0700
From:      Gleb Smirnoff <glebius@FreeBSD.org>
To:        Steven Hartland <killing@multiplay.co.uk>
Cc:        Ryan Stone <rysto32@gmail.com>, Kubilay Kocak <koobs@freebsd.org>, freebsd-net <freebsd-net@freebsd.org>, Karl Pielorz <kpielorz_lst@tdx.co.uk>
Subject:   Re: lagg Interfaces - don't do Gratuitous ARP?
Message-ID:  <20160922150940.GK1018@cell.glebi.us>
In-Reply-To: <348d534d-ef87-f90c-aa43-cc65c2f6283c@multiplay.co.uk>
References:  <0D84203FAAFD0A8E7BBB24A3@10.12.30.106> <bc33560b-59bc-01be-6a5d-7994ac121258@multiplay.co.uk> <6E574F1B61786E6032824A88@10.12.30.106> <2c62f5f0-3fb4-f513-2a8f-02de3a1d552f@FreeBSD.org> <20160921235703.GG1018@cell.glebi.us> <CAFMmRNwZBEJ9Me4FSh=W7fRNjm4344jiUGuJqX8KUB_0sWcajA@mail.gmail.com> <20160922025856.GH1018@cell.glebi.us> <348d534d-ef87-f90c-aa43-cc65c2f6283c@multiplay.co.uk>

next in thread | previous in thread | raw e-mail | index | archive | help
On Thu, Sep 22, 2016 at 08:21:08AM +0100, Steven Hartland wrote:
S> On 22/09/2016 03:58, Gleb Smirnoff wrote:
S> > On Wed, Sep 21, 2016 at 09:12:11PM -0400, Ryan Stone wrote:
S> > R> > IMHO, the original patch was absolutely evil hack touching multiple
S> > R> > layers, for the sake of a very special problem.
S> > R> >
S> > R> > I think, that in order to kick forwarding table on switches, lagg
S> > R> > should:
S> > R> >
S> > R> > - allocate an mbuf itself
S> > R> > - set its source hardware address to its own
S> > R> > - set destination hardware to broadcast
S> > R> > - put some payload in there, to make packet of valid size. Why should it be
S> > R> >   gratuitous ARP? A machine can be running IPv6 only, or may even use
S> > R> > whatever
S> > R> >   higher level protocol, e.g. PPPoE. We shouldn't involve IP into this
S> > R> > Layer 2
S> > R> >   problem at all.
S> > R> > - Finally, send the prepared mbuf down the lagg member(s).
S> > R> >
S> > R> > And please don't hack half of the network stack to achieve that :)
S> > R>
S> > R> The original report in this thread is about a system where it takes almost
S> > R> 15 minutes for the network to start working again after a failover.  That
S> > R> does not sound to me like a switch problem.  That sounds to me like the ARP
S> > R> cache on the remote system.  To fix such a case we have to touch L3.
S> >
S> > Does lagg(4) hardware address change when it failovers?
S> >
S> It moves the address between interfaces which typically moves it between 
S> switches too.

So, the address doesn't change, which means ARP cache doesn't need to
change as well. If it moves between switches, all that needs to be done
is to send whatever packet from proper hardware address to broadcast.

-- 
Totus tuus, Glebius.



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20160922150940.GK1018>