Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 22 Sep 2016 17:50:09 +0100
From:      Steven Hartland <killing@multiplay.co.uk>
To:        Gleb Smirnoff <glebius@FreeBSD.org>
Cc:        Ryan Stone <rysto32@gmail.com>, Kubilay Kocak <koobs@freebsd.org>, freebsd-net <freebsd-net@freebsd.org>, Karl Pielorz <kpielorz_lst@tdx.co.uk>
Subject:   Re: lagg Interfaces - don't do Gratuitous ARP?
Message-ID:  <80fd962a-fba3-d71e-a1cb-2b09181d3925@multiplay.co.uk>
In-Reply-To: <20160922160840.GP1018@cell.glebi.us>
References:  <6E574F1B61786E6032824A88@10.12.30.106> <2c62f5f0-3fb4-f513-2a8f-02de3a1d552f@FreeBSD.org> <20160921235703.GG1018@cell.glebi.us> <CAFMmRNwZBEJ9Me4FSh=W7fRNjm4344jiUGuJqX8KUB_0sWcajA@mail.gmail.com> <20160922025856.GH1018@cell.glebi.us> <348d534d-ef87-f90c-aa43-cc65c2f6283c@multiplay.co.uk> <20160922150940.GK1018@cell.glebi.us> <f4100561-4977-0b19-c245-0cd09438943d@multiplay.co.uk> <20160922154144.GO1018@cell.glebi.us> <0c678da4-bf72-5a81-aee1-d82a873661b7@multiplay.co.uk> <20160922160840.GP1018@cell.glebi.us>

next in thread | previous in thread | raw e-mail | index | archive | help
On 22/09/2016 17:08, Gleb Smirnoff wrote:
> On Thu, Sep 22, 2016 at 04:52:35PM +0100, Steven Hartland wrote:
> S> > S> > S> > Does lagg(4) hardware address change when it failovers?
> S> > S> > S> >
> S> > S> > S> It moves the address between interfaces which typically moves it between
> S> > S> > S> switches too.
> S> > S> >
> S> > S> > So, the address doesn't change, which means ARP cache doesn't need to
> S> > S> > change as well. If it moves between switches, all that needs to be done
> S> > S> > is to send whatever packet from proper hardware address to broadcast.
> S> > S> >
> S> > S> That would be nice but unfortunately in the wild that won't work as
> S> > S> without GARP devices can and do ignore :(
> S> >
> S> > You can create a fake gratious ARP packet, if you want. Switches must not
> S> > require IP addresses matching the reality in the packet.
> S> >
> S> > P.S. I always read GARP as Generic Attribute Registration Protocol.
> S> >
> S> We could but then what happens when its IPv6 or $other protocol that
> S> needs to know? That would require lagg to be edited with all the special
> S> cases instead of allowing the protocol to handle it they way it needs.
>
> You just said that "without GARP devices can and do ignore", didn't you?
> Let's take this as truth, although I doubt. So, if this is the truth, that
> means that if you are running IPv6 only, the switches won't recondigure
> theirselves due to lack of gratious ARP.
Not sure I follow you, gratuitous ARP is required for IPv4 to work, for 
IPv6 you need an unsolicited neighbour announcement.
> Other protocols, where PPPoE is good example simply doesn't have any
> analogs of ARP or ND. So what would your switches do in that case? And
> what other layers are you going to hack, if you are going to run PPPoE
> service with lagg failover?
Good question, surely that's a good reason to have each protocol handle 
it and not to teach LAGG about every possible protocol?
> In reality, a layer 2 device must forward layer 2 traffic, and must
> reconfigure its forwarding table based on source addresses seen on ports.
> And that's what all devices I've seen do. So what if we actually try
> the approach, I suggested? I can write the patch for you if you want.
The main problem with LAGG in failover mode is ensuring the traffic is 
sent to the correct port.

When you have the scenario where a switch stack believes MAC XYZ is 
accessible by port ABC then unless you tell it otherwise it will 
continue to believe that and hence send traffic to said port. I'm sure 
we'll agree that the standard for doing this for IPv4 is ARP and for 
IPv6 is NA.

When using LAGG and we loose the master port we need correct the 
connected devices view (both direct and remote) of the world such that 
traffic is now sent to a different physical port.

Back in the day, when switches weren't so "smart", sending a correctly 
address packet from the new port would potentially help, but with 
smarter switches and stacking in the mix sticking to the "standards" 
helps maintain compatibility and hence functionality with things like LAGG.

Having tested with a number of vendor switches Cisco, Extreme and more 
recently Arista only sending gratuitous ARP for IPv4 and unsolicited NA 
for IPv6 reliably resulted in rapid failover between LAGG ports.

Other methods like sending correctly addressed output from the new port 
helped, we tested this with outbound pings from IPMI, but still resulted 
in noticeable recovery delay.

> S> Overall, while the proposed change (https://reviews.freebsd.org/D4111)
> S> does involve changes to multiple layers it still feels like the right
> S> approach as it has the right layer dealing with the change instead of
> S> hard-coded assumptions.
>
> Sorry, it doesn't feel like the right approach. :(
Out of interest why has your opinion changed since your post here: 
https://lists.freebsd.org/pipermail/freebsd-net/2012-February/031340.html ?



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?80fd962a-fba3-d71e-a1cb-2b09181d3925>