Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 10 Apr 2019 18:49:43 +0000
From:      bugzilla-noreply@freebsd.org
To:        net@FreeBSD.org
Subject:   [Bug 221146] [ixgbe] Problem with second laggport
Message-ID:  <bug-221146-7501-cS45QIU0SI@https.bugs.freebsd.org/bugzilla/>
In-Reply-To: <bug-221146-7501@https.bugs.freebsd.org/bugzilla/>
References:  <bug-221146-7501@https.bugs.freebsd.org/bugzilla/>

next in thread | previous in thread | raw e-mail | index | archive | help
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D221146

Johan Str=C3=B6m <johan@stromnet.se> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |johan@stromnet.se

--- Comment #26 from Johan Str=C3=B6m <johan@stromnet.se> ---
Hi,

not sure if this is fully related, but I've had issues with carp and lagg t=
oo.
If changing the carp status, i.e. plugging in one of the configured interfa=
ces
(that was how I first noticed it), then the lagg0 interface went down and u=
p,
but carp failed to catch up on this. This happens on reboot too (but not su=
re
it has happened *every time*, this is a new setup).

The net.inet.carp.demotion counter went to 2160 ( /240 =3D 9, which is the =
number
of VLANs with CARP on the lagg), but got stuck there and never came back do=
wn
to 0:

Apr 10 20:23:00 gw1 kernel: ix1: link state changed to UP
Apr 10 20:23:07 gw1 kernel: carp: demoted by 240 to 240 (send error 50 on
vlan14)
Apr 10 20:23:07 gw1 kernel: carp: demoted by 240 to 480 (send error 50 on
vlan11)
Apr 10 20:23:07 gw1 kernel: carp: demoted by 240 to 720 (send error 50 on
vlan17)
Apr 10 20:23:07 gw1 kernel: carp: demoted by 240 to 960 (send error 50 on
vlan16)
Apr 10 20:23:07 gw1 kernel: carp: demoted by 240 to 1200 (send error 50 on
vlan13)
Apr 10 20:23:07 gw1 kernel: carp: demoted by 240 to 1440 (send error 50 on
vlan15)
Apr 10 20:23:07 gw1 kernel: carp: demoted by 240 to 1680 (send error 50 on
vlan10)
Apr 10 20:23:07 gw1 kernel: carp: demoted by 240 to 1920 (send error 50 on
vlan1)
Apr 10 20:23:07 gw1 kernel: carp: demoted by 240 to 2160 (send error 50 on
vlan18)
Apr 10 20:23:08 gw1 kernel: carp: 10@vlan10: MASTER -> BACKUP (more frequent
advertisement received)
Apr 10 20:23:08 gw1 kernel: ifa_maintain_loopback_route: deletion failed for
interface vlan10: 3
Apr 10 20:23:08 gw1 kernel: carp: 13@vlan13: MASTER -> BACKUP (more frequent
advertisement received)
Apr 10 20:23:08 gw1 kernel: ifa_maintain_loopback_route: deletion failed for
interface vlan13: 3
Apr 10 20:23:08 gw1 kernel: carp: 5@vlan15: MASTER -> BACKUP (more frequent
advertisement received)
Apr 10 20:23:08 gw1 kernel: ifa_maintain_loopback_route: deletion failed for
interface vlan15: 3
Apr 10 20:23:08 gw1 kernel: carp: 18@vlan18: MASTER -> BACKUP (more frequent
advertisement received)
Apr 10 20:23:08 gw1 kernel: ifa_maintain_loopback_route: deletion failed for
interface vlan18: 3
Apr 10 20:23:08 gw1 kernel: carp: 11@vlan11: MASTER -> BACKUP (more frequent
advertisement received)
Apr 10 20:23:08 gw1 kernel: arp: 172.28.2.1 moved from 00:22:4d:6b:b1:5b to
00:00:5e:00:01:0b on vlan11
Apr 10 20:23:08 gw1 kernel: ifa_maintain_loopback_route: deletion failed for
interface vlan11: 3
Apr 10 20:23:08 gw1 kernel: carp: 17@vlan17: MASTER -> BACKUP (more frequent
advertisement received)
Apr 10 20:23:08 gw1 kernel: ifa_maintain_loopback_route: deletion failed for
interface vlan17: 3
Apr 10 20:23:08 gw1 kernel: carp: 1@vlan1: MASTER -> BACKUP (more frequent
advertisement received)
Apr 10 20:23:08 gw1 kernel: ifa_maintain_loopback_route: deletion failed for
interface vlan1: 3
Apr 10 20:23:08 gw1 kernel: carp: 14@vlan14: MASTER -> BACKUP (more frequent
advertisement received)
Apr 10 20:23:08 gw1 kernel: ifa_maintain_loopback_route: deletion failed for
interface vlan14: 3
Apr 10 20:23:08 gw1 kernel: carp: 16@vlan16: MASTER -> BACKUP (more frequent
advertisement received)
Apr 10 20:23:08 gw1 kernel: ifa_maintain_loopback_route: deletion failed for
interface vlan16: 3


Then I manually removed the demote via sysctl:

Apr 10 20:23:21 gw1 kernel: carp: demoted by -2160 to 0 (sysctl)
Apr 10 20:23:23 gw1 kernel: carp: 11@vlan11: BACKUP -> MASTER (preempting a
slower master)
Apr 10 20:23:23 gw1 kernel: carp: 17@vlan17: BACKUP -> MASTER (preempting a
slower master)
Apr 10 20:23:23 gw1 kernel: carp: 14@vlan14: BACKUP -> MASTER (preempting a
slower master)
Apr 10 20:23:23 gw1 kernel: carp: 1@vlan1: BACKUP -> MASTER (preempting a
slower master)
Apr 10 20:23:23 gw1 kernel: carp: 13@vlan13: BACKUP -> MASTER (preempting a
slower master)
Apr 10 20:23:23 gw1 kernel: carp: 10@vlan10: BACKUP -> MASTER (preempting a
slower master)
Apr 10 20:23:23 gw1 kernel: arp: 172.28.4.129 moved from 00:00:5e:00:01:11 =
to
00:22:4d:6b:b1:5b on vlan17
Apr 10 20:23:23 gw1 kernel: carp: 5@vlan15: BACKUP -> MASTER (preempting a
slower master)
Apr 10 20:23:23 gw1 kernel: carp: 16@vlan16: BACKUP -> MASTER (preempting a
slower master)
Apr 10 20:23:23 gw1 kernel: carp: 18@vlan18: BACKUP -> MASTER (preempting a
slower master)


(Also a bit interesting that it mentions those ARP changes..  Why would eit=
her
of the nodes announce the CARPed IP on the nic mac rather than the CARP ip,=
 at
any time?)

The "other" carp node (not using lagg) is 11.2-RELEASE-p7, this node with l=
agg
is 11.2-RELEASE-p8.
The lagg'ed nic's are ix0-ix4 "<Intel(R) PRO/10GbE PCI-Express Network Driv=
er,
Version - 3.2.12-k> mem 0xddc00000-0xdddfffff,0xdde04000-0xdde07fff at devi=
ce
0.0 on pci6" on a Supermicro A2SDi-4C-HLN4F.

On both nodes, net.inet.carp.preempt=3D1, and advbase 1, advskew 100 on this
node, 200 on the other.


To add another dimension to this. If I set net.inet.carp.preempt=3D0 (which=
 I had
initially), I cannot get the interfaces out of BACKUP at all:
...
Apr 10 20:45:23 gw1 kernel: carp: 1@vlan1: MASTER -> BACKUP (more frequent
advertisement received)
Apr 10 20:45:23 gw1 kernel: ifa_maintain_loopback_route: deletion failed for
interface vlan1: 3
Apr 10 20:45:36 gw1 kernel: carp: demoted by -2160 to 0 (sysctl)

and then nothing more.. Setting it to 1 again, immediately makes it master.


Anyway, not sure if this is related to the ixgbe 3.2.12-k driver, or lagg, =
or
carp..  But I thought I'd write it down here anyway.

--=20
You are receiving this mail because:
You are the assignee for the bug.=



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-221146-7501-cS45QIU0SI>