Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 28 Oct 2005 14:28:18 -0400
From:      John Baldwin <jhb@freebsd.org>
To:        Brooks Davis <brooks@one-eyed-alien.net>
Cc:        Tom McLaughlin <tmclaugh@sdf.lonestar.org>, freebsd-current@freebsd.org
Subject:   Re: kern/87506 : [PATCH] Fix alias support on vr interfaces
Message-ID:  <200510281428.19462.jhb@freebsd.org>
In-Reply-To: <20051028180420.GA17096@odin.ac.hmc.edu>
References:  <41473.192.216.27.32.1129818223.squirrel@192.216.27.32> <200510281357.05802.jhb@freebsd.org> <20051028180420.GA17096@odin.ac.hmc.edu>

next in thread | previous in thread | raw e-mail | index | archive | help
On Friday 28 October 2005 02:04 pm, Brooks Davis wrote:
> On Fri, Oct 28, 2005 at 01:57:04PM -0400, John Baldwin wrote:
> > On Thursday 27 October 2005 04:14 pm, Brooks Davis wrote:
> > > On Thu, Oct 27, 2005 at 04:02:26PM -0400, Anish Mistry wrote:
> > > > On Thursday 20 October 2005 05:29 pm, Brooks Davis wrote:
> > > > > On Thu, Oct 20, 2005 at 04:51:21PM -0400, John Baldwin wrote:
> > > > > > On Thursday 20 October 2005 02:27 pm, Anish Mistry wrote:
> > > > > > > On Thursday 20 October 2005 11:15 am, John Baldwin wrote:
> > > > > > > > On Thursday 20 October 2005 10:23 am, Tom McLaughlin wrote:
> > > > > > > > > Hi, would someone be available to look at kern PR 87506?  I
> > > > > > > > > found the problem while building a jail on my 6.0-RC1 box.
> > > > > > > > > When setting an alias on a vr device with ifconfig the
> > > > > > > > > alias address replaces the initial address of the device.
> > > > > > > > > marcus@ took a look at the problem for me and provided the
> > > > > > > > > patch for the vr interface which I tested and it resolved
> > > > > > > > > the problem. Without the patch I can't create a jail that
> > > > > > > > > is network accessible.
> > > > > > > > >
> > > > > > > > > Marcus also notes in the PR that the rl, sis, ti, and tl
> > > > > > > > > may also be affected by this bug.  If anyone has any other
> > > > > > > > > questions or needs someone to test any changes to the patch
> > > > > > > > > just let me know. I only have vr interfaces on my FreeBSD
> > > > > > > > > machines though.  Thanks.
> > > > > > > > >
> > > > > > > > > Tom
> > > > > > > >
> > > > > > > > I'm not sure that fix is really the right fix.  The patch
> > > > > > > > just makes vr(4) ignore changes to if_flags while the driver
> > > > > > > > is up. Probably there is a bug in vr(4)'s handling of alias
> > > > > > > > addresses.  I did just reproduce this on my laptop's rl(4)
> > > > > > > > interface though. I'll see if I can't figure out what is
> > > > > > > > happening.
> > > > > > >
> > > > > > > I'm also seeing this too along with the following.
> > > > > > >
> > > > > > > I'm not sure if this is related, but I'm seeing the following
> > > > > > > on RELENG_6 and CURRENT, but and older RELENG_5 as of ~2 months
> > > > > > > ago doesn't show this problem.
> > > > > > > I'm trying to setup my workstation with a normal DHCP'd address
> > > > > > > and an alias IP for a jail running on the system, but the alias
> > > > > > > setting wipes out all the other addresses on the interface.
> > > > > > > in /etc/rc.conf:
> > > > > > > ifconfig_rl0="DHCP"
> > > > > > > ifconfig_rl0_alias0="inet 192.168.1.10 netmask 255.255.255.255"
> > > > > > >
> > > > > > > I've checked the rc boot order (on RELENG_6 and CURRENT) and it
> > > > > > > seems correct:
> > > > > > > netif
> > > > > > > dhclient
> > > > > > > netif
> > > > > > >
> > > > > > > I narrowed it down to:
> > > > > > > dhclient rl0
> > > > > > > ifconfig rl0 inet 192.168.1.11 netmask 0xffffffff alias
> > > > > > > [dhclient prints a message here saying connection closed and
> > > > > > > exiting]
> > > > > > >
> > > > > > > All of the other addresses on the card are removed.
> > > > > > > I'm also seeing this on dc.  So thinking it to be a problem in
> > > > > > > ifconfig I copied over the version from my RELENG_5 box, and
> > > > > > > that did the same thing...so this seems to be present several
> > > > > > > of the network drivers in RELENG_6/CURRENT.
> > > > > >
> > > > > > Yes, it seems to be an issue with dhclient.  If I turn dhclient
> > > > > > off and manually configure my NIC then the alias works fine:
> > > > > > rl0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> mtu 1500
> > > > > >         options=8<VLAN_MTU>
> > > > > >         inet6 fe80::290:f5ff:fe0e:c8e5%rl0 prefixlen 64 scopeid
> > > > > > 0x2 inet 10.50.41.234 netmask 0xfffffe00 broadcast 10.50.41.255
> > > > > > inet 10.50.41.101 netmask 0xffffffff broadcast 10.50.41.101 ether
> > > > > > 00:90:f5:0e:c8:e5
> > > > > >         media: Ethernet autoselect (100baseTX <full-duplex>)
> > > > > >         status: active
> > > > >
> > > > > I believe the problem is that adding an address to a NIC causes a
> > > > > call to ifp->if_init() which resets the media and triggers a
> > > > > LINK_DOWN event in all too many cases.  Using if_init here is
> > > > > rather like driving finishing nails with a sledge hammer.
> > > >
> > > > Has there been any resolution of this issue? Since 6.0 is just around
> > > > the corner.
> > >
> > > No.  It's going to require non-trivial changes to the design of drivers
> > > and eventually some changes to the driver API to really fix this so
> > > it's really too late in the game.
> > >
> > > I think we need to document the lack of support for this configuration
> > > and move on.  Frankly, it's a configuration I see no compelling reason
> > > to support.  If you are setting alises, you should know what your
> > > address is so you don't need dhcp.  I'm aware there are other reasons
> > > to want dhcp, but I personally put this sort of thing in the "that
> > > would be nice" category.  We should try to fix it, but it's no where
> > > near the top of my list.
> >
> > Actually, I think the best fix would be to add some de-bounce logic to
> > dhclient so that when it gets a link down state, it waits a second before
> > acting on it, and if it gets a link up event before the second expires it
> > just cancels the pending link down action.  That change would probably go
> > a long way to fixing several people's problems with spurious link state
> > changes.
>
> True.  Dhclient should however complain loudly if it happens with any
> kind of frequency.  Note that one second probably won't cut it with many
> gigabit nics. :(

Just make the timeout configurable and people can tweak it for a given 
setup. :)

-- 
John Baldwin <jhb@FreeBSD.org>  <><  http://www.FreeBSD.org/~jhb/
"Power Users Use the Power to Serve"  =  http://www.FreeBSD.org



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200510281428.19462.jhb>