Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 10 Jan 2013 16:48:18 +0100
From:      "O. Hartmann" <ohartman@zedat.fu-berlin.de>
To:        Gleb Smirnoff <glebius@FreeBSD.org>
Cc:        freebsd-current@FreeBSD.org
Subject:   Re: loopback interface broken on current
Message-ID:  <50EEE2C2.8050404@zedat.fu-berlin.de>
In-Reply-To: <20130109085907.GJ66284@glebius.int.ru>
References:  <201301011040.r01Ae37A043153@pozo.com> <20130101194803.GB25661@glebius.int.ru> <201301012042.r01Kgq6E001548@pozo.com> <20130102201330.GC25661@glebius.int.ru> <201301022325.r02NPKEE076633@pozo.com> <89160.1357182550@pf2.ed.niigata-u.ac.jp> <201301030355.r033t4aI001542@pozo.com> <90299.1357212618@pf2.ed.niigata-u.ac.jp> <20130104094532.4a921e47@ernst.jennejohn.org> <50ED30B9.7040101@zedat.fu-berlin.de> <20130109085907.GJ66284@glebius.int.ru>

next in thread | previous in thread | raw e-mail | index | archive | help
This is an OpenPGP/MIME signed message (RFC 2440 and 3156)
--------------enig27CE62C2833F3C0E9520B7A6
Content-Type: text/plain; charset=KOI8-R
Content-Transfer-Encoding: quoted-printable

Am 01/09/13 09:59, schrieb Gleb Smirnoff:
> On Wed, Jan 09, 2013 at 09:56:25AM +0100, Hartmann, O. wrote:
> H> Same here.
> H> The OS: FreeBSD 10.0-CURRENT/amd64 r245218. Since I have three boxes=

> H> running approximately the same configurations (I share my configs
> H> between lab and home), but different hardware, I'm confused.
> H>=20
> H> The symptoms in my case are:
> H>=20
> H> Booting the box is all right until it comes to the start of nfsuserd=
=2E
> H> Prior to that, ntp adjusts the clock properly with an external time
> H> server  - so this implies network connectivity. Start of nfsuserd is=

> H> stuck forever.
> H> Interrupting the start of nfsuserd restarts several other services, =
but
> H> winbindd and slapd (OpenLDAP) get stuck again. In case I also interr=
upt
> H> them, there are other services which will not start.
> H>=20
> H> Trying to login as root on the console fails - I never get a passwor=
d
> H> tag after having issued the root login name. Since this machine is b=
ound
> H> to a local and remote OpenLDAP backend, I'm used to have an emergenc=
y
> H> local user which usually works - but this time, neither root nor thi=
s
> H> user can login!
> H>=20
> H> Bringing up the box in SINGLEUSER allows me to login. Investigating =
the
> H> network interfaces with ifconfig reveals, that the loopback did not =
get
> H> assigned to any inet 127.0.0.1 address. Sometimes there is only inet=
6
> H> linklocal address, some nd6 options, but sometimes even IPv6 assignm=
ents
> H> do not show up.
> H>=20
> H> In a desperate move I tried to recompile a kernel. In /etc/src.conf,=
 I
> H> recompile also the kernel module for the most recent virtual-box ker=
nel
> H> module. While the kernel and module (*.ko) get installed properly, t=
he
> H> recompilation of the VirtualBox port gets stuck when the system unfo=
lds
> H> the source tarball. Hitting Ctrl-T say "sbwait" for the process. Oth=
er
> H> processes seem to have trouble getting a proper ownership or UID for=
 a
> H> file - this is my naiv interpretation what I see at the surface.
> H>=20
> H> The funny thing is, that after several reboots, the box gets up as n=
ormal.
> H>=20
> H> I revealed this issue approx. two weeks ago when out of the sudden t=
he
> H> amd automounter stopped working and the NFSv4 network drives didn't
> H> attach properly and made the whole box being stuck.
> H>=20
> H> Sorry for the more superficial description of the problem ...
> H>=20
> H> Has the problem been identified? Is there a solution? Since it affec=
ts
> H> only my very modern hardware (i7-3930, 32GB RAM, ASUS P9X79 WS
> H> mainboard), while a very same setup on older hardware (our local ser=
ver
> H> is Intel Q6600 with 8GB RAM and and oldish Intel P45-chipset based
> H> mainboard), both systems do have Intel NICs, I'm a bit confused.
>=20
> This looks unrelated to the problem discussed, because r245218 is
> later than r244989 which backed out my change.
>=20
> Can you do a binary search to identify which revision broke things?
>=20

Sorry for the delay.

Today I realized that the problems occured and described are due to the
fact that I use jumbo frames (mtu 9100 or mtu 6150) on the em0-device
(em0@pci0:0:25:0:        class=3D0x020000 card=3D0x844e1043 chip=3D0x1503=
8086
rev=3D0x05 hdr=3D0x00
    vendor     =3D 'Intel Corporation'
    device     =3D '82579V Gigabit Network Connection'
    class      =3D network
    subclass   =3D ethernet
). Using the default of mtu 1500 does not make the problem occur.

My time constraints disallow to do further or deeper investigations - at
least until the end of next week, I'm sorry. The problems arose around
Christmas, it might be even earlier, since I didn't access the machine
since 10th December.

I have other boxes with Intel NICs - different chiptypes. They do not
show this problem.

Oliver


--------------enig27CE62C2833F3C0E9520B7A6
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: OpenPGP digital signature
Content-Disposition: attachment; filename="signature.asc"

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.19 (FreeBSD)

iQEcBAEBAgAGBQJQ7uLKAAoJEOgBcD7A/5N8+B0H/AyeT0dNahSyuCbp5OiEWTyH
LfKccvosiplls/sIeqeV5Wp6GlF/2PqDHO7mhXM/c/i07dXZpGdUttYxZbfEmYV9
/8DcVhNEYpYCyzyo1a0xIrMv4niyl+sZhdyla0hGbr0jaPOlu9KLTsDQH0E0EvmB
UfAlhuAeZA8VcCoc4H/q31DVU83OIy2sey9KvCThEObvN7M8deNSf3qlkvuVMNUZ
kF9fNy4cKujPh7GeAebyvAid9n27vQH6L9y1PSC+aPmp0TUXU3vULY66uLIv5SUW
KWzYyF7JzdDGy09TJjvtrgV6hhHvdkuYeGm4QvZvFTc7HEPAUDfwtI0NkthKBGc=
=xAVw
-----END PGP SIGNATURE-----

--------------enig27CE62C2833F3C0E9520B7A6--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?50EEE2C2.8050404>