From owner-freebsd-current@freebsd.org Thu May 19 17:02:23 2016 Return-Path: Delivered-To: freebsd-current@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 12B0EB42371 for ; Thu, 19 May 2016 17:02:23 +0000 (UTC) (envelope-from ohartman@zedat.fu-berlin.de) Received: from outpost1.zedat.fu-berlin.de (outpost1.zedat.fu-berlin.de [130.133.4.66]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id C81921A98 for ; Thu, 19 May 2016 17:02:22 +0000 (UTC) (envelope-from ohartman@zedat.fu-berlin.de) Received: from inpost2.zedat.fu-berlin.de ([130.133.4.69]) by outpost.zedat.fu-berlin.de (Exim 4.85) for freebsd-current@freebsd.org with esmtps (TLSv1.2:DHE-RSA-AES256-GCM-SHA384:256) (envelope-from ) id <1b3RKv-0040zk-Oq>; Thu, 19 May 2016 19:02:13 +0200 Received: from x55b3abcf.dyn.telefonica.de ([85.179.171.207] helo=thor.walstatt.dynvpn.de) by inpost2.zedat.fu-berlin.de (Exim 4.85) for freebsd-current@freebsd.org with esmtpsa (TLSv1.2:AES128-GCM-SHA256:128) (envelope-from ) id <1b3RKu-000dWY-U8>; Thu, 19 May 2016 19:02:13 +0200 Date: Thu, 19 May 2016 19:04:12 +0200 From: "O. Hartmann" To: freebsd-current@freebsd.org Subject: Re: boot broken on VMWare somewhere between r300069 and r300176 Message-ID: <20160519190412.68752f55.ohartman@zedat.fu-berlin.de> In-Reply-To: References: Organization: FU Berlin X-Mailer: Claws Mail 3.13.2 (GTK+ 2.24.29; amd64-portbld-freebsd11.0) MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha256; boundary="Sig_/tqF2B3ff.N7VFw+EVav9kRw"; protocol="application/pgp-signature" X-Originating-IP: 85.179.171.207 X-ZEDAT-Hint: A X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.22 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 19 May 2016 17:02:23 -0000 --Sig_/tqF2B3ff.N7VFw+EVav9kRw Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable On Thu, 19 May 2016 14:04:59 +0300 Andriy Gapon wrote: > On 19/05/2016 13:50, Boris Samorodov wrote: > > 19.05.16 09:28, K. Macy =D0=BF=D0=B8=D1=88=D0=B5=D1=82: =20 > >> I did an IFC on my drm-next-4.6 branch yesterday at r300069. I just > >> did an IFC to r300176 and boot will hang right ater printing out > >> "setting hostid: ". ^T just shows sh [piperd]. ddb just shows the > >> shell as hanging in piperead. Diffing between those two revisions I > >> don't see any obvious offenders so I'm hoping that individuals who > >> have committed in the last 24 hours will have some idea of their > >> changes having such an impact. =20 > >=20 > > For me (BIOS boot at DELL notebook) is broken after jump > > from r300062 to r300158. CapsLock works, but ^T shows nothing. > > Here is a photo (sorry for the quality): > > ftp://ftp.wart.ru/pub/misc/boot_broken.jpg > >=20 > > Boot with r300062 works fine. =20 >=20 > A wild guess (not really), try to revert r300113 >=20 We updated several systems of different ages and CPU generations (around 10) to 300158. Bare metal. The systems all failed to boot, they got stuck after the USB system has been probed (according to the kernel messages). Some boxes get stuck after the message of the generation of the UUID occured. Pushing the Powerbutton performs a clean shutdown, although - so te system seems still alive, but usually the power is turning off - this time, the box is stuck with the uptime message. On random reboots some of the boxes boot. But the desaster then starts. The network is highly unstable and flaky - while I can ping hosts or resolve their IP by a DNS, I can not login via ssh, the webservices of the webserver of the machines in question are inaccessible as well as their databases (PostgreSQL) as well as ssh. And it is more frustrating: I can't update or go back with svn (either /usr/bin/svn or /usr/local/bin/svn) within the sources to avoid this mess. In all cases, svn "times out". Accessing the web from clients with the broken CURRENT code also ends up in a wild guess game: sometimes the connection to services can be established, sometimes not and I see a timeout. With svn in /usr/src, on one box I could obtain a poor fragment of the code via "svn update -r 300005" (300005 was in my case the starting point when everything was up an running and working). In short words: reverting back to r300113 isn't possible on the most systems! This problem has been present immediately after 300158 has been introduced and build-world/build-kernel has been performed and the fact, that different hardware, including NICs, has been affected, does not narrow down the problem to a specific NIC, CPU type or hardware. And that leads me to the question whether the code injected into CURRENT gets tested - or not. If there would be a test, I guess the problem would have revealed itself immediately. I boot via UEFI as well as BIOS - the problem is with both. In such a case as described with a nonworking svn, how am I supposed to revert to the supposedly working revision r300113? Kind regards and thanks in advance for your suggestions, O. Hartmann=20 P.S. I'm using IPFW on all systems. Disabling IPFW (ipfw disable firewall) seems= to releafe the symptoms a bit - no matter whether custom scripts were used or the sett= ings from rc.conf. --Sig_/tqF2B3ff.N7VFw+EVav9kRw Content-Type: application/pgp-signature Content-Description: OpenPGP digital signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 iQEcBAEBCAAGBQJXPfIMAAoJEOgBcD7A/5N86XgIAOHcWP0CP6ngd/STAddiIKsf 7obvXijy5b48lqjgAjqR6iTk/B/AMt8CLP4QvVJx4sJZC34ZXAZbyttts09ZmY4f c/xy/gcfRikzSEi70uuZJwLOguze0+YgViln/9G8xGfojUnIjg2FBimilzKcSC7y IMA+bS54DDT/Gi78aJzy0DwBefQrE5IOutFJHfAtoeOgzNEFGyDYaWhtrjU0oxbL IDt7KkbwG0AM8M9vyeWJbuJalKWorCWpQjWOqjXheMo+PYCSwgqm/D9bssTQ7w6u qbuTNBRxfr7NKK6V/wNqE2MQBn9v6ixZj17vEZHMw8+ge+NWa3M6gVqS8+sB2Kc= =oJon -----END PGP SIGNATURE----- --Sig_/tqF2B3ff.N7VFw+EVav9kRw--