From owner-freebsd-current@freebsd.org Wed Aug 9 19:22:46 2017 Return-Path: Delivered-To: freebsd-current@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id A1B0CDD4037 for ; Wed, 9 Aug 2017 19:22:46 +0000 (UTC) (envelope-from bdrewery@FreeBSD.org) Received: from mailman.ysv.freebsd.org (unknown [127.0.1.3]) by mx1.freebsd.org (Postfix) with ESMTP id 7CC4682310 for ; Wed, 9 Aug 2017 19:22:46 +0000 (UTC) (envelope-from bdrewery@FreeBSD.org) Received: by mailman.ysv.freebsd.org (Postfix) id 791B8DD4036; Wed, 9 Aug 2017 19:22:46 +0000 (UTC) Delivered-To: current@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 78AEEDD4035 for ; Wed, 9 Aug 2017 19:22:46 +0000 (UTC) (envelope-from bdrewery@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2610:1c1:1:6074::16:84]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "freefall.freebsd.org", Issuer "Let's Encrypt Authority X3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 474C78230F; Wed, 9 Aug 2017 19:22:46 +0000 (UTC) (envelope-from bdrewery@FreeBSD.org) Received: from mail.xzibition.com (unknown [127.0.1.132]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by freefall.freebsd.org (Postfix) with ESMTPS id 5B00112378; Wed, 9 Aug 2017 19:22:45 +0000 (UTC) (envelope-from bdrewery@FreeBSD.org) Received: from mail.xzibition.com (localhost [172.31.3.2]) by mail.xzibition.com (Postfix) with ESMTP id 48AA68BB2; Wed, 9 Aug 2017 19:22:44 +0000 (UTC) X-Virus-Scanned: amavisd-new at mail.xzibition.com Received: from mail.xzibition.com ([172.31.3.2]) by mail.xzibition.com (mail.xzibition.com [172.31.3.2]) (amavisd-new, port 10026) with LMTP id 5JFbfE1MAYCT; Wed, 9 Aug 2017 19:22:38 +0000 (UTC) Subject: Re: Apparent race in buildworld (head/amd64, r322214 -> r322304) DKIM-Filter: OpenDKIM Filter v2.9.2 mail.xzibition.com A24E48BAD To: David Wolfskill , "current@freebsd.org" Cc: sjg@freebsd.org References: <20170809120436.GY1244@albert.catwhisker.org> <689a8aa1-c8a3-a8e3-bc01-8bec5c212b41@FreeBSD.org> <20170809175724.GC1244@albert.catwhisker.org> From: Bryan Drewery Openpgp: id=F9173CB2C3AAEA7A5C8A1F0935D771BB6E4697CF; url=http://www.shatow.net/bryan/bryan2.asc Organization: FreeBSD Message-ID: <2d6ecd49-2bcc-0c24-8854-63079c0eef6b@FreeBSD.org> Date: Wed, 9 Aug 2017 12:22:20 -0700 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.2.1 MIME-Version: 1.0 In-Reply-To: <20170809175724.GC1244@albert.catwhisker.org> Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="gF9NCUB3eWvlgtS0IdGU6iakXiKObT7Mi" X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 09 Aug 2017 19:22:46 -0000 This is an OpenPGP/MIME signed message (RFC 4880 and 3156) --gF9NCUB3eWvlgtS0IdGU6iakXiKObT7Mi Content-Type: multipart/mixed; boundary="I7EMDTc44XW9Tu3NFekFJvpKQIPHmcS9R"; protected-headers="v1" From: Bryan Drewery To: David Wolfskill , "current@freebsd.org" Cc: sjg@freebsd.org Message-ID: <2d6ecd49-2bcc-0c24-8854-63079c0eef6b@FreeBSD.org> Subject: Re: Apparent race in buildworld (head/amd64, r322214 -> r322304) References: <20170809120436.GY1244@albert.catwhisker.org> <689a8aa1-c8a3-a8e3-bc01-8bec5c212b41@FreeBSD.org> <20170809175724.GC1244@albert.catwhisker.org> In-Reply-To: <20170809175724.GC1244@albert.catwhisker.org> --I7EMDTc44XW9Tu3NFekFJvpKQIPHmcS9R Content-Type: text/plain; charset=windows-1252 Content-Language: en-US Content-Transfer-Encoding: quoted-printable > /usr/obj/usr/src/tmp/usr/bin/ld: cannot find -lgcc_s On 8/9/2017 10:57 AM, David Wolfskill wrote: > On Wed, Aug 09, 2017 at 10:49:04AM -0700, Bryan Drewery wrote: >> ... >>> on one machine, but the other never had an issue. On the "failing" o= ne, >>> a re-start of the buildworld completed (apparently) successfully. >> >> Yeah, I've gotten reports of this one for years. I fixed a few proble= ms >> with it in the past but something else must have creeped in. >=20 > Or I just got "lucky." :-) >=20 >> I don't believe it is related to META_MODE though. >=20 > Fair enough; I pointed it out just in case it might be relevant. (I tr= y > to avoid hiding possibly-relevant information when I'm trying to work > with someone to solve a problem. I know that's weird, but... :-} ) >=20 >> The last time I fixed this (AFAIK) it was related to an early error >> being ignored. I'll review your log to see if I can find anything lik= e >> that. >=20 > Cool. FWIW, the scheduler will see 8 cores on each machine, so the > "make buildworld" will have been "make -j16 buildworld" (on each). >=20 >> .... >=20 This should fix it: https://people.freebsd.org/~bdrewery/patches/gcc_s-install-race.diff The problem has consistently been, from your reports, that gcc_s is being installed to WORLDTMP *while* something is trying to link to it. > --- gnu/lib/libgcc__L --- > Building /common/S4/obj/usr/src/world32/usr/src/gnu/lib/libgcc/_libinst= all > --- kerberos5/lib/libhx509__L --- > Building /common/S4/obj/usr/src/world32/usr/src/kerberos5/lib/libhx509/= keyset.So > --- secure/lib/libssl__L --- > /usr/obj/usr/src/tmp/usr/bin/ld: cannot find -lgcc_s >=20 >=20 > Building /common/S3/obj/usr/src/world32/usr/src/gnu/lib/libgcc/_libinst= all > --- lib/ncurses/ncursesw__L --- > Building /common/S3/obj/usr/src/world32/usr/src/lib/ncurses/ncursesw/nc= _panel.po > --- lib/ncurses/ncurses__L --- > Building /common/S3/obj/usr/src/world32/usr/src/lib/ncurses/ncurses/com= p_parse.po > --- lib/ncurses/ncursesw__L --- > Building /common/S3/obj/usr/src/world32/usr/src/lib/ncurses/ncursesw/re= sizeterm.po > --- lib/libc++__L --- > /usr/obj/usr/src/tmp/usr/bin/ld: cannot find -lgcc_s >=20 > --- lib/libgcc_s__L ---^M = =20 > Building /common/S4/obj/usr/src/world32/usr/src/lib/libgcc_s/_libinstal= l^M =20 > --- kerberos5/lib/libwind__L ---^M = =20 > --- obj ---^M = =20 > --- secure/lib/libcrypto__L ---^M = =20 > --- all_subdir_secure/lib/libcrypto/engines/libatalla ---^M = =20 > /usr/obj/usr/src/tmp/usr/bin/ld: cannot find -lgcc_s^M = =20 > cc: error: linker command failed with exit code 1 (use -v to see invoca= tion)^M > --- all_subdir_secure/lib/libcrypto/engines/libsureware ---^M = =20 > /usr/obj/usr/src/tmp/usr/bin/ld: cannot find -lgcc_s^M = =20 By default 'install' unlinks the file and then copies over the new file. Using PRECIOUSLIB we get the -S flag to install which is atomic in its installation. Note the patch is not what I will commit. At Isilon we changed our install to always use -S for library installation, but not to force schg on. I am considering making that change the default, to use -S for all libraries. --=20 Regards, Bryan Drewery --I7EMDTc44XW9Tu3NFekFJvpKQIPHmcS9R-- --gF9NCUB3eWvlgtS0IdGU6iakXiKObT7Mi Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 iQEcBAEBAgAGBQJZi2DtAAoJEDXXcbtuRpfPM9AH/iChj5dVPn6Dc5ya1Oxu3/2v HooVuSxvAnb7yeYTAxaLLrjCjuu1R8AW7mOJdmLVbA0VekWLiHKQi3Uqo8a0bsxS zMq/DRoomUFvmwO1RyoQboTaJbDaYsSCdzvj5A4RFD2DCSmkFTyJWpx5mwCT1QPx VFdeU0/OGgqZ/22ySPQTD7vfPNInEA6yArRbAPifwi0htV77qd3V2OD1jDw/uoss a8fxtAj0fZNcuf8pEjUCAXoigVI7C7o9lDGuElKUtIjxiyHLiS9yXxYF4gSHJQjs G1pQShEMAqFGZec7pXnlMjj0I4ludrN6iRXhZp9oRsIQ7mOYekJ/+6smrdIIjRM= =cDaP -----END PGP SIGNATURE----- --gF9NCUB3eWvlgtS0IdGU6iakXiKObT7Mi--