From owner-freebsd-threads@FreeBSD.ORG Thu Feb 12 22:59:12 2015 Return-Path: Delivered-To: freebsd-threads@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id BD7E747D; Thu, 12 Feb 2015 22:59:12 +0000 (UTC) Received: from thyme.infocus-llc.com (thyme.infocus-llc.com [199.15.120.10]) (using TLSv1 with cipher DHE-RSA-CAMELLIA256-SHA (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 9745BA30; Thu, 12 Feb 2015 22:59:12 +0000 (UTC) Received: from draco.over-yonder.net (c-75-65-60-66.hsd1.ms.comcast.net [75.65.60.66]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by thyme.infocus-llc.com (Postfix) with ESMTPSA id 96DA937B593; Thu, 12 Feb 2015 16:59:10 -0600 (CST) Received: by draco.over-yonder.net (Postfix, from userid 100) id 3kjtZ609VSz2Mm; Thu, 12 Feb 2015 16:59:10 -0600 (CST) Date: Thu, 12 Feb 2015 16:59:09 -0600 From: "Matthew D. Fuller" To: freebsd-threads@freebsd.org Subject: Troubles with BIND + r276630 (/r277317 MFC) Message-ID: <20150212225909.GB15127@over-yonder.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline X-Editor: vi X-OS: FreeBSD User-Agent: Mutt/1.5.23-fullermd.4 (2014-03-12) X-Virus-Scanned: clamav-milter 0.98.6 at thyme.infocus-llc.com X-Virus-Status: Clean Cc: kib@freebsd.org X-BeenThere: freebsd-threads@freebsd.org X-Mailman-Version: 2.1.18-1 Precedence: list List-Id: Threading on FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 12 Feb 2015 22:59:12 -0000 I've been setting up a stable/10 system (currently r278451, ~early this week), and had some disturbing troubles out of BIND (net/bind910, but also tested bind99). Sporadically, on restart, I get errors like Feb 12 10:42:37 dns named[4071]: dns_master_load: /usr/local/etc/namedb/master/empty.db:1: isc_lex_gettoken() failed: I/O error Feb 12 10:42:37 dns named[4071]: dns_master_load: /usr/local/etc/namedb/master/empty.db:1: I/O error Feb 12 10:42:37 dns named[4071]: zone 112.100.in-addr.arpa/IN: loading from master file /usr/local/etc/namedb/master/empty.db failed: I/O error Feb 12 10:42:37 dns named[4071]: zone 112.100.in-addr.arpa/IN: not loaded due to errors. and similar. Nothing wrong with the files, and it's wildly inconsistent. Easy to reproduce; just hammering 'service named restart' while tail -f'ing the messages log will show some startups going problem-free, while others will get random errors in random files. I can also reproduce it on my -CURRENT workstation (r278359, ~late last week), but not on several stable/10 systems of late-Dec vintage. In some testing on the -CURRENT box, if I build libc/libthr as of r276629 and do LD_LIBRARY_PATH hackery (and obviously not the 'service' wrapper) to use them, I can kick the reload without ever getting troubles (tried at least a hundred times). Going to 276630, or using the system libs, it'll happen sporadically (maybe once a dozen or so on average?). So far, I haven't seen anything odd in anything but BIND. I was running a mid-Jan -CURRENT before this for a couple weeks too, so there should have been time for Firefox to throw a hissy fit or something, but no hint of trouble. So it's not clear whether the problem is in the libc/libthr changes or in BIND 9.9 and 9.10, but certainly something between the two is unhappy... -- Matthew Fuller (MF4839) | fullermd@over-yonder.net Systems/Network Administrator | http://www.over-yonder.net/~fullermd/ On the Internet, nobody can hear you scream.