From owner-freebsd-stable@freebsd.org Wed May 24 22:40:44 2017 Return-Path: Delivered-To: freebsd-stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id D7AEBD8012B for ; Wed, 24 May 2017 22:40:44 +0000 (UTC) (envelope-from mcdouga9@egr.msu.edu) Received: from mailman.ysv.freebsd.org (unknown [127.0.1.3]) by mx1.freebsd.org (Postfix) with ESMTP id C536C1FEB for ; Wed, 24 May 2017 22:40:44 +0000 (UTC) (envelope-from mcdouga9@egr.msu.edu) Received: by mailman.ysv.freebsd.org (Postfix) id C1A49D8012A; Wed, 24 May 2017 22:40:44 +0000 (UTC) Delivered-To: stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id C139FD80129 for ; Wed, 24 May 2017 22:40:44 +0000 (UTC) (envelope-from mcdouga9@egr.msu.edu) Received: from mail.egr.msu.edu (hill.egr.msu.edu [35.9.37.163]) by mx1.freebsd.org (Postfix) with ESMTP id A585B1FEA; Wed, 24 May 2017 22:40:44 +0000 (UTC) (envelope-from mcdouga9@egr.msu.edu) Received: from hill (localhost [127.0.0.1]) by mail.egr.msu.edu (Postfix) with ESMTP id A603553D25; Wed, 24 May 2017 18:33:07 -0400 (EDT) X-Virus-Scanned: amavisd-new at egr.msu.edu Received: from mail.egr.msu.edu ([127.0.0.1]) by hill (hill.egr.msu.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Xh0uITOFnvc8; Wed, 24 May 2017 18:33:07 -0400 (EDT) Received: from daemon.localdomain (daemon.egr.msu.edu [35.9.44.65]) by mail.egr.msu.edu (Postfix) with ESMTP id 74FD153D0D; Wed, 24 May 2017 18:33:07 -0400 (EDT) Received: by daemon.localdomain (Postfix, from userid 21281) id 70AE21361F1; Wed, 24 May 2017 18:33:07 -0400 (EDT) Date: Wed, 24 May 2017 18:33:07 -0400 From: Adam McDougall To: royger@FreeBSD.org Cc: stable@freebsd.org Subject: Boot hang on Xen after r318347/(310418) Message-ID: <20170524223307.GS79337@egr.msu.edu> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.8.2 (2017-04-18) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 24 May 2017 22:40:44 -0000 Hello, Recently I made a new build of 11-STABLE but encountered a boot hang at this state: http://www.egr.msu.edu/~mcdouga9/pics/r318347-smp-hang.png It is easy to reproduce, I can just boot from any 11 or 12 ISO that contains the commit. I compiled various svn revisions to confirm that r318347 caused the issue and r318346 is fine. With r318347 or later including the latest 11-STABLE, the system will only boot with one virtual CPU in XenServer. Any more cpus and it hangs. I also tried a 12 kernel from head this afternoon and I have the same hang. I had this issue on XenServer 7 (Xen 4.7) and XenServer 6.5 (Xen 4.4). I did most of my testing on 7. I also did much of my testing with a GENERIC kernel to try to rule out kernel configuration mistakes. When it hangs, the performance monitoring in Xen tells me at least one CPU is pegged. r318674 boots fine on physical hardware without Xen involved. Looking at r318347 which mentions EARLY_AP_STARTUP and later seeing r318763 which enables EARLY_AP_STARTUP in GENERIC, I tried adding it to my kernel but it turned the hang into a panic but with any number of CPUs: http://www.egr.msu.edu/~mcdouga9/pics/r318347-early-ap-startup-panic.png I think I verified that this happens with EARLY_AP_STARTUP before r318347 too so I'll assume it is a different problem. I may need to do some experimentation to figure out how to get the console to pass through hotkeys to drop into a kernel debugger. I could also try modifying the kernel config if I can make it print information about the hang. Is there anything else I can provide that might help? Would you prefer this be entered in a bugzilla report? Thanks.