From owner-freebsd-current@FreeBSD.ORG Mon Jul 12 20:13:32 2004 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id C39EF16A4CE for ; Mon, 12 Jul 2004 20:13:32 +0000 (GMT) Received: from mail6.speakeasy.net (mail6.speakeasy.net [216.254.0.206]) by mx1.FreeBSD.org (Postfix) with ESMTP id 9472143D53 for ; Mon, 12 Jul 2004 20:13:32 +0000 (GMT) (envelope-from jhb@FreeBSD.org) Received: (qmail 26079 invoked from network); 12 Jul 2004 20:13:32 -0000 Received: from dsl027-160-063.atl1.dsl.speakeasy.net (HELO server.baldwin.cx) ([216.27.160.63]) (envelope-sender ) encrypted SMTP for ; 12 Jul 2004 20:13:31 -0000 Received: from 10.50.41.229 (gw1.twc.weather.com [216.133.140.1]) by server.baldwin.cx (8.12.11/8.12.11) with ESMTP id i6CKDRYG011082; Mon, 12 Jul 2004 16:13:28 -0400 (EDT) (envelope-from jhb@FreeBSD.org) From: John Baldwin To: "Willem Jan Withagen" Date: Mon, 12 Jul 2004 16:14:40 -0400 User-Agent: KMail/1.6 References: <20040712024044.GA24706@xor.obsecurity.org> <03b401c467e9$70d4c650$471b3dd4@digiware.nl> In-Reply-To: <03b401c467e9$70d4c650$471b3dd4@digiware.nl> MIME-Version: 1.0 Content-Disposition: inline Content-Type: text/plain; charset="windows-1252" Content-Transfer-Encoding: 7bit Message-Id: <200407121614.40864.jhb@FreeBSD.org> X-Spam-Checker-Version: SpamAssassin 2.63 (2004-01-11) on server.baldwin.cx cc: current@FreeBSD.org cc: Kris Kennaway Subject: Re: page fault/panic: mi_switch: switch in a critical section X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 12 Jul 2004 20:13:33 -0000 On Monday 12 July 2004 04:22 am, Willem Jan Withagen wrote: > From: "Kris Kennaway" > > > panic: page fault > > panic messages: > > --- > > Fatal trap 12: page fault while in kernel mode > > fault virtual address = 0x104 > > fault code = supervisor read, page not present > > instruction pointer = 0x8:0xc058a8cf > > stack pointer = 0x10:0xdcb34cc4 > > frame pointer = 0x10:0xdcb34cec > > code segment = base 0x0, limit 0xfffff, type 0x1b > > = DPL 0, pres 1, def32 1, gran 1 > > processor eflags = resume, IOPL = 0 > > current process = 50 (schedcpu) > > trap number = 12 > > panic: page fault > > > > syncing disks, buffers remaining... panic: mi_switch: switch in a > > critical > > section > > > addr2line says the panic was in kern/sched_4bsd.c:327 > > > > /* > > * The kse slptimes are not touched in > > wakeup * because the thread may not HAVE a KSE. */ > > if (ke->ke_state == KES_ONRUNQ) { > > awake = 1; > > ke->ke_flags &= ~KEF_DIDRUN; > > ---> } else if ((ke->ke_state == KES_THREAD) > > && (TD_IS_RUNNING(ke->ke_thread))) { awake = 1; > > > > gdb -k got confused and couldn't make anything out of the backtrace. > > That is only a few lines from what is bugging me already quite a few days > on amd64. Note that it only occurs with me when I'm running SMP. Disableing > ACPI also disables the second CPU, and the box stays up for as long as I > leave it on. I have not tried running with sched_bsd. > > panic: mi_switch: called by old code > cpuid = 1; > Stack backtrace: > backtrace() at backtrace+0x17 > panic() at panic+0x1d2 > mi_switch() at mi_switch+0xcf > maybe_preempt() at maybe_preempt+0xd0 > sched_add() at sched_add+0x2dd > kseq_assign() at kseq_assign+0x45 > sched_choose() at sched_choose+0x5b > choosethread() at choosethread+0x3d > sched_switch() at sched_switch+0x126 > mi_switch() at mi_switch+0x23b > ast() at ast+0x35f > Xfast_syscall() at Xfast_syscall+0xdd > --- syscall (0), rip = 0x20069afdc, rsp = 0x7fffffffe8a8, rbp = This should be fixed in HEAD as kseq_assign() now calls sched_add_internal() which no longer recursively mi_switch()'s. -- John Baldwin <>< http://www.FreeBSD.org/~jhb/ "Power Users Use the Power to Serve" = http://www.FreeBSD.org