From owner-freebsd-stable@FreeBSD.ORG Tue Nov 11 18:14:07 2003 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 91A8616A4CE for ; Tue, 11 Nov 2003 18:14:07 -0800 (PST) Received: from carver.gumbysoft.com (carver.gumbysoft.com [66.220.23.50]) by mx1.FreeBSD.org (Postfix) with ESMTP id C4E1B43F75 for ; Tue, 11 Nov 2003 18:14:06 -0800 (PST) (envelope-from dwhite@gumbysoft.com) Received: by carver.gumbysoft.com (Postfix, from userid 1000) id B273972DB5; Tue, 11 Nov 2003 18:14:06 -0800 (PST) Received: from localhost (localhost [127.0.0.1]) by carver.gumbysoft.com (Postfix) with ESMTP id B071C72DA8; Tue, 11 Nov 2003 18:14:06 -0800 (PST) Date: Tue, 11 Nov 2003 18:14:06 -0800 (PST) From: Doug White To: Lefteris Tsintjelis In-Reply-To: <3FAE5958.6B162FAF@ene.asda.gr> Message-ID: <20031111180355.A66327@carver.gumbysoft.com> References: <3FAE5958.6B162FAF@ene.asda.gr> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII cc: stable@freebsd.org Subject: Re: Machine Freezes with SMP+APIC enabled. X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 12 Nov 2003 02:14:07 -0000 On Sun, 9 Nov 2003, Lefteris Tsintjelis wrote: > I have been experiencing random machine freezes when using SMP (Hyper > threading) the past week. There are no core dumps or error messages > displayed anywhere. Load is very minimum. When SMP/Hyperthreading is > disabled machine works with no problems at full load. Hm, smells like Giant deadlock. dwhite's Form Letter on Debugging Giant Deadlocks If you are experiencing problems with CURRENT locking up hard, it may be due to a deadlock against the Giant mutex, which controls large parts of the kernel. Symptoms include: . No response to any input . System video console . Network (ping) To debug this, you will need to set up a serial console with some special kernel options. Instructions for booting with serial console are in the Handbook, but you will have to compile with the following kernel options: options DDB options BREAK_TO_DEBUGGER options WITNESS options INVARIANTS options INVARIANTS_SUPPORT Make sure your serial console is capable of sending a Break signal. If not, use "ALT_BREAK_TO_DEBUGGER" instead of "BREAK_TO_DEBUGGER". Enable the serial console and boot the system. Turn on terminal logging. In loader, stop the boot and type "boot -v" at the OK prompt to get additional info during the boot process. Once the system is up, trigger the hang. When the system hangs, issue the Break signal (or if you have used ALT_BREAK_TO_DEBUGGER, press Enter ~ ^E b (tilde, Ctrl-E, b)). If you get the db> prompt, then your hang is probably due to a Giant deadlock. If not, then something else may be at fault. Once in db>, run the following two commands and capture their output using your terminal's logging capability: show locks tr Take these and the boot -v output, put them on a webpage, and send a message to current@freebsd.org carefully explaining what you did to trigger the hang. Good luck! > > Best, > Lefteris Tsintjelis > > # dmesg > Copyright (c) 1992-2003 The FreeBSD Project. > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > The Regents of the University of California. All rights reserved. > FreeBSD 4.9-STABLE #0: Sat Nov 8 01:02:24 EET 2003 > ASUS@P4T533-C:/usr/obj/usr/src/sys/CUSTOM-SMP > Timecounter "i8254" frequency 1193182 Hz > CPU: Intel(R) Pentium(R) 4 CPU 3.06GHz (3073.65-MHz 686-class CPU) > Origin = "GenuineIntel" Id = 0xf27 Stepping = 7 > Features=0xbfebfbff > Hyperthreading: 2 logical CPUs > real memory = 536788992 (524208K bytes) > avail memory = 517226496 (505104K bytes) > Programming 24 pins in IOAPIC #0 > IOAPIC #0 intpin 2 -> irq 0 > FreeBSD/SMP: Multiprocessor motherboard: 2 CPUs > cpu0 (BSP): apic id: 0, version: 0x00050014, at 0xfee00000 > cpu1 (AP): apic id: 1, version: 0x00050014, at 0xfee00000 > io0 (APIC): apic id: 4, version: 0x00178020, at 0xfec00000 > Preloaded elf kernel "kernel" at 0xc04da000. > Preloaded elf module "agp.ko" at 0xc04da09c. > Warning: Pentium 4 CPU: PSE disabled > VESA: v3.0, 131072k memory, flags:0x1, mode table:0xc04308c2 (1000022) > VESA: NVidia > Pentium Pro MTRR support enabled > md0: Malloc disk > Using $PIR table, 9 entries at 0xc00f1be0 > npx0: on motherboard > npx0: INT 16 interface > pcib0: on motherboard > IOAPIC #0 intpin 19 -> irq 2 > IOAPIC #0 intpin 23 -> irq 5 > IOAPIC #0 intpin 17 -> irq 9 > pci0: on pcib0 > agp0: mem 0xf8000000-0xfbffffff at device 0.0 on pci0 > pcib1: at device 1.0 on pci0 > IOAPIC #0 intpin 16 -> irq 10 > pci1: on pcib1 > pci1: at 0.0 irq 10 > pcib2: at device 30.0 on pci0 > IOAPIC #0 intpin 21 -> irq 11 > IOAPIC #0 intpin 22 -> irq 16 > pci2: on pcib2 > ohci0: mem 0xed800000-0xed800fff irq 11 at device 4.0 on pci2 > usb0: OHCI version 1.0 > usb0: on ohci0 > usb0: USB revision 1.0 > uhub0: NEC OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub0: 3 ports with 3 removable, self powered > ohci1: mem 0xed000000-0xed000fff irq 16 at device 4.1 on pci2 > usb1: OHCI version 1.0 > usb1: on ohci1 > usb1: USB revision 1.0 > uhub1: NEC OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub1: 2 ports with 2 removable, self powered > pci2: at 4.2 irq 5 > ahc0: port 0xd800-0xd8ff mem 0xec000000-0xec000fff irq 16 at device 10.0 on pci2 > aic7892: Ultra160 Wide Channel A, SCSI Id=7, 32/253 SCBs > fxp0: port 0xd400-0xd43f mem 0xeb000000-0xeb0fffff,0xeb800000-0xeb800fff irq 5 at device 11.0 on pci2 > fxp0: Ethernet address 00:d0:b7:23:19:8c > inphy0: on miibus0 > inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto > isab0: at device 31.0 on pci0 > isa0: on isab0 > atapci0: port 0xb800-0xb80f at device 31.1 on pci0 > ata0: at 0x1f0 irq 14 on atapci0 > ata1: at 0x170 irq 15 on atapci0 > uhci0: port 0xb400-0xb41f irq 2 at device 31.2 on pci0 > usb2: on uhci0 > usb2: USB revision 1.0 > uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub2: 2 ports with 2 removable, self powered > uhci1: port 0xb000-0xb01f irq 5 at device 31.4 on pci0 > usb3: on uhci1 > usb3: USB revision 1.0 > uhub3: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub3: 2 ports with 2 removable, self powered > pcm0: port 0xa400-0xa43f,0xa800-0xa8ff irq 9 at device 31.5 on pci0 > pcm0: > orm0: