From owner-freebsd-hackers@FreeBSD.ORG Wed Aug 1 00:50:27 2012 Return-Path: Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0B384106564A for ; Wed, 1 Aug 2012 00:50:27 +0000 (UTC) (envelope-from nonesuch@longcount.org) Received: from mail-vc0-f182.google.com (mail-vc0-f182.google.com [209.85.220.182]) by mx1.freebsd.org (Postfix) with ESMTP id AFC268FC12 for ; Wed, 1 Aug 2012 00:50:26 +0000 (UTC) Received: by vcbgb22 with SMTP id gb22so7851080vcb.13 for ; Tue, 31 Jul 2012 17:50:25 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=references:in-reply-to:mime-version:content-transfer-encoding :content-type:message-id:cc:x-mailer:from:subject:date:to :x-gm-message-state; bh=HrvwHZDVmj7qd6iE3RTj1me6n1omWOGV8TL/nco/gRw=; b=XVhZM8Ru5/73f1oLuO1jPRf6kjo51iPTVqJxvqDxsWAzmj2Ttffi1RcJiOEvtw0xJN Q3f6dkWhNdAem4OtZYA0kIFScf3q4SnclhoXeynj6mBpf0JLDOKYOkof+Gok0k2o0ZM5 qBkdkpnOYaTLt1Kmtro894BmqSREcmj0SScKMob4fySjI08IkBKaKtZ+n0fFo4EV0d8U 0MUx+BZ+oGj1NPels1c6nraD1YspZppcQgP6Gswc/cHRmvppm5ITs6T9MVhcIzp+W0ON Z0j1fKG1cE8UD9CAN6sN9rRo9Z5l/7cNZwI24AkIHAy1LKE4qqIriwiNhsGeO6zZuP6y lPZg== Received: by 10.58.169.16 with SMTP id aa16mr4225062vec.33.1343782225807; Tue, 31 Jul 2012 17:50:25 -0700 (PDT) Received: from [192.168.11.202] (ool-182c8651.dyn.optonline.net. [24.44.134.81]) by mx.google.com with ESMTPS id ek5sm1583873vdb.5.2012.07.31.17.50.25 (version=TLSv1/SSLv3 cipher=OTHER); Tue, 31 Jul 2012 17:50:25 -0700 (PDT) References: <501871FD.601@rawbw.com> <50187853.7080206@freebsd.org> In-Reply-To: <50187853.7080206@freebsd.org> Mime-Version: 1.0 (1.0) Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=us-ascii Message-Id: X-Mailer: iPhone Mail (9B206) From: Mark Saad Date: Tue, 31 Jul 2012 20:50:23 -0400 To: Julian Elischer X-Gm-Message-State: ALoCoQmz4q1A/ffKx5mxASRrBYrJy9fcW+Mr7G1N9anG2o8IGQagYHI/4WVK62KMgRlaHfEG/2jE Cc: "freebsd-hackers@freebsd.org" Subject: Re: How to diagnose system freezes? X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 01 Aug 2012 00:50:27 -0000 On Jul 31, 2012, at 8:29 PM, Julian Elischer wrote: > On 7/31/12 5:02 PM, Yuri wrote: >> One of my 9.1-BETA1 systems periodically freezes. If sound was playing, i= t would usually cycle with a very short period. And system stops being sensi= tive to keyboard/mouse. Also ping of this system doesn't get a response. >> I would normally think that this is the faulty memory. But memory was rec= ently replaced and tested with memtest+ for hours both before and after free= zes and it passes all tests. >> One out of the ordinary thing that is running on this system is nvidia dr= iver. But the freezes happen even when there is no graphics activity. >> Another out of the ordinary thing is that the kernel is built for DTrace.= But DTrace was never used in the sessions that had a freeze. >>=20 >> What is the way to diagnose this problem? > The answer depends on a number of things but an NMI can be useful if you h= ave some way of > generating them. (some IPMI implementations can allw you to generate them a= nd some motherboards have > jumpers to allow you to attach a 'nmi-button'. >=20 > The fact that ping is not responsive is important, as that is done at a ve= ry low level but > it may still be alive down there somewhere. >=20 > Make sure you have debugging enabled in your kernel. That will catch quite= a few 'hangs'. >=20 > as also mentioned by others... a serial console and DDB may also be useful= in some hangs. >=20 >=20 > Julian >> CPU: i7 CPU 920 @ 2.67GHz >> Memory: 24GB >> MB: P2T >>=20 >> Yuri >>=20 Yuri Install sysutils/mcelog and try running the example included . While not a= complete definitative hardware test it can report other hardware issues tha= t memtest86+ misses and it can be run on line in multiuser mode and via cron= .=20 --- Mark saad | mark.saad@longcount.org