From owner-freebsd-hackers@FreeBSD.ORG Thu Mar 29 16:27:42 2012 Return-Path: Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 00D851065670; Thu, 29 Mar 2012 16:27:42 +0000 (UTC) (envelope-from feld@feld.me) Received: from feld.me (unknown [IPv6:2607:f4e0:100:300::2]) by mx1.freebsd.org (Postfix) with ESMTP id C9D508FC12; Thu, 29 Mar 2012 16:27:41 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=feld.me; s=blargle; h=In-Reply-To:Message-Id:From:Mime-Version:Date:References:Subject:Cc:To:Content-Type; bh=OgUiSRDFY6uvWK0DlzrRv2SWbHw2dnaKH4UPxJtCkRo=; b=Oq5W/har4j+fd6KPf4INeY+S8mqAkezW9PoLU2VtsdPPcQhNRKGFTITFzaqDfDxmSsaac5JlCIp39kS3V7NhzDRVPtkONaznlyAXFbVf3OFVT27NAyjDMbYxVIGflP54; Received: from localhost ([127.0.0.1] helo=mwi1.coffeenet.org) by feld.me with esmtp (Exim 4.77 (FreeBSD)) (envelope-from ) id 1SDICG-000IA5-9P; Thu, 29 Mar 2012 11:27:41 -0500 Received: from feld@feld.me by mwi1.coffeenet.org (Archiveopteryx 3.1.4) with esmtpa id 1333038455-20726-20725/5/20; Thu, 29 Mar 2012 16:27:35 +0000 Content-Type: text/plain; charset=utf-8; format=flowed; delsp=yes To: freebsd-hackers@freebsd.org, freebsd-questions@FreeBSD.org References: <201203291549.q2TFnUc7080406@aurora.sol.net> <201203291755.36651.hselasky@c2i.net> Date: Thu, 29 Mar 2012 11:27:35 -0500 Mime-Version: 1.0 From: Mark Felder Message-Id: In-Reply-To: <201203291755.36651.hselasky@c2i.net> User-Agent: Opera Mail/11.62 (FreeBSD) X-SA-Score: -1.5 Cc: Hans Petter Selasky Subject: Re: Please help me diagnose this crazy VMWare/FreeBSD 8.x crash X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 29 Mar 2012 16:27:42 -0000 On Thu, 29 Mar 2012 10:55:36 -0500, Hans Petter Selasky wrote: > > It almost sounds like the lost interrupt issue I've seen with USB EHCI > devices, though disk I/O should have a retry timeout? > > What does "wmstat -i" output? > > --HPS Here's a server that has a week uptime and is due for a crash any hour now: root@server:/# vmstat -i interrupt total rate irq1: atkbd0 34 0 irq6: fdc0 9 0 irq15: ata1 34 0 irq16: em1 778061 1 irq17: mpt0 19217711 31 irq18: em0 283674769 460 cpu0: timer 246571507 400 Total 550242125 892