From owner-freebsd-current@FreeBSD.ORG Thu Sep 16 21:05:28 2004 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 165BF16A4CE for ; Thu, 16 Sep 2004 21:05:28 +0000 (GMT) Received: from av9-2-sn4.m-sp.skanova.net (av9-2-sn4.m-sp.skanova.net [81.228.10.107]) by mx1.FreeBSD.org (Postfix) with ESMTP id B732D43D5E for ; Thu, 16 Sep 2004 21:05:27 +0000 (GMT) (envelope-from daniel_k_eriksson@telia.com) Received: by av9-2-sn4.m-sp.skanova.net (Postfix, from userid 502) id E90C437E6B; Thu, 16 Sep 2004 23:05:26 +0200 (CEST) Received: from smtp2-1-sn4.m-sp.skanova.net (smtp2-1-sn4.m-sp.skanova.net [81.228.10.183]) by av9-2-sn4.m-sp.skanova.net (Postfix) with ESMTP id D65F337E42; Thu, 16 Sep 2004 23:05:26 +0200 (CEST) Received: from gadget (h130n1fls11o822.telia.com [213.64.66.130]) by smtp2-1-sn4.m-sp.skanova.net (Postfix) with ESMTP id B4DCE37E53; Thu, 16 Sep 2004 23:05:26 +0200 (CEST) From: "Daniel Eriksson" To: "'Ariff Abdullah'" , Date: Thu, 16 Sep 2004 23:05:22 +0200 Organization: Home Message-ID: MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Mailer: Microsoft Office Outlook, Build 11.0.6353 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.2180 In-Reply-To: <20040917043653.419a8e0e.skywizard@MyBSD.org.my> Thread-Index: AcScLQ7c3rCv7IiPT3iH1K/22La//wAAjkEQ Subject: RE: ad0: TIMEOUT - READ_DMA retrying (2 retries left) LBA=207594611 X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 16 Sep 2004 21:05:28 -0000 Ariff Abdullah wrote: > One more thing to consider, the default scheduler. I found that those > errors occured with SCHED_4BSD (PREEMPTION or NOT), while SCHED_ULE > (of course without PREEMPTION, or *else*), nothing such that. I am using SCHED_4BSD without PREEMPTION. Given the recent stability problems with ULE I haven't even toyed with the idea of trying it since the machine in question is a production machine. I'm curious if this problem is in any way related to my other problem: I cannot use ataraid and gstripe at the same time. If I start gstripe I get an interrupt storm followed by timeouts that tears down the ataraid arrays. Next time I feel lucky I'll try gstripe again (Pawel Jakub Dawidek told me how to get some more debugging output from it). Only reason I haven't done it yet is because the machine has been very nice and stable for the last two weeks. Which reminds me: there was an interrupt storm preceeding the messages I got. I missed copying that line in my original "me-too" message: Interrupt storm detected on "irq9: acpi0"; throttling interrupt source ad0: TIMEOUT - READ_DMA retrying (2 retries left) LBA=75822564 ad0: WARNING - READ_DMA no interrupt but good status ... /Daniel Eriksson