From owner-freebsd-current@FreeBSD.ORG Thu Sep 16 20:47:22 2004 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id B2E9F16A4CE for ; Thu, 16 Sep 2004 20:47:22 +0000 (GMT) Received: from postal1.es.net (postal1.es.net [198.128.3.205]) by mx1.FreeBSD.org (Postfix) with ESMTP id 9CE2443D1D for ; Thu, 16 Sep 2004 20:47:22 +0000 (GMT) (envelope-from oberman@es.net) Received: from ptavv.es.net ([198.128.4.29]) by postal1.es.net (Postal Node 1) with ESMTP (SSL) id IBA74465; Thu, 16 Sep 2004 13:47:22 -0700 Received: from ptavv (localhost [127.0.0.1]) by ptavv.es.net (Tachyon Server) with ESMTP id E59CB5D04; Thu, 16 Sep 2004 13:47:21 -0700 (PDT) To: Ariff Abdullah In-reply-to: Your message of "Fri, 17 Sep 2004 04:36:53 +0800." <20040917043653.419a8e0e.skywizard@MyBSD.org.my> Date: Thu, 16 Sep 2004 13:47:21 -0700 From: "Kevin Oberman" Message-Id: <20040916204721.E59CB5D04@ptavv.es.net> cc: daniel_k_eriksson@telia.com cc: sos@DeepCore.dk cc: current@freebsd.org Subject: Re: ad0: TIMEOUT - READ_DMA retrying (2 retries left) LBA=207594611 X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 16 Sep 2004 20:47:22 -0000 > Date: Fri, 17 Sep 2004 04:36:53 +0800 > From: Ariff Abdullah > > On Thu, 16 Sep 2004 13:13:42 -0700 > "Kevin Oberman" wrote: > > > From: "Daniel Eriksson" > > > Date: Thu, 16 Sep 2004 21:17:45 +0200 > > > Sender: owner-freebsd-current@freebsd.org > > > > > > > > > This is a me-too report: > > > > > > After upgrading a 6-CURRENT kernel and world from > > > 2004.09.09.08.00.00 to 2004.09.16.13.00.00, I am now getting these > > > messages on a machine that previously worked just fine: > > > > Thanks! This should greatly simplify my search for the change that > > is causing the problem. I can start with 9/9/04 and work forward. I > > saw the problem on 9/11/04 in RELENG_5, so I now have a window. I > > could shrink it to less than nothing if I was REALLY confident that > > nothing was MT5 in less than 3 days, so something the was moved to > > RELENG_5 after a very short time is suspect. > > > > I am currently building a 9/9/04 RELENG_5 kernel and I'll test it > > later today (if nobody beats me to it.) > > One more thing to consider, the default scheduler. I found that those > errors occured with SCHED_4BSD (PREEMPTION or NOT), while SCHED_ULE > (of course without PREEMPTION, or *else*), nothing such that. > > Note that this is *my* case, your mileage may vary. I can confirm I'm using 4BSD. If anyone is using ULE and seeing this EXACT problem, please holler? adX: TIMEOUT - READ_DMA retrying (2 retries left) LBA= or adX: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA= It should be differing only in the ad unit number and the specified LBA. -- R. Kevin Oberman, Network Engineer Energy Sciences Network (ESnet) Ernest O. Lawrence Berkeley National Laboratory (Berkeley Lab) E-mail: oberman@es.net Phone: +1 510 486-8634