From owner-freebsd-amd64@FreeBSD.ORG Sun May 9 09:30:12 2004 Return-Path: Delivered-To: freebsd-amd64@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 0F2BC16A4CF for ; Sun, 9 May 2004 09:30:12 -0700 (PDT) Received: from transport.cksoft.de (transport.cksoft.de [62.111.66.27]) by mx1.FreeBSD.org (Postfix) with ESMTP id E6F4843D4C for ; Sun, 9 May 2004 09:30:10 -0700 (PDT) (envelope-from bzeeb-lists@lists.zabbadoz.net) Received: from transport.cksoft.de (localhost [127.0.0.1]) by transport.cksoft.de (Postfix) with ESMTP id DAC171FFDD6; Sun, 9 May 2004 18:30:08 +0200 (CEST) Received: by transport.cksoft.de (Postfix, from userid 66) id E04AE1FFDD4; Sun, 9 May 2004 18:30:06 +0200 (CEST) Received: by mail.int.zabbadoz.net (Postfix, from userid 1060) id 7FB73154F8; Sun, 9 May 2004 16:25:22 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by mail.int.zabbadoz.net (Postfix) with ESMTP id 74F55154E2; Sun, 9 May 2004 16:25:23 +0000 (UTC) Date: Sun, 9 May 2004 16:25:23 +0000 (UTC) From: "Bjoern A. Zeeb" X-X-Sender: bz@e0-0.zab2.int.zabbadoz.net To: Adriaan de Groot In-Reply-To: <200405091721.33399.adridg@cs.kun.nl> Message-ID: References: <20040302031226.GA670@xor.obsecurity.org> <4044297F.1080701@DeepCore.dk> <200405091721.33399.adridg@cs.kun.nl> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Virus-Scanned: by AMaViS cksoft-s20020300-20031204bz on transport.cksoft.de cc: freebsd-amd64@freebsd.org Subject: Re: NFS or ATA driver causes FS corruption? X-BeenThere: freebsd-amd64@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Porting FreeBSD to the AMD64 platform List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 09 May 2004 16:30:12 -0000 On Sun, 9 May 2004, Adriaan de Groot wrote: > On Saturday 08 May 2004 20:25, Bjoern A. Zeeb wrote: > > On Mon, 1 Mar 2004, it was written: > > > Kris Kennaway wrote: > > > > ad0: WARNING - WRITE_DMA interrupt was seen but timeout fired LBA=9440 > > > > ad0: WARNING - WRITE_DMA interrupt was seen but timeout fired LBA=20904 > > > > > > The above means that *something* is stomping on the taskqueue that > > > should take care of returning finished requests to the system (they are > > I see this (timeouts fired) as well on my Asus K8V with a single S-ATA disk; > it happens only when the machine is ridiculously loaded (like feeding a > 5500-message mbox file into sa-learn while doing two different make -j6 > compiles and also cvsupping the FBSD tree along with another 2G source repo). > Haven't noticed any averse effects, though - the machine chokes for 30 > seconds or so and then carries on. I have to hard reset the machine here :( I can reproduce it with not too much IO. copying sources; make buildworld, ... I tried to do everything mentioned in the errata for 5.2(.1) but nothing helped. It also happened with DMA disbaled (an d I thing it logged PIO errors at that time). I am now going to build a world from NFS src to NFS obj to not have ATA traffic and to slow down things so this will not happen (hopefully). [ obj dir could be oon the same machine I suspect but the installworld will then most likely make the machine go berserk again ]. Are you running Release or HEAD ? If this isn't fix in HEAD yet I am very interested in every patches/things to try to fix this. The machine in question should become my new in house server and I do not like another IO problem once I night when backup is running... -- Greetings Bjoern A. Zeeb bzeeb at Zabbadoz dot NeT 56 69 73 69 74 http://www.zabbadoz.net/