From owner-freebsd-stable@FreeBSD.ORG Fri Feb 10 08:03:59 2006 Return-Path: X-Original-To: stable@freebsd.org Delivered-To: freebsd-stable@FreeBSD.ORG Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 4FC6E16A420 for ; Fri, 10 Feb 2006 08:03:59 +0000 (GMT) (envelope-from wb@freebie.xs4all.nl) Received: from smtp-vbr1.xs4all.nl (smtp-vbr1.xs4all.nl [194.109.24.21]) by mx1.FreeBSD.org (Postfix) with ESMTP id BDE1D43D48 for ; Fri, 10 Feb 2006 08:03:58 +0000 (GMT) (envelope-from wb@freebie.xs4all.nl) Received: from freebie.xs4all.nl (freebie.xs4all.nl [213.84.32.253]) by smtp-vbr1.xs4all.nl (8.13.3/8.13.3) with ESMTP id k1A83rNA091291; Fri, 10 Feb 2006 09:03:53 +0100 (CET) (envelope-from wb@freebie.xs4all.nl) Received: from freebie.xs4all.nl (localhost [127.0.0.1]) by freebie.xs4all.nl (8.13.4/8.13.3) with ESMTP id k1A83r46005996; Fri, 10 Feb 2006 09:03:53 +0100 (CET) (envelope-from wb@freebie.xs4all.nl) Received: (from wb@localhost) by freebie.xs4all.nl (8.13.4/8.13.1/Submit) id k1A83ob1005995; Fri, 10 Feb 2006 09:03:50 +0100 (CET) (envelope-from wb) Date: Fri, 10 Feb 2006 09:03:50 +0100 From: Wilko Bulte To: =?iso-8859-1?Q?S=F8ren?= Schmidt Message-ID: <20060210080350.GA5978@freebie.xs4all.nl> References: <43EA5C50.5020804@deepcore.dk> <20060208213704.GA703@freebie.xs4all.nl> <43EA6625.2070106@deepcore.dk> <20060208221056.GA1299@freebie.xs4all.nl> <43EB5393.5090502@deepcore.dk> <20060209144250.GB4874@freebie.xs4all.nl> <43EB55A1.9040405@deepcore.dk> <20060209201912.GA680@freebie.xs4all.nl> <43EBA4F7.7040407@deepcore.dk> <20060209220824.GA1499@freebie.xs4all.nl> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20060209220824.GA1499@freebie.xs4all.nl> X-OS: FreeBSD 6.1-PRERELEASE User-Agent: Mutt/1.5.11 X-Virus-Scanned: by XS4ALL Virus Scanner Cc: stable@freebsd.org Subject: Re: Showstopper ATA bug in 6.1-PRE? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 10 Feb 2006 08:03:59 -0000 On Thu, Feb 09, 2006 at 11:08:24PM +0100, Wilko Bulte wrote.. > On Thu, Feb 09, 2006 at 09:24:23PM +0100, Sren Schmidt wrote.. > > Wilko Bulte wrote: > > >On Thu, Feb 09, 2006 at 03:45:53PM +0100, Sren Schmidt wrote.. > > >>Wilko Bulte wrote: > > >>>On Thu, Feb 09, 2006 at 03:37:07PM +0100, Sren Schmidt wrote.. > > >>>>Wilko Bulte wrote: > > >>>>>On Wed, Feb 08, 2006 at 10:44:05PM +0100, Sren Schmidt wrote.. > > >>>>>>Wilko Bulte wrote: > > >>>>>>>On Wed, Feb 08, 2006 at 10:02:08PM +0100, Sren Schmidt wrote.. > > >>>>>>>>Wilko Bulte wrote: > > >>>>>>>>>Hi Soren, > > >>>>>>>>> > > >>>>>>>>>I just went to 6.1-PRE on my main machine, coming from 6.0-STABLE > > >>>>>>>>>of roughly end of december. > > >>>>>>>>> > > >>>>>>>>>And I hit some stuff that really worries me: > > >>>>>>>>> > > >>>>>>>>>- the freshly built kernel keels over with (hand transcribed): > > >>>>>>>>> > > >>>>>>>>>ata3: reiniting channel SATA connect ... > > >>>>>>>>>SATA connected > > >>>>>>>>>sata_connect_devices 0x1 > > >>>>>>>>> > > >>>>>>>>>ad6: req=0xC35ba0c8 SETFEATURES SETTRANSFERMODE semaphore timeout > > >>>>>>>>>!! DANGER Will RObinson !! > > >>>>>>>>> > > >>>>>>>>>(... is where I cannot read my own handwriting, it scrolled quite > > >>>>>>>>>fast on > > >>>>>>>>>the screen..) > > >>>>>>>>> > > >>>>>>>>>Boot device is a SATA RAID1 on a Promise 2300. > > >>>>>>>>Hmm, that should not happen. Could you try to backstep just ATA to > > >>>>>>>>before the MFC, that is 24/1/06 and let me know if that helps > > >>>>>>>>please ? > > >>>>>>>First impression is that the problem is gone. None of the > > >>>>>>>previously reported errors are seen. I am running a level 0 dump > > >>>>>>>from disk to disk > > >>>>>>>to see if the box remains stable. Given that this is my primary > > >>>>>>>machine > > >>>>>>>I sure hope it will be :-) > > >>>>>>> > > >>>>>>>>>Another snag is that my ad10 disk on 6.0-STABLE suddenly became > > >>>>>>>>>ad12 on > > >>>>>>>>>6.1-PRE > > >>>>>>>>Hmm that is because there is only 2 ports on your promise which is > > >>>>>>>>now correctly identified, before it was errounsly found as 3 ports. > > >>>>>>>Ah, OK. I would suggest a note to the Release Note writers would be > > >>>>>>>a good > > >>>>>>>thing, devices changing location after an upgrade in the -stable > > >>>>>>>branch > > >>>>>>>is unnerving ;-) > > >>>>>>Well, the good thing is that I can reproduce the error here, the bad > > >>>>>>thing is that it slipped through testing on -current... > > >>>>>>Oh, well, I'll look into it ASAP... > > >>>>>Thank you Soren! > > >>>>OK, had a few this afternoon, could you try this patch and let me know > > >>>>if it helps, at least it makes the problem go away on my testbed.. > > >>>Is this relative to HEAD or RELENG_6? I cannot / will not go to HEAD > > >>>with this machine (my main production box.. :-) > > >>Doesn't matter, ATA is the same on both... > > > > > >OK, I was not sure if they were 100% identical. > > > > > >The patch at first impression seems to have eliminated the problem. > > > > Good seems I'm on the right track at least. > > > > >Interestingly enough ad10 remained ad10 with the patch applied? > > > > Yeah, thats intentional, I though we better not break POLA here.. > > I agree :-) > > > >I'll put some load on to see what happens. > > > > Let me know how that turns out, I'll clean things up a bit and get it > > committed to -current, then get permission to MFC when we are sure it > > fixes the problem... > > I ran a 44GB disk-to-disk dump without incidents (source on the RAID1, > target on the JBOD). No problems whatsoever. > > Looks like things behave much better now. Tonight the machine will > run a daily full dump to DLT tape, I'll know how that turns out tomorrow. Backup ran without problems. -- Wilko Bulte wilko@FreeBSD.org