From owner-freebsd-scsi@FreeBSD.ORG Wed Jan 8 06:44:40 2014 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 6EC297AA for ; Wed, 8 Jan 2014 06:44:40 +0000 (UTC) Received: from mail-qa0-x236.google.com (mail-qa0-x236.google.com [IPv6:2607:f8b0:400d:c00::236]) (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id 2BD8A150A for ; Wed, 8 Jan 2014 06:44:40 +0000 (UTC) Received: by mail-qa0-f54.google.com with SMTP id j7so1497748qaq.27 for ; Tue, 07 Jan 2014 22:44:39 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:cc:content-type:content-transfer-encoding; bh=P8qBMFjvY6ft08jt+d3kwqcT+bLhrchwwifmwop+3Vo=; b=JHV6TbduIrUUO2CaYxaT9i85KwkzHsf3q4o46IrCV1u2ig2/RsmBXb8QYSeL7KJSDv Mc5D/ME3krNvvvM+NjgSy5ImfyT/DNhqXxmzv0aiVFmQA/c9Xt3wI6wZrO7Aq8XcgIiL itBEpKtqFeFMQv4fRCgeWwgPpNw77JmAaWGudoRe1SiwLKCKaQmQb48D94MKOK8Tk13m dMOKbjwuWoJp/Kj4n1Kh041VpakXpeMW7WmFY93LUW9zrJAWFiNpb0+T2VAuQ7lfw9cN 3+oQ6YjwIrmGPeaNvi3AfjXRs+KZ2WcPOu+I3hPnQo7Bg88VnlOeddiLP7ByGfoTxqZK Om5w== MIME-Version: 1.0 X-Received: by 10.49.59.83 with SMTP id x19mr10837717qeq.47.1389163479422; Tue, 07 Jan 2014 22:44:39 -0800 (PST) Sender: benlaurie@gmail.com Received: by 10.96.72.196 with HTTP; Tue, 7 Jan 2014 22:44:39 -0800 (PST) In-Reply-To: <84D23688-DDC6-421E-9D21-3DA646229038@scsiguy.com> References: <84D23688-DDC6-421E-9D21-3DA646229038@scsiguy.com> Date: Wed, 8 Jan 2014 06:44:39 +0000 X-Google-Sender-Auth: 56hwxsa3MWCOd786M4ZTICXNaR4 Message-ID: Subject: Re: Dropped interrupts From: Ben Laurie To: "Justin T. Gibbs" Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Cc: freebsd-scsi@freebsd.org X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.17 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 08 Jan 2014 06:44:40 -0000 On 7 January 2014 18:11, Justin T. Gibbs wrote: > On Jan 7, 2014, at 12:36 AM, Ben Laurie wrote: > >> Attached. >> >> On 7 January 2014 05:46, Justin T. Gibbs wrote: >>> On Jan 6, 2014, at 3:01 PM, Ben Laurie wrote: >>> >>>> Not subscribed to the list, so please cc on replies. >>>> >>>> I'm using Bacula with an LTO-2 SCSI drive. >>>> >>>> With increasing frequency lately, I've been getting errors like this >>>> from bacula: >>>> >>>> backup-sd JobId 13092: Error: block.c:608 Write error at 23:6772 on >>>> device "Ultrium" (/dev/nsa0). ERR=3DOperation not permitted. >>>> >>>> Associated with this, I see in dmesg: >>>> >>>> ahc0: Recovery Initiated >>>> >>>> [a lot of dump info, including=85] >>> >>> If you provide the dump info, I may be able to tell you why recovery is= starting. >>> >>> The dmesg information from a boot of the system would be good to have t= oo. >>> >>> =97 >>> Justin > > The target is keeping us in command phase for some reason. No parity or = other > errors are being reported. My guess is that the tape drive does not like= the command > that was issued for some reason. > > Attached are two totally untested/uncompiled changes for you to try out. = The first > should give more information about the command that timed out so we can b= etter > determine if it is well formed. The second is an attempted fix for spuri= ous > =93Interrupts may not be functioning=94 warnings. Can you attempt to rep= licate this > again with these changes? Rebuilding now - you had a ; missing in the patch :-) Of course, now I've done this, it'll not fail for a month (its been failing multiple times per day recently, but on average its a lot rarer than that!). Will let you know when I get a fresh failure.