From owner-freebsd-bugs Tue Jul 28 01:01:21 1998 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id BAA26657 for freebsd-bugs-outgoing; Tue, 28 Jul 1998 01:01:21 -0700 (PDT) (envelope-from owner-freebsd-bugs@FreeBSD.ORG) Received: from freefall.freebsd.org (freefall.FreeBSD.ORG [204.216.27.21]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id BAA26644 for ; Tue, 28 Jul 1998 01:01:14 -0700 (PDT) (envelope-from gnats@FreeBSD.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.8.8/8.8.5) id BAA14228; Tue, 28 Jul 1998 01:00:01 -0700 (PDT) Received: from flea.best.net (root@flea.best.net [206.184.139.131]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id AAA25575 for ; Tue, 28 Jul 1998 00:51:48 -0700 (PDT) (envelope-from dillon@flea.best.net) Received: (from dillon@localhost) by flea.best.net (8.9.0/8.9.0/best.fl) id AAA15626; Tue, 28 Jul 1998 00:51:17 -0700 (PDT) Message-Id: <199807280751.AAA15626@flea.best.net> Date: Tue, 28 Jul 1998 00:51:17 -0700 (PDT) From: Matt Dillon Reply-To: dillon@best.net To: FreeBSD-gnats-submit@FreeBSD.ORG X-Send-Pr-Version: 3.2 Subject: kern/7424: biodone: buffer not busy panics Sender: owner-freebsd-bugs@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org >Number: 7424 >Category: kern >Synopsis: Machine crashes do not occur very often, but when they do occur it's usually a panic on biodone: buffer not busy. >Confidential: no >Severity: serious >Priority: low >Responsible: freebsd-bugs >State: open >Quarter: >Keywords: >Date-Required: >Class: sw-bug >Submitter-Id: current-users >Arrival-Date: Tue Jul 28 01:00:01 PDT 1998 >Last-Modified: >Originator: Matt Dillon >Organization: Best Internet Communications >Release: FreeBSD 2.2.6-STABLE i386 >Environment: FreeBSD-stable from CVS (somewhere inbetween 2.2.6, 2.2.7). FreeBSD-current All of our boxes use Adaptec 2940UW boards and nearly all of them are single-cpu PPro-200 motherboards. Boot information is similar to as shown below. Tagged queueing is enabled. options AHC_TAGENABLE options AHC_ALLOW_MEMIO ahc0 rev 0 int a irq 11 on pci0:12:0 ahc0: aic7880 Wide Channel, SCSI Id=7, 16 SCBs ahc0 waiting for scsi devices to settle ahc0: target 0 Tagged Queuing Device (ahc0:0:0): "SEAGATE ST34371W 0484" type 0 fixed SCSI 2 sd0(ahc0:0:0): Direct-Access 4148MB (8496884 512 byte sectors) sd0(ahc0:0:0): with 5172 cyls, 10 heads, and an average 164 sectors/track ahc0: target 1 Tagged Queuing Device (ahc0:1:0): "SEAGATE ST19171W 0023" type 0 fixed SCSI 2 sd1(ahc0:1:0): Direct-Access 8683MB (17783112 512 byte sectors) sd1(ahc0:1:0): with 5268 cyls, 20 heads, and an average 168 sectors/track ahc0: target 2 Tagged Queuing Device (ahc0:2:0): "SEAGATE ST19171W 0023" type 0 fixed SCSI 2 sd2(ahc0:2:0): Direct-Access 8683MB (17783112 512 byte sectors) sd2(ahc0:2:0): with 5268 cyls, 20 heads, and an average 168 sectors/track >Description: I'm submitting this bug report even though I don't have a hard backtrace. Unfortuntely, as you can see, the nature of the panic generally precludes getting a dump. I figure it's good to keep the PR in the bug list. I've set the severity to to serious since it is a crash, but the priority to low because it only happens once a month per machine or so... but it's an important stability issue because 'biodone: buffer not busy' panics are responsible for most of the crashes we get these days. If it could be fixed, it would considerably increase machine reliability. Our -stable machines, around 40 of them, each tend to crash around once a month (so we get about a crash a day). i.e. they do not crash very often. But when they do, many of the crashes are due to biodone: buffer not busy panics. These crashes are sometimes preceded by kernel printf's relating to the SCSI subsystem. We have also seen this crash on our FreeBSD-current test box. -Matt (FROM CONSOLE LOGS) ahc0: WARNING no command for scb 4 (cmdcmplt) QOUTCNT == 8 panic: biodone: buffer not busy Debugger("panic") db> trace _Debugger(f0113258) at _Debugger+0x35 _panic(f012ebe9,f1adb080,f10a7c00,f3b51114,f1aabfd0) at _panic+0x5a _biodone(f3b51114,f1adb080,f10a7c00,1,f1adb080) at _biodone+0x30 _scsi_done(f1adb080,f1aafa60,f1ab0800,40000,f01db4e1) at _scsi_done+0x84 _ahc_done(f1ab0800,f1aafa60) at _ahc_done+0x155 _ahc_intr(f1ab0800,0,27,efbf0027,40000) at _ahc_intr+0x1c7 Xresume11() at Xresume11+0x2b --- interrupt, eip = 0xa5c4, esp = 0xefbffff0, ebp = 0xefbfb8b8 --- db> pani panic: from debugger dumping to dev 401, offset 786432 dump panic: biodone: buffer not busy dumping to dev 401, offset 786432 dump device not ready Automatic reboot in 15 seconds - press a key on the console to abort >How-To-Repeat: The panics cannot be deterministically reproduced, but occur around once a month per machine. >Fix: >Audit-Trail: >Unformatted: To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-bugs" in the body of the message