Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 3 Dec 2001 10:43:35 +0000
From:      Josh Paetzel <friar_josh@webwarrior.net>
To:        Tim <timmcm@optushome.com.au>
Cc:        freebsd-questions@FreeBSD.ORG
Subject:   Re: strange kernel panics under high disk i/o - ffs_blkfree
Message-ID:  <20011203104335.A446@twincat.vladsempire.net>
In-Reply-To: <EFEMJNJMJJDEJPHJFNAAMEDMGHAA.timmcm@optushome.com.au>; from timmcm@optushome.com.au on Mon, Dec 03, 2001 at 08:59:59PM %2B1100
References:  <EFEMJNJMJJDEJPHJFNAAMEDMGHAA.timmcm@optushome.com.au>

next in thread | previous in thread | raw e-mail | index | archive | help
On Mon, Dec 03, 2001 at 08:59:59PM +1100, Tim wrote:
> Hi All,
> 
> Im wondering whether any one has got a solution to this problem
> I have found plenty of info about it in google groups etc.. but cant seem to
> find a solid fix
> 
> We are running FreeBSD 4.3 with the following hardware:
> 
> Gigabyte  GA-6EXDR MB (with on board LSI SCSI card disabled in the bios for
> FreeBSD incompatability problems)
> Adaptec 29160 SCSI card
> 3 X Seagate 73 GIG SCSI ST373405LC HDD
> All in a 1ru chasis
> 
> 
> and get these errors
> 
> dev = #da/0x20004, block = 43584, fs = /disk/00
> panic: ffs_blkfree: freeing free block
> mp_lock = 00000001; cpuid = 0; lapic.id = 00000000
> boot() called on cpu#0
> 
> syncing disks... 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3
> giving up on 3 buffers
> Uptime: 25d5h24m50s
> 
> 
> We updated to the latest mother board bios , and the problem is still as
> often , but now its a new error
> 
> This is under exactly the same circumsances
> 
> 
> /kernel: panic: ffs_clusteralloc: map mismatch
> /kernel: mp_lock = 00000001; cpuid = 0; lapic.id = 00000000
> /kernel: boot() called on cpu#0
> /kernel:
> /kernel: syncing disks... 155 155 74 39 35 35 35 35 35 35 35 35 35 35 35 35
> 35 35 35 35
> /kernel: giving up on 25 buffers
> /kernel: Uptime: 2h30m20s
> /kernel: Automatic reboot in 15 seconds - press a key on the console to
> abort
> 
> This is fairly random (it could be up for 2 hours or 2 days)
> It just doesnt happen on one machine  it happens on up to 10 seperate we
> have (all the same hardware)
> But happens always with high SCSI I/O and writing big files
> We can simulate it fairly easily testing with Bonnie writing 1-2 gig files
> I have tried FreeBSD 4.4 with no difference plus different revisions of
> adaptec 29160 card
> Also with soft updates on or off
> 
> Any ideas
> Please email me if you want further info
> 
> Thanks
> Tim

Just a hunch here, but I bet it's related to the SMP.  If you running 
these with dual processors and SMP kernels, try doing sysctl 
machdep.smp_active=0 and run bonnie.  If it still panics try booting a 
generic kernel and testing, and if it still panics, remove a 
processor, boot generic, and test.  My guess is that somewhere down 
the line in this process the panics will go away.  At that point take 
the info you have collected and do a send-pr.  You have multiple 
machines with reproducable errors, it should be tracked down quickly.  
You might as well build a kernel with debug symbols in it as well, 
whoever gets to handle this will probably want a stack trace and 
kernel dump.

Josh


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-questions" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20011203104335.A446>