From owner-freebsd-stable@FreeBSD.ORG Fri Jul 2 08:23:55 2004 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 1857716A4CE; Fri, 2 Jul 2004 08:23:55 +0000 (GMT) Received: from cs1.cs.huji.ac.il (cs1.cs.huji.ac.il [132.65.16.10]) by mx1.FreeBSD.org (Postfix) with ESMTP id C1C5F43D2D; Fri, 2 Jul 2004 08:23:54 +0000 (GMT) (envelope-from danny@cs.huji.ac.il) Received: from pampa.cs.huji.ac.il ([132.65.80.32] ident=danny) by cs1.cs.huji.ac.il with esmtp id 1BgJJg-000Off-2Z; Fri, 02 Jul 2004 11:22:40 +0300 X-Mailer: exmh version 2.6.3 04/04/2003 with nmh-1.0.4 To: freebsd-stable@freebsd.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Date: Fri, 02 Jul 2004 11:22:39 +0300 From: Danny Braniss Message-Id: <20040702082354.C1C5F43D2D@mx1.FreeBSD.org> cc: Soren Schmidt Subject: problems with RAID0 and Intel/SE7501WV2/Promise X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 02 Jul 2004 08:23:55 -0000 Hi, the raid info keeps disappearing!, and im not convinced it's hardware. I've tried it on 2 different - but similar - boxes, after some time, kernel reports something like: Jul 2 10:18:35 cs7.cs.huji.ac.il /kernel: ad6: hard error writing fsbn 0 (ad6 bn 0; cn 0 tn 0 sn 0) trying PIO mode Jul 2 10:18:35 cs7.cs.huji.ac.il /kernel: ad4: hard error writing fsbn 0 (ad4 bn 0; cn 0 tn 0 sn 0) trying PIO mode Jul 2 10:18:35 cs7.cs.huji.ac.il /kernel: ad6: hard error writing fsbn 0 (ad6 bn 0; cn 0 tn 0 sn 0) status=51 error=10 Jul 2 10:18:35 cs7.cs.huji.ac.il /kernel: ar0: ERROR - array broken Jul 2 10:18:35 cs7.cs.huji.ac.il /kernel: ad4: hard error writing fsbn 0 (ad4 bn 0; cn 0 tn 0 sn 0) status=51 error=10 Jul 2 10:18:35 cs7.cs.huji.ac.il /kernel: ar0: ERROR - array broken Jul 2 10:18:35 cs7.cs.huji.ac.il /kernel: ar0: ERROR - array broken from dmesg: ... FreeBSD 4.10-STABLE #7: Fri Jul 2 09:57:10 IDT 2004 ... ar0: 381564MB [48642/255/63] status: READY subdisks: 0 READY ad4: 190782MB [387621/16/63] at ata2-master UDMA100 1 READY ad6: 190782MB [387621/16/63] at ata3-master UDMA100 i've partitioned the disk so: # size offset fstype [fsize bsize bps/cpg] a: 1024000 0 4.2BSD 2048 16384 90 # (Cyl. 0 - 63*) b: 8388608 1024000 swap # (Cyl. 63*- 585*) c: 781433667 0 unused 0 0 # (Cyl. 0 - 48641*) d: 1024000 9412608 4.2BSD 0 0 0 # (Cyl. 585*- 649*) h: 770997059 10436608 4.2BSD 0 0 0 # (Cyl. 649*- 48641*) the machine boots diskless, so just to check the disk i did a newfs -U to /dev/ar0s1a, then restored a root image unto it, no problems. the h partition has a big postgres data base, starting postgres i get the above error, notice that the error is a bit suspicious, fsbn 0 ( ... bn 0; cn 0; tn 0; sn 0) using the Fastrack/Promise Bios i reconfigure the Raid, and if tried the above again with the same results. btw, on a different host, same motherboard, same type of disks, with a older kernel, it panics, but the disk error is the same, and the array info is lost. Any more info/help needed to track this down? thanks, danny