From owner-freebsd-stable Mon Jan 1 14:58:29 2001 From owner-freebsd-stable@FreeBSD.ORG Mon Jan 1 14:58:25 2001 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from wantadilla.lemis.com (wantadilla.lemis.com [192.109.197.80]) by hub.freebsd.org (Postfix) with ESMTP id EFC4037B400 for ; Mon, 1 Jan 2001 14:58:22 -0800 (PST) Received: by wantadilla.lemis.com (Postfix, from userid 1004) id 1093E6A913; Tue, 2 Jan 2001 09:28:20 +1030 (CST) Date: Tue, 2 Jan 2001 09:28:20 +1030 From: Greg Lehey To: Roman Shterenzon Cc: freebsd-stable@FreeBSD.ORG Subject: Re: vinum malfunction! Message-ID: <20010102092819.K44043@wantadilla.lemis.com> References: <20001231101426.K99806@wantadilla.lemis.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.2.5i In-Reply-To: ; from roman@harmonic.co.il on Mon, Jan 01, 2001 at 12:32:22PM +0200 Organization: LEMIS, PO Box 460, Echunga SA 5153, Australia Phone: +61-8-8388-8286 Fax: +61-8-8388-8725 Mobile: +61-418-838-708 WWW-Home-Page: http://www.lemis.com/~grog X-PGP-Fingerprint: 6B 7B C3 8C 61 CD 54 AF 13 24 52 F8 6D A4 95 EF Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG On Monday, 1 January 2001 at 12:32:22 +0200, Roman Shterenzon wrote: > On Sun, 31 Dec 2000, Greg Lehey wrote: > >> [Format recovered--see http://www.lemis.com/email/email-format.html] >> >> Please limit your lines to < 80 characters. >> >> On Saturday, 30 December 2000 at 19:00:27 +0100, O. Hartmann wrote: >>> Dear Sirs. >>> Today I run a make world and compiled a new kernel for our server, >>> which uses a vinum RAID 0. At the same time I reconfigured the >>> kernel to use SC console driver and VESA instead of VT console. >>> >>> Since the new kernel got compiled loading of vinum fails! I do not >>> know why, I removed all kernel changes, but it does not help. >>> >>> Does anyone know why? >> >> Of course not. You haven't given *any* details. >> >> Take a look at http://www.vinumvm.org/vinum/how-to-debug.html and give >> some information, and I'll probably be able to help. > > I've submitted kern/22021 which was left untouched, it has all the needed > details. Please don't jump to conclusions. O. Hartmann has, as I mentioned, given *no* information. If you can find the cause of the problem from what he has done, please go ahead and do so. All I can see here is that he is talking about RAID-0, while the problem you failed to describe in 22021 occurs only with RAID-5. kern/22021 was closed one day after it was issued. To quote: Greg Lehey, 17 October 2000 I'm sorry, this PR is so completely mutilated that I can't read it. It appears to have been sent with quoted-printable, but without declaring the fact. As a result, the information is almost completely illegible. It's not so completely mutilated, however, that I don't note that it includes dmesg and kernel config information, information that I specifically ask *not* to include in problem reports. I'm closing this PR because it'll be easier to resubmit it than to fix it. Please read http://www.vinumvm.org/vinum/how-to-debug.html and http://www.lemis.com/email.html before doing so. Since you mention this PR, I suppose I should amplify on this comment. First, it is by *far* the most illegible PR I have ever seen. For example: #39 0xc0236a0d in trap_pfault (frame=3D0xc0276a70, usermode=3D0, eva=3D84) at ../../i386/i386/trap.c:820 #40 0xc023660b in trap (frame=3D{tf_fs =3D -1071185904, tf_es =3D -10723655= 52,=20 tf_ds =3D 7077904, tf_edi =3D -1049722848, tf_esi =3D -1049722880,=20 tf_ebp =3D -1071158580, tf_isp =3D -1071158628, tf_ebx =3D -103960512= 0,=20 tf_edx =3D 0, tf_ecx =3D 90505217, tf_eax =3D -7113793, tf_trapno =3D= 12,=20 tf_err =3D 2, tf_eip =3D -1051657113, tf_cs =3D 8, tf_eflags =3D 6611= 8,=20 tf_esp =3D -1049722848, tf_ss =3D -1051996160}) at ../../i386/i386/tr= ap.c:426 #41 0xc150fc67 in ?? () #42 0xc0178d6b in biodone (bp=3D0xc16e8020) at ../../kern/vfs_bio.c:2637 #43 0xc0126bb9 in dadone (periph=3D0xc14ca700, done_ccb=3D0xc1658c00) at ../../cam/scsi/scsi_da.c:1246 Apart from the fact that it's barely legible, this also shows: 1. You did a backtrace without loading Vinum symbols. 2. It's related to a known, but rare, RAID-5 problem which I am trying to get information on. You don't mention that you resubmitted this PR as 22103. Despite my requests, 1. You insisted on including the config file and dmesg, which just bloat the PR and make it more difficult to read. 2. The backtrace still does not include Vinum symbols, making it pretty useless. 3. It includes (without comment) a mail exchange with Andy Newman, which have been twice mutilated, once before I received the message, once after. After fighting my way through this PR, I don't have any additional information. Dealing with this kind of PR is an absolute pain. It's bloated, illegible, full of irrelevant information and lacking the information I need. It is, however, one of the few cases I've seen of a particular buffer header corruption which occurs only in RAID-5, and I would like to fix it. If you want to help, instead of sending "me too" messages about what is obviously a completely different problem, help me find the problem you're having. Greg -- Finger grog@lemis.com for PGP public key See complete headers for address and phone numbers To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message