From owner-freebsd-questions Fri Jan 12 17:31:53 2001 Delivered-To: freebsd-questions@freebsd.org Received: from danube.mindmaker.com (danube.mindmaker.com [209.66.98.152]) by hub.freebsd.org (Postfix) with ESMTP id EC3A037B401; Fri, 12 Jan 2001 17:31:23 -0800 (PST) Received: from hera.int.mindmaker.com (hera.int.mindmaker.com [192.168.10.16]) by danube.mindmaker.com (Postfix) with ESMTP id 60B6AF83E; Fri, 12 Jan 2001 17:31:25 -0800 (PST) Received: from mindmaker.com (joe2000.int.mindmaker.com [192.168.10.49]) by hera.int.mindmaker.com (8.11.1/8.11.1) with ESMTP id f0D1Y0Y01156; Fri, 12 Jan 2001 17:34:00 -0800 (PST) (envelope-from jfilla@mindmaker.com) Message-ID: <3A5FAFED.6F305826@mindmaker.com> Date: Fri, 12 Jan 2001 17:31:25 -0800 From: Joseph Filla Organization: Mindmaker Inc. X-Mailer: Mozilla 4.76 [en] (Windows NT 5.0; U) X-Accept-Language: en MIME-Version: 1.0 To: Greg Lehey Cc: freebsd-questions@FreeBSD.ORG, freebsd-stable@FreeBSD.ORG Subject: Re: vinum create causes my 4.2-Stable SMP machine to reboot References: <3A5F9BFB.9C24C85F@mindmaker.com> <20010113110111.B66238@wantadilla.lemis.com> Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: owner-freebsd-questions@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG Greg Lehey wrote: > > [Format recovered--see http://www.lemis.com/email/email-format.html] > > Please don't wrap log output. I've turned off wrapping of lines. Netscape 4.76 won't allow me to set outgoing message wrap to 0 characters. I've set it to 999. > > On Friday, 12 January 2001 at 16:06:19 -0800, Joseph Filla wrote: > > System information: > > 4.2-STABLE FreeBSD 4.2-STABLE #3: Wed Dec 6 17:46:52 PST 2000 > > SMP > > 2 Promise ATA66 controllers > > 4 IDE drives: > > ad4 > > ad6 > > ad8 > > ad10 > > All dangerously dedicated > > > > > > For the last few months, I have had vinum running in a raid 10 > > configuration: > > > > su-2.03# vinum list > > 4 drives: > > D a State: up Device /dev/ad4s1e Avail: 0/11500 MB (0%) > > D b State: up Device /dev/ad6s1e Avail: 0/11500 MB (0%) > > D c State: up Device /dev/ad8s1f Avail: 0/11500 MB (0%) > > D d State: up Device /dev/ad10s1f Avail: 0/11500 MB (0%) > > > > 4 volumes: > > V swap State: up Plexes: 2 Size: 649 MB > > V var State: up Plexes: 2 Size: 1999 MB > > V usr State: up Plexes: 2 Size: 1999 MB > > V home State: up Plexes: 2 Size: 17 GB > > > > > > > > After a few months of use on this box, I decided to use the remaining > > free space on the drives by creating another raid 10 volume. Through > > Sysinstall, I created a partition on each of the four drives, and > > changed the disklabels on all 4 new partitions to type vinum. > > > > I created another vinum config file vinum_home2.conf: > > drive e device /dev/ad4s1f > > drive f device /dev/ad6s1f > > drive g device /dev/ad8s1g > > drive h device /dev/ad10s1g > > > > volume home2 > > plex org striped 257k > > sd length 7705m drive e > > sd length 7705m drive f > > plex org striped 257k > > sd length 7705m drive g > > sd length 7705m drive h > > > > I know putting another volume on another partition in the same slice > > isn't optimal but I don't have any other drives. > > You really need to make that a single partition. But that shouldn't > be the problem. > > > I ran 'vinum create /etc/vinum_home2.conf' and the machine rebooted. > > Spontaneously, with no messages? I was on an ssh connection to the server both times. The first time, I got a few lines back but the second time nothing. By the time I walked over to the server, it had rebooted. > > > vinum_history doesn't show the the create command at all. > > No, it wouldn't. You had a crash. > > > The system rebooted into single user mode and I was eventually able > > to get back to a running system. vinum -l showed the new volume > > home2, the new plexes and the new subdisks but after running init > > and start, I could never get the subdisks past faulty. One of the > > striped plexes was always in state crashed. > > Where's the list? In the frustration of the moment, I didn't save a copy of the list. My bad. After reading some postings to -questions, I looked in /dev/vinum and there were no devices in drive, plex, sd etc for my new volumes. So I ran vinum -makedev and tried to init and start the new volumes. From /var/log/messages: Jan 9 17:04:07 hera /kernel: vinum: drive e is up Jan 9 17:04:07 hera /kernel: vinum: drive f is up Jan 9 17:04:07 hera /kernel: vinum: drive g is up Jan 9 17:04:07 hera /kernel: vinum: drive h is up Jan 9 17:04:07 hera /kernel: vinum: removing 80 blocks of partial stripe at the end o f home2.p0 Running makedev didn't solve my problem. I rebooted and here is /var/log/messages: Jan 9 17:57:10 hera /kernel: vinum: loaded Jan 9 17:57:10 hera /kernel: vinum: reading configuration from /dev/ad10s1f Jan 9 17:57:10 hera /kernel: vinum: home2.p0.s0 is crashed Jan 9 17:57:10 hera /kernel: vinum: home2.p0 is faulty Jan 9 17:57:10 hera /kernel: vinum: home2 is down Jan 9 17:57:10 hera /kernel: vinum: home2.p0.s1 is crashed Jan 9 17:57:10 hera /kernel: vinum: home2.p1.s0 is up Jan 9 17:57:10 hera /kernel: vinum: home2.p1 is up Jan 9 17:57:10 hera /kernel: vinum: home2 is up Jan 9 17:57:10 hera /kernel: vinum: home2.p1.s0 is up Jan 9 17:57:10 hera /kernel: vinum: home2.p1.s1 is up Jan 9 17:57:10 hera /kernel: vinum: updating configuration from /dev/ad8s1f Jan 9 17:57:10 hera /kernel: vinum: updating configuration from /dev/ad4s1e Jan 9 17:57:10 hera /kernel: vinum: updating configuration from /dev/ad6s1e Jan 9 17:57:10 hera /kernel: vinum: updating configuration from /dev/ad6s1f Jan 9 17:57:10 hera /kernel: vinum_scandisk: /dev/ad6s1f is down Jan 9 17:57:10 hera /kernel: vinum: Can't read device /dev/ad6s1f, error 5 Jan 9 17:57:10 hera /kernel: vinum: updating configuration from /dev/ad8s1g Jan 9 17:57:10 hera /kernel: vinum_scandisk: /dev/ad8s1g is down Jan 9 17:57:10 hera /kernel: vinum: Can't read device /dev/ad8s1g, error 5 Jan 9 17:57:10 hera /kernel: vinum: updating configuration from /dev/ad4s1f Jan 9 17:57:10 hera /kernel: vinum_scandisk: /dev/ad4s1f is down Jan 9 17:57:10 hera /kernel: vinum: Can't read device /dev/ad4s1f, error 5 Jan 9 17:57:10 hera /kernel: vinum: updating configuration from /dev/ad10s1g Jan 9 17:57:10 hera /kernel: vinum_scandisk: /dev/ad10s1g is down Jan 9 17:57:10 hera /kernel: vinum: Can't read device /dev/ad10s1g, error 5 Jan 9 17:57:10 hera /kernel: vinum: couldn't read configuration Similar results as I wrestled with it for a day before eventually doing a rm -f on it all > > > I eventually had to detach and rm all remnants of home2 and got my > > system back to the original state. Thinking I may have done > > something incorrectly, I retried the vinum create config and the > > system rebooted again. This time vinum_history recorded the > > following: > > > > 12 Jan 2001 14:05:27.960421 *** vinum started *** > > 12 Jan 2001 14:05:27.961106 create vinum_home2.conf > > The ten seconds here are hardly enough time for a reboot. Let's > intersperse these messages: > > > 12 Jan 2001 14:05:37.293978 *** vinum started *** > > 12 Jan 2001 14:12:43.507023 *** vinum started *** > > 12 Jan 2001 14:12:44.694607 l > > ... > > > > This time the system came backup correctly, however this time, running > > vinum -l show me nothing regarding my desired volume home2. Here is the > > relevant stuff from /var/log/messages: > > Jan 10 16:03:34 hera /kernel: vinum: loaded > > Jan 10 16:03:34 hera /kernel: vinum: reading configuration from /dev/ad8s1f > > Jan 10 16:03:34 hera /kernel: vinum: updating configuration from /dev/ad10s1f > > Jan 10 16:03:34 hera /kernel: vinum: updating configuration from /dev/ad4s1e > > Jan 10 16:03:34 hera /kernel: vinum: updating configuration from /dev/ad6s1e > > 12 Jan 2001 14:05:27.960421 *** vinum started *** > > 12 Jan 2001 14:05:27.961106 create vinum_home2.conf > > 12 Jan 2001 14:05:37.293978 *** vinum started *** > > Jan 12 14:05:37 hera /kernel: vinum: drive e is up > > Jan 12 14:05:37 hera /kernel: vinum: drive f is up > > Jan 12 14:05:37 hera /kernel: vinum: drive g is up > > Jan 12 14:05:37 hera /kernel: vinum: drive h is up > > Jan 12 14:10:37 hera /kernel: vinum: loaded > > Jan 12 14:10:37 hera /kernel: vinum: reading configuration from /dev/ad8s1f > > Jan 12 14:10:37 hera /kernel: vinum: updating configuration from /dev/ad10s1f > > Jan 12 14:10:37 hera /kernel: vinum: updating configuration from /dev/ad6s1e > > Jan 12 14:10:37 hera /kernel: vinum: updating configuration from /dev/ad4s1e > > Jan 12 14:10:37 hera /kernel: vinum: updating configuration from /dev/ad6s1f > > Jan 12 14:10:37 hera /kernel: vinum_scandisk: /dev/ad6s1f is down > > Jan 12 14:10:37 hera /kernel: vinum: Can't read device /dev/ad6s1f, error 5 > > Jan 12 14:10:37 hera /kernel: vinum: updating configuration from /dev/ad8s1g > > Jan 12 14:10:37 hera /kernel: vinum_scandisk: /dev/ad8s1g is down > > Jan 12 14:10:37 hera /kernel: vinum: Can't read device /dev/ad8s1g, error 5 > > Jan 12 14:10:37 hera /kernel: vinum: updating configuration from /dev/ad4s1f > > Jan 12 14:10:37 hera /kernel: vinum_scandisk: /dev/ad4s1f is down > > Jan 12 14:10:37 hera /kernel: vinum: Can't read device /dev/ad4s1f, error 5 > > Jan 12 14:10:37 hera /kernel: vinum: updating configuration from /dev/ad10s1g > > Jan 12 14:10:37 hera /kernel: vinum_scandisk: /dev/ad10s1g is down > > Jan 12 14:10:37 hera /kernel: vinum: Can't read device /dev/ad10s1g, error 5 > > Jan 12 14:10:37 hera /kernel: vinum: couldn't read configuration > > > > Neither reboots created any dumps in /var/crash > > Do you have dumps enabled? I do now... I can always run the create command again and catch it. > > The best I can guess here is that your partition layout is funny in > some way. Let's have a look at the disklabel output for each > spindle. Just the end, like this: > > 8 partitions: > # size offset fstype [fsize bsize bps/cpg] > c: 4124640 0 unused 0 0 # (Cyl. 0 - 1006*) > h: 4124640 0 vinum # (Cyl. 0 - 1006*) disklabel ad4: 8 partitions: # size offset fstype [fsize bsize bps/cpg] a: 253952 0 4.2BSD 0 0 0 # (Cyl. 0 - 403*) c: 40088160 0 unused 0 0 # (Cyl. 0 - 63631) e: 23552000 253952 vinum # (Cyl. 403*- 37787*) f: 16282208 23805952 vinum # (Cyl. 37787*- 63631*) disklabel ad6: 8 partitions: # size offset fstype [fsize bsize bps/cpg] b: 253952 0 swap # (Cyl. 0 - 403*) c: 40088160 0 unused 0 0 # (Cyl. 0 - 63631) e: 23552000 253952 vinum # (Cyl. 403*- 37787*) f: 16282208 23805952 vinum # (Cyl. 37787*- 63631*) disklabel ad8: 8 partitions: # size offset fstype [fsize bsize bps/cpg] c: 40088160 0 unused 0 0 # (Cyl. 0 - 63631) e: 251904 0 4.2BSD 0 0 0 # (Cyl. 0 - 399*) f: 23552000 251904 vinum # (Cyl. 399*- 37783*) g: 16284256 23803904 vinum # (Cyl. 37783*- 63631*) disklabel ad10: 8 partitions: # size offset fstype [fsize bsize bps/cpg] c: 40088160 0 unused 0 0 # (Cyl. 0 - 63631) e: 253952 0 4.2BSD 0 0 0 # (Cyl. 0 - 403*) f: 23552000 253952 vinum # (Cyl. 403*- 37787*) g: 16282208 23805952 vinum # (Cyl. 37787*- 63631*) Note how ad8s1g has a different offset and size from the other related partitions. I noticed that before running create and I tried to make my lengths accordingly. I didn't intentionally do it this way. > > Greg > -- > When replying to this message, please copy the original recipients. > If you don't, I may ignore the reply. > For more information, see http://www.lemis.com/questions.html > Finger grog@lemis.com for PGP public key > See complete headers for address and phone numbers > > To Unsubscribe: send mail to majordomo@FreeBSD.org > with "unsubscribe freebsd-questions" in the body of the message -- Joe Filla Systems Administrator Mindmaker, Inc. (408) 467-0468 To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-questions" in the body of the message