From owner-freebsd-geom@FreeBSD.ORG Thu Nov 12 21:18:33 2009 Return-Path: Delivered-To: freebsd-geom@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id AB8EE10656CE for ; Thu, 12 Nov 2009 21:18:33 +0000 (UTC) (envelope-from gcubfg-freebsd-geom@m.gmane.org) Received: from lo.gmane.org (lo.gmane.org [80.91.229.12]) by mx1.freebsd.org (Postfix) with ESMTP id 3C8258FC16 for ; Thu, 12 Nov 2009 21:18:33 +0000 (UTC) Received: from list by lo.gmane.org with local (Exim 4.50) id 1N8h3n-00014p-JM for freebsd-geom@freebsd.org; Thu, 12 Nov 2009 22:18:31 +0100 Received: from 93-138-63-251.adsl.net.t-com.hr ([93.138.63.251]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 12 Nov 2009 22:18:31 +0100 Received: from ivoras by 93-138-63-251.adsl.net.t-com.hr with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 12 Nov 2009 22:18:31 +0100 X-Injected-Via-Gmane: http://gmane.org/ To: freebsd-geom@freebsd.org From: Ivan Voras Date: Thu, 12 Nov 2009 22:18:02 +0100 Lines: 53 Message-ID: References: <20091112151753.GA20087@crosswinds.net> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: 93-138-63-251.adsl.net.t-com.hr User-Agent: Thunderbird 2.0.0.21 (X11/20090612) In-Reply-To: <20091112151753.GA20087@crosswinds.net> Sender: news Subject: Re: GJournal too Small? X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 12 Nov 2009 21:18:33 -0000 Tony Holmes wrote: > I have a 894GB gstripe that I've put gjournal on top of. Since it was > an unused stripe I placed the data+journal into the same partition. > The server is fairly heavily used and hung suddenly. On reboot I got > this in the dmesg: > > GEOM_JOURNAL: Journal 3472355975: mirror/gm0s1e contains journal. > GEOM_STRIPE: Device st0 created (id=2649322337). > GEOM_STRIPE: Disk mirror/gm0s1f attached to st0. > GEOM_MIRROR: Device mirror/gm1 launched (1/1). > GEOM_STRIPE: Disk mirror/gm1s1f attached to st0. > GEOM_STRIPE: Device st0 activated. > GEOM_JOURNAL: Journal 2630378703: stripe/st0 contains data. > GEOM_JOURNAL: Journal 2630378703: stripe/st0 contains journal. > GEOM_JOURNAL: Journal stripe/st0 clean. > GEOM_JOURNAL: Timeout. Journal gjournal 3472355975 cannot be completed. > > That last line worries me. > >>From a quick google, it appears that the gjournal is too small. Since > I created it with the single partition, I would have expected the journal > to be autosized correctly. The message "Timeout. Journal %s cannot be completed." is printed when the gjournal composite device is created with data and journal on separate providers. It means that a timeout occurred while gjournal waited for both providers to come online. Your message contains something that looks like a journal ID (3472355975) which isn't in the code in 7-stable and 8-stable/head. How did you get that line? Assuming magic has happened and the journal ID (3472355975) is correct then it means you have two gjournal devices, one of those created on the stripe st0 (2630378703). It could mean that somehow, there is still recognizable metadata on your drives and/or partitions which confuses gjournal. > I know the OS is a little out of date but has been working very well until > the past couple months. I have 1 hang approximately every month. > > Information about system: > > FreeBSD fs.cwahi.net 7.1-PRERELEASE FreeBSD 7.1-PRERELEASE #0: Mon Dec 1 09:12:42 EST 2008 root@app.cwahi.com:/usr/obj/usr/src/sys/CWahi amd64 > > fs# gjournal list > Geom name: gjournal 1493846988 > ID: 1493846988 This, on the other hand, is a third gjournal ID. Assuming that somehow all this information is correct, you should probably send the output of sysctl -b kern.geom.confxml before anyone can unravel what has happened :)