From owner-freebsd-geom@FreeBSD.ORG Mon Oct 22 10:57:51 2007 Return-Path: Delivered-To: freebsd-geom@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 3EC5116A420 for ; Mon, 22 Oct 2007 10:57:51 +0000 (UTC) (envelope-from felipe@neuwald.biz) Received: from itacaiunas.cepatec.org.br (itacaiunas.cepatec.org.br [200.152.208.51]) by mx1.freebsd.org (Postfix) with ESMTP id 8209313C4E1 for ; Mon, 22 Oct 2007 10:57:50 +0000 (UTC) (envelope-from felipe@neuwald.biz) Received: from localhost (vermelho [10.0.0.5]) by itacaiunas.cepatec.org.br (Postfix) with ESMTP id 483CD11562D; Mon, 22 Oct 2007 08:57:38 -0200 (BRST) X-Virus-Scanned: amavisd-new at cepatec.org.br Received: from itacaiunas.cepatec.org.br ([10.0.0.3]) by localhost (vermelho.cepatec.org.br [10.0.0.5]) (amavisd-new, port 10024) with ESMTP id jHKQyGBCqXUN; Mon, 22 Oct 2007 07:57:36 -0300 (BRT) Received: from [192.168.0.152] (unknown [200.199.198.61]) by itacaiunas.cepatec.org.br (Postfix) with ESMTP id 4900011562B; Mon, 22 Oct 2007 08:57:35 -0200 (BRST) Message-ID: <471C821F.2090101@neuwald.biz> Date: Mon, 22 Oct 2007 08:57:35 -0200 From: Felipe Neuwald User-Agent: Thunderbird 1.5.0.13 (X11/20070824) To: Ulf Lilleengen References: <4718ECB2.9050207@neuwald.biz> <20071019200041.GA16812@stud.ntnu.no> In-Reply-To: <20071019200041.GA16812@stud.ntnu.no> Content-Transfer-Encoding: 7bit MIME-Version: 1.0 Content-Type: text/plain; charset="ISO-8859-1" X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: freebsd-geom@freebsd.org Subject: Re: gvinum - problem on hard disk X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 22 Oct 2007 10:57:51 -0000 Hi Ulf, Thank you for your information. As you can see, it worked: [root@fileserver ~]# gvinum list 4 drives: D a State: up /dev/ad4 A: 0/238474 MB (0%) D b State: up /dev/ad5 A: 0/238475 MB (0%) D c State: up /dev/ad6 A: 0/238475 MB (0%) D d State: up /dev/ad7 A: 0/238475 MB (0%) 1 volume: V data State: up Plexes: 1 Size: 931 GB 1 plex: P data.p0 S State: up Subdisks: 4 Size: 931 GB 4 subdisks: S data.p0.s3 State: up D: d Size: 232 GB S data.p0.s2 State: up D: c Size: 232 GB S data.p0.s1 State: up D: b Size: 232 GB S data.p0.s0 State: up D: a Size: 232 GB [root@fileserver ~]# fsck -t ufs -y /dev/gvinum/data ** /dev/gvinum/data ** Last Mounted on /data ** Phase 1 - Check Blocks and Sizes ** Phase 2 - Check Pathnames ** Phase 3 - Check Connectivity ** Phase 4 - Check Reference Counts ** Phase 5 - Check Cyl groups 258700 files, 419044280 used, 53985031 free (39599 frags, 6743179 blocks, 0.0% fragmentation) ***** FILE SYSTEM MARKED CLEAN ***** [root@fileserver ~]# mount -t ufs /dev/gvinum/data /data [root@fileserver ~]# mount /dev/ad0s1a on / (ufs, local) devfs on /dev (devfs, local) /dev/ad0s1d on /tmp (ufs, local, soft-updates) /dev/ad0s1e on /usr (ufs, local, soft-updates) /dev/ad0s1f on /var (ufs, local, soft-updates) /dev/gvinum/data on /data (ufs, local) [root@fileserver ~]# Now, I have to advice the customer again to make a backup file server. Thank you very much, Felipe Neuwald. Ulf Lilleengen escreveu: On fre, okt 19, 2007 at 03:43:14 -0200, Felipe Neuwald wrote: Hi folks, I have one gvinum raid on a FreeBSD 6.1-RELEASE machine. There are 4 disks running, as you can see: [root@fileserver ~]# gvinum list 4 drives: D a State: up /dev/ad4 A: 0/238474 MB (0%) D b State: up /dev/ad5 A: 0/238475 MB (0%) D c State: up /dev/ad6 A: 0/238475 MB (0%) D d State: up /dev/ad7 A: 0/238475 MB (0%) 1 volume: V data State: down Plexes: 1 Size: 931 GB 1 plex: P data.p0 S State: down Subdisks: 4 Size: 931 GB 4 subdisks: S data.p0.s3 State: stale D: d Size: 232 GB S data.p0.s2 State: up D: c Size: 232 GB S data.p0.s1 State: up D: b Size: 232 GB S data.p0.s0 State: up D: a Size: 232 GB But, as you can see, the data.p0.s3 is "stale". What should I do to try recover this and get the raid up again (and recover information) Hello, Since your plex organization is RAID0 (striping), recovering after a drive failure is a problem since you don't have any redundancy, but if you didn't replace any drives etc, this could just be gvinum fooling around. In that case, doing a 'gvinum setstate -f up data.p0.s3' should get the volume up again. From owner-freebsd-geom@FreeBSD.ORG Mon Oct 22 11:07:05 2007 Return-Path: Delivered-To: freebsd-geom@hub.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id BEDBE16A420 for ; Mon, 22 Oct 2007 11:07:05 +0000 (UTC) (envelope-from owner-bugmaster@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by mx1.freebsd.org (Postfix) with ESMTP id AD2E013C491 for ; Mon, 22 Oct 2007 11:07:05 +0000 (UTC) (envelope-from owner-bugmaster@FreeBSD.org) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.1/8.14.1) with ESMTP id l9MB75BE079942 for ; Mon, 22 Oct 2007 11:07:05 GMT (envelope-from owner-bugmaster@FreeBSD.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.1/8.14.1/Submit) id l9MB74Qm079938 for freebsd-geom@FreeBSD.org; Mon, 22 Oct 2007 11:07:05 GMT (envelope-from owner-bugmaster@FreeBSD.org) Date: Mon, 22 Oct 2007 11:07:05 GMT Message-Id: <200710221107.l9MB74Qm079938@freefall.freebsd.org> X-Authentication-Warning: freefall.freebsd.org: gnats set sender to owner-bugmaster@FreeBSD.org using -f From: FreeBSD bugmaster To: freebsd-geom@FreeBSD.org Cc: Subject: Current problem reports assigned to freebsd-geom@FreeBSD.org X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 22 Oct 2007 11:07:05 -0000 Current FreeBSD problem reports Critical problems Serious problems S Tracker Resp. Description -------------------------------------------------------------------------------- o kern/73177 geom kldload geom_* causes panic due to memory exhaustion o kern/76538 geom [gbde] nfs-write on gbde partition stalls and continue o kern/83464 geom [geom] [patch] Unhandled malloc failures within libgeo o kern/84556 geom [geom] GBDE-encrypted swap causes panic at shutdown o kern/87544 geom [gbde] mmaping large files on a gbde filesystem deadlo o kern/89102 geom [geom_vfs] [panic] panic when forced unmount FS from u o bin/90093 geom fdisk(8) incapable of altering in-core geometry o kern/90582 geom [geom_mirror] [panic] Restore cause panic string (ffs_ o kern/98034 geom [geom] dereference of NULL pointer in acd_geom_detach o kern/104389 geom [geom] [patch] sys/geom/geom_dump.c doesn't encode XML o kern/113419 geom [geom] geom fox multipathing not failing back o misc/113543 geom [geom] [patch] geom(8) utilities don't work inside the o kern/113957 geom [gmirror] gmirror is intermittently reporting a degrad o kern/115572 geom [gbde] gbde partitions fail at 28bit/48bit LBA address 14 problems total. Non-critical problems S Tracker Resp. Description -------------------------------------------------------------------------------- o bin/78131 geom gbde "destroy" not working. o kern/79251 geom [2TB] newfs fails on 2.6TB gbde device o kern/94632 geom [geom] Kernel output resets input while GELI asks for f kern/105390 geom [geli] filesystem on a md backed by sparse file with s o kern/107707 geom [geom] [patch] add new class geom_xbox360 to slice up p bin/110705 geom gmirror control utility does not exit with correct exi o kern/113837 geom [geom] unable to access 1024 sector size storage o kern/113885 geom [geom] [patch] improved gmirror balance algorithm o kern/114532 geom GEOM_MIRROR shows up in kldstat even if compiled in th o kern/115547 geom [geom] [patch] for GEOM Eli to get password from stdin 10 problems total. From owner-freebsd-geom@FreeBSD.ORG Tue Oct 23 14:46:56 2007 Return-Path: Delivered-To: freebsd-geom@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 3BC4F16A41B; Tue, 23 Oct 2007 14:46:56 +0000 (UTC) (envelope-from avg@icyb.net.ua) Received: from falcon.cybervisiontech.com (falcon.cybervisiontech.com [217.20.163.9]) by mx1.freebsd.org (Postfix) with ESMTP id BEF1813C4C8; Tue, 23 Oct 2007 14:46:55 +0000 (UTC) (envelope-from avg@icyb.net.ua) Received: from localhost (localhost [127.0.0.1]) by falcon.cybervisiontech.com (Postfix) with ESMTP id 53E83744006; Tue, 23 Oct 2007 14:13:57 +0300 (EEST) X-Virus-Scanned: Debian amavisd-new at falcon.cybervisiontech.com Received: from falcon.cybervisiontech.com ([127.0.0.1]) by localhost (falcon.cybervisiontech.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id CJark6ABQDCn; Tue, 23 Oct 2007 14:13:57 +0300 (EEST) Received: from [10.2.1.87] (gateway.cybervisiontech.com.ua [88.81.251.18]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by falcon.cybervisiontech.com (Postfix) with ESMTP id E19D7744003; Tue, 23 Oct 2007 14:13:56 +0300 (EEST) Message-ID: <471DD76F.6000402@icyb.net.ua> Date: Tue, 23 Oct 2007 14:13:51 +0300 From: Andriy Gapon User-Agent: Thunderbird 2.0.0.6 (X11/20070803) MIME-Version: 1.0 To: Pawel Jakub Dawidek References: <4715C3D1.3070308@icyb.net.ua> <4715F4EE.9020104@freebsd.org> <47161FD1.5010501@icyb.net.ua> In-Reply-To: <47161FD1.5010501@icyb.net.ua> References: <4715C3D1.3070308@icyb.net.ua> <4715F4EE.9020104@freebsd.org> <47161FD1.5010501@icyb.net.ua> <20071017201235.GD50219@garage.freebsd.pl> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Cc: freebsd-geom@freebsd.org Subject: Re: gjournal: FLUSHCACHE timed out X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 23 Oct 2007 14:46:56 -0000 It seems that setting switch_time to 5 helped only against small loads and I still saw timeouts in some cases. Unfortunately I lost Pavel's reply but I saw it in the web archives and I applied the patch from: http://www.freebsd.org/cgi/cvsweb.cgi/src/sys/dev/ata/ata-disk.c.diff?r1=1.201;r2=1.202 and everything is perfect now! I set switch_time back to 10 and I don't see any timeouts even under heavy loads. Thank you, Pavel! Also, according to the given advices I disabled softupdates and set async mount option. Thank you all again. -- Andriy Gapon From owner-freebsd-geom@FreeBSD.ORG Thu Oct 25 22:29:06 2007 Return-Path: Delivered-To: freebsd-geom@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 432AC16A417 for ; Thu, 25 Oct 2007 22:29:06 +0000 (UTC) (envelope-from mario.pranjic@gmail.com) Received: from wa-out-1112.google.com (wa-out-1112.google.com [209.85.146.182]) by mx1.freebsd.org (Postfix) with ESMTP id 9BBA813C494 for ; Thu, 25 Oct 2007 22:29:05 +0000 (UTC) (envelope-from mario.pranjic@gmail.com) Received: by wa-out-1112.google.com with SMTP id k17so819390waf for ; Thu, 25 Oct 2007 15:29:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:mime-version:content-type; bh=tfV0tHd77uzE7wxAJtpvRVf6W5FpYgAnHoTEvz4joZY=; b=XeG7cIQF8HHdEyXWYn76GTu2rYACAjhAYiwHnYMAhv4EzP65bk1E3YwD77QszTfeh05kXxoz1U5CGGqdFOra+JAtPi9s5ZrM9JRKwWy3iTxy3kMAKebKZTRUYKKpycTc5hEjG1G8iVqKwQ7rwU7GiIP7CeVk49eZYC/HcLk1GI4= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:mime-version:content-type; b=phIbqxOai/PE4cE0ib5mLqCNIMdAoNYImIH+Pcv2FLQ91kxGFjUeiph/L5MZ7adNNqFeLD2p+nlY2TolJ6GdA92LIH/eRd/rmuh5ykqdytTEsWyVeE78BQW6iqBfMYRAP+dwJxz5am4cwJcYCtfCiCTotMnG0LbTHaygIehHrRg= Received: by 10.115.15.1 with SMTP id s1mr2636229wai.1193349760971; Thu, 25 Oct 2007 15:02:40 -0700 (PDT) Received: by 10.114.13.14 with HTTP; Thu, 25 Oct 2007 15:02:40 -0700 (PDT) Message-ID: Date: Fri, 26 Oct 2007 00:02:40 +0200 From: "Mario Pranjic" To: freebsd-geom@freebsd.org MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: gmirror weird behaviour X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 25 Oct 2007 22:29:06 -0000 Hi! I have a system with 2 250 GB disks in mirror: [root@haerbe ~]# gmirror status Name Status Components mirror/gmo COMPLETE ad0 ad2 system info: FreeBSD haerbe.net 6.2-RELEASE-p7 FreeBSD 6.2-RELEASE-p7 #0: Wed Sep 5 06:24:11 CEST 2007 root@haerbe.net:/usr/obj/usr/src/sys/HAERBE i386 At some point ad0 disk was detached from the system: Oct 25 06:43:10 haerbe ad0: FAILURE - device detached Oct 25 06:43:10 haerbe subdisk0: detached Oct 25 06:43:10 haerbe ad0: detached Oct 25 06:43:10 haerbe GEOM_MIRROR: Device gmo: provider ad0 disconnected. Oct 25 06:43:10 haerbe g_vfs_done():mirror/gmos2e[READ(offset=10598187008, length=16384)]error = 6 Oct 25 06:43:10 haerbe g_vfs_done():mirror/gmos2e[READ(offset=30252744704, length=16384)]error = 6 Mirror status: [root@haerbe /www/_backups/hrbwiki/2007-10-22]# gmirror status Name Status Components mirror/gmo DEGRADED ad2 [root@haerbe /www/_backups/hrbwiki/2007-10-22]# gmirror list Geom name: gmo State: DEGRADED Components: 2 Balance: round-robin Slice: 4096 Flags: NONE GenID: 0 SyncID: 2 ID: 3221909623 Providers: 1. Name: mirror/gmo Mediasize: 250059349504 (233G) Sectorsize: 512 Mode: r8w8e10 Consumers: 1. Name: ad2 Mediasize: 250059350016 (233G) Sectorsize: 512 Mode: r1w1e1 State: ACTIVE Priority: 0 Flags: NONE GenID: 0 SyncID: 2 ID: 1951693847 I cleaned the mirror: [root@haerbe]# gmirror forget gmo [root@haerbe ~]# gmirror status Name Status Components mirror/gmo COMPLETE ad2 After reboot the system is aware of both disks: ad0: 238475MB at ata0-master UDMA100 ad2: 238475MB at ata1-master UDMA100 but after that: GEOM_MIRROR: Cannot add disk ad2 to gmo (error=22). Root mount waiting for: GMIRROR Root mount waiting for: GMIRROR Root mount waiting for: GMIRROR GEOM_MIRROR: Force device gmo start due to timeout. GEOM_MIRROR: Device gmo: provider ad0 activated. GEOM_MIRROR: Device gmo: provider mirror/gmo launched. And the system rebooted with ad0 in mirror and not ad2! [root@haerbe ~]# gmirror list Geom name: gmo State: DEGRADED Components: 2 Balance: round-robin Slice: 4096 Flags: NONE GenID: 0 SyncID: 2 ID: 3221909623 Providers: 1. Name: mirror/gmo Mediasize: 250059349504 (233G) Sectorsize: 512 Mode: r8w8e10 Consumers: 1. Name: ad0 Mediasize: 250059350016 (233G) Sectorsize: 512 Mode: r1w1e1 State: ACTIVE Priority: 0 Flags: DIRTY GenID: 0 SyncID: 2 ID: 2195860303 [root@haerbe ~]# gmirror status Name Status Components mirror/gmo DEGRADED ad0 Now I synced mirror again successfuly but this is weird. It should not be happening. Any ideas what is going on? Thnx! -- Mario. http://keeper.sharanet.org/ http://keeper.haerbe.net/ http://keeper.nymphomatic.org/ From owner-freebsd-geom@FreeBSD.ORG Fri Oct 26 07:58:23 2007 Return-Path: Delivered-To: freebsd-geom@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 8EE5516A419 for ; Fri, 26 Oct 2007 07:58:23 +0000 (UTC) (envelope-from pjd@garage.freebsd.pl) Received: from mail.garage.freebsd.pl (arm132.internetdsl.tpnet.pl [83.17.198.132]) by mx1.freebsd.org (Postfix) with ESMTP id DDE0413C4A3 for ; Fri, 26 Oct 2007 07:58:22 +0000 (UTC) (envelope-from pjd@garage.freebsd.pl) Received: by mail.garage.freebsd.pl (Postfix, from userid 65534) id 78D03467E4; Fri, 26 Oct 2007 09:58:20 +0200 (CEST) Received: from localhost (pjd.wheel.pl [10.0.1.1]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mail.garage.freebsd.pl (Postfix) with ESMTP id 5398F45FE9; Fri, 26 Oct 2007 09:58:15 +0200 (CEST) Date: Fri, 26 Oct 2007 09:57:56 +0200 From: Pawel Jakub Dawidek To: Mario Pranjic Message-ID: <20071026075756.GA1219@garage.freebsd.pl> References: Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="IS0zKkzwUGydFO0o" Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.2.3i X-PGP-Key-URL: http://people.freebsd.org/~pjd/pjd.asc X-OS: FreeBSD 7.0-CURRENT i386 X-Spam-Checker-Version: SpamAssassin 3.0.4 (2005-06-05) on mail.garage.freebsd.pl X-Spam-Level: X-Spam-Status: No, score=-5.9 required=3.0 tests=ALL_TRUSTED,BAYES_00 autolearn=ham version=3.0.4 Cc: freebsd-geom@freebsd.org Subject: Re: gmirror weird behaviour X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 26 Oct 2007 07:58:23 -0000 --IS0zKkzwUGydFO0o Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Fri, Oct 26, 2007 at 12:02:40AM +0200, Mario Pranjic wrote: > Hi! >=20 > I have a system with 2 250 GB disks in mirror: > [root@haerbe ~]# gmirror status > Name Status Components > mirror/gmo COMPLETE ad0 > ad2 >=20 >=20 > system info: > FreeBSD haerbe.net 6.2-RELEASE-p7 FreeBSD 6.2-RELEASE-p7 #0: Wed Sep 5 > 06:24:11 CEST 2007 root@haerbe.net:/usr/obj/usr/src/sys/HAERBE i386 >=20 >=20 > At some point ad0 disk was detached from the system: > Oct 25 06:43:10 haerbe ad0: FAILURE - device detached > Oct 25 06:43:10 haerbe subdisk0: detached > Oct 25 06:43:10 haerbe ad0: detached > Oct 25 06:43:10 haerbe GEOM_MIRROR: Device gmo: provider ad0 disconnected. > Oct 25 06:43:10 haerbe g_vfs_done():mirror/gmos2e[READ(offset=3D105981870= 08, > length=3D16384)]error =3D 6 > Oct 25 06:43:10 haerbe g_vfs_done():mirror/gmos2e[READ(offset=3D302527447= 04, > length=3D16384)]error =3D 6 >=20 > Mirror status: > [root@haerbe /www/_backups/hrbwiki/2007-10-22]# gmirror status > Name Status Components > mirror/gmo DEGRADED ad2 > [root@haerbe /www/_backups/hrbwiki/2007-10-22]# gmirror list > Geom name: gmo > State: DEGRADED > Components: 2 > Balance: round-robin > Slice: 4096 > Flags: NONE > GenID: 0 > SyncID: 2 > ID: 3221909623 > Providers: > 1. Name: mirror/gmo > Mediasize: 250059349504 (233G) > Sectorsize: 512 > Mode: r8w8e10 > Consumers: > 1. Name: ad2 > Mediasize: 250059350016 (233G) > Sectorsize: 512 > Mode: r1w1e1 > State: ACTIVE > Priority: 0 > Flags: NONE > GenID: 0 > SyncID: 2 > ID: 1951693847 >=20 >=20 > I cleaned the mirror: > [root@haerbe]# gmirror forget gmo >=20 > [root@haerbe ~]# gmirror status > Name Status Components > mirror/gmo COMPLETE ad2 >=20 >=20 > After reboot the system is aware of both disks: > ad0: 238475MB at ata0-master UDMA100 > ad2: 238475MB at ata1-master UDMA100 >=20 >=20 > but after that: > GEOM_MIRROR: Cannot add disk ad2 to gmo (error=3D22). > Root mount waiting for: GMIRROR > Root mount waiting for: GMIRROR > Root mount waiting for: GMIRROR > GEOM_MIRROR: Force device gmo start due to timeout. > GEOM_MIRROR: Device gmo: provider ad0 activated. > GEOM_MIRROR: Device gmo: provider mirror/gmo launched. >=20 > And the system rebooted with ad0 in mirror and not ad2! > [root@haerbe ~]# gmirror list > Geom name: gmo > State: DEGRADED > Components: 2 > Balance: round-robin > Slice: 4096 > Flags: NONE > GenID: 0 > SyncID: 2 > ID: 3221909623 > Providers: > 1. Name: mirror/gmo > Mediasize: 250059349504 (233G) > Sectorsize: 512 > Mode: r8w8e10 > Consumers: > 1. Name: ad0 > Mediasize: 250059350016 (233G) > Sectorsize: 512 > Mode: r1w1e1 > State: ACTIVE > Priority: 0 > Flags: DIRTY > GenID: 0 > SyncID: 2 > ID: 2195860303 >=20 > [root@haerbe ~]# gmirror status > Name Status Components > mirror/gmo DEGRADED ad0 >=20 >=20 > Now I synced mirror again successfuly but this is weird. It should not be > happening. >=20 > Any ideas what is going on? No idea, but in the future could you add kern.geom.mirror.debug=3D1 to your /boot/loader.conf? This will tell us why you get EINVAL when gmirror tries to attach a disk. --=20 Pawel Jakub Dawidek http://www.wheel.pl pjd@FreeBSD.org http://www.FreeBSD.org FreeBSD committer Am I Evil? Yes, I Am! --IS0zKkzwUGydFO0o Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.4 (FreeBSD) iD8DBQFHIZ4EForvXbEpPzQRAs3zAKCzqptXExoKFaK1yVMpma1vmhQt4gCbBt46 wwN/me2COJXF9MRctg6nHYU= =Er+2 -----END PGP SIGNATURE----- --IS0zKkzwUGydFO0o-- From owner-freebsd-geom@FreeBSD.ORG Fri Oct 26 18:44:11 2007 Return-Path: Delivered-To: freebsd-geom@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id BBE0516A419 for ; Fri, 26 Oct 2007 18:44:11 +0000 (UTC) (envelope-from mario.pranjic@gmail.com) Received: from wa-out-1112.google.com (wa-out-1112.google.com [209.85.146.178]) by mx1.freebsd.org (Postfix) with ESMTP id 8F6D813C49D for ; Fri, 26 Oct 2007 18:44:11 +0000 (UTC) (envelope-from mario.pranjic@gmail.com) Received: by wa-out-1112.google.com with SMTP id k17so1151340waf for ; Fri, 26 Oct 2007 11:44:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; bh=30DXkX6jdx9WoRORFeeBIsfipOzPICckvZRD4pxRWSE=; b=NwOm4mAzgBCNUAO1qkfkd17nmpESTJIkv2Q4RJo/zfKjY49JfJ8or7Tt99dBa+4Js8SBLAgXO4gSGMLbjhF2gdOhXaUIWkt+k+0G0D2imXF73Z4BeJO12jskyQ9y6MXRbHm2Kq/IpEPCvhZ1MMPKhGryRFf0rMliNll0Qhn9SKo= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=BM08fOO47gncgUOrD54i+E/GFhiTdrs5g6K0SlXcfgc6QsQLXRx+Dgxd5+THb0ymqCeJRNg1nE7687Jxx1PIMJ0Zt2nmNdGMD1QiCHww7rQMZwEi5+meUEACWEmkNdETN46JiAAozJZUroNmzNVGhArUx2ChCX3kYCTpVBLAfFI= Received: by 10.114.200.2 with SMTP id x2mr3788609waf.1193424250793; Fri, 26 Oct 2007 11:44:10 -0700 (PDT) Received: by 10.114.13.14 with HTTP; Fri, 26 Oct 2007 11:44:10 -0700 (PDT) Message-ID: Date: Fri, 26 Oct 2007 20:44:10 +0200 From: "Mario Pranjic" To: "Pawel Jakub Dawidek" , freebsd-geom@freebsd.org In-Reply-To: MIME-Version: 1.0 References: <20071026075756.GA1219@garage.freebsd.pl> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: Subject: Re: gmirror weird behaviour X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 26 Oct 2007 18:44:11 -0000 On 26/10/2007, Mario Pranjic wrote: > > > If problem reoccurs, I will do that and maybe I will have some more info > to share. I got the same situation half an hour ago: ad0 was detached from the system.I rebooted the machine and now ad2 was in mirror(as it should be when ad0 failed). The only thing different is the fact that I didn't do 'gmirror gmo forget' before reboot. Now, I'm syncing the mirror again (ad0 is visible again after reboot) and I set up geom debug as you suggested: sysctl kern.geom.mirror.debug=1 Now we will see what info I will get through syslog when ad0 fails again (it shouldn't be long now). However, in the mean time, I set-ups smartd and it registred ad0 problem: SMART error (FailedOpenDevice) detected on host Device: /dev/ad0, unable to open device At the moment there is no errors from smartd, both disks are visible and running normally. -- Mario. http://keeper.sharanet.org/ http://keeper.haerbe.net/ http://keeper.nymphomatic.org/ From owner-freebsd-geom@FreeBSD.ORG Fri Oct 26 19:23:22 2007 Return-Path: Delivered-To: freebsd-geom@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 7378716A41B for ; Fri, 26 Oct 2007 19:23:22 +0000 (UTC) (envelope-from mario.pranjic@gmail.com) Received: from wa-out-1112.google.com (wa-out-1112.google.com [209.85.146.181]) by mx1.freebsd.org (Postfix) with ESMTP id 4138513C4AA for ; Fri, 26 Oct 2007 19:23:22 +0000 (UTC) (envelope-from mario.pranjic@gmail.com) Received: by wa-out-1112.google.com with SMTP id k17so1162275waf for ; Fri, 26 Oct 2007 12:23:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; bh=Vl7D/ftx7XJ0SyMOFfEkUwWaH7a/AfgltTroR9P9Wd0=; b=mfD0iXEoZRqPyF3USrJs36fAKiQHVj4DAbZnM/kyvgDupGEEIxFIig07kerqvB7UhqaJGfcF/bdppjCPrHtDdzqlPbDyZr2V6xyhMu95D4C4JR6gpDPx5L3V0eg0dplzHSAqJGIYyylUwxrHwPgZJiv4gCChSTKbxQK/b5yaU9o= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=rwDKI16adMP6397+BmMRu+t3OeVxe++zk6VV6+dbddrqdYrPDCwEQ+IJUI7UGsuOYoIuGRh05IZ3PmouT4xy9qwbF41+88Z4ilLHU7WjoZ/cSHF70TCRks0R/xX2CiFlvD+Y2akI3U/3aFQfUdvZy0PkGya7KRY5H7aul42XndI= Received: by 10.114.148.1 with SMTP id v1mr3824250wad.1193426601733; Fri, 26 Oct 2007 12:23:21 -0700 (PDT) Received: by 10.114.13.14 with HTTP; Fri, 26 Oct 2007 12:23:21 -0700 (PDT) Message-ID: Date: Fri, 26 Oct 2007 21:23:21 +0200 From: "Mario Pranjic" To: "Pawel Jakub Dawidek" , freebsd-geom@freebsd.org In-Reply-To: MIME-Version: 1.0 References: <20071026075756.GA1219@garage.freebsd.pl> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: Subject: Re: gmirror weird behaviour X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 26 Oct 2007 19:23:22 -0000 And it broke again (during mirror resync): Oct 26 21:18:28 haerbe ad0: TIMEOUT - WRITE_DMA48 retrying (1 retry left) LBA=273046528 Oct 26 21:18:28 haerbe ad0: FAILURE - WRITE_DMA48 status=51 error=10 LBA=273046528 Oct 26 21:18:28 haerbe GEOM_MIRROR[0]: Synchronization request failed (error=5). ad0[WRITE(offset=139799822336, length=131072)] Oct 26 21:18:28 haerbe GEOM_MIRROR[1]: Disk ad0 state changed from SYNCHRONIZING to DISCONNECTED (device gmo). Oct 26 21:18:28 haerbe GEOM_MIRROR[0]: Device gmo: provider ad0 disconnected. Oct 26 21:18:28 haerbe GEOM_MIRROR[0]: Device gmo: rebuilding provider ad0 stopped. Oct 26 21:18:28 haerbe GEOM_MIRROR[1]: Consumer mirror/gmo destroyed. Oct 26 21:18:28 haerbe GEOM_MIRROR[1]: Device gmo: genid bumped to 2. Oct 26 21:18:28 haerbe GEOM_MIRROR[1]: Consumer ad0 destroyed. I see this as a indicator that ad0 is a bad disk and need to be replaced, but maybe I'm mistaking. Any help is welcomed! Thnx! -- Mario. http://keeper.sharanet.org/ http://keeper.haerbe.net/ http://keeper.nymphomatic.org/