From owner-freebsd-stable@FreeBSD.ORG Fri Aug 30 22:49:35 2013 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTP id 57B17B31; Fri, 30 Aug 2013 22:49:35 +0000 (UTC) (envelope-from zbeeble@gmail.com) Received: from mail-ve0-x230.google.com (mail-ve0-x230.google.com [IPv6:2607:f8b0:400c:c01::230]) (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id E3F6124AF; Fri, 30 Aug 2013 22:49:34 +0000 (UTC) Received: by mail-ve0-f176.google.com with SMTP id b10so1758749vea.7 for ; Fri, 30 Aug 2013 15:49:34 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=+Pnx3j+8tWRjEtV42HZJDDTecoCy/+GCjWmPmpx4iSw=; b=QBnlw7TjGo3kFH5XW7kIoAzjwCIaQ+J3xGDaXupuPJW1jpbmB/KTG9UGSU7ydjnbQs JYHCWdCXTvUISAKVR7UG+2hznwF1lnEw+2EylGVVlcXa/IYKGUVMrM8zK2OC7i+um9Im Sc/eO6OpPf/UXFu6UMb4mIKOYTVrmcI2Yvuncl2Gd0+4noLUQkeHKyIxZit4amqjQq70 rxAsy1rApZ+058X92CyXQcvRHKorfnmkUFxw2+suYpeZM3pdtnEufvK2N8AYJRcNwn0h KsokHBMRm6aIgig38efhqzrSYxeMTWCCJy22CgFgsFkgCleQpTZrQ8K/ABa3hqIjn9ra FGIw== MIME-Version: 1.0 X-Received: by 10.52.117.68 with SMTP id kc4mr8007274vdb.0.1377902974054; Fri, 30 Aug 2013 15:49:34 -0700 (PDT) Received: by 10.220.122.1 with HTTP; Fri, 30 Aug 2013 15:49:33 -0700 (PDT) In-Reply-To: References: <370A25C8-7747-4B96-A506-EB92FD0F77CF@FreeBSD.org> <1377895898.1111.341.camel@revolution.hippie.lan> Date: Fri, 30 Aug 2013 18:49:33 -0400 Message-ID: Subject: Re: gmirror crash writing to disk? Or is it su+j crash? From: Zaphod Beeblebrox To: Ian Lepore Content-Type: text/plain; charset=ISO-8859-2 Content-Transfer-Encoding: quoted-printable X-Content-Filtered-By: Mailman/MimeDel 2.1.14 Cc: FreeBSD Stable , =?ISO-8859-2?Q?Edward_Tomasz_Napiera=B3a?= X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 30 Aug 2013 22:49:35 -0000 Because someone said that there would be no logging of unerlying ATA errors without verbose, I rebooted with verbose and tried the same make -j4 again... and here is the relatively similar core.txt.5 https://uk.eicat.ca/owncloud/public.php?service=3Dfiles&t=3Dd99648ef5876b91= c5957148445e60c87 Looking at it, gmirror is dropping the same error and the underlying hardware is not causing the error... On Fri, Aug 30, 2013 at 6:09 PM, Zaphod Beeblebrox wrote= : > My bad. New link for the core.txt.4: > > > https://uk.eicat.ca/owncloud/public.php?service=3Dfiles&t=3Df471e5afae483= 342cd20dc390e9c2dd7 > > > > > On Fri, Aug 30, 2013 at 4:51 PM, Ian Lepore wrote: > >> On Fri, 2013-08-30 at 21:50 +0200, Edward Tomasz Napiera=B3a wrote: >> > Wiadomo=B6=E6 napisana przez Zaphod Beeblebrox w d= niu >> 29 sie 2013, o godz. 23:35: >> > > So I have a system running: >> > > >> > > FreeBSD walk.dclg.ca 9.2-RC3 FreeBSD 9.2-RC3 # r254952: Wed Aug 28 >> 03:02:55 >> > > EDT 2013 root@walk.dclg.ca:/usr/obj/usr/src/sys/STRIKE i386 >> > > >> > > and it has two 2T SATA disks. To keep this post short, the crash.tx= t >> is >> > > here. >> > > >> > > >> https://uk.eicat.ca/owncloud/public.php?service=3Dfiles&t=3Dfea9d25579fe= 0c4afb808859e80e1493 >> > >> > Login error. >> > >> > > now curiously, while running a "make -j4 buildkernel" ... almost >> every time >> > > ... it crashes with: >> > > >> > > g_vfs_done():mirror/walke[WRITE(offset=3D516764794880, >> length=3D65536)]error =3D >> > > 11 >> > > /usr: got error 11 while accessing filesystem >> > > panic: softdep_deallocate_dependencies: unrecovered I/O error >> > >> > This is softupdates panic caused by write operation returning error 11= , >> which, >> > according to 'man errno', is EDEADLK. >> > >> > To be honest, I have no idea why gmirror might be returning this error= . >> > >> > > ... no error report from the hard drives, simply an error report fro= m >> the >> > > mirror. >> > >> > Note that ahci(4) does not log errors unless you're running with >> bootverbose. >> > >> > > The filesystem is ufs with su+j... but I'm not sure this matters her= e. >> > >> > It does, kind of - without soft updates/SUJ, the error would be >> non-fatal - it >> > wouldn't panic the box, but it would (probably) cause data corruption. >> >> One of the few places in the kernel that uses EDEADLK is in geom_io.c >> (line 642 in -current) in g_io_transient_map_bio()... >> >> g_io_deliver(bp, EDEADLK/* XXXKIB */); >> >> -- Ian >> >> >> >