Date: Thu, 9 Aug 2018 01:16:47 +0300 From: Konstantin Belousov <kostikbel@gmail.com> To: Rick Macklem <rmacklem@uoguelph.ca> Cc: "freebsd-current@FreeBSD.org" <freebsd-current@FreeBSD.org>, peter@holm.cc Subject: Re: ffs_truncate3 panics Message-ID: <20180808221647.GH1884@kib.kiev.ua> In-Reply-To: <YTOPR0101MB18207C97903D3058A15091FFDD260@YTOPR0101MB1820.CANPRD01.PROD.OUTLOOK.COM> References: <YTOPR0101MB18206289DDED97BE9DD38D14DD270@YTOPR0101MB1820.CANPRD01.PROD.OUTLOOK.COM> <20180807131445.GC1884@kib.kiev.ua> <YTOPR0101MB18207C97903D3058A15091FFDD260@YTOPR0101MB1820.CANPRD01.PROD.OUTLOOK.COM>
next in thread | previous in thread | raw e-mail | index | archive | help
On Wed, Aug 08, 2018 at 12:30:54PM +0000, Rick Macklem wrote: > Konstantin Belousov wrote: > >On Tue, Aug 07, 2018 at 12:28:33PM +0000, Rick Macklem wrote: > >> Hi, > >> > >> During testing of the pNFS server I get an ffs_truncate3 panic every once in a while. > >> A few things that might be relevant: > >> - Seems to happen more often when soft update journaling is enabled, but will > >> happen when it is disabled. > >> - Normally happens when a fairly large subtree of the file system is being removed. > - Oh, and this is an old i386 with 256Mbytes (not one of them new fangled computers, > where memory is in Gbytes;-) > > >> > >> These file systems are a bit odd, since all the regular files in them are empty but > >> have extended attributes that are accessed during the subtree removal. (The > >> extended attributes tell the server where the data files are.) > >> > >> I replaced the panic() with a printf() and every time the printf() happens... > >> bo->bo_dirty.bv_cnt == 0 and bo->bo_clean.bv_cnt == 1. > >> After one of these printf()s, the system continues to run ok. When the file > >> system is fsck'd after this has occurred, it passes fine and I haven't seen and > >> indication of file system corruption after running with this file system for > >> quite a while after the printf()s first occurred. > >The lack of corruption is, most likely, because the files are removed. > >Would the files truncated to zero length and then extended, I am almost > >sure that a corruption occur. > > > >Can you print the only buffer on the clean queue when the panic occur ? > ffst3 vtyp=1 bodirty=0 boclean=1 > buf at 0x428a110 > b_flags = 0x20001020<vmio,reuse,cache>, b_xflags=0x2<clean>, b_vflags=0x0 > b_error = 0, b_bufsize = 4096, b_bcount = 4096, b_resid = 0 > b_bufobj = (0xfd8ba94), b_data = 0x5170000, b_blkno = -1, b_lblkno = -1, b_dep = 0 > b_kvabase = 0x5170000, b_kvasize = 32768 So the buffer was indeed for extended attrs, and never written to the disk. I am quite interested what was the inode content prior to the truncation, esp. the di_extsize. Could you try to formulate a way to reproduce the panic so that Peter can recreate it, please ? > > >Also, it is interesting to know the initial length of the file. > Since they are regular files, they are 0 length. (Just inodes with extended attributes.) > > >> > >> Since the panic() only occurs when "options INVARIANTS" is enabled and I don't > >> see evidence of file system corruption, I'm wondering if this panic() is valid and > >> needed? > > rick
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20180808221647.GH1884>