From owner-freebsd-stable@FreeBSD.ORG Fri Jul 19 13:52:52 2013 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by hub.freebsd.org (Postfix) with ESMTP id 06D9CBF8; Fri, 19 Jul 2013 13:52:52 +0000 (UTC) (envelope-from c.kworr@gmail.com) Received: from mail-lb0-x22c.google.com (mail-lb0-x22c.google.com [IPv6:2a00:1450:4010:c04::22c]) by mx1.freebsd.org (Postfix) with ESMTP id 5F81BB51; Fri, 19 Jul 2013 13:52:51 +0000 (UTC) Received: by mail-lb0-f172.google.com with SMTP id v20so3493511lbc.3 for ; Fri, 19 Jul 2013 06:52:50 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:cc:subject :references:in-reply-to:content-type:content-transfer-encoding; bh=bgM3m8esa5/XraltbhB5pSqLrutuTLAVM8S3aIkkBfg=; b=KlZVpCBoP95A/aIPcuUO1JZLZ/2cGbbxpOF25d6Rh1VcVSOMCm1yuqxTu2M1/AMDoQ RehASXurcEUlEyljT67n/Tfv3vzfaEVWEqR9oK1pCtPjwmWpS6ZEhbtADLjvwhesyA31 vhBWIpywJHLKqEmd1MWqI4QEA1w9PFOf1BYW+dGG8q/Vtx88M5POXVhSgH0/I26/1ta6 /3pHrOV3nFUnLUpPlmaEzXrQAcwahHTg46D3hzcIejfN+Nutfxy1zNsRAswbQCGgNAfz Vzq9JjKaAUNezbtogxdb8v9xjeBr1Xyh6klpn6KUN/52IQjq0jlat/EI/0A66WyIjkJW uNKg== X-Received: by 10.112.143.162 with SMTP id sf2mr7072684lbb.1.1374241970326; Fri, 19 Jul 2013 06:52:50 -0700 (PDT) Received: from [192.168.1.139] (mau.donbass.com. [92.242.127.250]) by mx.google.com with ESMTPSA id p7sm6145025lbi.15.2013.07.19.06.52.49 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Fri, 19 Jul 2013 06:52:49 -0700 (PDT) Message-ID: <51E944B0.5080409@gmail.com> Date: Fri, 19 Jul 2013 16:52:48 +0300 From: Volodymyr Kostyrko User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:17.0) Gecko/20130710 Thunderbird/17.0.7 MIME-Version: 1.0 To: olivier Subject: Re: 9.2PRERELEASE ZFS panic in lzjb_compress References: In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Cc: "freebsd-stable@freebsd.org" , zfs-devel@freebsd.org X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 19 Jul 2013 13:52:52 -0000 19.07.2013 07:04, olivier wrote: > Hi, > Running 9.2-PRERELEASE #19 r253313 I got the following panic > > Fatal trap 12: page fault while in kernel mode > cpuid = 22; apic id = 46 > fault virtual address = 0xffffff827ebca30c > fault code = supervisor read data, page not present > instruction pointer = 0x20:0xffffffff81983055 > stack pointer = 0x28:0xffffffcf75bd60a0 > frame pointer = 0x28:0xffffffcf75bd68f0 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, long 1, def32 0, gran 1 > processor eflags = interrupt enabled, resume, IOPL = 0 > current process = 0 (zio_write_issue_hig) > trap number = 12 > panic: page fault > cpuid = 22 > KDB: stack backtrace: > db_trace_self_wrapper() at db_trace_self_wrapper+0x2a/frame > 0xffffffcf75bd5b30 > kdb_backtrace() at kdb_backtrace+0x37/frame 0xffffffcf75bd5bf0 > panic() at panic+0x1ce/frame 0xffffffcf75bd5cf0 > trap_fatal() at trap_fatal+0x290/frame 0xffffffcf75bd5d50 > trap_pfault() at trap_pfault+0x211/frame 0xffffffcf75bd5de0 > trap() at trap+0x344/frame 0xffffffcf75bd5fe0 > calltrap() at calltrap+0x8/frame 0xffffffcf75bd5fe0 > --- trap 0xc, rip = 0xffffffff81983055, rsp = 0xffffffcf75bd60a0, rbp = > 0xffffffcf75bd68f0 --- > lzjb_compress() at lzjb_compress+0x185/frame 0xffffffcf75bd68f0 > zio_compress_data() at zio_compress_data+0x92/frame 0xffffffcf75bd6920 > zio_write_bp_init() at zio_write_bp_init+0x24b/frame 0xffffffcf75bd6970 > zio_execute() at zio_execute+0xc3/frame 0xffffffcf75bd69b0 > taskqueue_run_locked() at taskqueue_run_locked+0x74/frame 0xffffffcf75bd6a00 > taskqueue_thread_loop() at taskqueue_thread_loop+0x46/frame > 0xffffffcf75bd6a20 > fork_exit() at fork_exit+0x11f/frame 0xffffffcf75bd6a70 > fork_trampoline() at fork_trampoline+0xe/frame 0xffffffcf75bd6a70 > --- trap 0, rip = 0, rsp = 0xffffffcf75bd6b30, rbp = 0 --- > > lzjb_compress+0x185 corresponds to line 85 in > 80 cpy = src - offset; > 81 if (cpy >= (uchar_t *)s_start && cpy != src && > 82 src[0] == cpy[0] && src[1] == cpy[1] && src[2] == cpy[2]) { > 83 *copymap |= copymask; > 84 for (mlen = MATCH_MIN; mlen < MATCH_MAX; mlen++) > 85 if (src[mlen] != cpy[mlen]) > 86 break; > 87 *dst++ = ((mlen - MATCH_MIN) << (NBBY - MATCH_BITS)) | > 88 (offset >> NBBY); > 89 *dst++ = (uchar_t)offset; > > I think it's the first time I've seen this panic. It happened while doing a > send/receive. I have two pools with lzjb compression; I don't know which of > these pools caused the problem, but one of them was the source of the > send/receive. > > I only have a textdump but I'm happy to try to provide more information > that could help anyone look into this. > Thanks > Olivier Oh, I can add to this one. I have a full core dump of the same problem caused by copying large set of files from lzjb compressed pool to lz4 compressed pool. vfs.zfs.recover was set. #1 0xffffffff8039d954 in kern_reboot (howto=260) at /usr/src/sys/kern/kern_shutdown.c:449 #2 0xffffffff8039ddce in panic (fmt=) at /usr/src/sys/kern/kern_shutdown.c:637 #3 0xffffffff80620a6a in trap_fatal (frame=, eva=) at /usr/src/sys/amd64/amd64/trap.c:879 #4 0xffffffff80620d25 in trap_pfault (frame=0x0, usermode=0) at /usr/src/sys/amd64/amd64/trap.c:700 #5 0xffffffff806204f6 in trap (frame=0xffffff821ca43600) at /usr/src/sys/amd64/amd64/trap.c:463 #6 0xffffffff8060a032 in calltrap () at /usr/src/sys/amd64/amd64/exception.S:232 #7 0xffffffff805a9367 in vm_page_alloc (object=0xffffffff80a34030, pindex=16633, req=97) at /usr/src/sys/vm/vm_page.c:1445 #8 0xffffffff8059c42e in kmem_back (map=0xfffffe00010000e8, addr=18446743524021862400, size=16384, flags=) at /usr/src/sys/vm/vm_kern.c:362 #9 0xffffffff8059c2ac in kmem_malloc (map=0xfffffe00010000e8, size=16384, flags=257) at /usr/src/sys/vm/vm_kern.c:313 #10 0xffffffff80595104 in uma_large_malloc (size=, wait=257) at /usr/src/sys/vm/uma_core.c:994 #11 0xffffffff80386b80 in malloc (size=16384, mtp=0xffffffff80ea7c40, flags=0) at /usr/src/sys/kern/kern_malloc.c:492 #12 0xffffffff80c9e13c in lz4_compress (s_start=0xffffff80d0b19000, d_start=0xffffff8159445000, s_len=131072, d_len=114688, n=-2) at /usr/src/sys/modules/zfs/../../cddl/contrib/opensolaris/uts/common/fs/zfs/lz4.c:843 #13 0xffffffff80cdde25 in zio_compress_data (c=, src=, dst=0xffffff8159445000, s_len=131072) at /usr/src/sys/modules/zfs/../../cddl/contrib/opensolaris/uts/common/fs/zfs/zio_compress.c:109 #14 0xffffffff80cda012 in zio_write_bp_init (zio=0xfffffe0143a12000) at /usr/src/sys/modules/zfs/../../cddl/contrib/opensolaris/uts/common/fs/zfs/zio.c:1107 #15 0xffffffff80cd8ec6 in zio_execute (zio=0xfffffe0143a12000) at /usr/src/sys/modules/zfs/../../cddl/contrib/opensolaris/uts/common/fs/zfs/zio.c:1305 #16 0xffffffff803e25e6 in taskqueue_run_locked (queue=0xfffffe00060ca300) at /usr/src/sys/kern/subr_taskqueue.c:312 #17 0xffffffff803e2e38 in taskqueue_thread_loop (arg=) at /usr/src/sys/kern/subr_taskqueue.c:501 #18 0xffffffff8036f40a in fork_exit ( callout=0xffffffff803e2da0 , arg=0xfffffe00060cc3d0, frame=0xffffff821ca43a80) at /usr/src/sys/kern/kern_fork.c:988 #19 0xffffffff8060a56e in fork_trampoline () at /usr/src/sys/amd64/amd64/exception.S:606 I have a full crash dump in case someone wants to look at it. -- Sphinx of black quartz, judge my vow.