From owner-freebsd-stable@FreeBSD.ORG Mon Sep 29 21:05:13 2014 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 7A04CE93 for ; Mon, 29 Sep 2014 21:05:13 +0000 (UTC) Received: from mout.gmx.net (mout.gmx.net [212.227.17.20]) (using TLSv1.2 with cipher DHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "mout.gmx.net", Issuer "TeleSec ServerPass DE-1" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 1181381C for ; Mon, 29 Sep 2014 21:05:12 +0000 (UTC) Received: from walrus.pepperland ([81.217.76.60]) by mail.gmx.com (mrgmx102) with ESMTPSA (Nemesis) id 0MVJze-1XlLD60GTu-00YgDL; Mon, 29 Sep 2014 23:05:10 +0200 Message-ID: <5429C985.4090806@gmx.net> Date: Mon, 29 Sep 2014 23:05:09 +0200 From: Stefan Ehmann User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:31.0) Gecko/20100101 Thunderbird/31.1.2 MIME-Version: 1.0 To: Mark Felder Subject: Re: xzgrep: incomplete results on larger files References: <541DE9FC.2090003@gmx.net> <1412000043.1250852.172976197.0B46058D@webmail.messagingengine.com> In-Reply-To: <1412000043.1250852.172976197.0B46058D@webmail.messagingengine.com> Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 7bit X-Provags-ID: V03:K0:N5ck9Jf7JoON5HZispTppUb3oqHbDS0esRn+59nkfnxW7KdOouX dnti3KRPqVJv3G/Ivf0ugGIYH3v/V4/HqWC2VPRGqgREjNvAHeyRGgRIIJr7ieV/WZ7CvR7 NKZnBHDPJpXtiUrZRYtlk2sumddVfeK8d3Eq/r0lCuBcLz4OfZRpchLwmtoiTLuadDkspdq PkZRnE9utjVnxuYa1DskQ== X-UI-Out-Filterresults: notjunk:1; Cc: freebsd-stable@freebsd.org, gabor@FreeBSD.org X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.18-1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 29 Sep 2014 21:05:13 -0000 On 29.09.2014 16:14, Mark Felder wrote: > > > On Sat, Sep 20, 2014, at 15:56, Stefan Ehmann wrote: >> I observed the following behavior on 10.1-BETA1 r271683M (amd64): >> >> xzgrep doesn't search the complete file: >> $ seq 10000 | xz > seq.xz >> $ xzgrep -c . seq.xz >> 6775 >> >> Using regular grep works as expected: >> $ xzcat seq.xz | grep -c . >> 10000 >> >> Processing seems to stop after 32KB (uncompressed). >> > > Wow, this is bizarre... Compression with xz is getting more and more > popular. This may have bit me a few times and I didn't even know it! > > I'll see what I can to do bring this to someone's attention. I've looked further into this issue. The original approach could never have worked since the xz library was used incorrectly. I came up with this patch today: https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=186861 It's getting late. So it's not tested very much and might contain bugs. -- Stefan