Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 30 Mar 2017 19:54:18 +0000
From:      bugzilla-noreply@freebsd.org
To:        freebsd-bugs@FreeBSD.org
Subject:   [Bug 218203] Implement AVX2 accelerated Fletcher algorithms
Message-ID:  <bug-218203-8-jrN8pllyYD@https.bugs.freebsd.org/bugzilla/>
In-Reply-To: <bug-218203-8@https.bugs.freebsd.org/bugzilla/>
References:  <bug-218203-8@https.bugs.freebsd.org/bugzilla/>

next in thread | previous in thread | raw e-mail | index | archive | help

https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=218203

--- Comment #4 from Adam Stylinski <kungfujesus06@gmail.com> ---
Created attachment 181326
  --> https://bugs.freebsd.org/bugzilla/attachment.cgi?id=181326&action=edit
SSE4 implementation

Also not as impactful, but definitely measurable is an SSE4 variant.  This will
actually run on Atoms and a much larger number of other pre-Haswell CPUs.  

The ZFS On Linux version does 8 padds per loop iteration because they are
avoiding the pmovzxdq instruction.  This is probably to be compatible with
SSSE3 and not require SSE4.

-- 
You are receiving this mail because:
You are the assignee for the bug.


Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-218203-8-jrN8pllyYD>