Date: Thu, 30 Mar 2017 19:54:18 +0000 From: bugzilla-noreply@freebsd.org To: freebsd-bugs@FreeBSD.org Subject: [Bug 218203] Implement AVX2 accelerated Fletcher algorithms Message-ID: <bug-218203-8-jrN8pllyYD@https.bugs.freebsd.org/bugzilla/> In-Reply-To: <bug-218203-8@https.bugs.freebsd.org/bugzilla/> References: <bug-218203-8@https.bugs.freebsd.org/bugzilla/>
next in thread | previous in thread | raw e-mail | index | archive | help
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D218203 --- Comment #4 from Adam Stylinski <kungfujesus06@gmail.com> --- Created attachment 181326 --> https://bugs.freebsd.org/bugzilla/attachment.cgi?id=3D181326&action= =3Dedit SSE4 implementation Also not as impactful, but definitely measurable is an SSE4 variant. This = will actually run on Atoms and a much larger number of other pre-Haswell CPUs.= =20=20 The ZFS On Linux version does 8 padds per loop iteration because they are avoiding the pmovzxdq instruction. This is probably to be compatible with SSSE3 and not require SSE4. --=20 You are receiving this mail because: You are the assignee for the bug.=
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-218203-8-jrN8pllyYD>