Date: Mon, 26 Sep 2022 08:20:15 -0600 From: Warner Losh <imp@bsdimp.com> To: Lev Serebryakov <lev@freebsd.org> Cc: FreeBSD Current <freebsd-current@freebsd.org> Subject: Re: Did clang 14 lose some intrinsics support? Message-ID: <CANCZdfqnAFYCvg8k88peh2bKB%2B9G9GX1BCQvhUoggYcJL0Z0Aw@mail.gmail.com> In-Reply-To: <8cd08b95-396a-b8a3-e33a-b0482a7f23e5@FreeBSD.org> References: <YzCL/%2BUEhLy7kHaL@lorvorc.mips.inka.de> <zgen-6zwv-wny@FreeBSD.org> <1A903FD8-D904-4B91-ABC4-2F704F0E2CF4@FreeBSD.org> <YzDKQsfciGd%2BJBQ5@lorvorc.mips.inka.de> <430A830E-3473-4EF4-9605-039F8254999C@FreeBSD.org> <8cd08b95-396a-b8a3-e33a-b0482a7f23e5@FreeBSD.org>
index | next in thread | previous in thread | raw e-mail
[-- Attachment #1 --] On Mon, Sep 26, 2022, 7:54 AM Lev Serebryakov <lev@freebsd.org> wrote: > On 26.09.2022 13:03, Dimitry Andric wrote: > > > Sure, but if you are compiling without -mavx, why would you want the AVX > > intrinsics? You cannot use AVX intrinsics anyway, if AVX is not enabled. > Because autovectorization (generation of SSE or AVX instructions by > compiler itself, without intrinsics) can pessimize code. > > Sometimes it is valuable to know exactly where AVX is used. I don't > have examples on hands, but I've seen situations, when autovectorized code > was much slower than scalar code. > The detection method that dim@ outline will work just fine for the autodetect script. It just replaces the internal, charging undocumented names for standard ones. How you later compile individual bits of code is orthogonal. Warner > [-- Attachment #2 --] <div dir="auto"><div><br><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Mon, Sep 26, 2022, 7:54 AM Lev Serebryakov <<a href="mailto:lev@freebsd.org">lev@freebsd.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">On 26.09.2022 13:03, Dimitry Andric wrote:<br> <br> > Sure, but if you are compiling without -mavx, why would you want the AVX<br> > intrinsics? You cannot use AVX intrinsics anyway, if AVX is not enabled.<br> Because autovectorization (generation of SSE or AVX instructions by compiler itself, without intrinsics) can pessimize code.<br> <br> Sometimes it is valuable to know exactly where AVX is used. I don't have examples on hands, but I've seen situations, when autovectorized code was much slower than scalar code.<br></blockquote></div></div><div dir="auto"><br></div><div dir="auto">The detection method that dim@ outline will work just fine for the autodetect script. It just replaces the internal, charging undocumented names for standard ones.</div><div dir="auto"><br></div><div dir="auto">How you later compile individual bits of code is orthogonal. </div><div dir="auto"><br></div><div dir="auto">Warner</div><div dir="auto"><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"> </blockquote></div></div></div>help
Want to link to this message? Use this
URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CANCZdfqnAFYCvg8k88peh2bKB%2B9G9GX1BCQvhUoggYcJL0Z0Aw>
