Failure to vectorize @llvm.lrint.i64.f32 #55208

RKSimon · 2022-05-01T11:13:08Z

Pulled out of Issue #55202

#include <cmath>
void testrint( const float * __restrict arg, float *out) {
    *out++ = std::rint( *arg++ );
    *out++ = std::rint( *arg++ );
    *out++ = std::rint( *arg++ );
    *out++ = std::rint( *arg++ );
}
void testlrint( const float * __restrict arg, long *out) {
    *out++ = std::lrint( *arg++ );
    *out++ = std::lrint( *arg++ );
    *out++ = std::lrint( *arg++ );
    *out++ = std::lrint( *arg++ );
}

while rint gets vectorized, lrint doesn't:

testrint(float const*, float*):                       # @testrint(float const*, float*)
        vroundps        $4, (%rdi), %xmm0
        vmovups %xmm0, (%rsi)
        retq
testlrint(float const*, long*):                      # @testlrint(float const*, long*)
        vcvtss2si       (%rdi), %rax
        movq    %rax, (%rsi)
        vcvtss2si       4(%rdi), %rax
        movq    %rax, 8(%rsi)
        vcvtss2si       8(%rdi), %rax
        movq    %rax, 16(%rsi)
        vcvtss2si       12(%rdi), %rax
        movq    %rax, 24(%rsi)
        retq

The text was updated successfully, but these errors were encountered:

The issue llvm#55208 describes a current deficiency of the SLPVectorizer, namely that it doesn't vectorize code written with lrint, while similar code written with rint is vectorized. Add a test corresponding to this issue for the RISC-V target.

The issue #55208 describes a current deficiency of the SLPVectorizer, namely that it doesn't vectorize code written with lrint, while similar code written with rint is vectorized. Add a test corresponding to this issue for the RISC-V target.

The issue llvm#55208 noticed that std::rint is vectorized by the SLPVectorizer, but a very similar function, std::lrint, is not. std::lrint corresponds to ISD::LRINT in the SelectionDAG, and std::llrint is a familiar cousin corresponding to ISD::LLRINT. Now, neither ISD::LRINT nor ISD::LLRINT have a corresponding vector variant, and the LangRef makes this clear in the documentation of llvm.lrint.* and llvm.llrint.*. This patch extends the LangRef to include vector variants of llvm.lrint.* and llvm.llrint.*, and lays the necessary ground-work of scalarizing it for all targets. However, this patch would be devoid of motivation unless we show the utility of these new vector variants. Hence, the RISCV target has been chosen to implement a custom lowering to the vfcvt.x.f.v instruction. The patch also includes a CostModel for RISCV, and a trivial follow-up can potentially enable the SLPVectorizer to vectorize std::lrint and std::llrint, fixing llvm#55208. The patch includes tests, obviously for the RISCV target, but also for the X86, AArch64, and PowerPC targets to justify the addition of the vector variants to the LangRef.

…#66924) The issue #55208 noticed that std::rint is vectorized by the SLPVectorizer, but a very similar function, std::lrint, is not. std::lrint corresponds to ISD::LRINT in the SelectionDAG, and std::llrint is a familiar cousin corresponding to ISD::LLRINT. Now, neither ISD::LRINT nor ISD::LLRINT have a corresponding vector variant, and the LangRef makes this clear in the documentation of llvm.lrint.* and llvm.llrint.*. This patch extends the LangRef to include vector variants of llvm.lrint.* and llvm.llrint.*, and lays the necessary ground-work of scalarizing it for all targets. However, this patch would be devoid of motivation unless we show the utility of these new vector variants. Hence, the RISCV target has been chosen to implement a custom lowering to the vfcvt.x.f.v instruction. The patch also includes a CostModel for RISCV, and a trivial follow-up can potentially enable the SLPVectorizer to vectorize std::lrint and std::llrint, fixing #55208. The patch includes tests, obviously for the RISCV target, but also for the X86, AArch64, and PowerPC targets to justify the addition of the vector variants to the LangRef.

With the recent change 98c90a1 (ISel: introduce vector ISD::LRINT, ISD::LLRINT; custom RISCV lowering), it is now possible for SLPVectorizer, LoopVectorize, and Scalarizer to operate on llvm.lrint and llvm.llrint, with vector codegen for the RISC-V target. Make a trivial change to VectorUtils, and update the corresponding tests. A couple of important fixes have been landed since the original patch was landed and reverted, and it is now safe to re-land the patch: 5e1d81a (LegalizeIntegerTypes: implement PromoteIntRes for xrint) and fd887a3 (LegalizeVectorTypes: fix bug in widening of vec result in xrint). See also llvm#71399 which proves that lrint and llrint will indeed produce vector codegen on RISC-V. Fixes llvm#55208.

With the recent change 98c90a1 (ISel: introduce vector ISD::LRINT, ISD::LLRINT; custom RISCV lowering), it is now possible for SLPVectorizer, LoopVectorize, and Scalarizer to operate on llvm.lrint and llvm.llrint, with vector codegen for the RISC-V target. Make a trivial change to VectorUtils, and update the corresponding tests. A couple of important fixes have been landed since the original patch was landed and reverted, and it is now safe to re-land the patch: 5e1d81a (LegalizeIntegerTypes: implement PromoteIntRes for xrint) and fd887a3 (LegalizeVectorTypes: fix bug in widening of vec result in xrint). See also #71399, which proves that lrint and llrint will indeed produce vector codegen on RISC-V. Fixes #55208.

RKSimon added llvm:codegen vectorizers llvm:SLPVectorizer labels May 1, 2022

artagnon mentioned this issue Sep 7, 2023

SLP/RISCV: add negative test for lrint (#55208) #65611

Merged

michaelrj-google mentioned this issue Sep 12, 2023

[libc] Move long double table option to new config #66151

Merged

artagnon mentioned this issue Sep 20, 2023

ISel: introduce vector ISD::LRINT, ISD::LLRINT; custom RISCV lowering #66924

Merged

artagnon mentioned this issue Oct 23, 2023

VectorUtils: mark lrint, llrint as trivially vectorizable #69945

Merged

artagnon mentioned this issue Nov 6, 2023

Reland "VectorUtils: mark xrint as trivially vectorizable" #71416

Merged

artagnon closed this as completed in #71416 Nov 6, 2023

EugeneZelenko added llvm:analysis and removed llvm:codegen vectorizers llvm:SLPVectorizer labels Nov 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Failure to vectorize @llvm.lrint.i64.f32 #55208

Failure to vectorize @llvm.lrint.i64.f32 #55208

RKSimon commented May 1, 2022 •

edited by VoltrexKeyva

Loading

Failure to vectorize @llvm.lrint.i64.f32 #55208

Failure to vectorize @llvm.lrint.i64.f32 #55208

Comments

RKSimon commented May 1, 2022 • edited by VoltrexKeyva Loading

RKSimon commented May 1, 2022 •

edited by VoltrexKeyva

Loading