Bad-quality code passing union as argument on x86_64 #3971

efriedma-quic · 2009-02-17T06:36:00Z


Bugzilla Link	3599
Resolution	FIXED
Resolved on	Feb 17, 2009 14:48
Version	unspecified
OS	Linux

Extended Description

union UGeckoInstruction
{
unsigned hex;
};
unsigned a(union UGeckoInstruction a) {return a.hex;}

Run through clang -emit-llvm -O2 -arch=x86_64, gives the following:
; ModuleID = '-'
target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-f80:128:128"
target triple = "x86_64-pc-linux-gnu"

define i32 @a(i64) nounwind readnone {
entry:
%tmp5 = trunc i64 %0 to i8 ; [#uses=1]
%tmp7 = lshr i64 %0, 8 ; [#uses=1]
%tmp8 = trunc i64 %tmp7 to i8 ; [#uses=1]
%tmp10 = lshr i64 %0, 16 ; [#uses=1]
%tmp11 = trunc i64 %tmp10 to i8 ; [#uses=1]
%tmp13 = lshr i64 %0, 24 ; [#uses=1]
%tmp14 = trunc i64 %tmp13 to i8 ; [#uses=1]
%1 = zext i8 %tmp5 to i32 ; [#uses=1]
%2 = zext i8 %tmp8 to i32 ; [#uses=1]
%3 = shl i32 %2, 8 ; [#uses=1]
%4 = zext i8 %tmp11 to i32 ; [#uses=1]
%5 = shl i32 %4, 16 ; [#uses=1]
%6 = zext i8 %tmp14 to i32 ; [#uses=1]
%7 = shl i32 %6, 24 ; [#uses=1]
%8 = or i32 %7, %1 ; [#uses=1]
%9 = or i32 %8, %5 ; [#uses=1]
%10 = or i32 %9, %3 ; [#uses=1]
ret i32 %10
}

This is rather silly. I'm not sure whether to blame clang or the LLVM optimizers for this result, though.

lattner · 2009-02-17T06:57:09Z

llvm-gcc compiles this into a nice and simple:
target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128"
target triple = "x86_64-apple-darwin7"

define i32 @a(i32 %a.0) nounwind readnone {
entry:
ret i32 %a.0
}

:)

llvmbot · 2009-02-17T08:29:55Z

Ugh.

clang could certainly try harder to match to better types when coercing, but I'm not sure how far we should really go down this road.

My feeling is that in this case we should treat this as a missed IR optimization. The cases I think clang is clearly at fault is when the mid level IR passes do not have enough information to know that there is a nicer way to handle the argument (for example, choosing <4 x float> over <2 x double> in some cases).

lattner · 2009-02-17T09:00:56Z

instcombine can help with this.

llvmbot · 2009-02-17T09:20:00Z

With the attached patch, I get this:

ddunbar@ozzy:CodeGen$ xcc -emit-llvm -S -o - t4.c | llvm-as | opt -std-compile-opts | llvm-dis
; ModuleID = ''
target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-f80:128:128"
target triple = "x86_64-apple-darwin10.0"

define i32 @a(i64) nounwind readnone {
entry:
%1 = trunc i64 %0 to i32 ; [#uses=1]
ret i32 %1
}

llvmbot · 2009-02-17T09:20:43Z

instcombine patch

lattner · 2009-02-17T22:47:57Z

Simplified version of the patch committed here:
http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20090216/073864.html

We're now down to:

define i32 @a(i64 %tmp) nounwind readnone {
entry:
%tmp28 = trunc i64 %tmp to i32 ; [#uses=1]
%tmp1 = and i32 %tmp28, 255 ; [#uses=1]
%tmp2 = trunc i64 %tmp to i32 ; [#uses=1]
%tmp3 = and i32 %tmp2, 65280 ; [#uses=1]
%tmp4 = trunc i64 %tmp to i32 ; [#uses=1]
%tmp6 = and i32 %tmp4, 16711680 ; [#uses=1]
%tmp21 = trunc i64 %tmp to i32 ; [#uses=1]
%tmp12 = and i32 %tmp21, -16777216 ; [#uses=1]
%tmp15 = or i32 %tmp12, %tmp1 ; [#uses=1]
%tmp16 = or i32 %tmp15, %tmp6 ; [#uses=1]
%tmp17 = or i32 %tmp16, %tmp3 ; [#uses=1]
ret i32 %tmp17
}

lattner · 2009-02-17T22:48:56Z

Ah, with -std-compile-opts and that patch, this is completely simplified. Instcombine needs CSE to see the equivalence of the truncates.

llvmbot · 2021-11-27T04:43:38Z

mentioned in issue llvm/llvm-bugzilla-archive#5167

Make the initialization of SwiftASTContextForExpressions lazy (NFC)

llvmbot transferred this issue from llvm/llvm-bugzilla-archive Dec 3, 2021

This was referenced Aug 18, 2021

[Snyk] Fix for 3 vulnerabilities MaxMood96/llvm-project#3

Closed

[Snyk] Fix for 3 vulnerabilities Mement-Mori/llvm-project#2

Closed

plotfi pushed a commit to plotfi/llvm-project that referenced this issue Mar 15, 2022

Merge pull request llvm#3971 from adrian-prantl/lazy-swiftastcontext

63d5131

Make the initialization of SwiftASTContextForExpressions lazy (NFC)

This was referenced Mar 22, 2022

[Snyk] Security upgrade mocha from 4.0.1 to 6.2.3 XirdigH/llvm-project#9

Open

[Snyk] Security upgrade mocha from 4.0.1 to 6.2.3 baby636/llvm-project#13

Open

[Snyk] Security upgrade mocha from 4.0.1 to 6.2.3 Mement-Mori/llvm-project#437

Open

This was referenced Aug 19, 2022

[Snyk] Fix for 5 vulnerabilities devcode1981/llvm-project#2

Open

[Snyk] Fix for 5 vulnerabilities MarcelRaschke/llvm-project#8

Open

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bad-quality code passing union as argument on x86_64 #3971

Bad-quality code passing union as argument on x86_64 #3971

efriedma-quic commented Feb 17, 2009

lattner commented Feb 17, 2009

llvmbot commented Feb 17, 2009

lattner commented Feb 17, 2009

llvmbot commented Feb 17, 2009

llvmbot commented Feb 17, 2009

lattner commented Feb 17, 2009

lattner commented Feb 17, 2009

llvmbot commented Nov 27, 2021

Bad-quality code passing union as argument on x86_64 #3971

Bad-quality code passing union as argument on x86_64 #3971

Comments

efriedma-quic commented Feb 17, 2009

Extended Description

lattner commented Feb 17, 2009

llvmbot commented Feb 17, 2009

lattner commented Feb 17, 2009

llvmbot commented Feb 17, 2009

With the attached patch, I get this:

define i32 @​a(i64) nounwind readnone { entry: %1 = trunc i64 %0 to i32 ; [#uses=1] ret i32 %1 }

llvmbot commented Feb 17, 2009

lattner commented Feb 17, 2009

lattner commented Feb 17, 2009

llvmbot commented Nov 27, 2021

define i32 @a(i64) nounwind readnone {
entry:
%1 = trunc i64 %0 to i32 ; [#uses=1]
ret i32 %1
}