Improve optimizations for boolean values #15464

dotdash · 2014-07-05T22:44:13Z

LLVM doesn't handle i1 value in allocas/memory very well and skips a number of optimizations if it hits it. So we have to do the same thing that Clang does, using i1 for SSA values, but storing i8 in memory.

Fixes #15203.

lilyball · 2014-07-05T22:51:48Z

I am not qualified to evaluate this patch, but do you have any benchmarks that demonstrate an improvement?

dotdash · 2014-07-05T22:55:01Z

See the test case given in #15203. (I only had the "Fixes" on a commit, added it to the PR description now).

Before:

running 1 test
test bench_p3 ... bench:    319188 ns/iter (+/- 37902)

test result: ok. 0 passed; 0 failed; 0 ignored; 1 measured

After:

running 1 test
test bench_p3 ... bench:    138881 ns/iter (+/- 13047)

test result: ok. 0 passed; 0 failed; 0 ignored; 1 measured

The relevant optimization here is turning the store loop into a memset. It's not triggered for types which are not a byte-width multiple.

huonw · 2014-07-06T04:49:06Z

src/librustc/middle/trans/foreign.rs

-                let lltemp = builder.alloca(val_ty(llforeign_arg), "");
-                builder.store(llforeign_arg, lltemp);
-                llforeign_arg = lltemp;
+                llforeign_arg = if ty::type_is_bool(rust_ty) {


Is there a particular reason isn't this using the store_ty abstraction?

I can't, because this function doesn't use the regular building blocks like Block, but uses a Builder directly.

LLVM doesn't really like types with a bit-width that isn't a multiple of 8 and disable various optimizations if it encounters such types used with loads/stores. OTOH, booleans must be represented as i1 when used as SSA values. To get the best results, we must use i1 for SSA values, and i8 when storing the value to memory. By using range asserts on loads, LLVM can eliminate the required zero-extend and truncate operations. Fixes rust-lang#15203

LLVM doesn't handle i1 value in allocas/memory very well and skips a number of optimizations if it hits it. So we have to do the same thing that Clang does, using i1 for SSA values, but storing i8 in memory. Fixes #15203.

dotdash changed the title ~~Improve optimizations for booleans value~~ Improve optimizations for boolean values Jul 5, 2014

huonw reviewed Jul 6, 2014
View reviewed changes

dotdash added 2 commits July 6, 2014 22:12

Remove remainders from when booleans were i8

d2a22f5

bors closed this Jul 7, 2014

bors merged commit dd4112b into rust-lang:master Jul 7, 2014

dotdash deleted the bool_stores branch February 4, 2015 12:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve optimizations for boolean values #15464

Improve optimizations for boolean values #15464

dotdash commented Jul 5, 2014

lilyball commented Jul 5, 2014

dotdash commented Jul 5, 2014

huonw Jul 6, 2014

dotdash Jul 6, 2014

Improve optimizations for boolean values #15464

Improve optimizations for boolean values #15464

Conversation

dotdash commented Jul 5, 2014

lilyball commented Jul 5, 2014

dotdash commented Jul 5, 2014

huonw Jul 6, 2014

Choose a reason for hiding this comment

dotdash Jul 6, 2014

Choose a reason for hiding this comment