search for: dst_columns

Displaying 2 results from an estimated 2 matches for "dst_columns".

2012 Dec 03
2
[LLVMdev] Which transform passes to apply?
...fine void @jitcv_sum_64sf1001(%Matrix* %src, %Matrix* %dst, i32 %len) { entry: br label %loop_i loop_i: ; preds = %loop_i_end, %entry %i = phi i32 [ 0, %entry ], [ %increment_i, %loop_i_end ] %0 = getelementptr inbounds %Matrix* %dst, i32 0, i32 2 %dst_columns = load i32* %0 *%dst_yRem = urem i32 %i, %dst_columns %dst_y = urem i32 %i, %dst_columns %1 = sub i32 %i, %dst_y %2 = add i32 %1, %dst_yRem* %3 = getelementptr inbounds %Matrix* %src, i32 0, i32 0 %4 = load i8** %3 %src_data = bitcast i8* %4 to double* %5 = getelementptr double* %sr...
2012 Dec 04
0
[LLVMdev] Which transform passes to apply?
...osh, On 03/12/12 02:58, Josh Klontz wrote: > Hello, I'm a new LLVM user working on a C++ EDSL for image processing. I have > a function, which after applying createInstructionCombiningPass() and > createDeadCodeEliminationPass() looks like: > ... > *%dst_yRem = urem i32 %i, %dst_columns > %dst_y = urem i32 %i, %dst_columns > %1 = sub i32 %i, %dst_y > %2 = add i32 %1, %dst_yRem* ... > My question is which optimization pass(es) are needed to simplify the > instructions in bold. I've tried running the same passes again and also > tried createInstructio...