thr3ads.net - similar to: "Replicate Individual O3 optimizations"

Displaying 20 results from an estimated 5000 matches similar to: "Replicate Individual O3 optimizations"

2019 Oct 19

Replicate Individual O3 optimizations

On Thu, Oct 17, 2019 at 11:22 AM David Greene via llvm-dev < llvm-dev at lists.llvm.org> wrote: > hameeza ahmed via llvm-dev <llvm-dev at lists.llvm.org> writes: > > > Hello, > > I want to study the individual O3 optimizations. For this I am using > > following commands, but unable to replicate O3 behavior. > > > > 1.

Replicate Individual O3 optimizations

2019 Oct 24

Replicate Individual O3 optimizations

I run matrix multiplication code with both the approaches o3 at clang and o3 at opt. clang o3 is about 2.97x faster than opt o3. On Mon, Oct 21, 2019 at 8:24 AM Neil Nelson <nnelson at infowest.com> wrote: > is_sorted.cpp > bool is_sorted(int *a, int n) { > > for (int i = 0; i < n - 1; i++) > > if (a[i] > a[i + 1]) > return false; > return

Issues with using scalar evolution with newer versions of LLVM IR

2019 Jan 16

Issues with using scalar evolution with newer versions of LLVM IR

Thank You.. I used following command to generate .bc or .ll /Documents/clang+llvm-4.0.0-x86_64-linux-gnu-ubuntu-16.04/bin/clang -O0 -emit-llvm -S -o vec4.ll vecsum.c /Documents/clang+llvm-7.0.0-x86_64-linux-gnu-ubuntu-16.04/bin/clang -O0 -emit-llvm -S -o vec7.ll vecsum.c On Wed, Jan 16, 2019 at 6:49 AM Sanjoy Das <sanjoy at playingwithpointers.com> wrote: > It is hard to tell

[LLVMdev] [RFC] add Function Attribute to disable optimization

2013 Jul 18

[LLVMdev] [RFC] add Function Attribute to disable optimization

So.. I have investigated more on how a new function attribute to disable optimization on a per-function basis could be implemented. At the current state, with the lack of specific support from the pass managers I found two big problems when trying to implement a prototype implementation of the new attribute. Here are the problems found: 1) It is not safe to disable some transform passes in the

[LLVMdev] [cfe-dev] [RFC] add Function Attribute to disable optimization

2013 Jul 18

[LLVMdev] [cfe-dev] [RFC] add Function Attribute to disable optimization

On Thu, Jul 18, 2013 at 8:23 AM, <Andrea_DiBiagio at sn.scee.net> wrote: > So.. > I have investigated more on how a new function attribute to disable > optimization on a per-function basis could be implemented. > At the current state, with the lack of specific support from the pass > managers I found two big problems when trying to implement a prototype > implementation of

[LLVMdev] spec tests + PWD=

2009 Aug 31

[LLVMdev] spec tests + PWD=

Hi David, It looks like you added the PWD= magic to Makefile.spec. It is preventing me from running bugpoint on the desktop, with errors like this: /Users/sabre/llvm/projects/llvm-test/External/SPEC/Sandbox.sh bugpoint- train Output/176.gcc.bugpoint-opt /Users/sabre/cvs/benchmarks/ speccpu2000/benchspec/CINT2000/176.gcc/data/train/input/ \

Polly loop offloading to Accelerator

2018 Jan 29

Polly loop offloading to Accelerator

Thank You. i used -polly-ast-detect-parallel but there is no coincident info generated; my c code is simple vec-sum as follows; #include <stdio.h> int a[2048], b[2048], c[2048]; foo () { int i; for (i=0; i<2048; i++) { a[i]=b[5] + c[i]; } } i executed following commands; $clang -S -emit-llvm vec-sum.cpp -march=native -O3 -mllvm -disable-llvm-optzns -o vec-sum.s $opt -S

llvm optimizer turning musttail into tail

2016 Nov 24

llvm optimizer turning musttail into tail

I've got some calls like: musttail call void bitcast (i32 (i32, i8*, %Type*)* @MyMethod to void (i32, i8*)*)(i32 %0, i8* %1) ret void Into something like: %8 = tail call i32 @MyMethod(i32 %0, i8* %1, %Type* null) ret void I realize I'm losing a parameter there, but this is an interface jump trick I use and relies on the end code being a 'jmp' (x86). I realize i can probably

[LLVMdev] spec tests + PWD=

2009 Sep 01

[LLVMdev] spec tests + PWD=

I did make that change to support the following code in ToolRunner.cpp. I found that if I did not explicitly set PWD when invoking bugpoint, then it would not be in the environment. I'm not sure why PWD is not being inherited... perhaps some make weirdness... or bash??? and I don't know how it ever worked before my change. // Full path to the binary. We need to cd to the exec

disable optimizarions

2017 Jul 10

disable optimizarions

Hello, I have already asked how to receive unoptimized IR. This way: clang -S -emit-llvm -O3 -mllvm -disable-llvm-optzns memset.c -o memset.s I have noticed, that in any case, when I use O1, O2, O3 and disable optimizations, I still get llvm intrinsics functions inserted. when I put O0, I have no intrinsics, but have noopt attribute. Is there a way to receive purely unoptimizaed IR without any

[LLVMdev] [RFC] add Function Attribute to disable optimization

2013 Jun 17

[LLVMdev] [RFC] add Function Attribute to disable optimization

Andrea_DiBiagio at sn.scee.net wrote: > Hi, > > I previously made a proposal for adding a pragma for per-function > optimization level control due to a number of requests from our customers > (See http://comments.gmane.org/gmane.comp.compilers.clang.devel/28958 for > the previous discussion), however the discussion was inconclusive. Some > of my colleagues recently had the

[LLVMdev] [RFC] add Function Attribute to disable optimization

2013 Jun 17

[LLVMdev] [RFC] add Function Attribute to disable optimization

Hi, I previously made a proposal for adding a pragma for per-function optimization level control due to a number of requests from our customers (See http://comments.gmane.org/gmane.comp.compilers.clang.devel/28958 for the previous discussion), however the discussion was inconclusive. Some of my colleagues recently had the opportunity to discuss the proposal with a number of people at and

[RFC PATCH v3] Intrinsics/RTCD related fixes. Mostly x86.

2015 Mar 13

[RFC PATCH v3] Intrinsics/RTCD related fixes. Mostly x86.

From: Jonathan Lennox <jonathan at vidyo.com> * Makes ?enable-intrinsics work with clang and other non-GCC compilers * Enables RTCD for the floating-point-mode SSE code in Celt. * Disables use of RTCD in cases where the compiler targets an instruction set by default. * Enables the SSE4.1 Silk optimizations that apply to the common parts of Silk when Opus is built in floating-point mode, not

[RFC PATCHv2] Intrinsics/RTCD related fixes. Mostly x86.

2015 Mar 12

[RFC PATCHv2] Intrinsics/RTCD related fixes. Mostly x86.

[LLVMdev] [RFC] add Function Attribute to disable optimization

2013 Jul 18

[LLVMdev] [RFC] add Function Attribute to disable optimization

Andrea_DiBiagio at sn.scee.net wrote: > So.. > I have investigated more on how a new function attribute to disable > optimization on a per-function basis could be implemented. > At the current state, with the lack of specific support from the pass > managers I found two big problems when trying to implement a prototype > implementation of the new attribute. > > Here are the

llvm optimizer turning musttail into tail

2016 Nov 27

llvm optimizer turning musttail into tail

r287955 seems like it might be related. -- Sean Silva On Sat, Nov 26, 2016 at 4:06 PM, Sean Silva <chisophugis at gmail.com> wrote: > This sounds buggy to me. What pass is doing this? > > -- Sean Silva > > On Thu, Nov 24, 2016 at 5:39 AM, Carlo Kok via llvm-dev < > llvm-dev at lists.llvm.org> wrote: > >> >> I've got some calls like: >>

[LLVMdev] poolallocation error

2008 Nov 19

[LLVMdev] poolallocation error

Hi, I am trying to use the poolallocator. More specific, I am trying to play around with the pointer compression pass. Though, I get assertion failures for the pass dependencies. This is when it in PointerCompress::getAnalysisUsage tries to register the the BU pass as required. I.e. when AU.addRequired<CompleteBUDataStructures>(); is called. $ opt -f -load

[LLVMdev] Use clang to generate LLVM IR with -O3, how to visit these load operations after -reg2mem

2015 Apr 22

[LLVMdev] Use clang to generate LLVM IR with -O3, how to visit these load operations after -reg2mem

Hi, all I want to instrument load/store operations in LLVM IR. And I find the LLVM IR generated with flag -o3 is much more efficient than -o0, so I try to instrument on these optimized LLVM IR, but, some load operations are changed to Phi loads. Then, I try the -reg2mem flag to remove these phi nodes. The problems are: 1) I use visitLoadInst to collect these load operations, and it seems I

[LLVMdev] Use clang to generate LLVM IR with -O3, how to visit these load operations after -reg2mem

2015 Apr 22

[LLVMdev] Use clang to generate LLVM IR with -O3, how to visit these load operations after -reg2mem

Best Regards! Eric Lew On 周三, 4月 22, 2015 at 12:11 下午, David Blaikie < dblaikie at gmail.com [dblaikie at gmail.com] > wrote: On Tue, Apr 21, 2015 at 8:34 PM, Eric Lu <eirc.lew at gmail.com> wrote: > Hi, all > > I want to instrument load/store operations in LLVM IR. And I find the LLVM > IR generated with flag -o3 is much more efficient than -o0, so I try to >

Replication -O3 optimizations manually

2018 Aug 17

Replication -O3 optimizations manually

Hi Zide, I think I found the right way to reach my goal. I used the following command: clang -O3 -Xclang -disable-llvm-optzns main.c -S -emit-llvm -o main.ll to generate an IR file enriched by all the metadata that otherwise wouldn't be generated with -O0. Moreover, -disable-llvm-optzns flag ensures that none of the optimization passes has been applied yet to the IR. In this way, I can

similar to: Replicate Individual O3 optimizations