Displaying 20 results from an estimated 5000 matches similar to: "Replicate Individual O3 optimizations"
2019 Oct 19
3
Replicate Individual O3 optimizations
On Thu, Oct 17, 2019 at 11:22 AM David Greene via llvm-dev <
llvm-dev at lists.llvm.org> wrote:
> hameeza ahmed via llvm-dev <llvm-dev at lists.llvm.org> writes:
>
> > Hello,
> > I want to study the individual O3 optimizations. For this I am using
> > following commands, but unable to replicate O3 behavior.
> >
> > 1.
2019 Oct 24
2
Replicate Individual O3 optimizations
I run matrix multiplication code with both the approaches o3 at clang and
o3 at opt. clang o3 is about 2.97x faster than opt o3.
On Mon, Oct 21, 2019 at 8:24 AM Neil Nelson <nnelson at infowest.com> wrote:
> is_sorted.cpp
> bool is_sorted(int *a, int n) {
>
> for (int i = 0; i < n - 1; i++)
>
> if (a[i] > a[i + 1])
> return false;
> return
2019 Jan 16
3
Issues with using scalar evolution with newer versions of LLVM IR
Thank You..
I used following command to generate .bc or .ll
/Documents/clang+llvm-4.0.0-x86_64-linux-gnu-ubuntu-16.04/bin/clang -O0
-emit-llvm -S -o vec4.ll vecsum.c
/Documents/clang+llvm-7.0.0-x86_64-linux-gnu-ubuntu-16.04/bin/clang -O0
-emit-llvm -S -o vec7.ll vecsum.c
On Wed, Jan 16, 2019 at 6:49 AM Sanjoy Das <sanjoy at playingwithpointers.com>
wrote:
> It is hard to tell
2013 Jul 18
0
[LLVMdev] [RFC] add Function Attribute to disable optimization
So..
I have investigated more on how a new function attribute to disable
optimization on a per-function basis could be implemented.
At the current state, with the lack of specific support from the pass
managers I found two big problems when trying to implement a prototype
implementation of the new attribute.
Here are the problems found:
1) It is not safe to disable some transform passes in the
2013 Jul 18
1
[LLVMdev] [cfe-dev] [RFC] add Function Attribute to disable optimization
On Thu, Jul 18, 2013 at 8:23 AM, <Andrea_DiBiagio at sn.scee.net> wrote:
> So..
> I have investigated more on how a new function attribute to disable
> optimization on a per-function basis could be implemented.
> At the current state, with the lack of specific support from the pass
> managers I found two big problems when trying to implement a prototype
> implementation of
2009 Aug 31
2
[LLVMdev] spec tests + PWD=
Hi David,
It looks like you added the PWD= magic to Makefile.spec. It is
preventing me from running bugpoint on the desktop, with errors like
this:
/Users/sabre/llvm/projects/llvm-test/External/SPEC/Sandbox.sh bugpoint-
train Output/176.gcc.bugpoint-opt /Users/sabre/cvs/benchmarks/
speccpu2000/benchspec/CINT2000/176.gcc/data/train/input/ \
2018 Jan 29
1
Polly loop offloading to Accelerator
Thank You.
i used -polly-ast-detect-parallel but there is no coincident info generated;
my c code is simple vec-sum as follows;
#include <stdio.h>
int a[2048], b[2048], c[2048];
foo () {
int i;
for (i=0; i<2048; i++) {
a[i]=b[5] + c[i];
}
}
i executed following commands;
$clang -S -emit-llvm vec-sum.cpp -march=native -O3 -mllvm
-disable-llvm-optzns -o vec-sum.s
$opt -S
2016 Nov 24
3
llvm optimizer turning musttail into tail
I've got some calls like:
musttail call void bitcast (i32 (i32, i8*, %Type*)* @MyMethod to void
(i32, i8*)*)(i32 %0, i8* %1)
ret void
Into something like:
%8 = tail call i32 @MyMethod(i32 %0, i8* %1, %Type* null)
ret void
I realize I'm losing a parameter there, but this is an interface jump
trick I use and relies on the end code being a 'jmp' (x86). I realize i
can probably
2009 Sep 01
0
[LLVMdev] spec tests + PWD=
I did make that change to support the following code in
ToolRunner.cpp. I found that if I did not explicitly set PWD when
invoking bugpoint, then it would not be in the environment. I'm not
sure why PWD is not being inherited... perhaps some make weirdness...
or bash??? and I don't know how it ever worked before my change.
// Full path to the binary. We need to cd to the exec
2017 Jul 10
2
disable optimizarions
Hello,
I have already asked how to receive unoptimized IR. This way:
clang -S -emit-llvm -O3 -mllvm -disable-llvm-optzns memset.c -o memset.s
I have noticed, that in any case, when I use O1, O2, O3 and disable
optimizations, I still get llvm intrinsics functions inserted.
when I put O0, I have no intrinsics, but have noopt attribute. Is there a
way to receive purely unoptimizaed IR without any
2013 Jun 17
0
[LLVMdev] [RFC] add Function Attribute to disable optimization
Andrea_DiBiagio at sn.scee.net wrote:
> Hi,
>
> I previously made a proposal for adding a pragma for per-function
> optimization level control due to a number of requests from our customers
> (See http://comments.gmane.org/gmane.comp.compilers.clang.devel/28958 for
> the previous discussion), however the discussion was inconclusive. Some
> of my colleagues recently had the
2013 Jun 17
11
[LLVMdev] [RFC] add Function Attribute to disable optimization
Hi,
I previously made a proposal for adding a pragma for per-function
optimization level control due to a number of requests from our customers
(See http://comments.gmane.org/gmane.comp.compilers.clang.devel/28958 for
the previous discussion), however the discussion was inconclusive. Some
of my colleagues recently had the opportunity to discuss the proposal with
a number of people at and
2015 Mar 13
1
[RFC PATCH v3] Intrinsics/RTCD related fixes. Mostly x86.
From: Jonathan Lennox <jonathan at vidyo.com>
* Makes ?enable-intrinsics work with clang and other non-GCC compilers
* Enables RTCD for the floating-point-mode SSE code in Celt.
* Disables use of RTCD in cases where the compiler targets an instruction set by default.
* Enables the SSE4.1 Silk optimizations that apply to the common parts of Silk when Opus is built in floating-point mode, not
2015 Mar 12
1
[RFC PATCHv2] Intrinsics/RTCD related fixes. Mostly x86.
From: Jonathan Lennox <jonathan at vidyo.com>
* Makes ?enable-intrinsics work with clang and other non-GCC compilers
* Enables RTCD for the floating-point-mode SSE code in Celt.
* Disables use of RTCD in cases where the compiler targets an instruction set by default.
* Enables the SSE4.1 Silk optimizations that apply to the common parts of Silk when Opus is built in floating-point mode, not
2013 Jul 18
1
[LLVMdev] [RFC] add Function Attribute to disable optimization
Andrea_DiBiagio at sn.scee.net wrote:
> So..
> I have investigated more on how a new function attribute to disable
> optimization on a per-function basis could be implemented.
> At the current state, with the lack of specific support from the pass
> managers I found two big problems when trying to implement a prototype
> implementation of the new attribute.
>
> Here are the
2016 Nov 27
3
llvm optimizer turning musttail into tail
r287955 seems like it might be related.
-- Sean Silva
On Sat, Nov 26, 2016 at 4:06 PM, Sean Silva <chisophugis at gmail.com> wrote:
> This sounds buggy to me. What pass is doing this?
>
> -- Sean Silva
>
> On Thu, Nov 24, 2016 at 5:39 AM, Carlo Kok via llvm-dev <
> llvm-dev at lists.llvm.org> wrote:
>
>>
>> I've got some calls like:
>>
2008 Nov 19
2
[LLVMdev] poolallocation error
Hi,
I am trying to use the poolallocator. More specific, I am trying to
play around with the pointer compression pass. Though, I get assertion
failures for the pass dependencies.
This is when it in PointerCompress::getAnalysisUsage tries to register
the the BU pass as required. I.e. when
AU.addRequired<CompleteBUDataStructures>(); is called.
$ opt -f -load
2015 Apr 22
2
[LLVMdev] Use clang to generate LLVM IR with -O3, how to visit these load operations after -reg2mem
Hi, all
I want to instrument load/store operations in LLVM IR. And I find the LLVM
IR generated with flag -o3 is much more efficient than -o0, so I try to
instrument on these optimized LLVM IR, but, some load operations are
changed to Phi loads. Then, I try the -reg2mem flag to remove these phi
nodes.
The problems are:
1) I use visitLoadInst to collect these load operations, and it seems I
2015 Apr 22
2
[LLVMdev] Use clang to generate LLVM IR with -O3, how to visit these load operations after -reg2mem
Best Regards!
Eric Lew
On 周三, 4月 22, 2015 at 12:11 下午, David Blaikie < dblaikie at gmail.com [dblaikie at gmail.com] > wrote:
On Tue, Apr 21, 2015 at 8:34 PM, Eric Lu <eirc.lew at gmail.com> wrote:
> Hi, all
>
> I want to instrument load/store operations in LLVM IR. And I find the LLVM
> IR generated with flag -o3 is much more efficient than -o0, so I try to
>
2018 Aug 17
2
Replication -O3 optimizations manually
Hi Zide,
I think I found the right way to reach my goal.
I used the following command:
clang -O3 -Xclang -disable-llvm-optzns main.c -S -emit-llvm -o main.ll
to generate an IR file enriched by all the metadata that otherwise wouldn't be generated with -O0. Moreover, -disable-llvm-optzns flag ensures that none of the optimization passes has been applied yet to the IR.
In this way, I can