thr3ads.net - search: "speedups"

Displaying 20 results from an estimated 1824 matches for "speedups".

Did you mean: speedup

[LLVMdev] libiomp, not libgomp as default library linked with -fopenmp

2015 May 03

[LLVMdev] libiomp, not libgomp as default library linked with -fopenmp

A couple more data points. Current llvm 3.7svn with the two outstanding OPENMP patches can build the openmp support in gdl 0.9.5 (which completely passes its test suite) and apbs 1.4.1's limited openmp support. On Sat, May 2, 2015 at 11:11 PM, Jack Howarth < howarth.mailing.lists at gmail.com> wrote: > On a positive note, current llvm 3.7svn with the two outstanding > OPENMP

[LLVMdev] RFC: Callee speedup estimation in inline cost analysis

2015 Jul 30

[LLVMdev] RFC: Callee speedup estimation in inline cost analysis

TLDR - The proposal below is intended to allow inlining of larger callees when such inlining is expected to reduce the dynamic instructions count. Proposal ------------- LLVM inlines a function if the size growth (in the given context) is less than a threshold. The threshold is increased based on certain characteristics of the called function (inline keyword and the fraction of vector

[LLVMdev] RFC: Callee speedup estimation in inline cost analysis

2015 Jul 31

[LLVMdev] RFC: Callee speedup estimation in inline cost analysis

Just nitpicking: 1) DI(F) should include a component that estimate the epi/prologue cost (frameSetupCost) which InlinedDF does not have 2) The speedup should include callsite cost associated with 'C' (call instr, argument passing): Speedup(F,C) = (DI(F) + CallCost(C) - InlinedDF(F,C))/DI(F). Otherwise the proposal looks reasonable to me. David On Thu, Jul 30, 2015 at 2:25 PM,

Rsync shouldn't display a meaningless speedup on a dry run

2007 Nov 05

Rsync shouldn't display a meaningless speedup on a dry run

On a dry run, rsync displays a speedup value calculated from the total size of the source file data and the amount of data sent over the connection, but this value is meaningless and grossly misleading because the file data is not sent over the connection. Example: [matt@mattlaptop2 test]$ rsync -avi -n ~/eclipse/releases/eclipse-SDK-3.3-linux-gtk.tar.gz . sending incremental file list

[LLVMdev] Polyhedron 2005 results for dragonegg 3.3svn

2013 Jun 02

[LLVMdev] Polyhedron 2005 results for dragonegg 3.3svn

Hi Jack, thanks for splitting out what the effects of LLVM's / GCC's vectorizers is. On 01/06/13 21:34, Jack Howarth wrote: > On Sat, Jun 01, 2013 at 06:45:48AM +0200, Duncan Sands wrote: >> >> These results are very disappointing, I was hoping to see a big improvement >> somewhere instead of no real improvement anywhere (except for gas_dyn) or a >> regression

RFC: speedups with instruction side-data (ADCE, perhaps others?)

2015 Sep 16

RFC: speedups with instruction side-data (ADCE, perhaps others?)

You mean the input test data? I was testing performance using our offline perf suite (which is a ton of out of tree code), so it’s not something I can share, but I imagine any similar test suite put through a typical compiler pipeline will exercise ADCE in similar ways. ADCE’s cost is pretty much the sum of (cost of managing the set) + (cost of eraseinstruction), which in our case turns out to be

[LLVMdev] Polyhedron 2005 results for dragonegg 3.3svn

2013 Jun 02

[LLVMdev] Polyhedron 2005 results for dragonegg 3.3svn

Jack, Can you please file a bug report and attach the BC files for the major loops that we miss ? Thanks, Nadav On Jun 2, 2013, at 1:27, Duncan Sands <duncan.sands at gmail.com> wrote: > Hi Jack, thanks for splitting out what the effects of LLVM's / GCC's vectorizers > is. > > On 01/06/13 21:34, Jack Howarth wrote: >> On Sat, Jun 01, 2013 at 06:45:48AM +0200,

[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass

2011 Nov 08

[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass

...39;clang -O3' against 'clang -O3 -mllvm -vectorize'? Yes. [I've tested the current patch directly using opt -vectorize -unroll-allow-partial; for running the test suite I recompiled llvm/clang to hardcode the options as I wanted them]. > > > The largest three performance speedups are: > > SingleSource/Benchmarks/BenchmarkGame/puzzle - 59.2% speedup > > SingleSource/UnitTests/Vector/multiplies - 57.7% speedup > > SingleSource/Benchmarks/Misc/flops-7 - 50.75% speedup > > > > The largest three performance slowdowns are: > > MultiSource/Bench...

[patch] making rsync less verbose

2004 Jan 06

[patch] making rsync less verbose

Unless you suppress all output with the -q option, rsync will always print an initial "building/receiving file list ... done" line. In my opinion, this is a bit superfluous. When I want to see a progress indication, I can use the --progress option. Another issue is the 3-line transfer speed and speedup factor report at the end. So every rsync invocation produces at least four lines of

[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass

2011 Nov 08

[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass

...w much of the runtime overhead is due to the unrolling (produces more code that needs to be optimized) and which part is due to vectorization. The same counts for the speedup. How much is caused by unrolling and how much is actually caused by your pass. >>> The largest three performance speedups are: >>> SingleSource/Benchmarks/BenchmarkGame/puzzle - 59.2% speedup >>> SingleSource/UnitTests/Vector/multiplies - 57.7% speedup >>> SingleSource/Benchmarks/Misc/flops-7 - 50.75% speedup >>> >>> The largest three performance slowdowns are: >>>...

Patch

2001 Sep 08

Patch

Hallo short question how is the Syntax for interactivity.patch ext3-dir-speedup.patch ? patch -p0 ext3-dir-speedup.patch doesnt work -- Frank

[LLVMdev] selection dag speedups / llc speedups

2010 May 17

[LLVMdev] selection dag speedups / llc speedups

On May 14, 2010, at 11:24 AM, Jan Voung wrote: > I'm sure this has been asked many times, but is there current work on decreasing the time taken by the DAG-based instruction selector, or the other phases of llc? I am just beginning to dive into LLVM, and I am interested in compile-time reductions that do not reduce code quality dramatically. For example, simply switching on

[LLVMdev] selection dag speedups / llc speedups

2010 May 19

[LLVMdev] selection dag speedups / llc speedups

On May 18, 2010, at 12:07 PM, Jan Voung wrote: > Here are some recent stats of the fast vs local vs linear scan at O0 on "opt -std-compile-opts" processed bitcode files. The fast regalloc is still certainly faster at codegen than local with such bitcode files. Let me know if the link doesn't work: > >

speedup is always 0.99

2013 Feb 22

speedup is always 0.99

I'm syncing from a USB disk to my hard disk like this: rsync -vr /path/to/usb/disk/dir/ /path/to/hard/disk/dir/ But the speedup is always 0.99 which I think means it is just copying the files each time instead of syncing them. What could be wrong? - Grant

[LLVMdev] Polyhedron 2005 results for dragonegg 3.3svn

2013 Jun 03

[LLVMdev] Polyhedron 2005 results for dragonegg 3.3svn

Actually this kind of opportunities, as outlined bellow, was one of my contrived motivating example for fast-math. But last year we don't see such opportunities in real applications we care about. t1 = x1/y ... t2 = x2/y. I think it is better to be taken care by GVN/PRE -- blindly convert x/y => x *1/y is not necessarily beneficial. Or maybe we can blindly perform such

Is this correct?

2003 Dec 30

Is this correct?

I am trying to copy an 8 gig file. After first copying it over I try and rsync it again. There are no changes so why is it copying the file over again? Here is the output of what I am trying to do. And what does speedup mean? [root@VMWARE Storage]# rsync --verbose --recursive -t --stats -- progress ntapps.dsk vmware2::home/ [root@VMWARE Storage]# rsync --verbose --recursive -t --stats --

RFC: speedups with instruction side-data (ADCE, perhaps others?)

2015 Sep 14

RFC: speedups with instruction side-data (ADCE, perhaps others?)

I did something similar for dominators, for GVN, etc. All see significant speedups. However, the answer i got back when i mentioned this was "things like ptrset and densemap should only have a small performance difference from side data when used and sized right", and i've found this to mostly be true after looking harder. In the case you are looking at, i see: -...

[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass

2011 Nov 08

[LLVMdev] [llvm-commits] [PATCH] BasicBlock Autovectorization Pass

...ch will probably > work for you. Hey Hal, those are great news. Especially as the numbers seem to show that vectorization has a significant performance impact. What did you compare exactly. 'clang -O3' against 'clang -O3 -mllvm -vectorize'? > The largest three performance speedups are: > SingleSource/Benchmarks/BenchmarkGame/puzzle - 59.2% speedup > SingleSource/UnitTests/Vector/multiplies - 57.7% speedup > SingleSource/Benchmarks/Misc/flops-7 - 50.75% speedup > > The largest three performance slowdowns are: > MultiSource/Benchmarks/MiBench/security-rijndae...

[LLVMdev] selection dag speedups / llc speedups

2010 May 18

[LLVMdev] selection dag speedups / llc speedups

On May 17, 2010, at 9:09 PM, Rafael Espindola wrote: >> The fast and local register allocators are meant to be used on unoptimized code, a 'Debug build'. While they do work on optimized code, they do not give good results. Their primary goal is compile time, not code quality. > > Yes, we have a somewhat uncommon use case. It is fine to spend time > optimizing bitcode (LTO

--keep-dirlinks --delete erroneously deletes symlinks to directories

2009 Oct 03

--keep-dirlinks --delete erroneously deletes symlinks to directories

Hi, the combination of --keep-dirlinks and --delete (--delete-during, --delete-delay) erroneously deletes symlinks to directories on the receiver. With --delete-before and --delete-after it works as expected. (rsync version 3.0.3 protocol version, on Debian Lenny) I don't think that this behaviour is intended. Details/example: - Host 1 directory-structure: drwx------ ./1

search for: speedups