Displaying 20 results from an estimated 113 matches for "geomeans".
Did you mean:
geomean
2017 Nov 02
13
[RFC] Enable Partial Inliner by default
Forgot to add that all experiments were done with '-O3 -m64
-fexperimental-new-pass-manager'.
Graham Yiu
LLVM Compiler Development
IBM Toronto Software Lab
Office: (905) 413-4077 C2-707/8200/Markham
Email: gyiu at ca.ibm.com
From: Graham Yiu/Toronto/IBM
To: llvm-dev at lists.llvm.org
Cc: junbuml at codeaurora.org, xinliangli at gmail.com
Date: 11/02/2017 05:26 PM
Subject: [RFC]
2013 Apr 30
3
[LLVMdev] Improving the usability of LNT
Hi Daniel,
I made some changes to the LNT perf reporting tool to make it more user friendly by adding some features:
1. Make the sidebar and the navigation bar stationary, so that it is easy to navigate the site
2. Have the pop-down menu for the items in the navigation bar, activate upon hovering the mouse, rather than clicking the item
3. Add a nav-link in the sidebar for the
2017 Apr 03
5
[GlobalISel][AArch64] Toward flipping the switch for O0: Please give it a try!
I've kicked off a run to compare "-O0 -g" versus "-O0 -g -mllvm -global-isel -mllvm -global-isel-abort=2".
I've selected the test-suite (albeit a version which is a couple of months old now) and a few short-running proprietary benchmarks to get data back quickly for an initial feel of where things are.
This was running on Cortex-A57 AArch64 Linux.
I saw one assertion
2017 Jul 20
8
[RFC] Add IR level interprocedural outliner for code size.
I’m River and I’m a compiler engineer at PlayStation. Recently, I’ve been
working on an interprocedural outlining (code folding) pass for code size
improvement at the IR level. We hit a couple of use cases that the current
code size solutions didn’t handle well enough. Outlining is one of the
avenues that seemed potentially beneficial.
-- Algorithmic Approach --
The general implementation can be
2017 Jul 21
2
[RFC] Add IR level interprocedural outliner for code size.
Hi Evgeny,
I know of the current machine outliner in LLVM. If you look in the "More
detailed performance data" in the end section it includes performance
comparisons to the machine outliner.
As for the algorithmic approach they are kind of similar.
Machine Outliner:
- Builds a suffix tree based on identical equivalence between machine
instrs.
- Uses target specific cost model for
2014 Apr 08
2
[LLVMdev] Proposal: AArch64/ARM64 merge from EuroLLVM
Hi folks,
As Tim pointed out, we recently had the opportunity to collect 64-bit benchmark performance data for GCC 4.9, AArch64 and ARM64 compilers on a real hardware. It is a cortex-a53 device. Due to proprietary reasons we cannot share the full hardware configuration.
The preliminary results were shared at the hackers lab at EuroLLVM yesterday. For those who could not make it, below is
2012 Jun 05
2
[LLVMdev] [PATCH] add x32 psABI support
If you are interesting to play around X32, you may refer to http://sourceware.org/glibc/wiki/x32 to bootstrap a local environment on Linux.
Yours
- Michael
-----Original Message-----
From: cfe-commits-bounces at cs.uiuc.edu [mailto:cfe-commits-bounces at cs.uiuc.edu] On Behalf Of Liao, Michael
Sent: Monday, June 04, 2012 5:09 PM
To: llvm-commits at cs.uiuc.edu; cfe-commits at cs.uiuc.edu
2015 Feb 26
4
[LLVMdev] [RFC] AArch64: Should we disable GlobalMerge?
Hi Ahmed,
Did you run these experiments on a platform with a linker that makes
use of the AArch64CollectLOH-pass-produced information?
I'm guessing that the AArch64CollectLOH-pass information and a linker
that makes use of that information could affect the profitability of
the GlobalMerge pass?
Thanks,
Kristof
> -----Original Message-----
> From: llvmdev-bounces at cs.uiuc.edu
2013 May 02
0
[LLVMdev] Improving the usability of LNT
Wow, that sounds great! Thanks for working on this, and yes, please, send
the patches!
--renato
On 30 April 2013 16:23, Murali, Sriram <sriram.murali at intel.com> wrote:
> Hi Daniel,****
>
> I made some changes to the LNT perf reporting tool to make it more user
> friendly by adding some features:****
>
> **1. **Make the sidebar and the navigation bar stationary,
2017 Nov 10
0
[RFC] Enable Partial Inliner by default
Hi Evgeny,
I just realized that if these are compile-time errors I can help
investigate on my end. Do you have something I can use to reproduce?
Cheers,
Graham Yiu
LLVM Compiler Development
IBM Toronto Software Lab
Office: (905) 413-4077 C2-707/8200/Markham
Email: gyiu at ca.ibm.com
From: Graham Yiu/Toronto/IBM
To: Evgeny Astigeevich <Evgeny.Astigeevich at arm.com>
Cc:
2014 Apr 23
2
[LLVMdev] Proposal: AArch64/ARM64 merge from EuroLLVM
Hi Gerolf,
Sorry for the delayed response. I had to get permission to share more
details.
I am allowed to share relative numbers but not absolute numbers.
Any missing test is due to runtime failures (e.g., gcc failure due to the
fused multiply pattern bug which Tim fixed later on).
Thanks,
Ana.
Benchmarks
ARM64 vs GCC 4.9 %
ARM64 vs AArch64 %
ARM64 vs AArch64 patched %
2018 Apr 25
5
[RFC] Turn the MachineOutliner on by default in AArch64 under -Oz
Hello
A 4.4% geomean codesize improvement is really impressive. That stuff is hard to come by, you usually have to nibble away at it bit at a time. I ran some codesize benchmarks we have and they were in the same ballpark. Some of these are quite small so had less opportunity for outlining, but the average was still over 3% with some as high as 9-10%.
All the tests I ran were fine, although we
2020 Aug 19
2
[RFC] Switching to MemorySSA-backed Dead Store Elimination (aka cross-bb DSE)
Hi Florian,
First, thank you for working on this. I'm really glad to see this work so
close to being enabled.
I think the numbers look good for run time, and the benefits of switching
for all configurations are clear.
For compile time, the current regressions are noticeable, but not a deal
breaker in my opinion. I'm very much in favor of switching in all
configurations.
To address some
2012 Sep 29
7
[LLVMdev] LLVM's Pre-allocation Scheduler Tested against a Branch-and-Bound Scheduler
Hi,
We are currently working on revising a journal article that describes
our work on pre-allocation scheduling using LLVM and have some questions about LLVM's pre-allocation scheduler. The answers to these question will help us better document and analyze the results of our benchmark tests that compare our algorithm with LLVM's pre-allocation scheduling algorithm.
First, here is a
2017 Nov 10
5
[RFC] Enable Partial Inliner by default
Hi Graham,
Thank you for offering help. I am trying to create a reproducer. The problem is that the crashes happen whilst LTO is used. One thing I am sure about IR is broken at compile time.
Thanks,
Evgeny
From: Graham Yiu <gyiu at ca.ibm.com>
Date: Friday, 10 November 2017 at 16:09
To: Evgeny Astigeevich <Evgeny.Astigeevich at arm.com>
Cc: "junbuml at codeaurora.org"
2017 Apr 06
2
[GlobalISel][AArch64] Toward flipping the switch for O0: Please give it a try!
On Thu, Apr 6, 2017 at 6:53 AM, Kristof Beyls via llvm-dev
<llvm-dev at lists.llvm.org> wrote:
> I've been digging a little bit deeper into the biggest performance
> regressions I've observed.
>
> What I've observed so far is:
> * A lot of the biggest regressions are caused by unnecessarily moving
> floating point values through general purpose registers.
2017 May 24
2
[GlobalISel][AArch64] Toward flipping the switch for O0: Please give it a try!
Hi Kristof,
Thanks for the measurements.
> On May 24, 2017, at 6:00 AM, Kristof Beyls <kristof.beyls at arm.com> wrote:
>
>>
>> On 23 May 2017, at 21:48, Quentin Colombet <qcolombet at apple.com <mailto:qcolombet at apple.com>> wrote:
>>
>> Great!
>> I thought I had to look at our pipeline at O0 to make sure optimized regalloc was
2012 Jun 07
0
[LLVMdev] [PATCH] add x32 psABI support
Hi Folks,
Anyone got chance to review the patch adding X32 psABI support?
Yours
- Michael
-----Original Message-----
From: llvm-commits-bounces at cs.uiuc.edu [mailto:llvm-commits-bounces at cs.uiuc.edu] On Behalf Of Liao, Michael
Sent: Tuesday, June 05, 2012 11:18 AM
To: llvm-commits at cs.uiuc.edu; cfe-commits at cs.uiuc.edu; llvmdev at cs.uiuc.edu; cfe-dev at cs.uiuc.edu
Subject: Re:
2012 Sep 29
0
[LLVMdev] LLVM's Pre-allocation Scheduler Tested against a Branch-and-Bound Scheduler
Hi Ghassan, this is very interesting, however...
> We are currently working on revising a journal article that describes our work
> on pre-allocation scheduling using LLVM and have some questions about LLVM's
> pre-allocation scheduler. The answers to these question will help us better
> document and analyze the results of our benchmark tests that compare our
> algorithm with
2015 Aug 08
3
RFC: PGO Late instrumentation for LLVM
Instrumentation based Profile Guided Optimization (PGO) is a compiler
technique that leverages important program runtime information, such as
precise edge counts and frequent value information, to make frequently
executed code run faster. It's proven to be one of the most effective ways
to improve program performance.
An important design point of PGO is to decide where to place the