thr3ads.net - search: "gerolf"

Displaying 20 results from an estimated 113 matches for "gerolf".

[LLVMdev] Testing the new CFL alias analysis

2014 Sep 16

[LLVMdev] Testing the new CFL alias analysis

----- Original Message ----- > From: "Gerolf Hoflehner" <ghoflehner at apple.com> > To: "Hal Finkel" <hfinkel at anl.gov> > Cc: "LLVM Dev" <llvmdev at cs.uiuc.edu>, "Jiangning Liu" <liujiangning1 at gmail.com>, "George Burgess IV" > <george.burgess.iv at gmail.co...

[LLVMdev] Dev Meeting BOF: Performance Tracking

2014 Aug 20

[LLVMdev] Dev Meeting BOF: Performance Tracking

On 20 August 2014 00:24, Gerolf Hoflehner <ghoflehner at apple.com> wrote: > My experience from leading BOFs at other conferences is more talk than action. So I suggest a different setup for this topic: how about having a working group meeting with participants who can commit time to work on this topic? Mine too, but in...

Representations of IR in the output of opt

2019 Jun 26

Representations of IR in the output of opt

...ere order is irrelevant, like in data flow analysis etc. I pushed back one fix for sccp, and will post one for adce later. Hopefully they will help in your case, but I doubt they are exhaustive. FWIW, there is one bright spot here: I have no (not yet…) example where incorrect code is generated. -Gerolf > On May 31, 2019, at 2:08 AM, Gerolf Hoflehner via llvm-dev <llvm-dev at lists.llvm.org> wrote: > > > There is a non-deterministic problem with the uselists. The code causing this is almost identical in the IR and the bc writer. In some invocations of opt the uselists are sh...

CTMark - regular LLVM and CLANG compile-time tracking

2016 Nov 17

CTMark - regular LLVM and CLANG compile-time tracking

> On Nov 17, 2016, at 2:55 PM, Mehdi Amini <mehdi.amini at apple.com> wrote: > > Hi Gerolf, > > This is really cool! > I’m very excited about this initiative and I hope we’ll be able to get to a stage where compile time regression are handled like other regression: if they are not expected / justified by the commit author promptly, the commit should be reverted in the meantime!...

[LLVMdev] Testing the new CFL alias analysis

2014 Sep 15

[LLVMdev] Testing the new CFL alias analysis

...ance difference for the 7 benchmarks that compile. This includes bzip2 (although different source base than in CINT2000), mcf, hmmer, sjeng, h364ref, astar, xalancbmk On Sep 15, 2014, at 11:59 AM, Hal Finkel <hfinkel at anl.gov> wrote: > ----- Original Message ----- >> From: "Gerolf Hoflehner" <ghoflehner at apple.com> >> To: "Jiangning Liu" <liujiangning1 at gmail.com>, "George Burgess IV" <george.burgess.iv at gmail.com>, "Hal Finkel" >> <hfinkel at anl.gov> >> Cc: "LLVM Dev" <llvmdev a...

[LLVMdev] Supporting heterogeneous computing in llvm.

2015 Jun 05

[LLVMdev] Supporting heterogeneous computing in llvm.

Hello All, The last two months I have been working on the design and implementation of a heterogeneous execution engine for LLVM. I started this project as an intern at the Qualcomm Innovation Center and I believe it can be useful to different people and use cases. I am planning to share more details and a set of patches in the next days. However, I would first like to see if there is an interest

[LLVMdev] Proposal: AArch64/ARM64 merge from EuroLLVM

2014 Apr 23

[LLVMdev] Proposal: AArch64/ARM64 merge from EuroLLVM

Hi Gerolf, Sorry for the delayed response. I had to get permission to share more details. I am allowed to share relative numbers but not absolute numbers. Any missing test is due to runtime failures (e.g., gcc failure due to the fused multiply pattern bug which Tim fixed later on). Thanks,...

CTMark - regular LLVM and CLANG compile-time tracking

2016 Nov 15

CTMark - regular LLVM and CLANG compile-time tracking

...tests. Internally we also track a set of benchmarks and check for correlations of compile-time regressions to CTMark. And most importantly I think the shared interest and commitment of the community to compile-time will carry this forward. Special thanks to Chris and Michael to get this started! -Gerolf -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20161115/e25a0e7f/attachment.html> -------------- next part -------------- A non-text attachment was scrubbed... Name: PastedGraphic-3.png Type: image/png Size...

Enabling EarlyCSE w/ MemorySSA by default

2017 Jun 28

Enabling EarlyCSE w/ MemorySSA by default

Can you share you compile-time and memory footprint measurements at least for CTMark? For a new pass/feature it would be great to share this with the community before you commit. Or did I miss them? Thanks Gerolf > On Jun 27, 2017, at 3:26 PM, Geoff Berry via llvm-dev <llvm-dev at lists.llvm.org> wrote: > > EarlyCSE w/ MemorySSA has been enabled by default as of r306477 > > > On 6/19/2017 2:42 PM, Geoff Berry via llvm-dev wrote: >> Sounds good to me. >> >> &g...

[LLVMdev] GlobalsModRef (and thus LTO) is completely broken

2015 Jul 17

[LLVMdev] GlobalsModRef (and thus LTO) is completely broken

Can you say what Benchmark or give a test case so we understand the nature of the regression? As Gerolf said, that will be important to understand what is best to do. On Fri, Jul 17, 2015, 06:43 Evgeny Astigeevich <Evgeny.Astigeevich at arm.com> wrote: > Yes, the regression is stable. I double checked this. A full benchmark > run consists of at least 10 sub-runs to validate the score....

Representations of IR in the output of opt

2019 May 30

Representations of IR in the output of opt

Hello again, > It may be desirable to sort the table before writing the bitcode out, > adding Peter to the thread for his opinion. Thanks for this! Now it seems I've been optimistic about this result. I have instrumented the test suite to check it on a wider amount of files and quickly discovered that it fails for larger optimization sequences. In particular, the default -O3 set

2015 Oct 02

...metadata how to undo an optimization? Optimizations that attempt to reduce pressure like splitting or remat could be hooked up and call an undo routine based on a cost model. I think there is time to do something longer term. This particular instance can only be an issue under -fast-math. Cheers Gerolf > On Oct 1, 2015, at 9:27 AM, Sanjay Patel via llvm-dev <llvm-dev at lists.llvm.org> wrote: > > Hi Haicheng, > > We need to prevent the transform if it causes spilling, but I'm not sure yet what mechanism/heuristic we can use to do that. > Can you file a bug report wi...

[LLVMdev] GlobalsModRef (and thus LTO) is completely broken

2015 Jul 17

[LLVMdev] GlobalsModRef (and thus LTO) is completely broken

...chmark. > > > > Kind regards, > > Evgeny Astigeevich > > > > > > *From:* llvmdev-bounces at cs.uiuc.edu [mailto:llvmdev-bounces at cs.uiuc.edu] *On > Behalf Of *Evgeny Astigeevich > *Sent:* 15 July 2015 15:12 > > > *To:* 'Chandler Carruth'; Gerolf Hoflehner > *Cc:* LLVM Developers Mailing List > *Subject:* Re: [LLVMdev] GlobalsModRef (and thus LTO) is completely broken > > > > Hi Chandler, > > > > I would like to run some benchmarks on ARM hardware and to look at impact > of your patches on LTO. > > &gt...

Moderators needed for LLVM Developers' Meeting

2015 Oct 22

Moderators needed for LLVM Developers' Meeting

...s High-Level IR: A Case Study of Complementing LLVM IR with Language-Specific Optimization, Joseph Groff - Apple Inc. Chris Lattner - Apple Inc. Typeless Pointers in LLVM IR, David Blaikie - Google Inc Session #5: October 30 10:00-10:45: Salon I & II LLVM Performance Improvements and Headroom, Gerolf Hoflehner - Apple Session #6: October 30 11:15-12:45 Salon III & IV Optimizing LLVM for GPGPU, Jingyue Wu - Google Inc. OpenMP GPU/Accelerator support Coming of Age in Clang, Michael Wong - IBM, Alexey Bataev - Intel Session #7: October 30 11:15-12:45: Salon I & II Exception handling in...

Heroic LLVM optimizations

2017 Aug 15

Heroic LLVM optimizations

I am a professor at UC Santa Cruz, but I also do consulting a Huawei. Chris Lattner told me that I should post this in the llvm-dev. HiSilicon (Santa Clara office) is looking for some developer capable of implementing the "heroic optimizations" (http://llvm.org/devmtg/2015-10/slides/Gerolf-PerformanceImprovementsAndHeadroom.pdf) in LLVM. Focus on SPEC2006 but also looking at the new SPEC2017. The goal is to match, or get closer, to the Intel compiler with SPEC2006. ICC has a significant advantage. As the talk shows, there is over 10x diff in libquantum, and other benchmarks hav...

[LLVMdev] GlobalsModRef (and thus LTO) is completely broken

2015 Jul 15

[LLVMdev] GlobalsModRef (and thus LTO) is completely broken

..., I would like to run some benchmarks on ARM hardware and to look at impact of your patches on LTO. Kind regards, Evgeny Astigeevich From: llvmdev-bounces at cs.uiuc.edu [mailto:llvmdev-bounces at cs.uiuc.edu] On Behalf Of Chandler Carruth Sent: 15 July 2015 10:45 To: Chandler Carruth; Gerolf Hoflehner Cc: LLVM Developers Mailing List Subject: Re: [LLVMdev] GlobalsModRef (and thus LTO) is completely broken I've fixed the obvious bugs I spotted in r242281. These should be pure correctness improvements. I've sent the two patches I'm imagining to address the core issue...

(RFC) Adjusting default loop fully unroll threshold

2017 Feb 13

(RFC) Adjusting default loop fully unroll threshold

On Mon, Feb 13, 2017 at 2:06 PM Gerolf Hoflehner via llvm-dev < llvm-dev at lists.llvm.org> wrote: > For unrolling specifically I agree with Hal that the hooks should be > target specific. Actually, I go further and think they should be uArch > specific. > They already are, it is just that no one has contributed a pa...

[RFC] Generalize llvm.memcpy / llvm.memmove intrinsics.

2015 Aug 21

[RFC] Generalize llvm.memcpy / llvm.memmove intrinsics.

...struct copy code should probably carry a warning: Do what we say, not what we do. Cheers, Lang. On Thu, Aug 20, 2015 at 4:17 PM, Hal Finkel <hfinkel at anl.gov> wrote: > ----- Original Message ----- > > From: "Lang Hames" <lhames at gmail.com> > > To: "Gerolf Hoflehner" <ghoflehner at apple.com> > > Cc: "Mehdi Amini" <mehdi.amini at apple.com>, "LLVM Developers Mailing > List" <llvm-dev at lists.llvm.org>, "Hal Finkel" > > <hfinkel at anl.gov>, "Philip Reames" <listma...

[LLVMdev] Loop Vectorization and Store-Load Forwarding issue

2015 Jun 12

[LLVMdev] Loop Vectorization and Store-Load Forwarding issue

I have been looking into this small test case (Part A) where loop vectorization is disabled due to possible store-load forwarding conflict (Part B). As you can see, due to the presence of dependence distance 2 the loop is vectorizable only for a width of 2. However, the presence of dependence distance 15 (due to y[j-15]) results in store-load forwarding issue as store packet of y[16:17] (iteration

[LLVMdev] GlobalsModRef (and thus LTO) is completely broken

2015 Jul 17

[LLVMdev] GlobalsModRef (and thus LTO) is completely broken

...> > *To:* Evgeny Astigeevich; Chandler Carruth > *Cc:* LLVM Developers Mailing List > > *Subject:* Re: [LLVMdev] GlobalsModRef (and thus LTO) is completely broken > > > > Can you say what Benchmark or give a test case so we understand the nature > of the regression? As Gerolf said, that will be important to understand > what is best to do. > > > > On Fri, Jul 17, 2015, 06:43 Evgeny Astigeevich <Evgeny.Astigeevich at arm.com> > wrote: > > Yes, the regression is stable. I double checked this. A full benchmark run > consists of at least 10...

search for: gerolf