search for: divergence

Displaying 20 results from an estimated 1052 matches for "divergence".

2018 May 28
0
[RFC] A New Divergence Analysis for LLVM
TL;DR This RFC is a joint effort by Intel and Saarland University to bring the divergence analysis of the Region Vectorizer [1,2,3,4,5] (dubbed the vectorization analysis of RV) to LLVM. The implementation is available on github for feedback [0]. The existing divergence analysis infrastructure in LLVM has conceptual limitations (structured control, SCEV based). The new analysis reso...
2017 Jul 21
2
[SPIR/PTX] Divergence analysis for BasicBlocks
...alf Karrenberg used in his thesis. > > > > *From:* llvm-dev [mailto:llvm-dev-bounces at lists.llvm.org] *On Behalf Of *Alexandre > Isoard via llvm-dev > *Sent:* Friday, July 14, 2017 20:53 > *To:* llvm-dev <llvm-dev at lists.llvm.org> > *Subject:* [llvm-dev] [SPIR/PTX] Divergence analysis for BasicBlocks > > > > Hello, > > > > It seems to me that our current DivergenceAnalysis does not save which > BasicBlocks may suffer from divergent control. Am I correct? > > > > I want to modify our DivergenceAnalysis to add a "bool > isCon...
2017 Jul 14
2
[SPIR/PTX] Divergence analysis for BasicBlocks
Hello, It seems to me that our current DivergenceAnalysis does not save which BasicBlocks may suffer from divergent control. Am I correct? I want to modify our DivergenceAnalysis to add a "bool isControlDivergent(BasicBlock*) const" method and save in the divergence propagator the basicblock that are divergent. I am not sure that is ent...
2016 Oct 26
3
RFC: (Co-)Convergent functions and uniform function parameters
On 25.10.2016 16:28, Nicolai Hähnle wrote: > But I fear that this path leads to eternal fuzziness. Let me try a > completely different approach to define what we need by augmenting the > semantics of IR with "divergence tokens". In addition to its usual > value, every IR value carries a "divergence set" of divergence tokens. > > The basic rule is: the divergence set of a value is (at least) the union > of the divergence sets of its operands. > > Every function input carries a uniqu...
2010 Apr 21
1
Cross-checking a custom function for separability indices
Hi list! I have prepared a custom function (below) in order to calculate separability indices (Divergence, Bhattacharyya, Jeffries-Matusita, Transformed divergene) between two samples of (spectral land cover) classes. I need help to cross-compare results to verify that it works as expected (since I don't know of any other foss-tool that will give me quickly some results). Does anybody use anot...
2017 Dec 06
2
[AMDGPU] Strange results with different address spaces
...taking the intrinsics into account? > > Cheers, > Michael > The intrinsics certainly are correctly treated as divergent. Nothing would work otherwise. If I run the annotate pass or analysis on the examples it does the right thing and sees the load as divergent. $ opt -S -analyze -divergence -o - as1.ll Printing analysis 'Divergence Analysis' for function '_ZN5pacxx2v213genericKernelIZL12test_barrieriPPcE3$_0EEvT_': DIVERGENT: %6 = tail call i32 @llvm.amdgcn.workitem.id.x() #0, !range !11 DIVERGENT: %add.i.i.i.i.i = add nsw i32 %mul.i.i.i.i.i, %6 DIVERGENT: %idxprom....
2016 Oct 24
2
RFC: (Co-)Convergent functions and uniform function parameters
> On Oct 24, 2016, at 4:15 PM, Nicolai Hähnle <nhaehnle at gmail.com> wrote: > > On 25.10.2016 01:11, Nicolai Hähnle wrote: >> On 24.10.2016 21:54, Mehdi Amini wrote: >>>> On Oct 24, 2016, at 12:38 PM, Nicolai Hähnle via llvm-dev >>>> <llvm-dev at lists.llvm.org> wrote: >>>> Some brain-storming on an issue with SPMD/SIMT backend
2015 Jan 24
2
[LLVMdev] Proposal: pragma for branch divergence
...ignificant performance gain by leveraging this proposed extension, so I want to discuss it here to see how the community likes/dislikes the idea. I will focus on the CUDA language and the PTX backend for now, but I believe this addition will benefit OpenCL and its backends too. Background on branch divergenceCUDA programs have a very different execution model for code with branches. A CUDA program is executed by an array of threads broken into groups called warps. A warp typically contains 32 threads, and all the threads in a warp execute instructions in lock-step, i.e., executing the same instruction a...
2015 Jan 24
2
[LLVMdev] [cfe-dev] Proposal: pragma for branch divergence
...ce gain by > leveraging this proposed extension, so I want to discuss it here to see how > the community likes/dislikes the idea. I will focus on the CUDA language > and the PTX backend for now, but I believe this addition will benefit > OpenCL and its backends too. Background on branch divergenceCUDA programs > have a very different execution model for code with branches. A CUDA > program is executed by an array of threads broken into groups called warps. > A warp typically contains 32 threads, and all the threads in a warp execute > instructions in lock-step, i.e., executing th...
2015 Jan 25
2
[LLVMdev] [cfe-dev] Proposal: pragma for branch divergence
...gt; leveraging this proposed extension, so I want to discuss it here to see how >> the community likes/dislikes the idea. I will focus on the CUDA language >> and the PTX backend for now, but I believe this addition will benefit >> OpenCL and its backends too. Background on branch divergenceCUDA programs >> have a very different execution model for code with branches. A CUDA >> program is executed by an array of threads broken into groups called warps. >> A warp typically contains 32 threads, and all the threads in a warp execute >> instructions in lock-step, i....
2016 Oct 31
0
RFC: (Co-)Convergent functions and uniform function parameters
...llvm-dev <llvm-dev at lists.llvm.org> wrote: > On 25.10.2016 16:28, Nicolai Hähnle wrote: >> >> But I fear that this path leads to eternal fuzziness. Let me try a >> completely different approach to define what we need by augmenting the >> semantics of IR with "divergence tokens". In addition to its usual >> value, every IR value carries a "divergence set" of divergence tokens. >> >> The basic rule is: the divergence set of a value is (at least) the union >> of the divergence sets of its operands. >> >> Every functi...
2008 Sep 08
1
Vorticity and Divergence
Hi all, I have some wind data (U and V components) and I would like to compute Vorticity and Divergence of these fields. Is there any R function that can easily do that? Thanks in advance for any help Igor Oliveira CSAG, Dept. Environmental & Geographical Science, University of Cape Town, Private Bag X3, Rondebosch 7701. Tel.: +27 (0)21 650 5774 South Africa Fax: +27 (0)21 650 577...
2008 Jan 25
5
Ferret+Lucene Index
We use Nutch and Lucene for our heavy duty text analysis jobs but I?m trying to use ferrret to do some experiments. I understood that Ferret used the same index format as lucene but I cannot look into a lucene index with ferret and cannot read a ferret index with luke (the lucene index browser). Am I doing somehting wrong or have the formats diverged? -------------- next part -------------- An
2012 Aug 19
1
moving distance between two sets of data
On the surface this seems pretty simple, but I flummoxed. I have two sets of numbers they bounce around zero, positive one and negative one. They have a relationship between them, where one diverges away from the other. I want create a second set of numbers that tracks that divergence. #Lets make some data like mine, kinda Firstset <- runif(100, min = -1 , max =1) Secondset <- runif(100, min = -1 , max =1) #So something like: Divergence <- abs (Firstset - Secondset) #but this doesn't work because when Firstset is at .5 and Secondset is at -.25 it returns .25 in...
2013 Apr 09
1
Solving an integral in R gives the error “The integral is probably divergent”
I am trying to solve an integral in R. However, I am getting an error when I am trying to solve for that integral. The equation that I am trying to solve is as follows: $$ C_m = \frac{{abs{x}}e^{2x}}{\pi^{1/2}}\int_0^t t^{-3/2}e^{-x^2/t-t}dt $$ [image: enter image description here] The code that I am using is as follows: a <- seq(from=-10, by=0.5,length=100) ## Create a function to compute
2017 Dec 05
2
[AMDGPU] Strange results with different address spaces
> On Dec 5, 2017, at 13:53, Matt Arsenault <arsenm2 at gmail.com> wrote: > > > >> On Dec 5, 2017, at 02:51, Haidl, Michael via llvm-dev <llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>> wrote: >> >> Hi dev list, >> >> I am currently exploring the integration of AMDGPU/ROCm into the PACXX project and observing some
2010 Nov 15
1
Version 2.12.0 exe file
I have R version 2.9.1 on my computer and the anlaysis is not working because I need to update to R version 2.12.0 the latest release. The person incharge of IT tried to download R version 2.12.0 but .exe file referenced in install isn't there - What might we be doing wrong? We have downloaded as tar.gz, uncompress then look for .exe file but not present. Many thanks, Morris
2006 May 10
2
Speex echo canceller on TI C55 DSP
> misc.c provides the ability to override some functions, including the > allocation and printing. fftwrap.c uses speex_alloc, then calls > kiss_fftr_alloc, which calls kiss_fft_alloc, which calls KISS_FFT_MALLOC, > which is defined as malloc in kiss_fft.h. It would make it more consistent > to define KISS_FFT_MALLOC as speex_alloc. That is the only change that I > would
2019 Jul 22
3
Fwd: bugpoint can't automatically select a safe interpreter!
I tried to reduce the test case in https://bugs.llvm.org/show_bug.cgi?id=42706. Here it is crashing opt: $ ~/llvm-debug/bin/opt -use-gpu-divergence-analysis -divergence stripped.ll WARNING: You're attempting to print out a bitcode file. This is inadvisable as it may cause display problems. If you REALLY want to taste LLVM bitcode first-hand, you can force output with the `-f' option. opt: /home/jayfoad2/git/llvm-project/llvm/lib/Analy...
2015 Aug 21
2
[CUDA/NVPTX] is inlining __syncthreads allowed?
I'm using 7.0. I am attaching the reduced example. nvcc sync.cu -arch=sm_35 -ptx gives // .globl _Z3foov .visible .entry _Z3foov( ) { .reg .pred %p<2>; .reg .s32 %r<3>; mov.u32 %r1, %tid.x; and.b32 %r2, %r1, 1; setp.eq.b32 %p1, %r2, 1; @!%p1 bra BB7_2; bra.uni