search for: divergent

Displaying 20 results from an estimated 1059 matches for "divergent".

2018 May 28
0
[RFC] A New Divergence Analysis for LLVM
...sed by VPlan [6] in the mid term. The implementation is based on RV, the Region Vectorizer. -- RV -- The Region Vectorizer (github [1]) is an analysis and transformation framework for outer-loop and whole-function vectorization. RV vectorizes arbitrary, reducible control flow including nested divergent loops through partial control-flow linearization [2]. RV is being used by the Impala [7] and the PACXX [3,8] high performance programming frameworks and implements OpenMP #pragma omp simd and #pragma omp declare simd. -- The Divergence Analysis -- The Divergence Analysis determines how instru...
2017 Jul 21
2
[SPIR/PTX] Divergence analysis for BasicBlocks
Hello, Yes? Where is allActive defined, I couldn't find it. Basically, a BB is control divergent if it's execution depends on a branch that itself depends on a divergent ssa value. On Fri, Jul 21, 2017 at 4:13 PM, Zaks, Ayal <ayal.zaks at intel.com> wrote: > What would be the definition of “isControlDivergent(BasicBlock*)”; the > complementary of “allActive(BasicBlock*)” – bl...
2017 Jul 14
2
[SPIR/PTX] Divergence analysis for BasicBlocks
Hello, It seems to me that our current DivergenceAnalysis does not save which BasicBlocks may suffer from divergent control. Am I correct? I want to modify our DivergenceAnalysis to add a "bool isControlDivergent(BasicBlock*) const" method and save in the divergence propagator the basicblock that are divergent. I am not sure that is entirely correct, if you have input on that please let me know. That...
2016 Oct 26
3
RFC: (Co-)Convergent functions and uniform function parameters
On 25.10.2016 16:28, Nicolai Hähnle wrote: > But I fear that this path leads to eternal fuzziness. Let me try a > completely different approach to define what we need by augmenting the > semantics of IR with "divergence tokens". In addition to its usual > value, every IR value carries a "divergence set" of divergence tokens. > > The basic rule is: the
2010 Apr 21
1
Cross-checking a custom function for separability indices
Hi list! I have prepared a custom function (below) in order to calculate separability indices (Divergence, Bhattacharyya, Jeffries-Matusita, Transformed divergene) between two samples of (spectral land cover) classes. I need help to cross-compare results to verify that it works as expected (since I don't know of any other foss-tool that will give me quickly some results). Does anybody
2017 Dec 06
2
[AMDGPU] Strange results with different address spaces
...analysis does not run in the second pass manager because no metadata is generated at all. > > Could this be a bug in DA reporting the load falsely as uniform by not taking the intrinsics into account? > > Cheers, > Michael > The intrinsics certainly are correctly treated as divergent. Nothing would work otherwise. If I run the annotate pass or analysis on the examples it does the right thing and sees the load as divergent. $ opt -S -analyze -divergence -o - as1.ll Printing analysis 'Divergence Analysis' for function '_ZN5pacxx2v213genericKernelIZL12test_barrieriPPc...
2016 Oct 24
2
RFC: (Co-)Convergent functions and uniform function parameters
...>>>> 1) convergent: a call to a function with this attribute cannot be >>>> moved to have additional control dependencies; i.e., moving it from A >>>> to B is only possible if B dominates or post-dominates A. >>>> >>>> 2) co-convergent (divergent? for lack of a better name...): a call to >>>> a function with this attribute cannot be moved to have _fewer_ >>>> control dependencies; i.e., moving it from A to B is only possible if >>>> A dominates or post-dominates B. >>>> >>>> 3) un...
2015 Jan 24
2
[LLVMdev] Proposal: pragma for branch divergence
*Hi, I am considering a language extension to Clang for optimizing GPU programs. This extension will allow the compiler to use different optimization strategies for divergent and non-divergent branches (to be explained below). We have observed significant performance gain by leveraging this proposed extension, so I want to discuss it here to see how the community likes/dislikes the idea. I will focus on the CUDA language and the PTX backend for now, but I believe this a...
2015 Jan 24
2
[LLVMdev] [cfe-dev] Proposal: pragma for branch divergence
...> > > > > > > > > > > > > > > > > > > > > > > > > *Hi, I am considering a language extension to Clang for optimizing GPU > programs. This extension will allow the compiler to use different > optimization strategies for divergent and non-divergent branches (to be > explained below). We have observed significant performance gain by > leveraging this proposed extension, so I want to discuss it here to see how > the community likes/dislikes the idea. I will focus on the CUDA language > and the PTX backend for now,...
2015 Jan 25
2
[LLVMdev] [cfe-dev] Proposal: pragma for branch divergence
...lieve you have lots of great insights, but I don't see how cbranch.ifnone instructions directly solve my issue. The issue I am trying to solve is that certain CFG optimizations transform the CFG into a "bad" structure which hurts the performance of the compiled code in the presence of divergent branches. On the other hand, I don't want to disable jump threading all together because it is still beneficial for non-divergent branches. As far as I can understand, cbranch.ifnone provides a fast path so that a warp can jump over the region that no threads in the warp ever execute. However,...
2016 Oct 31
0
RFC: (Co-)Convergent functions and uniform function parameters
(I work on CUDA / PTX.) For one thing I'm in favor of having fewer annotations rather than more, so if we can do this in a reasonable way without introducing the notion of co-convergent calls, I think that would be a win. The one convergent annotation is difficult enough for the GPU folks to grok and then keep in cache, and everyone who works on llvm has to pay the cost of keeping their
2008 Sep 08
1
Vorticity and Divergence
Hi all, I have some wind data (U and V components) and I would like to compute Vorticity and Divergence of these fields. Is there any R function that can easily do that? Thanks in advance for any help Igor Oliveira CSAG, Dept. Environmental & Geographical Science, University of Cape Town, Private Bag X3, Rondebosch 7701. Tel.: +27 (0)21 650 5774 South Africa Fax: +27 (0)21
2008 Jan 25
5
Ferret+Lucene Index
We use Nutch and Lucene for our heavy duty text analysis jobs but I?m trying to use ferrret to do some experiments. I understood that Ferret used the same index format as lucene but I cannot look into a lucene index with ferret and cannot read a ferret index with luke (the lucene index browser). Am I doing somehting wrong or have the formats diverged? -------------- next part -------------- An
2012 Aug 19
1
moving distance between two sets of data
On the surface this seems pretty simple, but I flummoxed. I have two sets of numbers they bounce around zero, positive one and negative one. They have a relationship between them, where one diverges away from the other. I want create a second set of numbers that tracks that divergence. #Lets make some data like mine, kinda Firstset <- runif(100, min = -1 , max =1) Secondset <- runif(100,
2013 Apr 09
1
Solving an integral in R gives the error “The integral is probably divergent”
...XX^2/x)-x) integrated <- integrate(integrand, lower=0, upper=upper)$value (final <- abs(XX)*pi^(-0.5)*exp(2*XX)*integrated) } b<- sapply(a, Cfun, upper=1) The error that I am getting is as follows: Error in integrate(integrand, lower = 0, upper = upper) : the integral is probably divergent Does this mean I cannot solve the integral ? Any possible ways to fix this problem will be highly appreciated.The question can be found on http://stackoverflow.com/questions/15892586/solving-an-integral-in-r-gives-error-the-integral-is-probably-divergent also. Thanks. [[alternative HTML versi...
2017 Dec 05
2
[AMDGPU] Strange results with different address spaces
> On Dec 5, 2017, at 13:53, Matt Arsenault <arsenm2 at gmail.com> wrote: > > > >> On Dec 5, 2017, at 02:51, Haidl, Michael via llvm-dev <llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>> wrote: >> >> Hi dev list, >> >> I am currently exploring the integration of AMDGPU/ROCm into the PACXX project and observing some
2010 Nov 15
1
Version 2.12.0 exe file
...4 (0)07976206103 Email:morris.anglin at ncl.ac.uk Email:morris.anglin at nuth.nhs.uk http://www.ncl.ac.uk/ihs/postgrad/research/studentprofile.htm To me, diversity is more complicated than rituals, food, clothing, etc. ...or race, language, and class, etc. To me, diversity needs to include divergent viewpoints, divergent interpretations, and divergent perspectives. Thus, diversity needs to include giving power and sanctioned space for divergent voices, even if this means disagreeing".
2006 May 10
2
Speex echo canceller on TI C55 DSP
> misc.c provides the ability to override some functions, including the > allocation and printing. fftwrap.c uses speex_alloc, then calls > kiss_fftr_alloc, which calls kiss_fft_alloc, which calls KISS_FFT_MALLOC, > which is defined as malloc in kiss_fft.h. It would make it more consistent > to define KISS_FFT_MALLOC as speex_alloc. That is the only change that I > would
2019 Jul 22
3
Fwd: bugpoint can't automatically select a safe interpreter!
I tried to reduce the test case in https://bugs.llvm.org/show_bug.cgi?id=42706. Here it is crashing opt: $ ~/llvm-debug/bin/opt -use-gpu-divergence-analysis -divergence stripped.ll WARNING: You're attempting to print out a bitcode file. This is inadvisable as it may cause display problems. If you REALLY want to taste LLVM bitcode first-hand, you can force output with the `-f' option.
2015 Aug 21
2
[CUDA/NVPTX] is inlining __syncthreads allowed?
I'm using 7.0. I am attaching the reduced example. nvcc sync.cu -arch=sm_35 -ptx gives // .globl _Z3foov .visible .entry _Z3foov( ) { .reg .pred %p<2>; .reg .s32 %r<3>; mov.u32 %r1, %tid.x; and.b32 %r2, %r1, 1; setp.eq.b32 %p1, %r2, 1; @!%p1 bra BB7_2; bra.uni