search for: sgemm

Displaying 7 results from an estimated 7 matches for "sgemm".

Did you mean: gemm
2003 Feb 04
1
Bug in configure script for Solaris v9 arch with libsunperf (PR#2530)
...tails: -------------- Sun Blade 1000, Solaris 8, Sun ONE Compiler Suite (rebranded Forte 7), R versions 1.6.0 and 1.6.2 - suspect others are affected, too Problem description: ------------------- Using the recommended env var setting from p.15 of the Admin Guide, the configure script fails the sgemm sunperf test, even though the sunperf libraries are available Not linking with libsunperf causes drastic performance hits on some simple benchmarks I've tried (e.g. matrix multiplication can take up to 100 times as long for a sufficiently large matrix (n=5000)) Solution: -------- Configure s...
2013 Apr 25
1
[LLVMdev] getNodePriority()
...ock of operations is bounded at either end by an OpenCL barrier (an AMDIL fence instruction). The loads and multiply/adds are ordinarily interleaved... that is, the IR going in to code generation looks like: %39 = load float addrspace(3)* getelementptr inbounds ([16 x [17 x float]] addrspace(3)* @sgemm.b, i32 0, i32 0, i32 0), align 4 %40 = call float @llvm.fmuladd.f32(float %37, float %39, float %c0.037) nounwind %41 = load float addrspace(3)* getelementptr inbounds ([16 x [17 x float]] addrspace(3)* @sgemm.b, i32 0, i32 0, i32 1), align 4 %42 = call float @llvm.fmuladd.f32(float %37, floa...
2014 Aug 11
2
[LLVMdev] [RFC] OpenMP offload infrastructure
...ng llvm-ir in the fat binary may have the same performance issues mentioned below. The fat binary discussed in the proposal has provision for storing the isa/llvm-ir. My point is instead of llvm-ir it shd be something like spir. Ok - so lets see some data. #1 Benchmarks showing at least SPIR dgemm/sgemm performance #2 Some logical explanation why all the extra work for SPIR when LLVM IR is native Basically besides an opinion or because it's "shiny" some solid technical reason. I hate to repeat myself, but again.. why on earth would a solution which is closed source be preferred...
2005 Feb 23
1
BLAS or ATLAS?
I believe that actually means configure was able to use the BLAS you specified (MKL in this case). You can scan through config.log to make sure. Andy > From: > > Fedora Core 3 > > I installed the Intel MLk, > > and tried : --with-blas="-lmkl -lguide -lpthread" > > and got : External libraries: readline, BLAS(generic) > > thus I'm
2014 Aug 11
2
[LLVMdev] [RFC] OpenMP offload infrastructure
On 08/11/14 01:03 PM, Das, Dibyendu wrote: > I didn’t see SPIR discussed anywhere. This isn't OpenCL and depending on OpenCL for OpenMP may not really make sense. While I have my own opinions - If you feel strongly that it will help enable higher performance somewhere please list those reasons. ---------- More specifically LLVM has a native AMD dGPU backend that is tightly coupled to the
2005 May 11
5
Implementing R on IBM p690 cluster Jump
...ne required checking for __setfpucw... no checking for working calloc... no checking for working finite... yes checking for working log... yes checking for working log1p... yes checking for working strptime... yes checking whether ftell works correctly on files opened for append... yes checking for sgemm... no checking for ATL_xerbla in -latlas... no checking for sgemm in -lblas... yes checking for dgemm in -ldgemm... no checking for sgemm in -lblas... (cached) yes checking for sgemm in -lessl... yes checking wchar.h usability... yes checking wchar.h presence... yes checking for wchar.h... yes chec...
2019 Mar 19
3
Scalable Vector Types in IR - Next Steps?
On Tue, Mar 19, 2019 at 4:11 AM Graham Hunter <Graham.Hunter at arm.com> wrote: > Hi Eric and Chandler, > > I appreciate your concerns; I don't think the impact will be that great, > but then it's > rather easy for me to keep SVE in mind when working on other parts of the > codebase > given how long I've spent working on it. > > Are there any