thr3ads.net - search: "vecmathlib"

Displaying 4 results from an estimated 4 matches for "vecmathlib".

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

2013 Feb 09

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

The lack of an open-source vector math library (which is what you suggest here) prompted me to start a project "vecmathlib", available at < https://bitbucket.org/eschnett/vecmathlib>. This library provides almost all math functions available in libm, implemented in a vectorised manner, i.e. suitable for SSE2/AVX/MIC/PTX etc. In its current state the library has rough edges, e.g. the precision of many math f...

[LLVMdev] SIMD trigonometry/logarithms?

2013 Feb 05

[LLVMdev] SIMD trigonometry/logarithms?

...AM > Subject: [LLVMdev] SIMD trigonometry/logarithms? > > I was looking at loop vectorizer code and wondered if there was any > current or planned effort to introduce SIMD implementations of > sin/cos/exp/log intrinsics (in particular for x86-64 backend)? I am currently implementing vecmathlib < https://bitbucket.org/eschnett/vecmathlib> for just this purpose. There are implementations of most libm functions for various vector sizes, optimised for particular hardware architectures where e.g. machine instructions are available. Support for x86_64 should be most complete (SSE2, AVX)....

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

2013 Feb 07

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

Hi Justin, gentlemen, I'm afraid I have to escalate this issue at this point. Since it was discussed for the first time last summer, it was sufficient for us for a while to have lowering of math calls into intrinsics disabled at DragonEgg level, and link them against CUDA math functions at LLVM IR level. Now I can say: this is not sufficient any longer, and we need NVPTX backend to deal with

[LLVMdev] Portable Computing Language (pocl) v0.8 released

2013 Aug 12

[LLVMdev] Portable Computing Language (pocl) v0.8 released

...work-group functions with multiple work-items, even in the presence of work-group barriers. These functions are suitable for parallelization in multiple ways (SIMD, VLIW, superscalar,...). This release adds support for LLVM/Clang 3.3, employs inner loop parallelization in the kernel compiler, uses Vecmathlib for inlineable efficient math library implementations, contains plenty of bug fixes, and provides several new OpenCL API implementations. We consider pocl ready for wider scale testing, although the OpenCL 1.2 standard is not yet fully implemented, and it contains known bugs. The pocl 0.8 test sui...

search for: vecmathlib