search for: saxpy

Displaying 6 results from an estimated 6 matches for "saxpy".

Did you mean: axpy
2016 Dec 31
0
Automatic Insertion of OpenACC/OpenMP directives
...gt; to infer the sizes of memory chunks, and to find dependences within > loops. After that, we use debug information to translate the low-level > information back into annotations that we insert into C/C++ programs. > For instance, if we take a program like this one below: > > void saxpy(float a, float *x, float *y, int n) { > for (int i = 0; i < n; ++i) > y[i] = a*x[i] + y[i]; > } > > Then dawn-cc produces the code below: > > void saxpy(float a, float *x, float *y, int n) { > long long int AI1[6]; > AI1[0] = n - 1; > AI1[1] = 4 * AI1[0]; &...
2016 Dec 31
3
Automatic Insertion of OpenACC/OpenMP directives
...awn-cc, analyzes the LLVM IR to infer the sizes of memory chunks, and to find dependences within loops. After that, we use debug information to translate the low-level information back into annotations that we insert into C/C++ programs. For instance, if we take a program like this one below: void saxpy(float a, float *x, float *y, int n) { for (int i = 0; i < n; ++i) y[i] = a*x[i] + y[i]; } Then dawn-cc produces the code below: void saxpy(float a, float *x, float *y, int n) { long long int AI1[6]; AI1[0] = n - 1; AI1[1] = 4 * AI1[0]; AI1[2] = AI1[1] + 4; AI1[3] = AI1[2] / 4;...
2016 Dec 31
2
Automatic Insertion of OpenACC/OpenMP directives
Dear Mehdi, I've changed your example a little bit: float saxpy(float a, float *x, float *y, int n) { int j = 0; for (int i = 0; i < n; ++i) { y[j] = a*x[i] + y[I]; // Change 'I' into 'j'? ++j; } } I get this code below, once I replace 'I' with 'j'. We are copying n positions of both arrays, 'x' and 'y...
2017 Jan 03
2
Automatic Insertion of OpenACC/OpenMP directives
...t; On Jan 3, 2017, at 7:17 AM, Jonathan Roelofs <jonathan at codesourcery.com> wrote: > > > > On 12/31/16 12:37 PM, Fernando Magno Quintao Pereira via llvm-dev wrote: >> Dear Mehdi, >> >> I've changed your example a little bit: >> >> float saxpy(float a, float *x, float *y, int n) { >> int j = 0; >> for (int i = 0; i < n; ++i) { >> y[j] = a*x[i] + y[I]; // Change 'I' into 'j'? >> ++j; >> } >> } >> >> I get this code below, once I replace 'I' with 'j'. W...
2002 Dec 26
3
R-1.6.2beta_2002-12-20: build problems (PR#2395)
...6.2beta_2002-12-20 were successful on several systems, there were a few failures: ------------------------------------------------------------------------ Apple Darwin 6.2 (MacOS 10.2.2) with gcc-3.2.1: gcc -dynamiclib -flat_namespace -undefined suppress -o .libs/libgslcblas.0.0.0.dylib sasum.lo saxpy.lo ... isamax.lo izamax.lo xerbla.lo -lc -install_name /usr/local/lib/libgslcblas.0.dylib -compatibility_version 1 -current_version 1.0 gcc: suppress: No such file or directory gcc: /usr/local/lib/libgslcblas.0.dylib: No such file or directory gcc: 1: No such file or directory gcc: 1.0: No such f...
2004 May 31
1
Question about building library and BLAS
...not use subroutines isamax.f, sgefa.f sgesl.f in BLAS, as expected. In fact, there are problems to use these subroutines in BLAS, for some reason. Now what I want is to build an R library. The Makefile is the following: LIBNAME=car PKG_LIBS = $(LAPACK_LIBS) $(BLAS_LIBS) $(FLIBS) OBJS=isamax.o saxpy.o sscal.o foo1.o ... foo20.o -llapack -lblas $(LIBNAME)$(SHLIB_EXT): $(OBJS) $(SHLIB_LD) $(SHLIB_LDFLAGS) -o $@ $(OBJS) $(FLIBS) clean: @rm -f *.o *.$(SHLIB_EXT) realclean: clean Some compiling outputs are the following: g77 -mieee-fp -fPIC -O2 -g -pipe -march=i386 -mcpu=i6...