search for: gemm

Displaying 20 results from an estimated 51 matches for "gemm".

Did you mean: gem
2016 May 17
4
Determination of statements that contain only matrix multiplication
...m>: >> Hi Tobias, >> >> could we use information about memory accesses of a SCoP statement and >> def-use chains to determine statements, which don’t contain matrix >> multiplication of the following form? > > Assuming s/don't/do you want to pattern-match gemm kernels inside larger scops. > > >> for (int i = 0; i < Upper Bound1; i++) >> for (int j = 0; j < Upper Bound2; j++) >> for (int k = 0; k < Upper Bound3; j++) >> C[i][j] += A[i][k] * B[k][j] >> >> We could probably check that memory a...
2016 May 16
2
Determination of statements that contain only matrix multiplication
Hi Tobias, could we use information about memory accesses of a SCoP statement and def-use chains to determine statements, which don’t contain matrix multiplication of the following form? for (int i = 0; i < Upper Bound1; i++) for (int j = 0; j < Upper Bound2; j++) for (int k = 0; k < Upper Bound3; j++) C[i][j] += A[i][k] * B[k][j] We could probably check that memory access
2007 Nov 10
9
problème de téléchargement
salut Alex, je ne peut pas télécharger wxruby-1.9.2 avec rubygems. J''avait wxruby-1.9.0, mais j''ai fait une restauration complète de mon pc hier. J''ai la version 0.9.4 de ruby gems. J''ai un message d''erreur <openURI::HTTPerror> quand je veut installer wxruby-1.9.2. merci _______________________________________________ wxruby-users mailing list
2008 Sep 26
3
rails -1.2.3 to 2.1.1 ? how ?
now i am working in a rails project version 1.2.3 but i like to work in rails 2.1.1 i thing by freexe the 1.2.3 gemm into vendor will solve the problem but i have 35 plugin in my project(which is version 1.2.3) so i worried about freezeing old gem into vendeor any help appreciated? thanks -- Posted via http://www.ruby-forum.com/. --~--~---------~--~----~------------~-------~--~----~ You received this message...
2018 Jan 08
2
Fwd: R/MKL Intel 2018 Compatibility
...libraries Intel 2018, we are facing to an issue with *all INTERNAL command* that are executing in R. The R console is freezing executing a process at 100% and never stop!!! It?s really an issue for us. As example, we can reproduce the error with *crossprod. Crossprod *which is a wrapper of BLAS GEMM (optimized with MKL libraries), in this function it seems that variables are not protected ( PROTECT(); UNPROTECT() ), see the screenshot below, which is a recommendation for external commands: Picture1 *RECOMMANDATION* *Picture2* *Code of CROSSPROD* Picture 3 If we are recoding the CRO...
2013 Aug 12
1
[LLVMdev] [FastPolly]: Update of Polly's performance on LLVM test-suite
..., execution_time(0%) SingleSource/Benchmarks/Polybench/linear-algebra/kernels/3mm/3mm, compile_time(+270.73%), execution_time(0%) SingleSource/Benchmarks/Polybench/linear-algebra/kernels/syrk/syrk, compile_time(+208.57%), execution_time(0%) SingleSource/Benchmarks/Polybench/linear-algebra/kernels/gemm/gemm, compile_time(+202.63%), execution_time(0%) SingleSource/Regression/C/test_indvars, compile_time(+200.00%), execution_time(0%) Results show that some Polly leads to significant compile-time overhead without any execution performance improvement. I have reported a bug for nestedloop (http://...
2013 Sep 02
2
[LLVMdev] [Polly] Comionpile-time of Polly's code generation
Hi all, It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898. Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generation, but both of them lead to almost double comp...
2013 Sep 25
0
[LLVMdev] [Polly] Performance comparison between Cloog and ISL code generation
...nce Improvements - Compile Time (ISL over Cloog) MultiSource/Benchmarks/ASC_Sequoia/AMGmk/AMGmk -69.11% MultiSource/Benchmarks/Trimaran/netbench-crc/netbench-crc -44.39% SingleSource/Benchmarks/Polybench/linear-algebra/kernels/3mm/3mm -12.74% SingleSource/Benchmarks/Polybench/linear-algebra/kernels/gemm/gemm -11.21% MultiSource/Benchmarks/TSVC/LoopRerolling-flt/LoopRerolling-flt -11.14% SingleSource/Benchmarks/Polybench/linear-algebra/kernels/syr2k/syr2k -11.11% MultiSource/Benchmarks/TSVC/Packing-flt/Packing-flt -10.87% MultiSource/Benchmarks/TSVC/Searching-dbl/Searching-dbl -10.87% SingleSource/...
2018 Jan 23
0
Inclusion of Polly and isl into core LLVM
...e future > * Polly still shows up at the top of the middle-end, but larger > compile time regressions are often due to increased code size (and > the LLVM backend) > * We see non-trivial speedups for hmmer, libquantum, and various > linear-algebra kernels (we use gemm-specific optimizations). The > first two require additional flags to be enabled. On the topic of performance, this paper might be of interest: Fast Polyhedral Abstract Domain Gagandeep Singh, Markus Püschel, Martin Vechev http://www.srl.inf.ethz.ch/papers/POPL17-Polyhedra.pdf > Our e...
2007 Nov 11
0
Recent French communications
...I want to install wxruby-1.9.2. Please Reply (Me): [French:] Salut Sebastien, Il semble que RubyGems est d''avoir avec les erreurs d''indexation et le téléchargement de wxRuby. Elle sera mieux pour vous de télécharger wxRuby - 1.9.2 de Rubyforge page, et l''installer si gemme. Ceci devrait vous permettre d''installer wxRuby 1.9.2. Laters, Mario Steele [English:] Hello Sebastien, It seems that RubyGems is to have mistakes with the indexing and downloading wxRuby. It will be better for you to download wxRuby - 1.9.2 of Rubyforge page, and install it if gem....
2016 May 28
1
Determination of statements that contain only matrix multiplication
...it. Too many negations :-) I'm not sure I followed exactly what you wanted to say, but I understand that this is not the priority since you can get 90% of the performance without worrying about prefetching. > I started to consider prefetching, because it’s used in > implementations of gemm micro-kernels of BLIS framework [3]. If I’m > not mistaken, it’s applied to try to make sure that micro-panel Br is > loaded after micro-panel Ar (as required in [1] p. 11). For example, > its using helps to reduce the execution time of the attached > implementation. Interesting. The B...
2016 May 20
0
Determination of statements that contain only matrix multiplication
...he project, if Tobias doesn’t disagree with it. My understanding is that prefetching isn’t used explicitly in [1] and, according to [2], in some cases 90% of the turbo boost peak of the processor can be attained without it. I started to consider prefetching, because it’s used in implementations of gemm micro-kernels of BLIS framework [3]. If I’m not mistaken, it’s applied to try to make sure that micro-panel Br is loaded after micro-panel Ar (as required in [1] p. 11). For example, its using helps to reduce the execution time of the attached implementation. Refs: [1] - http://www.cs.utexas.edu/...
2018 Jan 15
3
Inclusion of Polly and isl into core LLVM
...y very generic and must be sped up in the future * Polly still shows up at the top of the middle-end, but larger compile time regressions are often due to increased code size (and the LLVM backend) * We see non-trivial speedups for hmmer, libquantum, and various linear-algebra kernels (we use gemm-specific optimizations). The first two require additional flags to be enabled. The precise inclusion agenda has been presented here: http://lists.llvm.org/pipermail/llvm-dev/2017-September/117698.html After having merged communities, I suggest to form a loop optimization working group which join...
2013 Sep 02
0
[LLVMdev] [Polly] Comionpile-time of Polly's code generation
On 09/01/2013 08:02 PM, Star Tan wrote: > Hi all, > > > It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898. > > > Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generation, but both of them lead to almost...
2013 Sep 08
2
[LLVMdev] [Polly] Compile-time of Polly's code generation
...17:05:52,"Tobias Grosser" <tobias at grosser.es> wrote: >On 09/01/2013 08:02 PM, Star Tan wrote: >> Hi all, >> >> >> It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898. >> >> >> Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generation, but both of them le...
2018 Jan 15
2
(no subject)
...y very generic and must be sped up in the future * Polly still shows up at the top of the middle-end, but larger compile time regressions are often due to increased code size (and the LLVM backend) * We see non-trivial speedups for hmmer, libquantum, and various linear-algebra kernels (we use gemm-specific optimizations). The first two require additional flags to be enabled. The precise inclusion agenda has been presented here: http://lists.llvm.org/pipermail/llvm-dev/2017-September/117698.html After having merged communities, I suggest to form a loop optimization working group which join...
2013 Aug 11
0
[LLVMdev] [FastPolly]: Update of Polly's performance on LLVM test-suite
On 08/10/2013 06:59 PM, Star Tan wrote: > Hi all, > > I have evaluated Polly's performance on LLVM test-suite with latest LLVM (r188054) and Polly (r187981). Results can be viewed on: http://188.40.87.11:8000. Hi Star Tan, thanks for the update. > There are mainly five new tests and each test is run with 10 samples: > clang (run id = 27): clang -O3 > pollyBasic (run id =
2013 Sep 08
0
[LLVMdev] [Polly] Compile-time of Polly's code generation
...sser" <tobias at grosser.es> wrote: > >> On 09/01/2013 08:02 PM, Star Tan wrote: >>> Hi all, >>> >>> >>> It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898. >>> >>> >>> Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generation, but bot...
2013 Sep 09
1
[LLVMdev] [Polly] Compile-time of Polly's code generation
...grosser.es> wrote: >> >>> On 09/01/2013 08:02 PM, Star Tan wrote: >>>> Hi all, >>>> >>>> >>>> It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898. >>>> >>>> >>>> Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generat...
2013 May 03
2
[LLVMdev] [Polly] GSoC Proposal: Reducing LLVM-Polly Compiling overhead
...I get the following output: alias polly-clang alias polly-clang='~/Projekte/polly/build/bin/clang -Xclang -load -Xclang ~/Projekte/polly/build/lib/LLVMPolly.so' grosser at tobilaptop:~/Projekte/polybench$ polly-clang -O3 -mllvm -polly -mllvm -debug-only=polly-cloog linear-algebra/kernels/gemm/gemm.c -I utilities/ utilities/polybench.c -mllvm -polly-ignore-aliasing -DPOLYBENCH_USE_SCALAR_LB :: init_array : entry.split => for.end56 if ((nj >= 1) && (nk >= 1) && (p_1 >= 1) && (p_4 >= 1)) { for (c2=0;c2<=p_4-1;c2+=32) { for (c3=max(-32*f...