Displaying 20 results from an estimated 51 matches for "gemm".
Did you mean:
gem
2016 May 17
4
Determination of statements that contain only matrix multiplication
...m>:
>> Hi Tobias,
>>
>> could we use information about memory accesses of a SCoP statement and
>> def-use chains to determine statements, which don’t contain matrix
>> multiplication of the following form?
>
> Assuming s/don't/do you want to pattern-match gemm kernels inside larger scops.
>
>
>> for (int i = 0; i < Upper Bound1; i++)
>> for (int j = 0; j < Upper Bound2; j++)
>> for (int k = 0; k < Upper Bound3; j++)
>> C[i][j] += A[i][k] * B[k][j]
>>
>> We could probably check that memory a...
2016 May 16
2
Determination of statements that contain only matrix multiplication
Hi Tobias,
could we use information about memory accesses of a SCoP statement and
def-use chains to determine statements, which don’t contain matrix
multiplication of the following form?
for (int i = 0; i < Upper Bound1; i++)
for (int j = 0; j < Upper Bound2; j++)
for (int k = 0; k < Upper Bound3; j++)
C[i][j] += A[i][k] * B[k][j]
We could probably check that memory access
2007 Nov 10
9
problème de téléchargement
salut Alex, je ne peut pas télécharger wxruby-1.9.2 avec rubygems. J''avait wxruby-1.9.0, mais j''ai fait une restauration complète de mon pc hier.
J''ai la version 0.9.4 de ruby gems.
J''ai un message d''erreur <openURI::HTTPerror> quand je veut installer wxruby-1.9.2.
merci
_______________________________________________
wxruby-users mailing list
2008 Sep 26
3
rails -1.2.3 to 2.1.1 ? how ?
now i am working in a rails project version 1.2.3
but i like to work in rails 2.1.1
i thing by freexe the 1.2.3 gemm into vendor will solve the problem
but i have 35 plugin in my project(which is version 1.2.3)
so i worried about freezeing old gem into vendeor
any help appreciated?
thanks
--
Posted via http://www.ruby-forum.com/.
--~--~---------~--~----~------------~-------~--~----~
You received this message...
2018 Jan 08
2
Fwd: R/MKL Intel 2018 Compatibility
...libraries Intel 2018, we are facing to an
issue with *all INTERNAL command* that are executing in R. The R console is
freezing executing a process at 100% and never stop!!! It?s really an issue
for us.
As example, we can reproduce the error with *crossprod. Crossprod *which is
a wrapper of BLAS GEMM (optimized with MKL libraries), in this function it
seems that variables are not protected ( PROTECT(); UNPROTECT() ), see the
screenshot below, which is a recommendation for external commands:
Picture1
*RECOMMANDATION*
*Picture2*
*Code of CROSSPROD*
Picture 3
If we are recoding the CRO...
2013 Aug 12
1
[LLVMdev] [FastPolly]: Update of Polly's performance on LLVM test-suite
..., execution_time(0%)
SingleSource/Benchmarks/Polybench/linear-algebra/kernels/3mm/3mm,
compile_time(+270.73%), execution_time(0%)
SingleSource/Benchmarks/Polybench/linear-algebra/kernels/syrk/syrk,
compile_time(+208.57%), execution_time(0%)
SingleSource/Benchmarks/Polybench/linear-algebra/kernels/gemm/gemm,
compile_time(+202.63%), execution_time(0%)
SingleSource/Regression/C/test_indvars,
compile_time(+200.00%), execution_time(0%)
Results show that some Polly leads to significant compile-time overhead without any execution performance improvement.
I have reported a bug for nestedloop (http://...
2013 Sep 02
2
[LLVMdev] [Polly] Comionpile-time of Polly's code generation
Hi all,
It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898.
Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generation, but both of them lead to almost double comp...
2013 Sep 25
0
[LLVMdev] [Polly] Performance comparison between Cloog and ISL code generation
...nce Improvements - Compile Time (ISL over Cloog)
MultiSource/Benchmarks/ASC_Sequoia/AMGmk/AMGmk -69.11%
MultiSource/Benchmarks/Trimaran/netbench-crc/netbench-crc -44.39%
SingleSource/Benchmarks/Polybench/linear-algebra/kernels/3mm/3mm -12.74%
SingleSource/Benchmarks/Polybench/linear-algebra/kernels/gemm/gemm -11.21%
MultiSource/Benchmarks/TSVC/LoopRerolling-flt/LoopRerolling-flt -11.14%
SingleSource/Benchmarks/Polybench/linear-algebra/kernels/syr2k/syr2k -11.11%
MultiSource/Benchmarks/TSVC/Packing-flt/Packing-flt -10.87%
MultiSource/Benchmarks/TSVC/Searching-dbl/Searching-dbl -10.87%
SingleSource/...
2018 Jan 23
0
Inclusion of Polly and isl into core LLVM
...e future
> * Polly still shows up at the top of the middle-end, but larger
> compile time regressions are often due to increased code size (and
> the LLVM backend)
> * We see non-trivial speedups for hmmer, libquantum, and various
> linear-algebra kernels (we use gemm-specific optimizations). The
> first two require additional flags to be enabled.
On the topic of performance, this paper might be of interest:
Fast Polyhedral Abstract Domain
Gagandeep Singh, Markus Püschel, Martin Vechev
http://www.srl.inf.ethz.ch/papers/POPL17-Polyhedra.pdf
> Our e...
2007 Nov 11
0
Recent French communications
...I want to install
wxruby-1.9.2.
Please
Reply (Me):
[French:]
Salut Sebastien,
Il semble que RubyGems est d''avoir avec les erreurs d''indexation et le
téléchargement de wxRuby. Elle sera mieux pour vous de télécharger
wxRuby - 1.9.2 de Rubyforge page, et l''installer si gemme. Ceci devrait
vous permettre d''installer wxRuby 1.9.2.
Laters,
Mario Steele
[English:]
Hello Sebastien,
It seems that RubyGems is to have mistakes with the indexing and
downloading wxRuby. It will be better for you to download wxRuby - 1.9.2
of Rubyforge page, and install it if gem....
2016 May 28
1
Determination of statements that contain only matrix multiplication
...it.
Too many negations :-) I'm not sure I followed exactly what you wanted
to say, but I understand that this is not the priority since you can get
90% of the performance without worrying about prefetching.
> I started to consider prefetching, because it’s used in
> implementations of gemm micro-kernels of BLIS framework [3]. If I’m
> not mistaken, it’s applied to try to make sure that micro-panel Br is
> loaded after micro-panel Ar (as required in [1] p. 11). For example,
> its using helps to reduce the execution time of the attached
> implementation.
Interesting. The B...
2016 May 20
0
Determination of statements that contain only matrix multiplication
...he project, if Tobias doesn’t
disagree with it. My understanding is that prefetching isn’t used
explicitly in [1] and, according to [2], in some cases 90% of the
turbo boost peak of the processor can be attained without it.
I started to consider prefetching, because it’s used in
implementations of gemm micro-kernels of BLIS framework [3]. If I’m
not mistaken, it’s applied to try to make sure that micro-panel Br is
loaded after micro-panel Ar (as required in [1] p. 11). For example,
its using helps to reduce the execution time of the attached
implementation.
Refs:
[1] - http://www.cs.utexas.edu/...
2018 Jan 15
3
Inclusion of Polly and isl into core LLVM
...y very generic and must be sped up in the future
* Polly still shows up at the top of the middle-end, but larger compile time regressions are often due to increased code size (and the LLVM backend)
* We see non-trivial speedups for hmmer, libquantum, and various linear-algebra kernels (we use gemm-specific optimizations). The first two require additional flags to be enabled.
The precise inclusion agenda has been presented here:
http://lists.llvm.org/pipermail/llvm-dev/2017-September/117698.html
After having merged communities, I suggest to form a loop optimization working group which join...
2013 Sep 02
0
[LLVMdev] [Polly] Comionpile-time of Polly's code generation
On 09/01/2013 08:02 PM, Star Tan wrote:
> Hi all,
>
>
> It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898.
>
>
> Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generation, but both of them lead to almost...
2013 Sep 08
2
[LLVMdev] [Polly] Compile-time of Polly's code generation
...17:05:52,"Tobias Grosser" <tobias at grosser.es> wrote:
>On 09/01/2013 08:02 PM, Star Tan wrote:
>> Hi all,
>>
>>
>> It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898.
>>
>>
>> Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generation, but both of them le...
2018 Jan 15
2
(no subject)
...y very generic and must be sped up in the future
* Polly still shows up at the top of the middle-end, but larger compile time regressions are often due to increased code size (and the LLVM backend)
* We see non-trivial speedups for hmmer, libquantum, and various linear-algebra kernels (we use gemm-specific optimizations). The first two require additional flags to be enabled.
The precise inclusion agenda has been presented here:
http://lists.llvm.org/pipermail/llvm-dev/2017-September/117698.html
After having merged communities, I suggest to form a loop optimization working group which join...
2013 Aug 11
0
[LLVMdev] [FastPolly]: Update of Polly's performance on LLVM test-suite
On 08/10/2013 06:59 PM, Star Tan wrote:
> Hi all,
>
> I have evaluated Polly's performance on LLVM test-suite with latest LLVM (r188054) and Polly (r187981). Results can be viewed on: http://188.40.87.11:8000.
Hi Star Tan,
thanks for the update.
> There are mainly five new tests and each test is run with 10 samples:
> clang (run id = 27): clang -O3
> pollyBasic (run id =
2013 Sep 08
0
[LLVMdev] [Polly] Compile-time of Polly's code generation
...sser" <tobias at grosser.es> wrote:
>
>> On 09/01/2013 08:02 PM, Star Tan wrote:
>>> Hi all,
>>>
>>>
>>> It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898.
>>>
>>>
>>> Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generation, but bot...
2013 Sep 09
1
[LLVMdev] [Polly] Compile-time of Polly's code generation
...grosser.es> wrote:
>>
>>> On 09/01/2013 08:02 PM, Star Tan wrote:
>>>> Hi all,
>>>>
>>>>
>>>> It seems that Polly's code generation can leads to high compile-time overhead, especially for PolyBench applications such as 2mm, 3mm, gemm, syrk, etc. Some basic evaluation and analysis for Polly's code generation can be referred to http://llvm.org/bugs/show_bug.cgi?id=16898.
>>>>
>>>>
>>>> Currently, we can choose to run -polly-code-generator=cloog or -polly-code-generator=isl for code generat...
2013 May 03
2
[LLVMdev] [Polly] GSoC Proposal: Reducing LLVM-Polly Compiling overhead
...I get the following output:
alias polly-clang
alias polly-clang='~/Projekte/polly/build/bin/clang -Xclang -load
-Xclang ~/Projekte/polly/build/lib/LLVMPolly.so'
grosser at tobilaptop:~/Projekte/polybench$ polly-clang -O3 -mllvm -polly
-mllvm -debug-only=polly-cloog linear-algebra/kernels/gemm/gemm.c -I
utilities/ utilities/polybench.c -mllvm -polly-ignore-aliasing
-DPOLYBENCH_USE_SCALAR_LB
:: init_array : entry.split => for.end56
if ((nj >= 1) && (nk >= 1) && (p_1 >= 1) && (p_4 >= 1)) {
for (c2=0;c2<=p_4-1;c2+=32) {
for
(c3=max(-32*f...