Displaying 7 results from an estimated 7 matches for "100351".
Did you mean:
10351
2017 Jul 01
2
Jacobi 5 Point Stencil Code not Vectorizing
I am able to vectorize it with the following code;
#include <stdio.h>
#define N 100351
// This function computes 2D-5 point Jacobi stencil
void stencil(int a[][N], int b[][N])
{
int i, j, k;
for (k = 0; k < N; k++) {
for (i = 1; i <= N-2; i++)
for (j = 1; j <= N-2; j++)
b[i][j] = 0.25 * (a[i][j] + a[i-1][j] + a[i+1][j] + a[i][j-1] +
a[i][j+1]...
2017 Jul 01
3
Jacobi 5 Point Stencil Code not Vectorizing
...; Please specify what is wrong with my code?
>
>
> On Sat, Jul 1, 2017 at 4:08 PM, hameeza ahmed <hahmed2305 at gmail.com>
> wrote:
>
>> Hello,
>>
>> I am trying to vectorize following stencil code;
>>
>> #include <stdio.h>
>> #define N 100351
>>
>> // This function computes 2D-5 point Jacobi stencil
>> void stencil(int a[restrict][N])
>> {
>> int i, j, k;
>> for (k = 0; k < 100; k++)
>> { for (i = 1; i <= N-2; i++)
>> { for (j = 1; j <= N-2; j++)
>>...
2017 Jun 24
4
AVX Scheduling and Parallelism
Hello,
After generating AVX code for large no of iterations i came to realize that
it still uses only 2 registers zmm0 and zmm1 when the loop urnroll
factor=1024,
i wonder if this register allocation allows operations in parallel?
Also i know all the elements within a single vector instruction are
computed in parallel but does the elements of multiple instructions
computed in parallel? like are
2017 Jul 01
2
Jacobi 5 Point Stencil Code not Vectorizing
Hello,
I am trying to vectorize following stencil code;
#include <stdio.h>
#define N 100351
// This function computes 2D-5 point Jacobi stencil
void stencil(int a[restrict][N])
{
int i, j, k;
for (k = 0; k < 100; k++)
{ for (i = 1; i <= N-2; i++)
{ for (j = 1; j <= N-2; j++)
{ a[i][j] = 0.25 * (a[i][j] + a[i-1][j] + a[i+1][j] + a[i][j-1] +
a[i][j+1]...
2017 Oct 23
3
Jacobi 5 Point Stencil Code not Vectorizing
...5:11, "hameeza ahmed via llvm-dev" <<a href="mailto:llvm-dev@lists.llvm.org">llvm-dev@lists.llvm.org</a>>:<div><div>> further i modified the code to the following;<br />><br />> #include <stdio.h><br />> #define N 100351<br />><br />> // This function computes 2D-5 point Jacobi stencil<br />> void stencil(int a[restrict][N], int b[restrict][N])<br />> {<br />> int i, j, k;<br />> for (k = 0; k < N; k++) {<br />> for (i = 1; i <= N-2;...
2017 Oct 24
3
Jacobi 5 Point Stencil Code not Vectorizing
...t; suggest?
>>
>> Serge Preis
>>
>>
>> 02.07.2017, 05:11, "hameeza ahmed via llvm-dev" <llvm-dev at lists.llvm.org>:
>> > further i modified the code to the following;
>> >
>> > #include <stdio.h>
>> > #define N 100351
>> >
>> > // This function computes 2D-5 point Jacobi stencil
>> > void stencil(int a[restrict][N], int b[restrict][N])
>> > {
>> > int i, j, k;
>> > for (k = 0; k < N; k++) {
>> > for (i = 1; i <= N-2; i++)
>>...
2010 Jan 21
4
dlmglue fixes
David,
So here are the two patches. Remove all patches that you have and apply
these.
The first one is straight forward.
The second one will hopefully fix the livelock issue you have been
encountering.
People reviewing the patches should note that the second one is slightly
different than the one I posted earlier. It removes the BUG_ON in the if
condition where we jump to update_holders. The