search for: f543

Displaying 2 results from an estimated 2 matches for "f543".

Did you mean: 543
2018 Jun 21
2
NVPTX - Reordering load instructions
...[j][i]; > peri_col[idx][i] /= dia[i][i]; > } NVCC emits PTX instructions where all loads from shared memory are packed together: > ... > ld.shared.f32 %f546, [kernel_dia+440]; > ld.shared.f32 %f545, [%r4+-996]; > ld.shared.f32 %f544, [kernel_dia+56]; > ld.shared.f32 %f543, [kernel_dia+88]; > ld.shared.f32 %f542, [kernel_dia+500]; > ld.shared.f32 %f541, [kernel_dia+84]; > ld.shared.f32 %f540, [%r4+-972]; > ld.shared.f32 %f539, [%r4+-1008]; > ld.shared.f32 %f538, [kernel_dia+496]; > ld.shared.f32 %f537, [kernel_dia+136]; > ld.shared.f3...
2018 Jun 21
2
NVPTX - Reordering load instructions
...PTX instructions where all loads from shared memory are > > packed together: > > > >> ... > >> ld.shared.f32 %f546, [kernel_dia+440]; > >> ld.shared.f32 %f545, [%r4+-996]; > >> ld.shared.f32 %f544, [kernel_dia+56]; > >> ld.shared.f32 %f543, [kernel_dia+88]; > >> ld.shared.f32 %f542, [kernel_dia+500]; > >> ld.shared.f32 %f541, [kernel_dia+84]; > >> ld.shared.f32 %f540, [%r4+-972]; > >> ld.shared.f32 %f539, [%r4+-1008]; > >> ld.shared.f32 %f538, [kernel_dia+496]; > >> ld.s...