search for: f542

Displaying 2 results from an estimated 2 matches for "f542".

Did you mean: 6542
2018 Jun 21
2
NVPTX - Reordering load instructions
...> } NVCC emits PTX instructions where all loads from shared memory are packed together: > ... > ld.shared.f32 %f546, [kernel_dia+440]; > ld.shared.f32 %f545, [%r4+-996]; > ld.shared.f32 %f544, [kernel_dia+56]; > ld.shared.f32 %f543, [kernel_dia+88]; > ld.shared.f32 %f542, [kernel_dia+500]; > ld.shared.f32 %f541, [kernel_dia+84]; > ld.shared.f32 %f540, [%r4+-972]; > ld.shared.f32 %f539, [%r4+-1008]; > ld.shared.f32 %f538, [kernel_dia+496]; > ld.shared.f32 %f537, [kernel_dia+136]; > ld.shared.f32 %f536, [%r4+-976]; > ld.shared.f32 %...
2018 Jun 21
2
NVPTX - Reordering load instructions
...e > > packed together: > > > >> ... > >> ld.shared.f32 %f546, [kernel_dia+440]; > >> ld.shared.f32 %f545, [%r4+-996]; > >> ld.shared.f32 %f544, [kernel_dia+56]; > >> ld.shared.f32 %f543, [kernel_dia+88]; > >> ld.shared.f32 %f542, [kernel_dia+500]; > >> ld.shared.f32 %f541, [kernel_dia+84]; > >> ld.shared.f32 %f540, [%r4+-972]; > >> ld.shared.f32 %f539, [%r4+-1008]; > >> ld.shared.f32 %f538, [kernel_dia+496]; > >> ld.shared.f32 %f537, [kernel_dia+136]; > >> ld....