search for: sm_

Displaying 4 results from an estimated 4 matches for "sm_".

Did you mean: s_
2008 Jun 30
3
Trying to install an app from floppy
...n.in_ -rwxr-xr-x 1 root wheel 3418 Aug 10 1994 setupw.in_ -rwxr-xr-x 1 root wheel 212659 Aug 10 1994 states.vs_ -rwxr-xr-x 1 root wheel 4888 Aug 10 1994 uninstal.ex_ -rwxr-xr-x 1 root wheel 131568 Aug 10 1994 visio.ex_ -rwxr-xr-x 1 root wheel 183 Aug 10 1994 visio123.sm_ -rwxr-xr-x 1 root wheel 190 Aug 10 1994 visioami.sm_ -rwxr-xr-x 1 root wheel 458 Aug 10 1994 visioami.sx_ -rwxr-xr-x 1 root wheel 222 Aug 10 1994 visioflw.sm_ -rwxr-xr-x 1 root wheel 263 Aug 10 1994 visiolss.bm_ -rwxr-xr-x 1 root wheel 83 Aug 10 1994 visiols...
2017 Feb 07
0
Clang option to provide list of target-subarchs.
...old crufty subarchs you would get with an exclusion flag. We expect that the runtime will match the most appropriate subarch. As is currently done with --cuda-gpu-arch, we expect that the triple for the arch will be implied from the context. That is, if one specifies --target-subarchs="sm_50,gfx702", the software will generate the triples "nvptx64-nvidia-cuda" and "amdgcn--cuda" from the subarchs. Collisions (different archs) for the same subarch are unlikely and indicate a poor choice of subarch names. For example, AMD should never choose sm_ prefix for...
2016 Mar 05
2
instrumenting device code with gpucc
...M 3.9, and I've > written a pass to insert hook functions for certain function calls and > memory accesses. For example, given a CUDA program, say, axpy.cu, I > first compile it with > > clang++ -emit-llvm -c axpy.cu, > > which gives me two bitcode files, axpy.bc and axpy-sm_20.bc. Then I use > opt to load my pass and insert the hook functions to axpy.bc, which works > fine. After inspecting the instrumented axpy.bc, I noticed that the kernel > code was not there; rather, it lived inside axpy-sm_20.bc, so I also load > my pass to instrument axpy-sm_20.bc....
2016 Mar 10
4
instrumenting device code with gpucc
...t hook functions for certain function calls and >>> memory accesses. For example, given a CUDA program, say, axpy.cu, I >>> first compile it with >>> >>> clang++ -emit-llvm -c axpy.cu, >>> >>> which gives me two bitcode files, axpy.bc and axpy-sm_20.bc. Then I use >>> opt to load my pass and insert the hook functions to axpy.bc, which works >>> fine. After inspecting the instrumented axpy.bc, I noticed that the kernel >>> code was not there; rather, it lived inside axpy-sm_20.bc, so I also load >>> my pa...