search for: gpu4

Displaying 3 results from an estimated 3 matches for "gpu4".

Did you mean: gpu
2012 Nov 21
0
[LLVMdev] LLVM Archive Format Extension Proposal
...ler emits references to support routine "foo" as "foo$gpu1". Then you construct the support libraries with aliases for each support function. So a particular implementation of foo in an archive may show up in the symbol table through its aliases "foo$gpu3", "foo$gpu4", "foo$gpu7". When the linker is only looking for "foo$gpu1", it will ignore all other foo implementations and just pick the one aliased to "foo$gpu1". BTW, Apple/Darwin has a similar issue with supporting multiple CPUs. Our solution is "fat" archi...
2012 Nov 21
7
[LLVMdev] LLVM Archive Format Extension Proposal
AMD would like to add new functionality to ranlib (and later ar and nm) and to the bits of LLVM Core that read (and later write) archives. Herewith a terse summary of the change, which we want to improve support of OpenCL for multiple GPUs in a single run-time. Conceptually, a serialized archive is really 2 pieces: a few header members and a set of normal file members. There are no constraints on
2012 Nov 22
2
[LLVMdev] LLVM Archive Format Extension Proposal
...ler emits references to support routine "foo" as "foo$gpu1". Then you construct the support libraries with aliases for each support function. So a particular implementation of foo in an archive may show up in the symbol table through its aliases "foo$gpu3", "foo$gpu4", "foo$gpu7". When the linker is only looking for "foo$gpu1", it will ignore all other foo implementations and just pick the one aliased to "foo$gpu1". I'll have to think about this… We try to treat the IR emitted by the front-end as pretty family/device ne...