thr3ads.net - search: "cudamalloc"

Displaying 6 results from an estimated 6 matches for "cudamalloc".

2012 Jul 21

Use GPU in R with .Call

...= a[idx] + b[idx]; } } void vecAdd_kernel(double *ain,double *bin,double *cout,int len){ int alloc_size; alloc_size=len*sizeof(double); /*Step 0a) Make a device copies of ain,bin,and cout.*/ double *a_copy,*b_copy,*cout_copy; /*Step 0b) Allocate memory for device copies.*/ cudaMalloc(&a_copy,alloc_size); cudaMalloc(&b_copy,alloc_size); cudaMalloc(&cout_copy,alloc_size); /*Step 0c) Copy arguments to device.*/ cudaMemcpy(a_copy,ain,alloc_size,cudaMemcpyHostToDevice); cudaMemcpy(b_copy,bin,alloc_size,cudaMemcpyHostToDevice); cudaMemcpy(co...

R external pointer and GPU memory leak problem

2016 May 14

R external pointer and GPU memory leak problem

...is a R external pointer pointing to GPU vector(device) */ SEXP createGPU(SEXP input, SEXP n) { ? int *lenth = INTEGER(n); ? ? ? ?PROTECT (input = AS_NUMERIC (input)); ? ? ? ?double * temp;? ? ? ? ?temp = REAL(input); double *x; ? ? ? ? ? ? ? ##here is the step which causes the memory leak cudacall(cudaMalloc((void**)&x, *lenth * sizeof(double))); //protect the R external pointer from finalizer SEXP ext = PROTECT(R_MakeExternalPtr(x, R_NilValue, R_NilValue)); R_RegisterCFinalizerEx(ext, _finalizer, TRUE); ? //copying CPU to GPU cublascall(cublasSetVector(*lenth, sizeof(double), temp, 1,? R_ExternalP...

[LLVMdev] Clang support for CUDA

2012 Feb 23

[LLVMdev] Clang support for CUDA

...rt a simple CUDA program to LLVM IR using clang 3.0. The program is as follows, #include<stdio.h> #nclude<clang/test/SemaCUDA/cuda.h> __global__ void kernfunc(int *a) { *a=threadIdx.x+blockIdx.x*blockDim.x; } int main() { int *h_a,*d_a,n; n=sizeof(int); h_a=(int*)malloc(n); *h_a=5; cudaMalloc((void*)&d_a,n); cudaMemcpy(d_a,h_a,n,cudaMemcpyHostToDevice); kernelfunc<<<1,1>>>(d_a); cudaMemcpy(h_a,d_a,n,cudaMemcpyDeviceToHost); printf("%d",*h_a); return 0; } What additional header files should be included? What part of the code is currently not supported b...

Segmentation Fault when using CUDA

2010 Nov 08

Segmentation Fault when using CUDA

...The simplest case of allocating a char and then deallocating causes a segmentation fault when R closes. Not on garbage collection but only on exit. Is there anything in the R internals that explain why this is happening? The relevant C++ code is --- void gpualloctest(){ char * _a; cudaMalloc(&_a,sizeof(char)); cudaFree(_a); _a=NULL; } --- from gpu.cu gputest<-function(){ cat("testing allocation on gpu\n") Module('test','gputest')$gpualloctest() cat("Test successful\n") cat("Collecting Garbage\n&qu...

instrumenting device code with gpucc

2016 Mar 05

instrumenting device code with gpucc

On Fri, Mar 4, 2016 at 5:50 PM, Yuanfeng Peng <yuanfeng.jack.peng at gmail.com> wrote: > Hi Jingyue, > > My name is Yuanfeng Peng, I'm a PhD student at UPenn. I'm sorry to bother > you, but I'm having trouble with gpucc in my project, and I would be really > grateful for your help! > > Currently we're trying to instrument CUDA code using LLVM 3.9, and

instrumenting device code with gpucc

2016 Mar 10

instrumenting device code with gpucc

...ore the IR to PTX. On Wed, Mar 9, 2016 at 4:31 PM, Yuanfeng Peng <yuanfeng.jack.peng at gmail.com> wrote: > Hi Jingyue, > > Thanks for the instructions! I instrumented the device code and got a > binary of axpy.cu; however, the resulting executable always fails on the > first cudaMalloc call in host code (the kernel had not even been launched > yet), with the error code being 30 (cudaErrorUnknown). In my > instrumentation pass, I only inserted a hook function upon each access to > device memory, with their signatures being: "__device__ void > _Cool_MemRead_Hook(...

search for: cudamalloc