Questions tagged [cudafy.net]

CUDAfy.NET allows easy development of high performance GPGPU applications completely from the Microsoft .NET framework. It's developed in C#.

Modern graphics cards provide the potential of massive speed increase over CPUs for non-graphics related intensive numeric operations. Many large data set operations such as matrices can see a 100x or more speed up.

CUDAfy allows .NET developers to easily create complex applications that split processing cleanly between host and GPU.

The project website: http://cudafy.codeplex.com/

71 questions
0
votes
1 answer

Avoiding nvcc compilation when using Cudafy

I'm using Cudafy and would like my users to be able to use CUDA without installing the CUDA SDK, but they can use the Cudafy DLL. To avoid nvcc compilation done automatically in CudafyTranslator.Cudafy(types), I'm using the following…
Ahmed
  • 1
  • 1
0
votes
1 answer

Creating register variable arrays with CudaFy

My CUDA kernel would run much faster if instead of a shared memory array (L1) I was able to use thread register memory. I can do this in CUDA-C with the following declaration: unsigned short window[15]; but in C# when I try: ushort[] window = new…
Dr.YSG
  • 7,171
  • 22
  • 81
  • 139
0
votes
1 answer

Cudafy kernel does not compile

Making my first steps with Cudafy and trying to write a function which will take its thread's location and based on that save some int value into an array element. My code: [Cudafy] public static void GenerateRipples(GThread thread, int[]…
Val
  • 1,548
  • 1
  • 20
  • 36
0
votes
3 answers

Calculate intersecting coordinates from vector

Given a vector (or two points) how can i get the discrete coordinates that this vector intersects in some given interval? I am using this such that given a ray(vector) i can calculate the pixels in an image that this ray intersects and use these as…
smok
  • 355
  • 3
  • 16
0
votes
2 answers

Compress "sparse data" with CUDA (CCL: connected component labeling reduction)

I have a 5 million list of 32 bit integers (actually a 2048 x 2560 image) that is 90% zeros. The non-zero cells are labels (e.g. 2049, 8195, 1334300, 34320923, 4320932) that completely not sequential or consecutive in any way (it is the output of…
Dr.YSG
  • 7,171
  • 22
  • 81
  • 139
0
votes
0 answers

Cudafy.NET compilation error: illegal implicit conversion between two pointers with different address spaces

I have a problem compiling the method below. It compiles fine if the CUDA is selected as target, however if the OpenCL is selected that it throws error. [Cudafy] unsafe static void MyKernelMethod( GThread thread, …
dajuric
  • 2,373
  • 2
  • 20
  • 43
0
votes
1 answer

CUDAfy.Net / OpenCL, struct containing byte array results in non-blittable exception

Ok, so I'm using CUDAfy.Net, and I have the following 3 structs: [Cudafy] public struct Collider { public int Index; public int Type; public Sphere Sphere; public Plane Plane; public Material Material; } [Cudafy] public struct…
WolfCode
  • 1
  • 2
0
votes
1 answer

cudafy throws an exception while testing

I'm using VS 2010 on a Windows 7 64x. I've created a static class Cuda with the following code namespace Network { public static class Cuda { static GPGPU gpu= CudafyHost.GetDevice(); static CudafyHost host = new…
0
votes
1 answer

Declaring a private, thread specific, variable in a kernel and then returning that variable to the host

I have a method which i want to run on several threads but each thread will return a different number of results. Is it possible to declare a private, thread specific, variable ie a list which i can then pass back to the Host and merge all the…
Hans Rudel
  • 3,433
  • 5
  • 39
  • 62
-1
votes
1 answer

Is this Sobel filtering on the GPU performance ok?

I have a CUDA - related question for you :). Since I am relatively new to using CUDA I would like to know if this "performance" is ok, or not. I am using C# and Cudafy.Net! I have a grayscale image (represented as float[]) that I calculated from a…
Eru Iluvatar
  • 353
  • 1
  • 5
  • 17
-1
votes
1 answer

Is there a performance penalty for CUDA method not running in sync?

If i have a kernel which looks back the last Xmins and calculates the average of all the values in a float[], would i experience a performance drop if all the threads are not executing the same line of code at the same time? eg: Say @ x=1500, there…
Hans Rudel
  • 3,433
  • 5
  • 39
  • 62
1 2 3 4
5