Questions tagged [opencl]

OpenCL (Open Computing Language) is a framework for writing programs that execute across heterogeneous platforms consisting of CPUs, GPUs, and other processors.

This tag refers to the OpenCL (Open Computing Language) by Khronos Group. It is the first open, royalty-free standard for cross-platform, parallel programming of modern processors found in personal computers, servers and handheld/embedded devices. Using OpenCL, one can affect execution of parallel computations greatly improving speed and responsiveness of a wide spectrum of applications: From gaming and entertainment to scientific and medical software.

OpenCL is an API and a C99-like language; for each device, implementations are provider-specific. Some of the OpenCL implementation providers are:

Questions about OpenCL can be asked here along with the vendor/provider and architecture details. Bug reports should be discussed in the respective forums of the vendors NVIDIA Forums, Intel Forums, AMD Forums

Books

5705 questions

votes

2 answers

Installing additional files with CMake

I am attempting to supply some "source" files with some executables. I was wondering if there was a way to copy these source files to the build directory (From the source directory) then to the install directory using CMake. My more specific goal…

c cmake opencl

asked Mar 29 '13 at 00:10

Constantin

16,812
9
34
52

votes

5 answers

OpenCL vs. DirectCompute?

I'm looking for comparisons between OpenCL and DirectCompute, but I haven't found anything. OpenCL's advantages of being cross-platform and having a wider range of supported GPUs don't matter to me. I'm fine with coding on Windows against DX11…

opencl directcompute

asked Jul 03 '10 at 17:16

royco

5,409
13
60
84

votes

1 answer

How to use pinned memory / mapped memory in OpenCL

In order to reduce the transfer time from host to device for my application, I want to use pinned memory. NVIDIA's best practices guide proposes mapping buffers and writing the data using the following code: cDataIn = (unsigned…

memory opencl gpu gpgpu data-transfer

asked Jun 11 '14 at 09:10

krisg

votes

3 answers

How to declare local memory in OpenCL?

I'm running the OpenCL kernel below with a two-dimensional global work size of 1000000 x 100 and a local work size of 1 x 100. __kernel void myKernel( const int length, const int height, and a bunch of other parameters) { …

memory opencl

asked Jan 17 '12 at 01:41

user1111929

6,050
9
43
73

votes

1 answer

How many threads (or work-item) can run at the same time?

I'm new in GPGPU programming and I'm working with NVIDIA implementation of OpenCL. My question was how to compute the limit of a GPU device (in number of threads). From what I understood a there are a number of work-group (equivalent of blocks in…

opencl gpgpu

asked Apr 15 '11 at 16:31

Laure Jonchery

votes

4 answers

Why aren't there bank conflicts in global memory for Cuda/OpenCL?

One thing I haven't figured out and google isn't helping me, is why is it possible to have bank conflicts with shared memory, but not in global memory? Can there be bank conflicts with registers? UPDATE Wow I really appreciate the two answers from…

cuda opencl nvidia bank-conflict

asked Oct 01 '10 at 21:02

smuggledPancakes

9,881
20
74
113

votes

5 answers

What is the difference between creating a buffer object with clCreateBuffer + CL_MEM_COPY_HOST_PTR vs. clCreateBuffer + clEnqueueWriteBuffer?

I have seen both versions in tutorials, but I could not find out, what their advantages and disadvantages are. Which one is the proper one? cl_mem input = clCreateBuffer(context,CL_MEM_READ_ONLY,sizeof(float) * DATA_SIZE, NULL,…

memory-management opencl

asked Sep 30 '10 at 16:58

Framester

33,341
51
130
192

votes

3 answers

Is it possible to access hard disk directly from gpu?

Is it possible to access hard disk/ flash disk directly from GPU (CUDA/openCL) and load/store content directly from the GPU's memory ? I am trying to avoid copying stuff from disk to memory and then copying it over to GPU's memory. I read about…

cuda parallel-processing opencl gpu

asked Dec 03 '14 at 21:44

L Lawliet

2,565
4
26
35

votes

3 answers

Convenient way to show OpenCL error codes?

As per title, is there a convenient way to show readable OpenCL error codes? Being able to convert codes like '-1000' to a name would save a lot of time browsing through error codes.

opencl error-code

asked Jun 20 '14 at 11:40

Selmar

votes

1 answer

The variation of cache misses in GPU

I have been toying an OpenCL kernel that access 7 global memory buffers, do something on the values and store the result back to a 8th global memory buffer. As I observed, as the input size increases, the L1 cache miss ratio (=misses(misses + hits))…

opencl gpu gpgpu

asked Jul 19 '11 at 14:41

Zk1001

2,033
4
19
36

votes

3 answers

"Unrolling" a recursive function?

I'm writing a path tracer in C++ and I'd like to try and implement the most resource-intensive code into CUDA or OpenCL (I'm not sure which one to pick). I've heard that my graphics card's version of CUDA doesn't support recursion, which is…

python recursion cuda opencl

asked Jun 10 '11 at 00:14

Blender

289,723
53
439
496

votes

5 answers

When to use OpenCL?

Having stumbled over this forum thread, dot product faster on cpu than on gpu using OpenCL, I was reminded again, that there are instances, which look like they're made for OpenCL*, but where they're used, OpenCL does not provided us with a gain.…

opencl

asked Apr 20 '11 at 12:13

Framester

33,341
51
130
192

votes

5 answers

List of OpenCL compliant CPU/GPU

How can I know which CPU can be programmed by OpenCL? For example, the Pentium E5200. Is there a way to know w/o running and querying it?

cpu opencl

asked Mar 25 '11 at 22:45

Lior Dagan

votes

4 answers

Is it fair to compare SSE/AVX units to GPU cores?

I have a presentation to make to people who have (almost) no clue of how a GPU works. I think saying that a GPU has a thousand cores where a CPU only has four to eight of them is a non-sense. But I want to give my audience an element of…

cuda hardware opencl gpu sse

asked Jul 02 '13 at 13:25

Simon

votes

2 answers

OpenCL - is it possible to invoke another function from within a kernel?

I am following along with a tutorial located here: http://opencl.codeplex.com/wikipage?title=OpenCL%20Tutorials%20-%201 The kernel they have listed is this, which computes the sum of two numbers and stores it in the output variable: __kernel void…

opencl

asked Aug 25 '11 at 20:07

Adam S

8,945
17
67
103

Prev 1 2 3

…

99 100 Next