Questions tagged [offloading]

This tag is for questions about software which utilize mechanisms for reducing workload from the CPU. This can be done by aggregating work before further processing is done and/or processing some of the workload in a dedicated hardware.

This tag is for questions about software which utilize mechanisms for reducing workload from the CPU. This can be done by aggregating work before further processing is done and/or processing some of the workload in a dedicated hardware.

Common offloads are network stack offloads such as LRO, GRO, TSO, etc. Other offloads are CPU offloads such as Intel's AES-NI for IPSec.

Offloads can be multi-layered, for example OVS (Open Virtual Switch) has a service for identifying and creating steering rules for packets. The user space service will offload the steering to the kernel software. Specific hardware can process the steering itself, so the kernel might offload to the hardware.

Common questions about offloads

  • How does specific offloads work?
  • How to enable offloads for specific cases?
  • What is the benefit of using specific offload?
111 questions
2
votes
1 answer

map, double> used in this offload region is not bitwise copyable

I am using Intel C++ Compiler v14.0.3. This following code troubles me: #include #include #include #include "offload.h" using namespace std; typedef map, double, …
2
votes
2 answers

Offloading script function to post-response: methods and best-practices?

First, the set up: I have a script that executes several tasks after a user hits the "upload" button that sends the script the data it need. Now, this part is currently mandatory, we don't have the option at this point to cut out the upload and…
Anthony
  • 36,459
  • 25
  • 97
  • 163
1
vote
2 answers

Android: determining the current context to display an alert

I am calling the ZXing scanner from Screen-A using Intents. Once the scan is done, control returns, of course, to the code behind Screen-A and I do some other work before calling Screen-B. Problem is: the screen is black during this work period and…
1
vote
0 answers

HAProxy's TLS offload feature not working as I expect

I am in the process of setting up HAProxy 2.8 via Docker to leverage it's ability to provide TLS offload in my network to avoid having certificates attached to every VM instance I deploy. My problem is that when I am binding the frontend (HAProxy)…
1
vote
0 answers

GPU offloading of triangular loop

I have triangular loop to calculate forces, which I ported to GPU via OpenMP offloading: !$omp target teams distribute parallel do simd & !$omp private(i,j,ii,jj,rinvdenom,tp,index,fix0) & !$omp …
realnewton
  • 11
  • 2
1
vote
1 answer

OpenMP Offload Error during linking with gcc with nvptx-none: unresolved symbol _fputwc_r

I am trying to compile a simple test problem using OpenMP offloading for an Nvidia GPU. I am using gcc with the nvptx-none target. I have installed the gcc+nvptx package with spack (or compiled gcc-13 with nvptx-tools myself, the results are the…
ZehDeckel
  • 13
  • 3
1
vote
1 answer

OpenMP offloading target map alloc - how does it work

I have always been confused and never understood how the alloc map-type of the map clause of the target (or target data) construct works. What is my application - I would like to have a temporary array on a device, which is used only on the device,…
1
vote
0 answers

OpenMP offload doesn't recognize my Nvidia GPU

I'm trying to test OpenMP's GPU offloading on my machine with a GTX 1660ti card. Running this code gives me a 0. #include #include int main() { printf("%d\n", omp_get_num_devices()); return 0; } I'm compiling with ./gcc…
1
vote
2 answers

OpenMP offloading with Intel oneAPI DPC++ compiler to NVIDIA GPU

I'm on a mission to write a program with OpenMP offloading to a GPU. At the moment I compile my code with Intel oneAPI DPC++ compiler icpx v2022.1.0 and aim to utilise an NVIDIA Tesla V100 at the backend. Please find below the relevant parts of my…
mabalenk
  • 887
  • 1
  • 8
  • 17
1
vote
1 answer

OpenMP Target Task reduction

I'm using OpenMP target offloading do offload some nested loops to the gpu. I'm using the nowait to tun it asynchronous. This makes it a task. With the same input values the result differs from the one when not offloading (e.g. cpu: sum=0.99,…
ThiloOS
  • 92
  • 7
1
vote
0 answers

GPU number threads not showing in OpenMP runtime

I am trying out offloading an array calculation with GPU (GTX 1080Ti) using OpenMP and C++ on this dummy code that I have written: #include #include using namespace std; int main(){ //int totalSum, ompSum; int…
OMEGOSH01
  • 43
  • 4
1
vote
1 answer

Errors when trying to offload to GTX-1050 with GCC9.3 and OpenMP

The build log: -------------- Clean: Release in OffloadTest (compiler: GNU GCC Compiler)--------------- Cleaned "OffloadTest - Release" -------------- Build: Release in OffloadTest (compiler: GNU GCC Compiler)--------------- g++ -Wall -m64…
1
vote
1 answer

OpenMP offloading in a dynamic library fails to compile

I want to compile C code with OpenMP offloading and create a dynamic library libtest.so. When I use the following command: gcc -fPIC -shared -fopenmp -foffload=nvptx-none="-fPIC" test.c -o libtest.so I get this error: /usr/bin/ld:…
Benjamin Maier
  • 133
  • 1
  • 6
1
vote
2 answers

DPDK - RSS Offloads (Hash Calculations): ETH_RSS_IPV6_EX - Explanation

So I am currently working on understanding how to Receive Side Scaling (RSS) Offloads work inside DPDK (Data Plane Development Kit). So basically from my understanding depending on which offload you have selected, it calculates a Hash value and uses…
Ameer Usman
  • 95
  • 10
1
vote
0 answers

linux network acceleration ,hardware offload

I am learning network hardware acceleration in Linux kernel for quiet a long time, but there still are several problems that make me very very confused. After deeply inspecting source code in kernel ,what confued me all the time are: 1.I can't…