Questions tagged [offloading]

This tag is for questions about software which utilize mechanisms for reducing workload from the CPU. This can be done by aggregating work before further processing is done and/or processing some of the workload in a dedicated hardware.

This tag is for questions about software which utilize mechanisms for reducing workload from the CPU. This can be done by aggregating work before further processing is done and/or processing some of the workload in a dedicated hardware.

Common offloads are network stack offloads such as LRO, GRO, TSO, etc. Other offloads are CPU offloads such as Intel's AES-NI for IPSec.

Offloads can be multi-layered, for example OVS (Open Virtual Switch) has a service for identifying and creating steering rules for packets. The user space service will offload the steering to the kernel software. Specific hardware can process the steering itself, so the kernel might offload to the hardware.

Common questions about offloads

  • How does specific offloads work?
  • How to enable offloads for specific cases?
  • What is the benefit of using specific offload?
111 questions
1
vote
1 answer

force haproxy to https

I can't get haproxy to force https. It works with http and https. I want it to force port 443. When I try to force it using .htaccess, it says "To many redirects" Here is my haproxy.cfg: global log 127.0.0.1 local2 maxconn 2048 user…
Jason L
  • 94
  • 2
  • 5
1
vote
2 answers

Can you safely rely upon Yahoo Pipes to offload ETL for your application?

Yahoo Pipes are a very intriguing choice for sort of a poor-man's server-free ETL solution, but would it be a good idea to build an application around one or many Pipes? I've really only used them for toy things here and there, with the only thing…
Daniel DiPaolo
  • 55,313
  • 14
  • 116
  • 115
1
vote
0 answers

Best way to update a large set (50K+ keys) of cached (Redis) data during peak load?

Environment: multiple web (30), cache (8) servers and one dedicated database server. To offload the database, a central Redis cache and local memory cache are in place. Large common event data is cached and updated based on schedules User specific…
baskabas
  • 333
  • 3
  • 5
  • 15
0
votes
0 answers

Audio decode in Android Automotive

In Android Automotive OS, Question#1) where compressed audio (ex. mp3) get decoded in generally ? I think there is two ways. Decode in Android Framework layer in MediaCodec Decode in HW DSP (I mean, audio dedicated firmware) As I guess, in mobile,…
0
votes
1 answer

OpenMP task reduction with target offloading segfaults when running single threaded

I was using OpenMP with target offload and found that my application segfaults when limiting OpenMP to a single thread. I could boil it down to the following snippet: #include int main(){ int res = 0; #pragma omp parallel…
lulle2007200
  • 888
  • 9
  • 20
0
votes
1 answer

Receive side scaling (RSS) based on just one field inside UDP payload

I have an issue with HW-offloaded balancing of network packets between CPU cores on DPDK. In my case I have only several endpoints whose number is less than the number of CPU cores I have. All the incoming traffic is local (not transit/forward), so…
0
votes
1 answer

HAProxy SSL termination: how to resolve certificate error only experienced by WAN clients?

My HAProxy SSL termination setup works exactly as expected for LAN clients, but if I use a WAN client (e.g. Tor Browser) I get a certificate error. Yet, they can bypass my 80->443 redirect and access my server on port 80 if I provide a firewall rule…
0
votes
0 answers

OpenMP Vs. OpenMP Offloading

I am trying to code vector addition code using OpenMP in host and OpenMP Offloading. But time taken for OpenMP offloading is more than OpenMP in host. Why is that? openmp-host.c #include #include #include #include…
0
votes
0 answers

Tx checksum offloading - Linux network device driver

I'm a newbie in the Linux network drivers. I'm trying to enable offload features in an ethernet IP. This ethernet IP supports TX checksum offloading for only 2 Tx HW queues among the total 8 Tx HW queues. Whether the Linux network stack supports…
Thomas07
  • 82
  • 1
  • 9
0
votes
1 answer

Intel VTune - Estimate data offload to GPU

I'm interested in estimate the data transfer, in terms of bytes, of an algorithm or function to be executed on a GPU using Intel VTune. For example, if my algorithm computes a multiplication between two vectors of 10 floats elements each, the result…
0
votes
0 answers

how to fold the pseudo header checksum calculation into the csum field when using partial checksum offloading

I'm trying to understand something related to checksum offloading. when using partial checksum offloading, the checksum on the pseudo header is calculated in the host, and put into the udp checksum field, to be use in the full calculation inside the…
sun amar
  • 1
  • 1
0
votes
0 answers

Does OpenMP 5.x support classes/type bound procedures in Fortran?

It looks like OpenACC supports Fortran Type Bound Procedures. So I took that code and translated the directives to OpenMP Offloading directives and that code bombs out (memory error, using nvhpc 22.x). So does the current version of OpenMP support…
rosenbe2
  • 19
  • 1
0
votes
0 answers

OpenMP: Marking functions to be included in the offloaded code

I'm trying to compile a simple vector addition example daxpy with OpenMP offloading to GPU. When I compile the code with g++ v10.2.0 I see the following error: daxpy_loop_target.cpp:37:6: error: function ‘plasma_core_omp_daxpy_loop_target’ \ has…
mabalenk
  • 887
  • 1
  • 8
  • 17
0
votes
0 answers

OpenMP target offloading matrix multiplication compilation error

I am currently trying to implement a simple matrix multiplication of 2 nxn matrices using OpenMP target offloading. The code is taken from here: template void multiplyJIK(T *A, T *B, T *C, uint64_t size) { #pragma omp target data…
Dogyman
  • 57
  • 8
0
votes
1 answer

Dpdk - Segmentation/fragmentation offload for application-level packet reassembly

I'm looking into the capabilities of fragment/packet reassembly hardware and wondering whether one could use them to perform application-level packet reassembly. I have to receive and reassemble a sequence of 65kiB packets (hundreds of streams…