Questions tagged [intel]

For issues related to Intel semiconductor chips and assemblies, Intel architectural features and ISA extensions, and Intel chips micro-architecture.

Intel Corporation is an American multinational semiconductor chip maker corporation headquartered in Santa Clara, California, United States. Intel is the inventor of the x86 processor architecture and makes central processing units, motherboard chipsets, graphic processing units, network interface controllers and much more devices related to communications and computing.

In addition to their hardware offerings Intel also produces a variety of software including compilers, libraries for mathematical computation(Intel MKL), threading(OpenMP, Intel Performance Primatives, Threading Building Blocks), parallel communication(MPI,OFED/True Scale Infiniband Stack) and several other products included in the Intel Parallel Studio toolkit. In addition to these offerings which are widely used in HPC Intel also produces software for datacenter management and is one of the most prolific contributors to the Linux kernel.

This tag should be used for questions about Intel hardware and software.

The x86 and/or x86-64 tags are better choices for questions about assembly programming for the architecture, rather than things like performance tuning specifically for Intel's implementation of x86.

Useful links

Related tags

3529 questions

votes

1 answer

Why does this loop take 1.32 cycles per iteration

Consider this simple C++ function to calculate the prefix sum of an array: void prefix_sum(const uint32_t* input, uint32_t* output, size_t size) { uint32_t total = 0; for (size_t i = 0; i < size; i++) { total += input[i]; …

c++ optimization x86 intel micro-optimization

asked Aug 12 '19 at 20:31

BeeOnRope

60,350
16
207
386

votes

1 answer

What are "non-virtualizable" instructions in x86 architecture?

Before the advent of hardware assisted virtualization there were instructions that could not be virtualized due to various reasons. Can somebody please explain what those instructions are and why they cannot be virtualized?

x86 intel virtualization cpu-architecture xen

asked Sep 26 '15 at 06:32

Yadgar Woderf

votes

4 answers

How can I find the micro-ops which instructions on Intel's x86 CPUs decode to?

The Intel Optimization Reference, under Section 3.5.1, advises: "Favor single-micro-operation instructions." "Avoid using complex instructions (for example, enter, leave, or loop) that have more than 4 micro-ops and require multiple cycles to…

x86 intel compiler-optimization

asked Nov 04 '14 at 12:32

Alex D

29,755
7
80
126

votes

2 answers

VirtualBox - Kernel requires an x86-64 cpu but only detected an i686 cpu

Intel i5-2410M CPU running at 2.30 GHz running a Windows 7 64-bit operating system. I have VirtualBox 4.13 installed. I am trying to run ubuntu-14.04-desktop-amd64.iso but I get an error this kernel requires an x86-64 cpu but only detected an i686…

virtualbox x86-64 intel ubuntu-14.04

asked Jul 21 '14 at 19:11

user3861960

votes

1 answer

Dynamically determining where a rogue AVX-512 instruction is executing

I have a process running on an Intel machine that supports AVX-512, but this process doesn't directly use any AVX-512 instructions (asm or intrinsics) and is compiled with -mno-avx512f so that the compiler doesn't insert any AVX-512…

linux performance x86 intel avx512

asked Aug 24 '18 at 16:53

BeeOnRope

60,350
16
207
386

votes

2 answers

What is the stack engine in the Sandybridge microarchitecture?

I am reading http://www.realworldtech.com/sandy-bridge/ and I'm facing some problems in understanding some issues: The dedicated stack pointer tracker is also present in Sandy Bridge and renames the stack pointer, eliminating serial dependencies…

assembly x86 intel cpu-architecture

asked Apr 14 '16 at 18:50

Gilgamesz

4,727
3
28
63

votes

3 answers

x86-64 canonical address?

During reading of an Intel manual book I came across the following: On processors that support Intel 64 architecture, the IA32_SYSENTER_ESP field and the IA32_SYSENTER_EIP field must each contain a canonical address. What is a 'canonical address'?

assembly x86-64 intel memory-address virtual-address-space

asked Sep 15 '14 at 16:11

Rouki

2,239
1
24
41

votes

2 answers

Half-precision floating-point arithmetic on Intel chips

Is it possible to perform half-precision floating-point arithmetic on Intel chips? I know how to load/store/convert half-precision floating-point numbers [1] but I do not know how to add/multiply them without converting to single-precision…

x86 intel avx floating-point-conversion half-precision-float

asked Apr 24 '18 at 07:19

Kadir

1,345
3
15
25

votes

1 answer

Why is XCHG reg, reg a 3 micro-op instruction on modern Intel architectures?

I'm doing micro-optimization on a performance critical part of my code and came across the sequence of instructions (in AT&T syntax): add %rax, %rbx mov %rdx, %rax mov %rbx, %rdx I thought I finally had a use case for xchg which would allow me to…

performance assembly x86 intel

asked Aug 19 '17 at 00:05

jeteon

3,471
27
40

votes

2 answers

Enabling floating point interrupts on Mac OS X Intel

On Linux, feenableexcept and fedisableexcept can be used to control the generation of SIGFPE interrupts on floating point exceptions. How can I do this on Mac OS X Intel? Inline assembly for enabling floating point interrupts is provided in…

macos intel interrupt floating-point-exceptions

asked Oct 29 '08 at 14:41

Geoffrey Irving

6,483
4
32
40

votes

4 answers

Android emulator system images and AMD processor

I use AMD FX X6 6300 type processor. (it supports virtualization and my BIOS setting is ON) I installed "Intel x86 Emulator Accelerator". When I tried to run Intel accelerator setup, I get that setup is unable to install. (error was…

android android-emulator arm mips intel

asked Jul 23 '13 at 17:32

trante

33,518
47
192
272

votes

1 answer

Size of store buffers on Intel hardware? What exactly is a store buffer?

The Intel optimization manual talks about the number of store buffers that exist in many parts of the processor, but do not seem to talk about the size of the store buffers. Is this public information or is the size of a store buffer kept as a…

performance assembly x86 intel cpu-architecture

asked Feb 25 '19 at 23:21

Curious

20,870
8
61
146

votes

1 answer

Which cache mapping technique is used in intel core i7 processor?

I have learned about different cache mapping techniques like direct mapping and fully associative or set associative mapping, and the trade-offs between those. (Wikipedia) But I am curious which one is used in Intel core i7 or AMD processors…

x86 intel cpu-architecture cpu-cache amd-processor

asked Mar 04 '18 at 06:11

Subhadip

votes

2 answers

Why REP LODS AL instruction exists?

In other words, is there any case I might need this instruction? From Intel Instructions Manual, this is what the instruction do: Load (E)CX bytes from DS:[(E)SI] to AL. Take the following example in NASM: section .data src: db 0, 1, 2,…

assembly x86 intel amd

asked Jun 11 '17 at 17:02

Bite Bytes

1,455
8
24

votes

2 answers

Why is Skylake so much better than Broadwell-E for single-threaded memory throughput?

We've got a simple memory throughput benchmark. All it does is memcpy repeatedly for a large block of memory. Looking at the results (compiled for 64-bit) on a few different machines, Skylake machines do significantly better than Broadwell-E,…

performance x86 benchmarking intel cpu-architecture

asked Aug 31 '16 at 22:32

aggieNick02

2,557
2
23
36

Prev 1 2 3

…

99 100 Next