Questions tagged [infiniband]

InfiniBand is a high-speed switched fabric communications link technology used in high-performance computing and enterprise data centers.

InfiniBand is a switched fabric communications link used in high-performance computing and enterprise data centers. Its features include scalability, high throughput, low latency, quality of service and failover. The InfiniBand architecture specification defines a connection between processor nodes and high performance I/O nodes such as storage devices. Infiniband host bus adapters and network switches are commonly manufactured by both Mellanox and Intel.

178 questions
1
vote
1 answer

Infiniband Protocol

What is the use of the below structure and function in designing drivers for IB? struct ib_sa_client { atomic_t users; struct completion comp; }; /** * ib_sa_register_client - Register an SA client. */ void ib_sa_register_client(struct…
1
vote
2 answers

Has someone tried SoftiWARP?

I'm trying to use SoftiWARP for a project. I installed it using instructions from here, which I followed to the word, except I'm using Ubuntu 10.04 server. To test it out, I used the client-server programs from the-geek-in-the-corner…
Anuj Kalia
  • 803
  • 8
  • 16
1
vote
4 answers

Use infiniband with Akka in Scala?

Is it possible to use infiniband instead of TCP/IP with Akka? I want to use Akka for HPC on the local cluster and the network bottleneck is significant using TCP/IP...
Felix
  • 8,385
  • 10
  • 40
  • 59
1
vote
1 answer

find maximum allowed ibv_reg_mr

I'm trying to diagnose a memory allocation error thrown by ibv_reg_mr() in software that I use, and my suspicion is that it's related to known problems with some Mellanox Infiniband cards where the default maximum memory that can be registered is…
jason_r
  • 33
  • 6
1
vote
1 answer

Socket Direct Protocol error: "Address family not supported by protocol"

I thought I would try out SDP on our infiniband hardware. However, when I try to add AF_INET_SDP as the first argument to socket() I get the following error: "Address family not supported by protocol". Originally I had: #define AF_INET_SDP 26 But…
hookenz
  • 36,432
  • 45
  • 177
  • 286
1
vote
1 answer

How to find out InfiniBand installation path

I want to compile MVAPICH2 myself, but not sure where to find psm.h file, it cant be found in default places. Anyone knows that which command I can use to find InfiniBand?
Daniel
  • 2,576
  • 7
  • 37
  • 51
1
vote
1 answer

Port status is always down and cant open UMAD port

I have newly installed ofed-1.5-3 on following machine : SUSE Linux Enterprise Server 11 (x86_64) VERSION = 11 PATCHLEVEL = 1 Our admin has installed CX354A card (MCX354A-FCBT) card on the machine. $ lspci | grep -i mel 04:00.0 Network controller:…
Smash
  • 839
  • 1
  • 9
  • 10
0
votes
1 answer

I am planning to connect 18 nodes together with infiniband what parts do I need

I am planning to connect 18 nodes together with infiniband what do I need 1 36 port switch ? or do I need 2 switches ? 1 HCA for each node ? dual port ? 1 cable for each node ? or do I need 2 can the switch handle the subnet management or will it…
0
votes
2 answers

Programmatically retrieve infiniband device ip address

I'm trying to find programmatically the inet address of an Infiniband interface whose name is not know a priori. I'm on Linux, and I would like to avoid the parsing of ifconfig (8) output. I've read the second comment on this answer, that suggests…
Luca Martini
  • 1,434
  • 1
  • 15
  • 35
0
votes
1 answer

User-mode application that performs RDMA directly to nvme drive on Linux

We've been developing HPC applications that take advantage of the infiniband infrastructure. One of our applications exchange data that is stored in an nvme cache with other nodes, and for that it uses RDMA posts. We think we can increase the…
Caian
  • 440
  • 4
  • 15
0
votes
0 answers

Issues with using KafkaDirect for Kafka RDMA communication

KafkaDirect I'm attempting to install KafkaDirect from the GitHub repository to enable RDMA communication in Kafka. My environment is as follows: Ubuntu 20.04 Cluster : Node1, Node2, Node3 Mellanox ConnectX-3 InfiniBand KafkaDirect is an adaptation…
0
votes
0 answers

yum install kernel-devel-6.4.3-1.el8.elrepo.x86_64 Failed

I am trying to install MLNX_OFED_LINUX-23.04-1.1.3.0-rhel8.5-x86_64 on centos-8.5, it requires kernel-devel-6.4.3-1.el8.elrepo.x86_64 to be installated to continue further. But the yum install kernel-devel-6.4.3-1.el8.elrepo.x86_64 Failed with…
0
votes
0 answers

RDMA chelsio T6225-CR Register Memory Problem

I am an engineer working on the RDMA IWARP protocol. I use chelsio t6225-cr NIC Adapter and connect 2 pc back to back. Development environment is Window10, and i use NDSPI( NDSPI Interfaces (Windows) ). i want to register 1TB Memory to my adapter by…
holee
  • 1
  • 1
0
votes
0 answers

When tracing shared library functions with funclatency, no function names were displayed

When using the bcc tool funclatency, there are unknown function names. It would be helpful if I could track the entry and return values of a number of functions contained in the library ibverbs (Infiniband). I use funclatency to print a histogram of…
Bahamas
  • 345
  • 1
  • 2
  • 13
0
votes
0 answers

How to use RDMA between two RDMA capable instances on Azure

I have created 2 Standard HB120rs v2(RDMA capable) Virtual Machines on Azure with centos-based HPC, On "ifconfig" I am getting below ips for 2 machines I tried testing rping using the below commands On instance 1 executed rping -v -s -a…
JibinNajeeb
  • 784
  • 1
  • 10
  • 31