Questions tagged [supercomputers]

Supercomputers belong to a class of highly specialised hardware infrastructures, where high number of machines are typically pre-organised and smart-linked together with specialised high-speed low-latency interconnects, so as to allow new forms of concurrent processing cooperations to be orchestrated. Having any such supercomputing infrastructure is not enough, it is important to also use system tools capable to harness the most of the available CPU-powers

Supercomputers first began to appear in the 1960's.

These early supercomputers had only a single, high-speed processor. Control Data Corporation's CDC-6600, designed by Seymour Cray, was about ten times faster than all other computers of its day, and was dubbed a supercomputer -- the first appearance of the term.

Later, as processing speed, cooling ability, and physical size hit limits, Cray pioneered the method of linking multiple processors together in order to get more speed out of the same machine. This is the same method used in today's supercomputers, which can range in size from thousands of processing cores to hundreds of thousands of processing cores.

*  Seymour CRAY (                           yes, the supercomputer guy )
*  said:
*  --------------------------------------------------------------------
*  A supercomputer turns compute-bound problems into I/O bound problems
*  --------------------------------------------------------------------
*  and:
*  --------------------------------------------------------------------
*  It is not hard to build a fast processor or a fast memory,
*  but the challenge is to build a fast system.
*  --------------------------------------------------------------------

Interconnect latency is an additional [TIME]-domain penalty, each process has to pay for using a supercomputer's remote resource under a distributed computation-graph schedule.

Minimising interconnect's latency-costs is thus one natural direction, using a smarter, overhead-aware computation-graph design is the other direction to achieve the indeed I/O-bounds' bleeding edge of the ultimate performance from any supercomputing system's infrastructure.

enter image description here

91 questions
3
votes
2 answers

When using qsub to submit jobs, how can I include my locally installed python packages?

I have an account on a supercomputing cluster where I've installed some packages using e.g. "pip install --user keras". When using qsub to submit jobs to the queue, I try to make sure the system can see my local packages by setting "export…
user1634426
  • 563
  • 2
  • 5
  • 12
2
votes
1 answer

Slurm: how to use all cores available to the node?

I'm working with a large computing cluster with a SLURM workload manager that has four different subsections: we'll call them C1, C2, C3, and C4. Nodes in C1 and C2 have 28 cores, whereas those in C3 and C4 have 40 and 52 cores, respectively. I…
Heatherosa
  • 23
  • 2
2
votes
2 answers

How to setup amazon to use Grid computing with maple software that is in existing EC2?

I have one maple license in EC2 How to setup amazon to use Grid computing with maple software that is in existing EC2? I am not sure Grid computing whether is supercomputer. I just have a for loop program to run grid computing that would like to run…
Ho Yeung Lee
  • 453
  • 1
  • 3
  • 14
2
votes
1 answer

Master SLURM Script that calls other SLURMS

I'm a cancer researcher and we were wondering if any of you knew how to make a SLURM script that calls other slurms. If the previous script finished successfully, continue to the next call. I.E something like: #!/bin/bash #SBATCH --nodes=1 #SBATCH…
Bcheda
  • 21
  • 3
2
votes
0 answers

What addressing paradigms are being used for high performance computing?

What's the difference between "Active Global Address Space ( AGAS )" and "Partitioned Global Address Space ( PGAS )" used for high performance computing? It's not a specific question about implementation, so I think this information will be…
2
votes
0 answers

Building a Roofline Model

I'm trying to build a roofline model for a node in a supercomputer that I'm running simulations on. The node has 2x Intel Xeon E5-2650 v2 (Ivy Bridge) 8 core 2.6 GHz processors (16 cores per node), with 64GB RAM total (4GB each). The maximum memory…
2
votes
1 answer

Issues with supercomputer submission scripts to run NetLogo

I'm having some difficulty getting the supercomputer to run my netlogo model. I'm attempting to follow the advice given here and here. But no luck. I think it's not finding some of the needed jars? The entirety of the v5.3.1 folder is located…
Andrew Yoak
  • 643
  • 4
  • 13
2
votes
1 answer

What is Rmax/RPeak (Ratio) in terms of Supercomputer

I am working on top500 supercomputer database.(http://www.top500.org/) Rmax is maximum performance RPeak is theorotical maximum performance. Does Ratio of Rmax to RPeak results to something? Like say efficiency? or anything which could say…
Ojas Kale
  • 2,067
  • 2
  • 24
  • 39
2
votes
0 answers

Max size of set linear equations to solve? (X=AX+B)

This is a very general question regarding the maximum size of a set of linear equations to be solved by today's fastest hardware, in the form: X = AX + B A: NxN matrix of floats, it is sparse. B: N-vector of floats. solve for X. This becomes X(I-A)…
2
votes
1 answer

How to tell JRE to use more than one CPU node

I have a .jar file that I want to run on a supercomputer. There are some 40 CPU nodes available but Java uses only one of them when running my program. Is there any way to tell Java to use all the available nodes to run a given program (preferably…
Reza
  • 388
  • 2
  • 14
2
votes
2 answers

MPI speedup with trivially parallelizable DO-loop (F90)

I have a simple DO-loop (Fortran 90) in which the individual iterations are independent of each other and only input/output data from/to the hard drive (the processes do not exchange messages/MPI between each other) which I have parallelized using…
Boki
  • 193
  • 1
  • 3
  • 15
2
votes
1 answer

Submit multiple mpirun programs in sun grid engine

I would like to know how to submit several mpirun programs in one job script file in Sun Grid Engine. Suppose I have compiled program1, program2, program3 .. program100 and want to run them simultaneously such that each use 16 cpus in one job…
user2775514
  • 191
  • 1
  • 2
  • 7
2
votes
1 answer

Does Intel Xeon Phi co-processor support graphic processing on hardware level?

I am going to do some rendering experiments on a large scale computer system with massive number of processors. This system uses some Intel Xeon E5 processors and Intel Xeon Phi co-processors. I've read documents and developer guide of Xeon Phi…
cxcfan
  • 185
  • 9
2
votes
1 answer

Data locality relevance with The Machine and memristors?

Preliminary remark: I do not know whether this is the best stack exchange website for this question. If not, I apologize and it should be moved to the correct website. Recently, HP has spoken about a research project called The Machine based on…
2
votes
3 answers

Could WebWorkers be used for supercomputer power?

This a general question really, not sure if this is the place for it (it might be deleted as quite general) so please don't heckle (I am just curious). I have been reading up on WebWorkers API and had a thought. WebWorkers can be limited to using…
Craig Taub
  • 4,169
  • 1
  • 19
  • 25