Questions tagged [distributed]

Multiple computers working together, using a network to communicate

A distributed system consists of multiple autonomous computers that communicate through a . The computers interact with each other in order to achieve a common goal. A computer program that runs in a distributed system is called a distributed program, and distributed programming is the process of writing such programs.

2221 questions
0
votes
1 answer

Essential tools for a Distributed development team

I want to setup a website where a team of developers (distributed all over the world) can get together and develop Software. So far (after a ton of reading) im coming close to the conclusion that the tools I will need are: a Version control system…
iScotts
  • 9
  • 4
0
votes
1 answer

Get more computation power for developing java application

I'm developing Java application using Eclipse IDE. problem is my PC is every low performance with XP. My company has many PCs free. So is there way to distribute free computer power and get that computer power for my Android application development?
user881703
  • 1,111
  • 3
  • 19
  • 38
0
votes
1 answer

Running a C++ program (containing TCP IP code) inside a linux container.

I am in the process of simulating a distributed system using LXC (LinuX Containers). I understand that there is a config file where i specify all the things like hostname etc. then i can start a container. But how do I run my code in that container.…
Chani
  • 5,055
  • 15
  • 57
  • 92
0
votes
1 answer

State Naming Convention

I am working on a project, it has multiple parts in C# and Java(interoperable). I m working with files and it would be nice to keep states of the files. ie: which part did fail and where if there is a failure. also, if everything is processed mark…
DarthVader
  • 52,984
  • 76
  • 209
  • 300
0
votes
1 answer

Server returns 403 during secondary namenode docheckpoint with namenode

I am configuring hadoop on clusters. All node started successfully, but secondary node failed doCheckpoint with following log: 2011-10-25 11:09:07,207 ERROR org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode: Exception in doCheckpoint:…
Chris Zheng
  • 1,509
  • 1
  • 15
  • 20
0
votes
1 answer

I need suggestions about an implementation of Distributed Graph using MPI

Do you happen to know any robust, well-recongnised software library which implements distributed graphs using MPI? What I mean by distributed graph is a graph whose edges and vertices can be distributed over a cluster of processors in an abstract…
cpp_noname
  • 2,031
  • 3
  • 17
  • 30
0
votes
1 answer

Excel import to SQL Server using distributed queries

I was able to set this up successfully on my local and on production, as long as the user credentials used on running the script is a sysad. This url (http://www.excel-sql-server.com/excel-import-to-sql-server-using-distributed-queries.htm) By…
ace.spades
  • 33
  • 5
0
votes
1 answer

How should an WCF receiver service monitor an MSMQ queue and service requests in a real-world environment?

I have a service that queues requests in MSMQ. On the other side I have a WCF service that can handle those requests. How can I have the WCF receiver service respond when messages are available on the queue?
RBZ
  • 2,034
  • 17
  • 34
0
votes
0 answers

Distributed Locust testing gets CPU to 100% while other CPU's are almost idle

I am hoping someone can point me in the right direction. I am running locust to stress load one of our system and am running it distributed on my local machine. One master, four workers, spawning 10 users. (This is the minimal testing, I will ramp…
0
votes
0 answers

Are DHT algorithms good for handling large amount of counting states?

I wonder if a DHT algorithm like Kademlia is good at handling for my specific use case. I want to have a service that can maintain a large amounts of kv (string, int) pairs, within these kv pairs I also want the ability to do computation like how…
Baiqing
  • 1,223
  • 2
  • 9
  • 21
0
votes
0 answers

data shuffling issues with pyspark on multi-node cluster

I have time-series data, which I need to interpolate. There are several devices, that send the data for various submodules, that are connected to this device. I now want to interpolate the data per device and per submodule on a common…
bluhub
  • 129
  • 1
  • 2
  • 10
0
votes
0 answers

Using torch.distributed.all_reduce to eval

enter image description here enter image description here I add an online eval function in my training process, and this bug caused by dist.all_reduce only occurs after the eval function has been called many times. and after I delete this eval…
Castor
  • 1
0
votes
0 answers

Distributed worker load isn't reflected in master UI

In short, the exact same problem posted here 3 years ago. Running with 20 workers or 4 workers, the results are peculiar in the UI. They seem to reflect per-server metrics, and not the overall metrics of all workers. I request 500 users, the number…
0
votes
0 answers

What is the purpose of micrometer in opentelemetry? or can opentelementry add the traceid and spanid like sleuth, micrometer automatically for logs?

I am recently looking for the distributed tracing solution for my microservice architecture. my microservice architecture involved multiple different types of java application will be running. I have to provide the observability feature (logs,…
Vinoth
  • 7
  • 2
  • 10
0
votes
1 answer

JMeter distributed testing throws Connection timed out

Though I have tried the below steps, when I tried to start the Slave machine from the Master Machine GUI (Run-->Remote Start) getting the below Connection Timed Out Error Connection refused to host: 172.xx.x.xx; nested exception is: …
rpagadala
  • 796
  • 2
  • 15
  • 31