Questions tagged [distributed]

Multiple computers working together, using a network to communicate

A distributed system consists of multiple autonomous computers that communicate through a . The computers interact with each other in order to achieve a common goal. A computer program that runs in a distributed system is called a distributed program, and distributed programming is the process of writing such programs.

2221 questions
24
votes
7 answers

Search/Find a file and file content in Hadoop

I am currently working on a project using Hadoop DFS. I notice there is no search or find command in Hadoop Shell. Is there a way to search and find a file (e.g. testfile.doc) in Hadoop DFS? Does Hadoop support file content search? If so, how to…
leon
  • 10,085
  • 19
  • 60
  • 77
24
votes
1 answer

What is the dIfference between a distributed system and a clustered system?

Both are defined to be a set of computers that work together and give the end users a perception of a single computer running behind it. So what is the difference here?
batman
  • 5,022
  • 11
  • 52
  • 82
24
votes
5 answers

How do you keep two related, but separate, systems in sync with each other?

My current development project has two aspects to it. First, there is a public website where external users can submit and update information for various purposes. This information is then saved to a local SQL Server at the colo facility. The second…
jeremcc
  • 8,633
  • 11
  • 45
  • 55
23
votes
1 answer

Distributed tensorflow: the difference between In-graph replication and Between-graph replication

I got confused about the two concepts: In-graph replication and Between-graph replication when reading the Replicated training in tensorflow's official How-to. It's said in above link that In-graph replication. In this approach, the client…
ROBOT AI
  • 1,217
  • 3
  • 16
  • 27
23
votes
3 answers

How to decide Kafka Cluster size

I am planning to decide on how many nodes should be present on Kafka Cluster. I am not sure about the parameters to take into consideration. I am sure it has to be >=3 (with replication factor of 2 and failure tolerance of 1 node). Can someone tell…
puneet
  • 530
  • 1
  • 3
  • 8
23
votes
8 answers

Distributed Computing Framework (.NET) - Specifically for CPU Intensive operations

I am currently researching the options that are available (both Open Source and Commercial) for developing a distributed application. "A distributed system consists of multiple autonomous computers that communicate through a computer network."…
holsee
  • 1,974
  • 2
  • 27
  • 43
21
votes
4 answers

Keeping distributed databases synchronized in a unstable network

I'm facing the following challenge: I have a bunch of databases in different geographical locations where the network may fail a lot (I'm using cellular network). I need to keep all the databases synchronized but there is no need to be in real time.…
jassuncao
  • 4,695
  • 3
  • 30
  • 35
20
votes
5 answers

Distributed ProjectManagement/Bug Tracking

Now that we have DSCMs, are there any Project Management / Bug Tracking tools that are distributed?
BubbaT
  • 1,810
  • 4
  • 21
  • 24
20
votes
7 answers

GUI recommendations for eventual consistency?

When using distributed and scalable architecture, eventual consistency is often a requirement. Graphically, how to deal with this eventual consistency? Users are used to click save, and see the result instantaneously... with eventual consistency…
Steve B
  • 36,818
  • 21
  • 101
  • 174
20
votes
9 answers

Horizontal vs Vertical Fragmentation in Distributed Database Management Systems (DDBMS)

In the context in DDBMS, what is the difference between Vertical Fragmentation and Horizontal Fragmentation? Is it such that the relation's extension is fragmented for vertical fragmentation and intension fragmentation is horizontal fragmentation?
DreX
  • 333
  • 1
  • 5
  • 17
20
votes
2 answers

What's the difference between ZooKeeper and any distributed Key-Value stores?

I am new to zookeeper and distributed systems, and am learning it myself. From what I understand for now, it seems that ZooKeeper is simply a key-value store whose keys are paths and values are strings, which is nothing different from, say, Redis.…
OneZero
  • 11,556
  • 15
  • 55
  • 92
19
votes
5 answers

What algorithms there are for failover in a distributed system?

I'm planning on making a distributed database system using a shared-nothing architecture and multiversion concurrency control. Redundancy will be achieved through asynchronous replication (it's allowed to lose some recent changes in case of a…
Esko Luontola
  • 73,184
  • 17
  • 117
  • 128
19
votes
9 answers

Job queue with job affinity

I am currently facing a problem for which I am pretty sure there is an official name, but I don't know what to search the web for. I hope that if I describe the problem and the solution I have in mind, somebody is able to tell me the name of the…
Golo Roden
  • 140,679
  • 96
  • 298
  • 425
19
votes
5 answers

Distributed Java Compiler

Is there a distributed compiler for Java, analogous to distcc for C/C++?
Boolean
  • 14,266
  • 30
  • 88
  • 129
19
votes
10 answers

What are the faster Paxos-related algorithms for consensus in distributed systems?

I've read Lamport's paper on Paxos. I've also heard that it isn't used much in practice, for reasons of performance. What algorithms are commonly used for consensus in distributed systems?
Rob Lachlan
  • 14,289
  • 5
  • 49
  • 99