Highest Voted 'fault-tolerance' Questions

3

votes

2 answers

How to design : Avoid resource leaking when randomly accessing files

I have client/server application where the client app will open files. Those files get split in chunks, and sent to the server. Not only does the client send file chunks, but it sends other data as well. Each message (data or filechunk) has a…

java fault-tolerance

asked Mar 24 '12 at 13:20

Charles V.G.

169
4
11

3

votes

2 answers

What it the real benefit from Erlang's fault tolerance for a web project?

Let's assume we have a web project in which we want to have ~10000 web clients connected to the server simultaneously. Let's also assume that one client session lasts about 25 minutes. If we compare LAMP stack or any other popular web…

erlang lamp fault-tolerance

asked Oct 13 '11 at 10:27

skanatek

5,133
3
47
75

3

votes

1 answer

If a node of a DHT fails, will the values become unavailable?

I'm reading up about DHTs, but struggle to find information on what the consequences are for DHT values when a node fails. As far as I understand, without redundancy of data (hash table values) the failure of a single node would simply make the…

protocols distributed-computing theory fault-tolerance dht

asked Sep 10 '20 at 10:39

creativecoding

247
2
9

3

votes

3 answers

Fault Tolerance in MapReduce

I was reading about Hadoop and how fault tolerant it is. I read the HDFS and read how failure of master and slave nodes can be handled. However, i couldnt find any document that mentions how the mapreduce performs fault tolerance. Particularly, what…

mapreduce distributed-computing fault-tolerance

asked Apr 28 '11 at 04:35

Chander Shivdasani

9,878
20
76
107

3

votes

2 answers

How to make reliable, scalable redis on Kubernetes

I have been searching alot on how to deploy redis with high availability on kubernetes. I have some problems using redis cluster mode and when using the master-slave mode we need to also deploy sentinel to be able to handle master failures I have…

kubernetes redis high-availability fault-tolerance

asked Sep 17 '19 at 03:27

ElGenius

135
1
1
11

3

votes

0 answers

Bulk Unload from Redshift to S3 Interrupted

I wrote a python script that will do a bulk unload of all tables within a schema to s3, which scales to petabytes of data. While my script was running perfectly okay, my python script got interrupted due to a network disconnection. Now, I'm in the…

python database amazon-redshift database-migration fault-tolerance

asked Nov 15 '18 at 00:04

Praneeth Turlapati

56
4

3

votes

1 answer

Achieve Fault Tolerance with Consul Cluster

I have created consul server cluster using different ports in localhost. I used below commands for that. server 1: consul agent -server -bootstrap-expect=3 -data-dir=consul-data -ui -bind=127.0.0.1 -dns-port=8601 -http-port=8501 -serf-lan-port=8303…

spring-boot high-availability consul service-discovery fault-tolerance

asked Sep 27 '18 at 11:14

Ishara Madhawa

3,549
5
24
42

3

votes

1 answer

Python ZeroMQ broadcasting messages

I am going to implement a Practical Byzantine Fault Tolerance ( PBFT ). Hence, I am going to have multiple processes, P0 is going to initialize a round, by sending a first message. Is it possible to broadcast a message to all other processes using…

python zeromq fault-tolerance

asked Apr 28 '18 at 22:03

dilot

67
6

3

votes

1 answer

Fault Tolerance of FlinkKafkaConsumer in HiBench

I am running some experiments to test the fault tolerance capabilities of Apache Flink. I am currently using the HiBench framework with the WordCount micro benchmark implemented for Flink. I noticed that if I kill a TaskManager during an execution,…

apache-kafka intel apache-flink flink-streaming fault-tolerance

asked Apr 06 '18 at 16:48

Valerio

105
1
6

3

votes

1 answer

Do we need PBFT algorithm support in permissioned Block chain networks?

I am new to BCT. My question is why do we need a consensus algorithm such as PBFT in a permission based Block chain network where the nodes are trusted nodes. Is it only to find a way when nodes fail or is there any other use case. Can anyone…

blockchain fault-tolerance consensus

asked Apr 03 '18 at 09:46

Satya Narayana

454
6
20

3

votes

4 answers

How does HP/Tandem NonStop achieve single failure FT without spares?

As far as I could gather from Wikipedia and the mindboggling HPE website, the claim to fame of the NonStop system architecture is that it can achieve a single-failure FT without having to allocate excessive amounts of spare capacity (i.e. in…

fault-tolerance hp-nonstop tandem

asked Feb 01 '18 at 02:26

ddimitrov

3,293
3
31
46

3

votes

0 answers

Why should an HDFS cluster not be stretched across DCs?

It's easy to find well regarded references stating that HDFS should not be stretched across data centers [1], while Kafka should be stretched [2]. What specific issues make HDFS ill-suited to being stretched? I'm considering stretching HDFS across…

hadoop apache-kafka hdfs fault-tolerance disaster-recovery

asked Jul 22 '17 at 00:10

Paul Carey

1,768
1
17
19

3

votes

3 answers

What is the purpose of stopping actors in Akka?

I have read the Akka docs on fault tolerance & supervision, and I think I totally get them, with one big exception (no pun intended). Why would you ever want/need to stop a child actor??? The only clue in the docs is: Closer to the Erlang way is…

akka fault-tolerance

asked Jan 13 '16 at 15:51

smeeb

27,777
57
250
447

3

votes

2 answers

Akka + WithinTimeRange

I've testing the fault tolerant system of akka and so far it's been good when talking about retrying to send a msg according the maxNrOfRetries specified. However, it does not restart the actor within the given time range, it restarts all at once,…

scala akka fault-tolerance

asked Aug 07 '15 at 12:39

Thiago Pereira

1,724
1
17
31

3

votes

2 answers

Mitigating Hadoop's Achilles tendons

I just gave this Hadoop tuorial a read which state that Hadoop has an Achilles' tendon (a single point of failure) in JobTracker: The JobTracker is a single point of failure for the Hadoop MapReduce service which means if JobTracker goes down, all…

java hadoop fault-tolerance resiliency

asked Jun 25 '15 at 14:21

smeeb

27,777
57
250
447

Questions tagged [fault-tolerance]