Resiliency, also known as self-repair, is a Windows Installer concept triggering automatic repair of broken installations.
Questions tagged [resiliency]
60 questions
0
votes
1 answer
Chaos Toolkit support for CRI-O based runtimes? for instance will it work on OpenShift Container Platform v4.3 please?
Please can someone confirm if Chaos Toolkit (https://github.com/chaostoolkit) will work on OpenShift Container Platform version 4.3 please? which is based on CRI-O runtime. As previously had tried it with Gremlin, Ops-Monkey, Pumba and Litmus but…

Vijay
- 1
0
votes
0 answers
How to debug OOMKilled issues with Kubernetes / Why is traefik pod down
I'm testing a local app that use a service that wrap an external soap service.
When I make a request to get data from 1 week, everything works fine.
But when I make a request to get data from 1 month, my web server / reverse proxy falls ( traefik…

Juliatzin
- 18,455
- 40
- 166
- 325
0
votes
0 answers
Are there some common techniques for Java Web Servers to detect when approaching overload?
In a period where microservices become more and more predominant, I was wondering if there are some common techniques by which Java Web Services detect overload before start deteriorating.
Unfortunately, it doesn't look like there is One golden rule…

M4rk
- 2,172
- 5
- 36
- 70
0
votes
1 answer
Store and forward design pattern with multiple instances of a microservice
I am designing a service that will be receiving requests which must only be executed exactly once (can be received more than once due to unique id presence).
The flow is:
outside world -> HTTP -> [A set of instance of Myservice] -> HTTP ->
…

ren
- 115
- 3
- 14
0
votes
1 answer
How to Handle Resiliency in Sharded Mongo Cluster when updating more than 1 Document
Mongo Currently does not the transaction in the sharded cluster when we are updating more than one document. In such cases how we should handle partial failures. It there any recommended pattern for these kinds of problems.

user3814933
- 21
- 4
0
votes
1 answer
Kafka Resiliency (Clustering) setup on single and multiple machines
I have setup 1 Zookeeper and 3 Kafka Broker (for Redundancy) on a single machine.
I want to know what is the best practice for Kafka Setup on a single machine and multiple machines in a network.
for e.g. if I set up on a single machine how many…

Jason Foster
- 117
- 10
0
votes
0 answers
How to test the performance of a database during crash?
I am trying to test my database against crash, i.e. to see how does the DB behave during crash.
I realise that it is not an easy task, because it is not easy to crash the database at a right point of time. For instance, I want to crash my program…

Yang Mei Lian
- 75
- 1
- 11
0
votes
0 answers
Should servlet running under tomcat in a container exit when it is not able to reach DB container?
In our setup, we have two containers
Tomcat running on Linux Container
DB running on another Linux container
If the servlet running under tomcat is not able to reach DB container via hibernate. After retries,Should we call System.exit in Servlet ?

yogishaj
- 705
- 5
- 17
0
votes
1 answer
Is there any way to fetch Hystrix Stream using java SSE clients
Is there any way I can fetch Hystrix streams (which produces metrices), since it is a server side event stream, thus it is not possible to fetch it using normal http client.
I was trying to using Jersey based SSE, however it is not able to do…

Lovey
- 880
- 3
- 15
- 31
0
votes
1 answer
Implementing retry pattern for WCF call transient exceptions
I have a very simple WCF SOAP service that does some changes in DB upon every call and returns some response. I'm also using ServiceChannel to call it from my client side code.
I want to make sure that my client-side code does not just fail when…

Arsen Magomedov
- 480
- 8
- 21
0
votes
1 answer
EntityFramework 6 Database.SqlQuery connection resiliency
I know that from version 6 EntityFramework supports connection resiliency feature when working with SQL Azure database. I am pretty sure it applies to DbSet context members.
But what if I want to execute SQL query against the database like…

Andrei
- 42,814
- 35
- 154
- 218
-1
votes
1 answer
Implement Resiliency at protocol level
We are developing Azure function in .Net6 which is interacting with multiple 3rd party application over HTTP protocol.
We have implemented Polly to handle transient errors.
The only issue with Polly is that you have to wrap the code with the policy…

Sunil
- 17
- 3
-1
votes
1 answer
Why use AWS ELB over Route53 considering cost?
Imagine I have only two EC2 virtual machines running the same stateless workload and my priority is very high resilience, followed by cost.
Why would I choose a load balancer to share traffic between the two VMs, each one in a different AZ, instead…

ogre
- 21
- 3
-1
votes
1 answer
How to use Chaos Monkey on local cluster
I have a cluster back in my office for testing purposes. I have there a database and i would like to make all kind of "monekybusiness" to those test machines, long before i want to go to production.
I zipped 2-3 coffees all this morning trying to…

Mr'Black
- 274
- 1
- 6
- 19