Questions tagged [reliability]

Reliability is the ability of a system or component to perform its required functions under stated conditions for a specified period of time.

Reliability is the ability of a system or component to perform its required functions under stated conditions for a specified period of time.

305 questions
7
votes
5 answers

How to improve email sending and delivery reliability?

The current application uses Simple Java Mail to send couple emails a day but some of the emails never make it to the client. Based on the application server logs there has been couple mail server time outs but that does not explain all the cases of…
Petteri H
  • 11,779
  • 12
  • 64
  • 94
6
votes
1 answer

Why does PodDisruptionBudget have both minAvailable and maxUnavailable options?

While I understand the use of minAvailable and maxUnavailable to an extent, I wonder why are these two options exposed when one appears to be the complement of the other. Given that PDBs are immutable, changes to these parameters are rarely made (by…
Ela
  • 313
  • 3
  • 14
6
votes
1 answer

WCF Reliable session without transport security will not faulted event on time

I have encountered a very interesting behavior of reliable session. I am using netTcp binding + duplex channel + reliable session. When I am trying to listen on channel.faulted , if there is security mode is set to transport , faulted event would…
Yuan
  • 2,690
  • 4
  • 26
  • 38
6
votes
1 answer

Do CPUs make mistakes?

Imagine that a regular computer intensively works for 5 years non-stop. The CPU always works at 100% and is constantly reading and writing to memory. Is it true that the computer will not make a single mistake?
mosceo
  • 1,222
  • 11
  • 26
6
votes
1 answer

How can i terminate myself if i run too long?

I have a application that runs periodically (it's a scheduled task). The task is launched once a minute, and normally only takes a few seconds to do its business, then exits. But there's a ~1 in 80,000 chance (every two or three months) that the…
Ian Boyd
  • 246,734
  • 253
  • 869
  • 1,219
6
votes
1 answer

McDonalds omega: warnings in R

I'm computing omega for several different scales; and get different warning messages for different scales with different omega functions in R. My questions are regarding how to interpret these warnings and if it is safe to report the retrieved omega…
Oscar Kjell
  • 1,599
  • 10
  • 32
6
votes
5 answers

How to test reliability of my own (small) embedded operating system?

I've written a small operating system for embedded project running on small to medium target. I added some automated unit test with a high test code coverage (>95%), but the scope is only the static part. I got some code metrics as complexity and…
TridenT
  • 4,879
  • 1
  • 32
  • 56
5
votes
3 answers

How to deal with ^C in JVM console applications?

When a JVM-ran (written in Scala actually, but I tend to believe that the solution is going to be pretty much the same for Groovy, Clojure or pure Java) console program of mine gets terminated by the user pressing Ctrl+C (or by the system shut-down…
Ivan
  • 63,011
  • 101
  • 250
  • 382
5
votes
1 answer

Reliable udp in c#

There are several articles on the internet about how to make udp reliable. I have not been able to find one on c#. So maybe I can implement my algorithm. from researching on the internet I believe udp has two problems: it does not ensure that all…
Tono Nam
  • 34,064
  • 78
  • 298
  • 470
5
votes
2 answers

How to calculate one-sided tolerance interval with scipy

I would like to calculate a one sided tolerance bound based on the normal distribution given a data set with known N (sample size), standard deviation, and mean. If the interval were two sided I would do the following: conf_int =…
Gigi
  • 347
  • 4
  • 11
5
votes
1 answer

Reliable fire-n-forget Kafka producer implementation strategy

I'm in middle of a 1st mile problem with Kafka. Everybody deals with partitioning, etc. but how to handle the 1st mile? My system consists of many applications producing events distributed on nodes. I need to deliver these events to a set of…
Yuri
  • 4,254
  • 1
  • 29
  • 46
5
votes
1 answer

C++ - Zlib - Header and No Header support. How reliable?

I am currently working on a huge project that possibly compress/decompress using zlib in C++ over thousands of documents a day. (Our implementation has zlib 1.2.8) Our current implementation supports both compressed file with and without headers,…
James
  • 53
  • 3
5
votes
3 answers

Redundancy, reliability and fault tolerance in C# - where to look for examples?

I want to learn how to create truly robust applications in .net - ones that are fault tolerant and are capable of withstanding unexpected situations. Where can I find literature/guidance on this subject? So far, I am not having much luck.
Dmitri Nesteruk
  • 23,067
  • 22
  • 97
  • 166
5
votes
6 answers

How best can I isolate my application from an unreliable database?

I have a Java SOAP data service which sits on top of a Sybase database which, for reasons out of my control, has unreliable performance. The database is part of a vendor package which has been modified by an internal team and most of the issues are…
John Channing
  • 6,501
  • 7
  • 45
  • 56
5
votes
2 answers

Having error queues in celery

Is there any way in celery by which if a task execution fails I can automatically put it into another queue. For example it the task is running in a queue x, on exception enqueue it to another queue named error_x Edit: Currently I am using…
Vignesh
  • 315
  • 4
  • 14
1 2
3
20 21