Questions tagged [sre]

Site Reliability Engineering (SRE), a reliability focused implementation of DevOps.

Highest level concern is to design, build and support software with "ever-watchful eye on system availability, latency, performance, and capacity".

SRE has started at Google but has now been adopted by several other companies.

49 questions
-1
votes
1 answer

What is meant by symptom based monitoring and cause based monitoring?

In SRE context, what is meant by symptom and cause-based monitoring? why it is so important? And which tools are used for these kinds of monitoring?
Navin prasad
  • 558
  • 1
  • 8
  • 18
-2
votes
1 answer

how and where to learn site reliability engineering course?

can any one help me where and how to learn site reliability engineering course, i am not getting proper links any where. i went through this links 1.…
kumar
  • 1
  • 1
-3
votes
1 answer

information for debugging issues

Question 1 ____________ are a treasure trove of information for debugging issues. a) Databases b) Infrastructure as a Code tools c) Networks d) Logs **please anyone answer me**
-3
votes
1 answer

How google runs production systems - what's really the "50% time for project work" for SRE?

Quote: "SREs at 50% of their time. Their remaining time should be spent using their coding skills on project work." (page 7)" I'm reading this book, and realy can't understand. What is "project work"? It is production code or ansible yaml?
1 2 3
4