SRE

Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SRE

Here are 657 public repositories matching this topic...

rundeck / rundeck

isno / theByteBook

unixorn / git-extra-commands

nobl9 / sloctl

jaegertracing / jaeger-ui

nobl9 / terraform-provider-nobl9

robusta-dev / holmesgpt

DataDog / chaos-controller

wmariuss / awesome-devops

antonputra / tutorials

bregman-arie / devops-exercises

fkie-cad / Logprep

kaytu-io / kaytu

runatlantis / atlantis

seveas / herd

christiangalsterer / pg-promise-prometheus-exporter

christiangalsterer / node-postgres-prometheus-exporter

christiangalsterer / kafkajs-prometheus-exporter

k8sgpt-ai / k8sgpt

cloudprober / cloudprober