Site Reliability Engineering

Jump to Content

  • Home
  • Books
  • Resources
    • Latest resources

      Creating a Production Launch Plan

      Training site reliability engineers

      Anatomy of an Incident

      Enterprise Roadmap to SRE

      Efficient Machine Learning Inference

      Incident Metrics in SRE

      Practical Guide to Cloud Migration

      SRE Best Practices for Capacity Management

      Supplementary Materials

      SRE Classroom: Distributed PubSub

    • Books

      Building Secure & Reliable Systems

      The Site Reliability Workbook

      Site Reliability Engineering

    • Mobaa

      2022 Gallery

      2020 Gallery

      Vector Methods

    • Classroom

      Distributed PubSub

      Distributed Image Server

      The Art of SLO

    • Latest resources
      • Resources overview
      • Creating a Production Launch Plan
      • Training site reliability engineers
      • Anatomy of an Incident
      • Enterprise Roadmap to SRE
      • Efficient Machine Learning Inference
      • Incident Metrics in SRE
      • Practical Guide to Cloud Migration
      • SRE Best Practices for Capacity Management
      • Supplementary Materials
      • SRE Classroom: Distributed PubSub
    • Books
      • Books overview
      • Building Secure & Reliable Systems
      • The Site Reliability Workbook
      • Site Reliability Engineering
    • Mobaa
      • Mobaa overview
      • 2022 Gallery
      • 2020 Gallery
      • Vector Methods
    • Classroom
      • Classroom overview
      • Distributed PubSub
      • Distributed Image Server
      • The Art of SLO
  • Careers
  • SRE in Cloud
  • Prodcast

Site Reliability Engineering

Jump to Content

SRE Books

Building Secure & Reliable Systems
Read online
The Site Reliability Workbook
Read online
Site Reliability Engineering
Book updates
Read online

Building Secure & Reliable Systems

Building Secure & Reliable Systems

By:
Heather Adkins, Betsy Beyer, Paul Blankinship, Ana Oprea, Piotr Lewandowski, Adam Stubblefield

Can a system be considered truly reliable if it isn't fundamentally secure? Or can it be considered secure if it's unreliable? Security is crucial to the design and operation of scalable systems in production, as it plays an important part in product quality, performance, and availability. In this book, experts from Google share best practices to help your organization design scalable and reliable systems that are fundamentally secure.

Read online

The Site Reliability Workbook

The Site Reliability Workbook

Edited by:
Betsy Beyer, Niall Richard Murphy, David K. Rensin, Kent Kawahara and Stephen Thorne

The Site Reliability Workbook is the hands-on companion to the bestselling Site Reliability Engineering book and uses concrete examples to show how to put SRE principles and practices to work. This book contains practical examples from Google’s experiences and case studies from Google’s Cloud Platform customers. Evernote, The Home Depot, The New York Times, and other companies outline hard-won experiences of what worked for them and what didn’t.

Read online

Site Reliability Engineering

Site Reliability Engineering

Edited by:
Betsy Beyer, Chris Jones, Jennifer Petoff and Niall Richard Murphy

Members of the SRE team explain how their engagement with the entire software lifecycle has enabled Google to build, deploy, monitor, and maintain some of the largest software systems in the world.

Read online

Interested in joining SRE?

Google strives to cultivate an inclusive workplace. We believe diversity of perspectives and ideas leads to better discussions, decisions, and outcomes for everyone.

Follow us

  • About Google
  • Google products
  • Privacy
  • Terms
  • Help