Site Reliability Engineering (SRE) Fundamentals

Описание к видео Site Reliability Engineering (SRE) Fundamentals

Join us on September 22nd to learn the Site Reliability Engineering (SRE) principles and practices that you can apply in your organization that enable your systems to be more scalable, reliable, and efficient.

Technical Account Manager, Pamella Canova, will lead the session, including: 

- The core problems SRE solves and organizational structures to facilitate the practice of SRE
- Key principles SREs use to keep systems reliable
- Areas of responsibility and expertise amongst SREs
- How to adopt SRE best practices in your organization

Join, learn, and engage with the Community → https://goo.gle/google-cloud-community

07:06 The SRE approach to operations
09:04 What do SRE teams do?
10:10 SRE and DevOps
11:03 Error budgets: The key principle of SRE

23:57 Practice areas of SRE
24:17 Monitoring and alerting
26:57 Demand forecasting and capacity planning
29:04 Efficiency and performance
30:55 Change management
34:00 Pursuing maximum change velocity
39:55 Provisioning
41:50 Emergency response
44:09 Incident and postmortem thresholds
48:31 Culture of blamelessness
49:55 Toil management / operational work

52:55 Getting started in 4 steps
55:14 Resources and certification information

56:55 Q&A

Комментарии

Информация по комментариям в разработке