The mission critical Google platforms are maintained teams consisting of Google Site Reliability Engineers (SREs). SREs distinguish themselves from common system administrators by applying engineering and scientific methods for solving operational problems. Fundamentally, SRE is what you get when you treat operations as if it’s a software problem. In this blog post I distilled several key points that proved most relevant for my team which has the mission of designing and operating private database cloud for Oracle and Microsoft SQL Server databases. Operating on these principles over the last years not only has helped us achieve a high level of customer satisfaction with our service, but it also facilitated the creation of a very attractive working environment. Continue Reading →