Although site reliability engineering has been around for a while, it has only recently gained fame in general software circles. But there are still a lot of questions as to what a site reliability engineer (SRE) does. Much of what we know comes from the book Site Reliability Engineering from Google. And we’ll refer to that book a few times in this post.
SREs have been compared to operations groups, system admins, and more. But the comparison falls short in encompassing their role in today’s modern software environment. They cover more responsibilities than operations. And though they usually have a background in system administration, they also bring software development skills to the role. SREs combine all these skills and ensure that complex distributed systems run smoothly.
So how do they do all this? Read further to find out how SREs accomplish this through the responsibilities they fulfill.Read More