Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
27 views1 page

Sre JD

The Site Reliability Engineer (SRE) role focuses on ensuring scalable, observable, and reliable SaaS environments while enhancing user experience. Responsibilities include provisioning infrastructure as code, advocating for best practices, increasing automation, and managing incidents. Candidates should have extensive IT experience, particularly with Kubernetes, SaaS offerings, and cloud infrastructure, along with strong collaboration and learning abilities.

Uploaded by

Ashwani singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
27 views1 page

Sre JD

The Site Reliability Engineer (SRE) role focuses on ensuring scalable, observable, and reliable SaaS environments while enhancing user experience. Responsibilities include provisioning infrastructure as code, advocating for best practices, increasing automation, and managing incidents. Candidates should have extensive IT experience, particularly with Kubernetes, SaaS offerings, and cloud infrastructure, along with strong collaboration and learning abilities.

Uploaded by

Ashwani singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 1

JD - Site Reliability Engineer

The Site Reliability Engineering team (SRE) is committed to ensuring that our SaaS environments and
Platform solutions are scalable, observable, and reliable while providing an optimal user experience.
Bring your passion and software engineering best practices to scale the SaaS architecture and improve
reliability, increase automation, and remove toil.

Responsibilities:
 Be curious about new technology, infrastructure, and practices to scale our architecture and
prepare for future growth
 Provision of secure, reliable and scalable SaaS infrastructure via code (Infrastructure as Code)
 Through collaboration, advocate for reliability and scalability best practices throughout the
development lifecycle. Lead by example
 Increase automation and tooling to reduce toil and manual intervention
 Leverage metrics and traces to effectively observe systems and provide data and insights
 Automate data-driven alerts to proactively escalate issues. Work with development teams to
establish SLOs and improve reliability
 Apply software engineering best practices to develop SRE-managed services and tools
 Incident ownership and management

Skills:
 9 – 12 years of total IT experience
 4+ years of experience deploying and supporting Kubernetes clusters in a public, scalable SaaS
offering
 4+ years of experience in java/python/c#
 3+ years of experience with deploying and supporting a SaaS offering.
 3+ years of experience in IAC (Terraform/Helm)
 2+ years of experience as a Site Reliability Engineer
 2+ years of experience with each of the following:
◆ Cloud Infrastructure (Amazon Web Services (AWS))
◆ Continuous Deployment (GitHub Actions)
◆ Source control (github / gitlab / etc.)
 Self-starter with the ability to work independently on projects
 Proactive and strong ability to learn new things with limited guidance
 Demonstrated ability to work effectively within a team and with cross-functional teams
 A curious attitude that is interested in knowing why things work the way they do and using that
information to improve and enhance

You might also like