Image Loading

Senior Site Reliability Engineer SaaS

Job Description

  • Chennai

About CloudBees

CloudBees is the leading software delivery platform that enables enterprises to deliver scalable, compliant, and secure software, empowering developers to do their best work.

Seamlessly integrating into any hybrid and heterogeneous environment, CloudBees is more than a tool—it's a strategic partner in your cloud transformation journey, ensuring security, compliance, and operational efficiency while enhancing the developer experience across your entire software development lifecycle. It allows developers to bring and execute their code anywhere, providing greater flexibility and freedom through fast, self-serve, and secure workflows.

CloudBees supports organizations at every step of their DevSecOps journey, whether using Jenkins on-premise or transitioning software delivery to the cloud and wanting to accelerate their cloud transformation by years. CloudBees is helping customers build the future, today.

About the Role

As a Senior SRE at CloudBees, you will be an essential contributor to the development of our industry-leading software products. You'll work within the SAAS Platform team to design, develop, and deliver high-quality solutions to achieve high availability and performance of our systems.

What You'll Do

  • Design, develop, and maintain infrastructure that will enable your team to deliver world class products that will deliver the Sec element of DevSecOps.
  • Design, develop, and maintain infrastructure using popular IaC tools and technologies like Terraform, Helm, others.
  • Implement and participate exercising best practices for CI/CD
  • Analyze and address complex technical challenges and issues that arise during the software development & run lifecycle. Debug, troubleshoot, and resolve technical problems efficiently.
  • Create and maintain technical documentation, including design specifications, user guides, and best practice guidelines. 
  • Share knowledge and contribute to internal and external technical communities.
  • Participate in Agile ceremonies, such as sprint planning, stand-up meetings, and retrospectives.
  • Collaborate with product managers, designers, and other engineers to ensure alignment and efficient project execution.
  • Share your expertise and mentor engineers, helping them grow and develop their skills. Foster a culture of continuous learning and improvement within the team.
  • Stay updated with the latest technologies, tools, and cloud computing. Proactively learn and adapt to new technologies to drive innovation.
  • Collaborate with customers to understand their needs, gather feedback, and provide technical support and guidance as needed.

Requirements

  • Bachelor’s or master’s degree in computer science or a related technical field
  • 5+ years of experience as an SRE
  • Must have proven experienced and strong skills in
  • IAC tooling – Terraform, Ansible, AWS Cloud Formation, Google Deployment Manager, Azure devops. 
  • GCP or AWS devops experience is mandatory.
  • CI/CD tools – Jenkins, bamboo, others 
  • Monitoring tools – Splunk, Prometheus, Grafana, DataDog, etc
  • Alerting tools – OpsGenie, PagerDuty, etc
  • Docker containerization
  • Orchestration and clustering using Kubernetes GKE / AWS EKS
  • Container networking concepts
  • Experienced working in 24x7 distributed team with
  • Responsibility of maintaining high availability/low latency systems 
  • Ability to work in rota. 
  • Experience in system health monitoring & recovery 
  • Automate manual tasks and reduce toil. 
  • Desirable experienced and strong skills in
  • AWS, GCP or Azure certifications 
  • Strong understanding of 
  • Networking concepts - firewalls, VPC, VPN, Subnetting, IDS
  • Infrastructure Security concepts – Identity access, security groups, ACL
  • Operating systems administration
  • Release management concepts - A/B, canary, pipelines
  • Highly analytical mindset, logical approach to find solutions and perform root cause analysis.
  • Excellent communication skills with ability to communicate test results to stakeholders in the functional aspect of the system and its impact. 
  • Experienced of working in an Agile environment with grasp of 
  • Scrum /Agile 
  • Ticket management
  • Requirement traceability
  • Continuous integration / continuous delivery 
  • Dependency management 
  • Proven ability to lead and guide technical projects and initiatives.

Skills

  • SRE
  • AWS
  • Terraform
  • CI/CD
  • Azure DevOps
  • GCP
  • Docker

Education

  • Master's Degree
  • Bachelor's Degree

Job Information

Job Posted Date

Jun 07, 2024

Experience

5-10 Years

Compensation (Annual in Lacs)

Best in the Industry

Work Type

Permanent

Type Of Work

8 hour shift

Category

Information Technology

Copyright © 2022 All Rights Reserved. Saas Talent