Image Loading

Site Reliability Engineer

Job Description

  • Bengaluru, Karnataka, India

About NetApp

NetApp is the intelligent data infrastructure company, turning a world of disruption into opportunity for every customer. No matter the data type, workload or environment, we help our customers identify and realize new business possibilities. And it all starts with our people.

If this sounds like something you want to be part of, NetApp is the place for you. You can help bring new ideas to life, approaching each challenge with fresh eyes. We embrace diversity and openness because it's in our DNA. Of course, you won't be doing it alone. At NetApp, we're all about asking for help when we need it, collaborating with others, and partnering across the organization - and beyond.

"At NetApp, we fully embrace and advance a diverse, inclusive global workforce with a culture of belonging that leverages the backgrounds and perspectives of all employees, customers, partners, and communities to foster a higher performing organization."-George Kurian, CEO

Job Summary

As a Keystone Site Reliability Engineer, you will be operating at the intersection of development and operations. Your role will involve engaging in and enhancing the lifecycle of Keystone services - from monitoring, end of month reporting, working on operation critical issues and refinement. You will be responsible for responding to SLA adherence and play a crucial role in sustainably scaling systems through automation and driving changes that improve reliability and velocity.

Due to the critical nature of the services, this position requires you to be a motivated self-starter and self-learner, possess strong problem-solving skills, and be someone who embraces challenges This role offers the opportunity to work in a dynamic, global environment, ensuring the smooth operation of services.

Job Requirements

  • Must have prior work experience on any other NetApp product and should be able to lead other engineers technically on customer issues
  • Directly influence the decisions and outcomes related to solution implementation. Influence other SREs on new improvements to ensure scalability
  • Issue Tracking and Resolution: Use SNOW, Jira to track and resolve issues based on their priority. Implement SRE best practices for effective resolution will be needed.
  • Document system knowledge as you acquire it through confluence pages, creating KB articles and ensure stakeholders are updated
  • Security Management: Stay updated with security protocols and proactively identify, diagnose, and resolve complex security issues.
  • Deep working knowledge of Containers, Kubernetes, and python.
  • Team Collaboration and Influence: Work in tandem with other SRE Engineers to ensure issues are tracked till closure and blockers are removed quickly and efficiently. Additionally, consult and influence developers on new feature development and software architecture to ensure scalability.
  • Experience with cloud platforms such as AWS, Azure, or Google Cloud.
  • Strong oral & written communication skills are essential

Education

  • A minimum of 5 to 8 years of experience is required.
  • A Bachelor of Science Degree in Computer Science, a master’s degree; or equivalent experience is required.

Skills

  • GCP
  • Python
  • Kubernetes
  • Reliability
  • Site Reliability Engineer
  • System Performance
  • Troubleshooting

Education

  • Master's Degree
  • Bachelor's Degree

Job Information

Job Posted Date

Jul 12, 2024

Experience

5 to 9 Years

Compensation (Annual in Lacs)

₹ Market Standard

Work Type

Permanent

Type Of Work

8 hour shift

Category

Information Technology

Copyright © 2022 All Rights Reserved. Saas Talent