Image Loading

Cloud Reliability Engineer - FinOps

Job Description

We are looking for a Cloud Reliability Engineer to join our team, focusing on maintaining the reliability and availability of our cloud-based infrastructure with a specific focus on cloud cost operations (Fin Ops). In this role, you will be responsible for analyzing and optimizing cloud costs across various cloud platforms (AWS, GCP, Azure). You'll work to build structured processes for managing cloud cost operations, monitoring usage, bringing visibility to cost trends, and automating cost-optimization efforts.

Responsibilities

  • Monitor and manage cloud-based infrastructure to ensure high availability, performance, and security.
  • Respond to alerts and incidents, troubleshooting and resolving issues swiftly to minimize downtime.
  • Monitor cloud usage and set up automated alerts and reports to ensure cost visibility and avoid overspending.
  • Collaborate with Engineering and Finance teams to bring transparency into cloud cost operations and ensure alignment with budgets.
  • Build and maintain dashboards that provide real-time insights into cloud cost metrics and trends.
  • Automate cloud cost optimization tasks, such as rightsizing resources, enforcing policies, and scaling infrastructure based on cost efficiency.
  • Participate in on-call rotations to respond to critical cloud infrastructure issues.
  • Assist in the optimization and maintenance of monitoring and alerting systems for cloud environments.

Required Skills/Qualifications

  • Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent work experience.
  • 3-5 years of experience in cloud operations, system administration, or related fields.
  • Familiarity with cloud platforms such as AWS, GCP, or Azure.
  • Experience in programming and automation (e.g., Python, Golang, Bash, etc.).
  • Strong problem-solving skills, particularly in managing incidents under pressure.
  • Knowledge of cloud platforms (AWS, GCP, Azure) and their pricing models.
  • Experience with cost monitoring tools and cloud management platforms (e.g., AWS Cost Explorer, GCP Cost Management, or third-party tools).
  • Ability to analyze and interpret cloud billing data and create actionable insights.
  • Familiarity with scripting (Python, Bash, etc.) for automating tasks related to cloud cost management.
  • Strong problem-solving and analytical skills, especially in managing cloud budgets and identifying savings opportunities.

Preferred Skills

  • Experience with cloud-native tools (e.g., CloudWatch, Stackdriver) and automation frameworks.
  • Knowledge of containers and Kubernetes (EKS/GKE/AKS).
  • Experience with FinOps tools, cloud tagging strategies, and cost allocation frameworks.
  • Familiarity with cloud-native cost management and optimization tools (e.g., AWS Trusted Advisor, GCP Recommender).
  • Experience with CI/CD pipelines and DevOps tools for automation and infrastructure as code.

Working Conditions

  • On-call responsibilities for critical cloud infrastructure issues.
  • Fast-paced, collaborative environment with an emphasis on cloud operations, cost management, and automation.

Skills

  • Cloud Operations
  • Operating Systems
  • Cloud platform
  • Python
  • Bash
  • CI/CD
  • Devops

Education

  • Master's Degree
  • Bachelor's Degree

Job Information

Job Posted Date

Sep 21, 2024

Experience

3 to 5 Years

Compensation (Annual in Lacs)

Best in the Industry

Work Type

Permanent

Type Of Work

8 hour shift

Category

Information Technology

Copyright © 2022 All Rights Reserved. Saas Talent