Image Loading

Cloud Operations Engineer

Job Description

Why You Should Join Us

Provenir is a global fintech company with offices across North America, the UK, Latin America, India and Singapore. Recognized by the Global BankTech Awards as the the 2023 “Best Credit and Risk Solution by a Vendor”, we help fintechs, financial institutions, and payment providers make smarter decisions, faster. We are passionate about technology and empowering businesses to become industry leaders. As a leading provider of decisioning, and analytics products for financial services and other industries, we empower businesses to create digital-first decisioning solutions that drive business growth. If you’d like to work at an innovative fintech with a global footprint that is redefining the industry, then we want you!

What You'll Do

As a Cloud Operations Engineer, you will play a crucial role in managing and supporting the infrastructure necessary for hosting our products on the AWS cloud. Your expertise in resolving product-related issues, coupled with your ability to automate tasks and optimize performance, will ensure our hosting operations are seamless and efficient.

Your responsibilities will include, but are not limited to, the following:

Product Issue Resolution: Tackle technical challenges identified through monitoring tools or reported by customers, ensuring timely resolution of issues related to product hosting, infrastructure setup, networking, security, and more. Work closely with cross-functional teams, product development, and clients to maintain high availability and performance.

Collaboration: Engage with cross-functional teams, product development, DevOps, and clients to comprehend their requirements, offer technical guidance, and address product hosting and infrastructure concerns. Participate actively in incident, change, and problem management, adhering to ITIL best practices.

Infrastructure Setup and Automation: Establish and maintain essential infrastructure components and services for product hosting on multi-vendor cloud platforms. Utilize tools such as GIT, Jenkins, Terraform, and scripting/automation to automate setup processes, enhancing deployment, configuration efficiency, and overall operational effectiveness.

Cloud Optimization: Utilize cloud observability tools like DataDog, NewRelic, Splunk, Prometheus, etc., to monitor and enhance the hosting environment's performance and health. Identify and resolve performance bottlenecks, ensuring optimal performance and availability.

Task Automation: Leverage scripting and automation tools to automate routine tasks, including backups, scaling, monitoring, and maintenance, thereby boosting operational efficiency and minimizing manual efforts.

Documentation and Reporting: Keep comprehensive documentation of infrastructure setups, configurations, and best practices. Regularly report on infrastructure performance, issue resolutions, and automation efforts to stakeholders.

Qualifications, Strengths, and Skills

  • Experience : 6+ yrs of experience
  • Strong Linux Skills: Overseeing the day-to-day operations of cloud-based applications running on production Linux environments, ensuring their stability, performance, and security. This includes patch management, performance tuning, and system monitoring., including familiarity with JVMs, heap dumps, system performance analysis, installations, configurations, upgrades, and proficient command-line usage.
  • Cloud Support Operations: Proven background in service operations roles, especially in daily customer interactions, technical issue resolution through ticket triaging, and independent RCA drafting.
  • SaaS and AWS Experience: Demonstrated experience with SaaS solutions and services, particularly managing enterprise applications on AWS cloud, focusing on availability and performance. In-depth knowledge of AWS services like Storage, Databases, IAM, ECS, EKS, and CloudWatch.
  • Troubleshooting and Problem-Solving: Exceptional skills in diagnosing and resolving technical issues related to product hosting and infrastructure.
  • Cloud Observability and Monitoring: Experience with tools like Datadog, Splunk, Grafana, and Prometheus for cloud observability, monitoring, and alerting.
  • Infrastructure Management: Experience managing cloud infrastructure upgrades and change management. Proficiency in using tools like Jenkins, Terraform, and scripting/automation for infrastructure setup, automation, release, and task management.
  • Kubernetes and Containerization: Experience with Kubernetes and familiarity with containerization technologies is highly desirable.

Skills

  • SaaS
  • Linux.
  • Cloud support
  • Infrastructure Management
  • Kubernetes
  • Test Automation

Education

  • Master's Degree
  • Bachelor's Degree

Job Information

Job Posted Date

Nov 30, 2024

Experience

6 to 8 Years

Compensation (Annual in Lacs)

₹ Market Standard

Work Type

Permanent

Type Of Work

8 hour shift

Category

Information Technology

Copyright © 2022 All Rights Reserved. Saas Talent