Image Loading

Senior Site Reliability Engineer

Job Description

  • Bengaluru, Karnataka, India

Why You Should Join Us

Provenir is a global fintech company with offices across North America, the UK, Latin America, India and Singapore. Recognized by the Global BankTech Awards as the the 2023 “Best Credit and Risk Solution by a Vendor”, we help fintechs, financial institutions, and payment providers make smarter decisions, faster. We are passionate about technology and empowering businesses to become industry leaders. As a leading provider of decisioning, and analytics products for financial services and other industries, we empower businesses to create digital-first decisioning solutions that drive business growth. If you’d like to work at an innovative fintech with a global footprint that is redefining the industry, then we want you!

Your Role:

As a Senior SRE, you will be integral to maintaining and enhancing our product hosting infrastructure on AWS, ensuring high performance, availability, and security. Your responsibilities will encompass resolving product-related issues, automating tasks, optimizing system performance, and collaborating closely with various teams to meet and exceed our operational goals.

Key Responsibilities:

  • Product Issue Resolution: Address and resolve technical challenges, collaborating with cross-functional teams to ensure high product availability and performance.
  • Collaboration: Work alongside development, DevOps, and clients to understand requirements, offering technical insights and solutions for product hosting and infrastructure.
  • Infrastructure Setup and Automation: Utilize tools like GIT, Jenkins, Terraform, and scripting to automate and enhance deployment and operational efficiency on multi-vendor cloud platforms.
  • Cloud Optimization: Employ observability tools (e.g., DataDog, NewRelic) for monitoring and optimizing hosting environment performance.
  • Task Automation: Implement scripting and automation for routine tasks to improve operational efficiency.
  • Documentation and Reporting: Maintain detailed documentation and provide regular reports on infrastructure performance and automation initiatives.

Qualifications and Skills

  • Experience: 6-10 years in a relevant role, with a solid background in software engineering and system administration.
  • Technical Proficiency: Skilled in Python, Go, or Java, with experience in Linux/Unix environments, Ansible, Terraform, Kubernetes, cloud platforms (AWS), and monitoring tools (Datadog, Splunk, Prometheus, Grafana).
  • Incident Management: Strong capabilities in incident management, performance optimization, and disaster recovery planning.
  • CI/CD and Version Control: Proficiency in CI/CD practices, tools Jenkins, Ansible, Terraform and Git.
  • Communication: Excellent communication and collaboration skills.
  • Leadership: Experience in leading projects, influencing strategy, and establishing partnerships.
  • On-Call Rotation: Willingness to participate in on-call duties as part of our Cloud Platform Operations team.
  • Certifications in intermediate linux system administration, Kubernetes, and experience in developing monitoring mechanisms for APM purposes would be a distinct plus.

What You’ll Love about Us

Our employees are empowered to be curious, forward-thinking leaders. We ask them to explore the uncharted and invent the unimagined. That’s what makes Provenir unique.

We offer comprehensive health and wellness plans. You will enjoy paid time off and company holidays, flexible and remote-friendly options, along with benefits to plan for your future.

 

Skills

  • Python
  • GO
  • Java
  • CI/CD
  • AWS
  • Jenkins
  • Kubernetes

Education

  • Master's Degree
  • Bachelor's Degree

Job Information

Job Posted Date

May 13, 2024

Experience

6 to 8 Years

Compensation (Annual in Lacs)

Best in the Industry

Work Type

Permanent

Type Of Work

8 hour shift

Category

Information Technology

Copyright © 2022 All Rights Reserved. Saas Talent