Image Loading

Site Reliability Engineer

Job Description

About PhonePe Group: 

PhonePe is India’s leading digital payments company with 50 crore (500 Million) registered users and 3.7 crore (37 Million) merchants covering over 99% of the postal codes across India. On the back of its leadership in digital payments, PhonePe has expanded into financial services (Insurance, Mutual Funds, Stock Broking, and Lending) as well as adjacent tech-enabled businesses such as Pincode for hyperlocal shopping and Indus App Store which is India's first localized App Store. The PhonePe Group is a portfolio of businesses aligned with the company's vision to offer every Indian an equal opportunity to accelerate their progress by unlocking the flow of money and access to services.

Culture

At PhonePe, we take extra care to make sure you give your best at work, Everyday! And creating the right  environment for you is just one of the things we do. We empower people and trust them to do the right  thing. Here, you own your work from start to finish, right from day one. Being enthusiastic about tech is a  big part of being at PhonePe. If you like building technology that impacts millions, ideating with some of  the best minds in the country and executing on your dreams with purpose and speed, join us!

Roles and Responsibilities

  • Troubleshoot issues across the entire stack - hardware, software, application, and network
  • Work to improve the reliability and performance of the next generation of distributed systems
    and containerized deployments
  • Work to improve the reliability and performance of the next generation of distributed systems
    and containerized deployments
  • Diagnose and troubleshoot complex distributed systems handling millions of queries per second
  • Day-to-day work is heavily command-line driven, which requires a strong understanding of Linux.
  • Participate in on call rotation Design build and maintain core infrastructure that enables Phonepe scaling to support hundreds of thousands of concurrent users
  • Actively take part in the Analysis and System improvement plan.
  • Drive performance testing, capacity planning and high availability practices.
  • Own implementations of new technologies while ensuring proper testing and documentation.
  • Proactively monitor/identify/solve issues which could have a potential impact to our Infrastructure.
  • Natural team player and also have a resourceful attitude.
  • Buddy new team members, and get them production ready.

Skills Required

  • Minimum of 5-7 years of strong hands-on experience in Linux / Unix System Administration, including TCP/IP, DNS, and load balancers.
  • Expertise in managing and scaling proxy infrastructure, including configuring and optimizing
    proxies (e.g. Nginx, HAProxy). 
  • Knowledge in Database technologies, specifically in MySQL/NoSQL. Good to have exposure on Aerospike NoSQL.
  • In-depth knowledge in Python to automate tasks with minimal intervention.
  • Knowledge of Linux cloud services using kvm/qemu/lvm.

Skills

  • Linux System Administration
  • NoSQL
  • Python
  • TCP/IP
  • Cloud services

Education

  • Master's Degree
  • Bachelor's Degree

Job Information

Job Posted Date

May 02, 2025

Experience

5 to 7 Years

Compensation (Annual in Lacs)

₹ Market Standard

Work Type

Permanent

Type Of Work

8 hour shift

Category

Information Technology

Copyright © 2022 All Rights Reserved. Saas Talent