Image Loading

Lead Site Reliability Engineer

Job Description

Here at UKG, our purpose is people™. Our HR, payroll, and workforce management solutions help organizations unlock happier outcomes for all. And our U Krewers, who build those solutions and support our business, are talented, collaborative, and innovative problem-solvers. We strive to create a culture of belonging and an employee experience that empowers our people – both at work and at home. Our benefits show that we care about the whole you, from adoption and surrogacy assistance to tuition reimbursement and wellness programs. Our employee resource groups provide a welcoming place to land, learn, and connect with those who share your passions and interests. What are you waiting for? Learn more at

Description
Site Reliability Engineers at UKG are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include automated testing, performance analysis, observability, and auto remediation.

Site Reliability Engineers must be passionate about learning and evolving with current technology trends. They strive to innovate and are relentless in pursuing a flawless customer experience. They have an “automate everything” mindset, helping us bring value to our customers by deploying services with incredible speed, consistency, and availability.

In support of our culture, we have adopted a hybrid working model of 3 days per week in the office and the rest of the week from home. The UKG office that this position can be connected to is Noida, India.

Job Responsibilities

  • Engage in and improve the lifecycle of services from conception to EOL, including system design consulting, and capacity planning
  • Define and implement standards and best practices related to: System Architecture, Service delivery, metrics and the automation of operational tasks
  • Support services, product & engineering teams by providing common tooling and frameworks to deliver increased availability and improved incident response
  • Improve system performance, application delivery and efficiency through automation, process refinement, postmortem reviews, and in-depth configuration analysis
  • Collaborate closely with engineering professionals within the organization to deliver reliable services
  • Increase operational efficiency, effectiveness, and quality of services by treating operational challenges as a software engineering problem (reduce toil)
  • Actively participate in incident response, including on-call responsibilities
  • Partner with stakeholders to influence and help drive the best possible technical and business outcomes

Qualifications
Basic Qualifications:

  • Engineering degree, or a related technical discipline, or equivalent work experience
  • Experience coding in higher-level languages (e.g., Python, JavaScript, C++, or Java)
  • Demonstrated understanding of best practices in metric generation and collection, log aggregation pipelines, time-series databases, and distributed tracing
  • Demonstrable fundamentals in 2 of the following: Computer Science, Cloud Architecture, Security, or Network Design fundamentals
  • Working experience with industry standards like Terraform, Ansible
  • 3+ years of hands-on experience working in Engineering or Cloud
  • 3+ years of experience with public cloud platforms (e.g. GCP, AWS, Azure)
  • Experience working with automation

Preferred Qualifications

  • Experience with distributed system design and architecture
  • Experience with containerization technologies
  • Experience in configuration and maintenance of applications and/or systems infrastructure for large scale customer facing company
  • Experience building and managing CI/CD Pipelines

Skills

  • C++
  • Python
  • Javascript
  • Cloud Architecture
  • Network Design
  • CI/CD
  • Security

Education

  • Master's Degree
  • Bachelor's Degree

Job Information

Job Posted Date

Sep 20, 2024

Experience

3 to 7 Years

Compensation (Annual in Lacs)

₹ Market Standard

Work Type

Permanent

Type Of Work

8 hour shift

Category

Information Technology

Copyright © 2022 All Rights Reserved. Saas Talent