Image Loading

Site Reliability Engineer(Rotation US Shift & Hybrid Work) for Asimily

Asimily      Pune, Maharashtra

By Asimily

Job Description

Company Name: Asimily

Website: https://asimily.com/

About Company

Asimily is an IoT analytics startup focused on solving security and operational use cases for connected devices in specific verticals - healthcare, buildings, industrial control systems, etc.) We are funded by a top-tier investor, have built the product and have customers we are working with. Founders have deep experience in product, the market and the technology with a strong Engineering leader. One of the founders has run the connected device business unit at a Fortune 500 company.

Looking for a Site Reliability Engineer to ensure reliability, scalability, and performance of systems and applications. Design, implement, and maintain robust infrastructure and monitoring solutions. Collaborate cross-functionally to identify and resolve performance issues and drive system improvements.

Role: Site Reliability Engineer 

Experience: 4 to 7 Years

Location: Pune- Hadapsar (Hybrid )

Work Time: Rotation US Shift & Hybrid Work

Role & Responsibility of Site Reliability Engineer

  • As a Senior Site Reliability Engineer (SRE) at Asimily, you will be responsible for ensuring the reliability, scalability, and performance of our globally distributed hybrid cloud and on-premises infrastructure and applications.
  • You will lead efforts to operate and maintain high availability of software for multiple products running in a multi-region cloud environment, analyzing operational performance to deliver improvements in critical maintenance-related metrics.
  • Your role will involve driving focused initiatives to improve operational efficiency and scalability of the platform and applications, including building tools and automation to eliminate work and reduce resolution time for issues.
  • Additionally, you will set up and maintain cloud operations automation runbooks and tooling to monitor and maintain enterprise cloud-based applications, responding promptly to system outages and driving war rooms to mitigate outages.
  • This role offers the flexibility of remote work and involves regular on-call responsibilities, providing an opportunity to contribute to our dynamic startup environment across global time zones.

Key Responsibilities:

  • Architect, build, and maintain highly available, scalable infrastructure on both cloud platforms such as GCP and AWS, as well as on-premises environments, ensuring seamless integration and consistent performance.
  • Develop and implement monitoring and alerting solutions to proactively detect and respond to potential issues across our hybrid infrastructure, spanning cloud and on-premises environments.
  • Utilize Ansible for configuration management to automate deployment, configuration, and scaling processes across all infrastructure components, maintaining consistency and reliability.
  • Collaborate with cross-functional teams to establish and enforce best practices for system reliability, performance optimization, and incident management across hybrid cloud and on-premises environments.
  • Participate in bi-weekly on-call rotations, providing timely response and resolution to critical incidents during daytime shifts in both India and USA time zones, across all infrastructure layers.

Qualifications and Skills:

  • Bachelor's degree in Computer Science, Engineering, or related field, or equivalent practical experience.
  • Proven experience as a Site Reliability Engineer or similar role, focusing on designing and maintaining hybrid cloud and on-premises infrastructure.
  • Strong proficiency in at least one programming language (e.g., Python, Java) and expertise in scripting and automation.
  • Deep understanding of cloud computing platforms, particularly GCP, including hands-on experience with services such as Compute Engine, Cloud Storage, and Pub/Sub.
  • Experience with containerization technologies like Docker and container orchestration solutions, though Kubernetes is not required.
  • Proficiency in configuration management tools like Ansible for automating deployment and configuration tasks across hybrid environments.
  • Familiarity with monitoring and logging solutions such as Prometheus, Grafana, ELK stack, or similar tools, with experience being desirable.
  • Understanding or experience in networking principles and protocols, including TCP/IP, DNS, HTTP, load balancing, and VPNs.
  • Experience working with Unix-based operating systems, particularly Linux distributions, in a production environment.
  • Working experience with source code version control systems such as Git.
  • Strong knowledge of Google Cloud Infrastructure, with experience in developing Google Cloud CLI scripts being an added advantage.
  • Working experience with Postgres database management.
  • Excellent problem-solving skills and the ability to troubleshoot complex issues in hybrid cloud and on-premises environments.
  • Effective communication and collaboration skills, with experience working in remote or distributed teams.

Preferred Qualifications:

  • Relevant certifications such as Google Cloud Professional Cloud Architect.
  • Previous experience working in a globally distributed on-call environment with hybrid cloud and on-premises infrastructure.
  • Knowledge of infrastructure security best practices and tools, including IAM and security groups specific to GCP.
  • Experience with CI/CD toolsets such as Jenkins or GitLab.
  • Experience with Asana project management tools will be an added advantage.

Soft Skills:

  • Comfortable working in a fast-paced and dynamic environment, willing to collaborate diligently in a cross-functional, multi-geo team setup to meet project deadlines.
  • Demonstrates patience and tolerance when troubleshooting issues.
  • Exhibits excellent communication skills (written, verbal, & virtual) and has a strong drive, self-motivation, logical thinking, and attention to detail.
  • Passionate about adopting new technologies, software, and processes, with the ability to multitask effectively in a fast-paced environment with multiple deadlines.

Skills

  • Cloud Infrastructure
  • Linux Administration
  • Ansible
  • TCP/IP
  • Grafana
  • load balancing
  • Gitlab

Education

  • Bachelor's Degree

Job Information

Job Posted Date

Apr 24, 2024

Experience

4 to 7 Years

Work Type

Permanent

Type Of Work

Other

Job Location

Pune

Category

Information Technology

Application Ends:

Jul 03, 2026

Copyright © 2022 All Rights Reserved. Saas Talent