Image Loading

Site Reliability Engineer

Job Description

At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry.  With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our market opportunity is near infinite.

Instabase offers a consumption-based pricing model where customers can pay only for what they use, aligning directly with the value our products deliver. It empowers our clients to explore our AI Hub platform features extensively, enabling them to uncover crucial business insights. This customer-centric model allows Instabase to glean insights from diverse use cases and behaviors, ensuring we deliver top-tier solutions that provide unmatched advantages for everyday business operations.

With offices in San Francisco, New York, London and Bengaluru, Instabase is a truly global company. We are people-first, and we've built a fearlessly experimental, endlessly curious, customer obsessed team who work together and help organizations around the world turn their unstructured data into insights instantly.

Our Site Reliability Engineering team combines the Software Engineering & Systems Engineering to build scalable, distributed, fault-tolerant systems. The team keeps a watchful eye on the System Performance, Capacity and Failure modes to ensure high availability, and ability to grow.

What you will do: 

  • Work with all the engineering teams throughout the Software Development lifecycle to ensure that we incorporate the tenets of building scalable, distributed fault-tolerant systems
  • Perform production readiness checks for new/existing services and features by advising on design reviews, capacity management, system monitoring and alerting etc.
  • Maintain Instabase Platform and Applications by actively monitoring the health of the system
  • Actively work on creating Infrastructure to support scaling up systems, removing manual processes, increasing the speed of product development
  • Practice thorough Incident Response and Reviews

Our infrastructure is written in Go, Python, Java, and C++ and operates using the micro-services model. We use Docker and Kubernetes for our deployments

About You:

  • BE or BS (or higher, e.g., MS, or Ph.D.) in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent technical experience
  • 3+ years of professional experience working in Production Engineering, Site Reliability Engineering (SRE), ProductionDevOps, or equivalent positions 
  • Proficiency in programming languages or scripting languages
  • Experience working with Infrastructure as Code / Automation tools (Ansible, Terraform)
  • Experience with container orchestration systems
  • Proven track record of technical leadership
  • Strong knowledge of shipping impactful and complex software projects
  • Ability to set technical and cultural standards for engineers

Skills

  • SRE
  • Devops
  • Programming Languages
  • scripting languages
  • IaC
  • Terraform

Education

  • Master's Degree
  • Bachelor's Degree

Job Information

Job Posted Date

Sep 20, 2024

Experience

3 to 7 Years

Compensation (Annual in Lacs)

Best in the Industry

Work Type

Permanent

Type Of Work

8 hour shift

Category

Information Technology

Copyright © 2022 All Rights Reserved. Saas Talent