Job Description
We are looking for a Senior Cloud Reliability engineer to run our Multi-Cloud ( AWS, GCP, Azure-TBD) SaaS environment reliably, securely, and efficiently using best-of-the-breed Enterprise-grade SaaS tools and automation.
Responsibilities
- Use your experience in software development, systems engineering, networking and solution design to proactively prevent repeat but avoidable issues.
- Define, architect, and implement tools to enhance Observability via monitoring/alerting, Availability, Scalability, Security,and cost efficiency of our SaaS Platform
- Drive a culture of intolerance to repeat manual activity which results in a highly automated environment delivering scalable SaaS solutions.
- Participate in on-call rotation for critical Cloud Infra systems, lead the incident review, and root cause analysis to Provide relief and sustainable resolution to issues within our infrastructure application stack.
- Drive Integration/automation initiatives using appropriate build/Buy trade-offs across the spectrum of DevSecFinOps domain for the Thoughtspot SaaS ecosystem.
- Achieve quantifiable SaaS operational Excellence measured by related SLI/SLO/SLA
- Define Architecture and drive development of a multi cloud SaaS operations and status portal
Required Skills/qualifications
- B.Tech. degree in Computer Science or equivalent.
- At least 5 years of Enterprise SaaS DevSecFinOps experience
- Proficient in programming in Cloud Infrastructure specific languages like. Go, Terraform, Python, Ruby, Bash
- Expertise in Cloud Security and/or Cloud networking
- Expertise in K8s administration based on cloud offered technologies EKS/GKE ecosystem.
- Expertise in implementing operating an enterprise grade observability ( metrics, logs, tracing), alerting stack in a Cloud SaaS environment
- Experience in Linux Internals ( troubleshooting, tuning, monitoring, performance), virtualization, DevOps tools ( CI and CD), and cloud technologies.
- Strong debugging and problem-solving skills (network, systems, database, and application).
- Team-first attitude and uncompromising attention to detail.
- Good collaboration and communication skills.
- Experience/ Knowledge in Cloud Services, Kubernetes, Cloud Databases like Postgres/RDS/mysql/DynamoDB, Elastic, Kafka and Microservice architecture is a bonus.
- Advanced professional certifications from Cloud Providers ( AWS, Azure, GCP) in domains like K8s, Solution architecture, networking, databases is a bonus.
- Full Stack Architecture/Development Experience of a SaaS Operational Portal for an enterprise application is a bonus.
- Experience with AI/ML, MLOps using genAI to auto detect, remediate SaaS issues and achieve Operations Autopilot is a bonus.
What makes ThoughtSpot a great place to work?
ThoughtSpot is the experience layer of the modern data stack, leading the industry with our AI-powered analytics and natural language search. We hire people with unique identities, backgrounds, and perspectives—this balance-for-the-better philosophy is key to our success. When paired with our culture of Selfless Excellence and our drive for continuous improvement (2% done), ThoughtSpot cultivates a respectful culture that pushes norms to create world-class products. If you’re excited by the opportunity to work with some of the brightest minds in the business and make your mark on a truly innovative company, we invite you to read more about our mission, and apply to the role that’s right for you.