Image Loading

Senior Observability Engineer, Pingdom

Job Description

 

Get to know Okta

Okta is The World’s Identity Company. We free everyone to safely use any technology—anywhere, on any device or app. Our Workforce and Customer Identity Clouds enable secure yet flexible access, authentication, and automation that transforms how people move through the digital world, putting Identity at the heart of business security and growth.

At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we’re looking for lifelong learners and people who can make us better with their unique experiences.

Join our team! We’re building a world where Identity belongs to you.

Senior Web Observability Engineer

We are looking for an experienced BT Site Observability Engineer to join our Business Technology team and help build a new function within the BT SRE team. The Site Reliability Engineering team is looking to expand its scope and provide observability capabilities for critical okta.com and auth0.com properties, in addition to critical applications within the Okta corporate environment.

We are looking for a smart, innovative, and passionate engineer for this role, someone who is interested in best practices around observability, incident management, and security. The ideal candidate welcomes the challenge of building in a dynamic and ever changing environment, and is interested in bringing a culture of operational excellence to a new team. They enjoy seeing their designs run at scale with automation, testing, and an excellent operational mindset. If you exemplify the ethics of, "know about a problem before your users," we want to hear from you!

Responsibilities

  • Build out observability program and process, recommending and implementing tooling and services
  • Managed the security of critical Okta properties and manage security issues such as DDoS attacks and rate limiting
  • Ensure our critical infrastructure is meeting uptime and availability standards, and is stable for our Okta customers
  • Drive initiatives to evolve our observability platforms to increase efficiency in line with current standards and best practices, especially around incident management
  • Build data pipelines into Splunk and use your expertise to build queries and dashboards for a variety of stakeholders
  • Recommend, develop, implement, and manage appropriate policy, standards, process, and procedural updates
  • Discover and execute on opportunities to automate and increase our automation

Qualifications

  • Proficient with observability tools including Pingdom, New Relic, Cloudwatch and Prometheus/Grafana
  • Proficient with logging and SIEM tools, especially Splunk
  • Proficient with web security and web security tooling
  • Experience working in SOC/NOC teams and handling outage escalations and remediation
  • Experience with monitoring of hosted platforms, such as Adobe Experience Manager
  • Experience with automating systems and infrastructure via Terraform
  • Proficient with Git and building deployment pipeline using commercial tools, especially Gitlab
  • Demonstrated ability to develop complex applications for cloud infrastructure at scale and deliver projects on schedule and within budget
  • Experience with reliability engineering concepts and security best practices on public cloud platforms and web applications
  • Experience with developing tooling and automation in Bash, Python, Go, etc.
  • Familiar with Linux system administration skills
  • Good communication skills, with the ability to influence others and communicate complex technical concepts to different audiences
     

Skills

  • SIEM
  • Terraform
  • Python
  • Bash
  • Linux System Administration
  • Grafana

Education

  • Master's Degree
  • Bachelor's Degree

Job Information

Job Posted Date

Feb 21, 2025

Experience

4 to 8 Years

Compensation (Annual in Lacs)

Best in the Industry

Work Type

Permanent

Type Of Work

8 hour shift

Category

Information Technology

Copyright © 2022 All Rights Reserved. Saas Talent