About the Role:
As a Staff Engineer - DevOps you will maintain, manage and improve day-to-day operations that keep our platform secure and available
What will you do:
Execute standard operating procedures designed to ensure platform security and availability- Recommend and implement improvements towards standard operating procedures- Execute planned stories where the implementation is straightforward- Participate in platform architecture development discussions- Work with members from rest of the engineering team and customer success- Work with vendor support teams to solve issues
What are we looking for?
Must have:
- BS/MS degree in Computer Science or a related field from a premier institute with top grades or equivalent practical experience.
- 5+ years of hands-on experience in DevOps, with a focus on Azure cloud infrastructure and tools.
- Infrastructure as Code (IaC): Proven experience in provisioning and managing infrastructure using Terraform or Bicep in production environments.
- On-Call Support: Hands-on experience with production incident handling and being part of on-call rotations.
- Containerization: Expertise in packaging and deploying applications using Docker.
- Familiarity with orchestrating containers using Azure Kubernetes Service (AKS) or Kubernetes.
- CI/CD Pipelines: Experience in automating build and deployment pipelines using tools like Azure DevOps, Jenkins, Bamboo, or TeamCity.
- Monitoring & Observability: Experience in implementing and managing monitoring systems like Azure Monitor, Prometheus, Grafana, or equivalent tools to ensure service health.
- Web Servers: Experience with HTTP web servers such as Apache, Nginx, Tomcat, or Jetty.
- Networking Concepts: Solid understanding of at least four of the following:DNS mechanics and configurationTCP handshake process IPtables filteringTCP states and their significance Analyzing TCP trafficRouting, switching, subnets, VPN, and NAT.
- Configuration Management:Hands-on experience with at least one configuration management tool such as Ansible, Puppet, Chef, or Salt.
- Azure-Specific Expertise:Practical experience in provisioning, maintaining, and managing at least five of the following Azure resources:Virtual Networks (VNets), Gateways, Subnets, Route Tables, and Network Security Groups.
- Azure Scale Sets, Virtual Machines, and Auto Scaling configurations.Azure Batch and job management (Pools, Jobs, and Applications).Azure Blob Storage and policies.Azure DevOps Pipelines (CI/CD).
- Azure Role-Based Access Control (RBAC) and Identity Management.
- Application Gateway and Load Balancers.
- Linux Systems Administration:Proficiency in managing packages and configurations on GNU/Linux distributions: Installing packages, configuring services, managing service states, debugging startup issues, and analyzing system logs.
- Familiarity with disk management and debugging.Understanding of at least one init system (e.g., system, Upstart).
- HTTP Protocols:Knowledge of HTTP headers, request/response lifecycle, and troubleshooting.
- Soft Skills:Strong communication and collaboration abilities.Effective articulation of issues with team members and external vendors/support personnel.
- This role requires a proactive individual with a deep understanding of Azure DevOps practices, strong technical foundations, and excellent problem-solving skills.
Good to Have:
- Proficient in one coding language preferable Golang or Java
- Microsoft Certified: DevOps Engineer Expert