Saas Talent

Lead Site Reliability Engineer

Job Description

Cvent is a global meeting, event, travel, and hospitality technology leader, with more than 4000+ employees worldwide. As a leading cloud-based technology company, we have over 28,000+ customers, including 80% of the Fortune 100 companies, in more than 100 countries. Cvent’s software solutions optimize the entire event management value chain and have enabled clients around the world to manage hundreds of thousands of meetings and events. In addition to helping event planners navigate every aspect of the event process, we also provide an integrated platform to hoteliers to help create qualified demand for their hotels, manage that demand more efficiently, and measure their business performance in real-time.

• As a Lead - Site Reliability Engineer, you'll use your advanced development and operations knowledge to identify and prioritize issues. Find universal solutions to common problems and mentor and support junior staff. Additionally, you will:
• Enlighten, Enable and Empower a fast-growing set of multi-disciplinary teams, across multiple applications and locations.
• Tackle complex development, automation and business process problems.
• Champion Cvent standards and best practices.
• Ensure the scalability, performance, and resilience of our suite of products.
• Work with the development and product team of a new application to establish the right
monitoring and alerting strategy.
• Work with a new acquisition's DevOps team to cross -pollinate best practices, educate and close gaps in Cvent standards.
• Develop build, test and deployment automation that seamlessly targets multiple on-premises and AWS regions.
• Help a dev team working on a legacy code base to realize zero-down-time deployments• Give back by working on and contributing to Open Source projects
• Automate all the things!

Must Have:

• Experience with SDLC methodologies (preferably Agile software development methodology).

• Experience with software development - Knowledge of Java/Python/Ruby is a must. Preferably good understanding of Object-Oriented Programming concepts. • Exposure to managing AWS services / operational knowledge of managing applications in AWS

• Experience with configuration management tools such as Chef, Puppet, Ansible or equivalent

• Solid Windows and Linux administration skills.

• Working with APM, monitoring, and logging tools (New Relic, DataDog, Splunk)

• Experience in managing 3 tier application stacks / Incident response

• Experience with build tools such as Jenkins, CircleCI, Harness etc

• Exposure to containerization concepts - docker, ECS, EKS, Kubernetes

• Working experience with NoSQL databases such as MongoDB, couchbase, postgres etc

• Self-motivation and the ability to work under minimal supervision is must.

Good to Have:

• F5 load balancing concepts

• Basic understanding of observability & SLIs/SLOs

• Message Queues (RabbitMQ).

• Understanding of basic networking concepts

• Experience with package managers such as nexus, artifactory or equivalent

• Good communication skills

• People management experience

Skills

Linux System Administration
Configuration Management
Programming Concepts
Site Reliability Engineering
Puppet (Software)

Education

Master's Degree
Bachelor's Degree

Job Information

Job Posted Date

Oct 15, 2024

Experience

5-10 Years

Compensation (Annual in Lacs)

Best in the Industry

Work Type

Permanent

Type Of Work

8 hour shift

Lead Site Reliability Engineer

Job Description

Skills

Education

Job Information

Job Posted Date

Experience

Compensation (Annual in Lacs)

Work Type

Type Of Work

Category

Related Jobs

DevOps Engineer II

DevOps Engineer

Principal Engineer - Systems

Site Reliability Engineer II

Senior Solutions Engineer

Senior Observability Engineer, Pingdom