Job Description
KodeKloud is a fast-growing EdTech SaaS company dedicated to transforming the way businesses upskill their teams in DevOps, Cloud Computing, and IT by offering hands-on, practical learning experience.
Trusted by over 1 million users worldwide, #4 on the Straits Times’ List of Fastest Growing Companies in 2024! KodeKloud's mission to revolutionize DevOps education is championed by our team of over 100 exceptional individuals across the globe.
As trailblazers in the remote-first workforce, we seamlessly blend work from cozy cafes, sunny beaches, mountain retreats, and vibrant cities or from our company-sponsored home office setups. Meanwhile, our HR team is meticulously planning our next meetup in a unique corner of the globe, where our globally dispersed teams can reconnect, recharge and rebond.
Wondering if you will fit in ? Read on !!!
Role Overview
We are seeking a skilled DevOps/SRE Engineer with coding experience to join our engineering team. This role will be responsible for designing, implementing, and maintaining our infrastructure and deployment pipelines primarily on GCP, as well as contributing to the development of internal tools and automation scripts.
Key Responsibilities
- Develop and manage containerized applications using Docker and orchestrate them with Kubernetes
- Develop and maintain CI/CD pipelines using tools such GitLab CI, or GitHub Actions
- Automate operational tasks and create self-service tools for the development team
- Monitor system performance and reliability, implementing improvements where necessary
- Participate in on-call rotations and incident response
- Collaborate with development teams to improve application performance and scalability
- Implement and maintain logging, monitoring, and alerting solutions
- Contribute to the development of internal tools and services using languages such as Python or Go
- Continuously evaluate and implement new technologies to improve our infrastructure and processes
Required Skills and Experience
- 3-5 years of experience in DevOps, Site Reliability Engineering, or a similar role
- Strong coding skills in at least one programming language (e.g., Python, Go)
- Proficiency with Docker and Kubernetes for containerization and orchestration
- Hands-on experience on GCP.
- Experience with CI/CD tools and practices
- Knowledge of monitoring and logging tools (e.g., Prometheus, Grafana)
- Understanding of network protocols and security best practices
- Experience with version control systems, preferably Git
Desired Competencies
- Problem-solving: Ability to troubleshoot complex issues and implement effective solutions
- Automation mindset: Constantly seeking opportunities to automate manual processes
- Continuous learning: Keeps up-to-date with the latest trends and technologies in DevOps and cloud computing
- Communication: Excellent verbal and written communication skills, able to explain technical concepts to non-technical stakeholders
- Teamwork: Collaborates effectively with cross-functional teams
- Adaptability: Comfortable working in a fast-paced, ever-changing environment
- Security-focused: Understands and implements security best practices in all aspects of work
- Performance optimization: Able to identify and resolve performance bottlenecks in infrastructure and applications
Qualifications
- Bachelor's degree in Computer Science, Software Engineering, or a related field (or equivalent work experience)
- Relevant certifications (CKA/CKS or GCP ACE) are a plus