
Senior DevOps Engineer
Responsibilities
Qualifications & Requirements
Experience Level: Senior Level
Full Job Description
About the Role
We are seeking an experienced Senior DevOps Engineer to lead and elevate our cloud infrastructure, DevOps practices, and site reliability engineering initiatives. The ideal candidate will have a proven track record of managing cloud platforms, container orchestration, modern CI/CD workflows, and automation solutions. This role requires a balance of technical depth, leadership skills, and collaboration across teams. The position is designed for those who thrive in a hybrid work environment and are excited to tackle complex infrastructure challenges.
Key Responsibilities
- Cloud Infrastructure: Architect, deploy, and maintain scalable Azure cloud-based infrastructures.
- Kubernetes & Containers: Design, deploy, and manage Kubernetes clusters and containerized environments with an emphasis on high availability, fault tolerance, and scalability.
- CI/CD Pipeline Management: Lead the design and implementation of modern CI/CD pipelines using platforms like Github Actions and Azure Devops.
- Infrastructure as Code (IaC): Develop and manage infrastructure automation using tools such as Terraform, Ansible, and other IaC solutions to ensure consistency, scalability, and agility.
- DevOps Optimization: Extend solutions to optimize cloud infrastructure and DevOps workflows, driving continuous improvements.
- Performance & Cost Optimization: Continuously optimize cloud infrastructure for performance, cost efficiency, and security, ensuring sustainable scalability.
- Troubleshooting: Troubleshoot and resolve complex infrastructure and application issues, minimizing downtime and ensuring high system reliability.
- Scripting & Automation: Develop and maintain scripts and automation tools in Bash/Shell, Python, or Go to streamline operations.
- Networking & Security: Apply networking fundamentals (DNS, firewalls, load balancing, virtual networks, private endpoints) to improve system architecture and ensure security compliance.
- Collaboration & Mentorship: Collaborate with cross-functional teams to enhance developer productivity, system reliability, and operational efficiency. Mentor and guide team members, promoting best practices in DevOps and cloud operations.
- Compliance & Monitoring: Ensure infrastructure and systems adhere to security, compliance, and monitoring standards, with a focus on proactive management.
Qualifications
We require 6 to 8 years of experience in DevOps, Cloud Engineering, or Site Reliability Engineering (SRE).
Skills
Proficiency in Azure DevOps and DevSecOps is essential. We are looking for candidates with excellent communication skills and the ability to work effectively with diverse global teams. Key technical skills include:
- Proven experience in deploying and managing large-scale cloud infrastructure on Azure.
- Expertise in Kubernetes and containerization technologies (Docker, Helm, etc.).
- Strong experience with modern CI/CD tools like Github actions and Azure Devops.
- Proficiency in scripting languages such as Bash/Shell and programming languages like Python or Go.
- Hands-on experience with Terraform and Ansible for infrastructure automation and configuration management.
- Solid understanding of Linux administration and networking principles (DNS, firewalls, load balancing, Vnet, etc.).
- Expertise in cloud performance optimization, cost control, and security best practices.
- Ability to drive cloud, performance, cost, and DevOps optimizations.
- Excellent problem-solving and troubleshooting abilities, particularly in large-scale and distributed systems.
- Exceptional communication and collaboration skills, able to work across teams and geographies.
- Strong leadership abilities with a focus on mentoring and fostering a culture of continuous improvement.
- Adaptability to a hybrid work environment, balancing remote and office-based work.
- Passion for learning and staying up-to-date with emerging technologies in the DevOps and cloud ecosystem.
- Experience with observability tools like Dynatrace, Prometheus, Grafana, ELK stack, or Datadog.
- Familiarity with microservices and service mesh technologies.
- Proven experience in leading large-scale migration or cloud transformation projects.
An Azure Cloud certification is considered an added advantage.