
Devops Engineer
Responsibilities
Qualifications & Requirements
Experience Level: Mid Level
Full Job Description
Job Title: Senior DevOps Engineer
Location: India (Remote)
About the Role: Platform9 is seeking a highly motivated and experienced DevOps Engineer to join our dynamic team. In this critical role, you will be instrumental in the design, implementation, and ongoing maintenance of our cloud infrastructure, ensuring paramount levels of availability, scalability, and security. You will collaborate closely with our engineering teams to automate deployments, manage infrastructure as code, and proactively troubleshoot production issues.
This position offers a unique opportunity to work with cutting-edge technologies and significantly contribute to the success of a rapidly expanding company. We provide a fast-paced, collaborative work environment that encourages continuous learning and skill development.
Key Responsibilities:
- Design, implement, and maintain our cloud infrastructure on AWS, including Kubernetes clusters, OpenStack environments, and associated supporting services.
- Automate infrastructure provisioning, configuration management, and application deployments utilizing tools such as Terraform.
- Implement and manage robust monitoring and logging solutions using Prometheus, Grafana, and other relevant technologies.
- Develop and maintain internal tooling and scripts to enhance operational efficiency.
- Effectively troubleshoot and resolve production issues related to infrastructure, applications, and performance.
- Collaborate with cross-functional engineering teams to implement and sustain CI/CD pipelines.
- Participate in an on-call rotation to guarantee 24/7 availability of critical services.
- Stay abreast of the latest advancements and trends in cloud computing and DevOps practices.
Qualifications:
- 2-4 years of experience in a DevOps or SRE role, with a deep understanding of cloud infrastructure and operations.
- Extensive experience with Kubernetes, including cluster administration, deployment strategies, and troubleshooting.
- Experience with OpenStack is highly desirable, though not mandatory.
- Proficiency in infrastructure-as-code tools such as Terraform or Ansible.
- Strong scripting skills in Python or similar languages.
- Strong programming skills in Golang or similar languages.
- Strong configuration management skills with Salt, Chef, or similar tools.
- Experience with Observability tools including Prometheus, Cortex, Grafana, and Loki.
- Familiarity with CI/CD tools and best practices.
- Experience with administering and debugging Linux-based operating systems.
- Excellent problem-solving and troubleshooting capabilities.
- Strong communication and collaboration skills.
- Proven incident management experience.
Bonus Points:
- Experience with EKS (Elastic Kubernetes Service).
- Experience with Cluster API and Cluster API Provider for AWS.
- Experience managing on-premise infrastructure.
- Familiarity with OpenTelemetry and AI-powered observability tools.
- Experience working in a fast-paced startup environment.
Company
Platform9
Platform9 is a leading innovator in simplifying enterprise private cloud solutions. Founded by pioneers from the VMware cloud space, Platform9 is dedicated to transforming IT operations. Our core pro...