
DevOps Engineer
Full Job Description
Infra360 is seeking a skilled DevOps Engineer to join our team in Gurugram, India. In this role, you will be instrumental in building, automating, and managing robust cloud infrastructure, ensuring high availability, security, and scalability. The ideal candidate will possess hands-on experience in production environments, particularly with AWS and infrastructure automation tools such as Terraform. Proficiency in CI/CD pipelines and container orchestration technologies like Kubernetes, ECS, or EKS is essential. A strong understanding of cloud networking, monitoring, and security best practices is required. You should be adept at scripting languages including Python, Bash, or Go for automating deployments and operational tasks.
Your responsibilities will include troubleshooting and optimizing cloud environments, enhancing observability with tools like Prometheus, Grafana, ELK, or Datadog, and ensuring seamless incident management and system reliability. We value strong problem-solving skills, a proactive approach, and the ability to collaborate effectively with development and SRE teams.
Preferred experience includes exposure to multi-cloud environments (AWS, GCP, Azure), configuration management tools (Ansible, Helm), and security hardening techniques.
Tech Stack Expertise:
- Kubernetes: In-depth knowledge of Kubernetes clusters, container orchestration, and architecture.
- Terraform: Extensive practical experience with Infrastructure as Code (IaC) for managing cloud resources.
- ArgoCD: Experience with continuous deployment and GitOps workflows.
- Helm: Expertise in managing Kubernetes applications.
- Cloud Platforms: Proficiency in AWS, GCP, or Azure is advantageous.
- Debugging and Troubleshooting: Ability to identify and resolve complex issues in distributed environments, including networking and infrastructure misconfigurations.
Key Responsibilities:
- Manage CI/CD pipelines and configuration management.
- Perform root cause analysis (RCA) for production issues and implement resolutions.
- Configure failover, disaster recovery (DR), backups, logging, monitoring, and alerting systems.
- Containerize diverse applications on the Kubernetes platform.
- Conduct capacity planning for environment infrastructure.
- Ensure zero downtime for critical services.
- Perform database administration for SQL and NoSQL databases.
- Implement and manage Infrastructure as Code (IaC).
- Optimize infrastructure costs to the minimum.
- Implement appropriate security measures.
Ideal Candidate Profile:
- Bachelor's or Master's degree in Computer Science or a related field.
- 2-4 years of solid DevOps experience in a Linux environment.
- Enthusiasm for working with our specified tech stack.
- Excellent communication skills.
- Ability to work independently and as a self-starter.
- Hands-on experience with scripting languages (Bash, Python, Go, etc.).
- Proficiency with version control systems like Git.
- Strong experience with Amazon Web Services (EC2, RDS, VPC, S3, Route53, IAM, etc.).
- Proven experience managing production systems.
- Ability to diagnose and fix issues across various architectural layers in production environments.
- Knowledge of SQL and NoSQL databases, ElasticSearch, Solr, etc.
- Understanding of networking concepts, firewalls, load balancers, Nginx, Apache, etc.
- Experience with automation tools such as Ansible/SaltStack and Jenkins.
- Experience with Docker/Kubernetes platforms and OpenStack management is desirable.
- Familiarity with HashiCorp tools (Vault, Vagrant, Terraform, Consul, VirtualBox, etc.) is desirable.
- Experience managing or mentoring small teams (2-3 people) is desirable.
- Experience with monitoring tools like Prometheus/Grafana/Elastic APM.
- Experience with logging tools like ELK/Loki.
Company
Infra360
Infra360 is a prominent global cloud consulting firm dedicated to delivering innovative, cloud-native solutions and outstanding customer service. We collaborate with clients to modernize and optimize ...