Staff Devops Engineer
Responsibilities
Qualifications & Requirements
Experience Level: Senior Level
Full Job Description
Staff DevOps Engineer in Delhi NCR at Balbix
Balbix is seeking a dedicated Staff DevOps Engineer to join our expanding team. This pivotal role is essential for empowering our engineers and ensuring our company delivers a best-in-class, scalable platform. The position involves working across diverse hosting environments and technology stacks, with a strong emphasis on AWS, Infrastructure as Code (IaC), and continuous deployment. You will also be responsible for configuring and deploying IT and engineering infrastructure, both on-premises and in the cloud.
The ideal candidate possesses a proven history of massively scaling applications and infrastructure, excels at resolving complex challenges in AWS, IaC, and CI/CD automation, and demonstrates an aptitude for rapidly learning and adapting to evolving technologies within AWS and other cloud platforms.
Responsibilities:
- Collaborate with the existing DevOps team to architect and build IaC components for the Balbix solution and internal engineering tools hosted on AWS.
- Develop and deploy a cutting-edge security SaaS platform utilizing the latest automated, repeatable, and secure CI/CD methodologies.
- Administer Linux systems at scale through automation.
- Implement robust infrastructure security measures, including TLS, bastion hosts, certificate management, authentication and authorization, and network segmentation.
- Work alongside the DevOps team to design, develop, and manage deployments across multiple Kubernetes clusters.
- Oversee the management, maintenance, and monitoring of our infrastructure.
- Partner with the DevOps team to establish and implement a comprehensive system for logging, monitoring, and diagnostics for Balbix solutions.
Qualifications:
- Over 8 years of experience in DevOps/Platform Engineering.
- More than 4 years of experience setting up AWS infrastructure for SaaS-based product development.
- In-depth knowledge of AWS infrastructure and services such as load balancers (NLB/ALB/ELB), IAM, KMS, Networking, EC2, CloudWatch, CloudTrail, and Lambda.
- Over 4 years of experience building infrastructure using Terraform.
- At least 3 years of solid experience with Kubernetes and Helm.
- Excellent command of configuration management systems like Ansible.
- Familiarity with CI/CD code management and deployment tools such as GitLab and Docker.
- Experience with Nginx, HA Proxy, Kafka, and other common public cloud components.
- Experience with the Grafana/Prometheus/LGTM stack is advantageous.
- MLOps experience is a plus.
- Experience deploying and managing Aurora RDS-Postgres, ElasticCache, Cassandra, OpenSearch/Elasticsearch, and ClickHouse is beneficial.
- Experience building or working with the latest AI technologies for infrastructure management is a plus.
- Exceptional time management skills, with the ability to remain focused and calm under pressure when facing competing deadlines and to adapt to changing priorities.
- Clear and effective written and verbal communication skills.
- Willingness to participate in on-call duties.