Sr Staff/Principal Devops Engineer
Responsibilities
Qualifications & Requirements
Experience Level: Senior Level
Full Job Description
Senior Staff/Principal DevOps Engineer - Balbix - Delhi NCR
Balbix is seeking a highly skilled Senior Staff/Principal DevOps Engineer to join our expanding team in Delhi NCR. This pivotal role is instrumental in empowering our engineering team and ensuring the delivery of a best-in-class, scalable platform. The position demands expertise across diverse hosting environments and technology stacks, with a strong emphasis on AWS, Infrastructure as Code (IaC), continuous deployment, and observability.
The ideal candidate will demonstrate a proven history of architecting sophisticated DevOps systems, scaling applications and infrastructure on a massive scale, and adeptly resolving complex challenges within AWS, IaC, and CI/CD automation. A capacity for rapid learning and adaptation to evolving technologies, particularly within AWS and other cloud platforms, is essential.
Responsibilities:
- Lead the development of key DevOps initiatives, define technical strategy, and influence the organization's overall technical direction.
- Address complex technical problems, mentor senior engineers, and foster collaboration with cross-functional teams to deliver impactful DevOps solutions.
- Design and develop Infrastructure as Code (IaC) components for Balbix's solutions and internal engineering tools deployed on AWS.
- Construct and deploy a cutting-edge security SaaS platform leveraging the latest CI/CD methodologies, ensuring full automation, repeatability, and robust security.
- Implement robust infrastructure security using best practices, including TLS, bastion hosts, certificate management, authentication and authorization, and network segmentation.
- Design and build a scalable and cost-effective deployment infrastructure on Kubernetes.
- Establish and implement consistent observability systems for Balbix solutions.
- Participate in an on-call rotation to ensure platform availability.
About You:
- You are a collaborative individual, adept at working across teams, taking ownership of cross-organizational projects, and leading operational initiatives.
- You possess strong communication skills, maintain excellent documentation practices, and can articulate technical concepts effectively within a team environment.
- You are comfortable working within an agile development framework.
- You exhibit a strong curiosity for technology, industry trends, and DevOps best practices, demonstrating a commitment to continuous learning.
Qualifications:
- Bachelor's degree in Computer Science or a related technical field.
- 10+ years of experience in DevOps for a Senior Staff role, or 12-15 years for a Principal role.
- Minimum of 4 years of experience setting up and managing AWS infrastructure for product development organizations.
- Demonstrated ability to independently architect, design, document, and implement complex platforms and DevOps systems.
- In-depth knowledge of AWS infrastructure and services, including load balancers (ALB/ELB), IAM, KMS, Networking, EC2, CloudWatch, CloudTrail, CloudFormation, Lambda, and others.
- 4+ years of experience building infrastructure with Terraform.
- 3+ years of solid experience with Kubernetes and Helm.
- Expert-level programming proficiency in Python for scripting and automation.
- Excellent knowledge of configuration management systems such as Ansible.
- Hands-on experience with CI/CD code management and deployment tools like GitLab, Jenkins, or similar platforms.
- Familiarity with components commonly used in public cloud environments, such as Nginx, HAProxy, Kafka.
- Experience managing Observability systems using Grafana, Prometheus, or the LGTM stack is a plus.
- Experience deploying and managing databases such as Aurora RDS (Postgres), ElasticCache, Cassandra, OpenSearch/Elasticsearch, and ClickHouse is a plus.
- Relevant cloud certifications (e.g., Cloud Solutions Architect, DevOps, Developer, Database—AWS certifications preferred) are advantageous.
- MLOps experience is a plus.
- Experience building or working with the latest AI technologies for infrastructure management is a plus.
- Exceptional time management skills, maintaining focus and composure under pressure, meeting deadlines, and adapting to rapidly changing priorities.
- Clear and effective written and verbal communication abilities.
- Availability for on-call duties.