
Infra Tech Support Practitioner
Qualifications
Experience Level: Mid Level
- </b> </li><li>Expertise in Python
- or JavaScript for automation and tooling.</li><li>Hands-on with cloud environments AWS
- GCP and orchestration tools like Kubernetes and Terraform.</li><li>Deep understanding of Linux systems
- networking
- and distributed architectures.</li><li>Experience with observability solutions Prometheus
- Grafana
- Datadog
- CloudWatch
- or New Relic.</li><li>Familiarity with incident management and alerting platforms (PagerDuty
- xmatters)</li><li>Proficiency in CI/CD frameworks such as Jenkins
Full Job Description
Infra Tech Support Practitioner - Site Reliability Engineering at Accenture, Kolkata
Accenture is seeking an Infra Tech Support Practitioner with a focus on Site Reliability Engineering (SRE) to join their team in Kolkata. This role is crucial for ensuring the stability, scalability, and high availability of production and development systems. You will be responsible for providing ongoing technical support, maintenance, and technology implementation across various platforms, encompassing both remote and onsite duties.
This position involves L1 and L2 (basic to intermediate) level troubleshooting. The ideal candidate will bridge the gap between business application development and IT operations, leveraging automation, observability, incident response, and performance engineering to maintain continuous service reliability while accelerating delivery velocity.
Key Responsibilities:
- Monitor and optimize system uptime, latency, and throughput to meet Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
- Lead incident response, manage escalations, perform root cause analysis (RCA), and drive postmortem reviews.
- Develop CI/CD pipelines, automate infrastructure management, and eliminate manual toil through scripting and orchestration.
- Implement metrics, logging, and tracing frameworks (e.g., Prometheus, Grafana, ELK, Datadog) for real-time visibility into distributed systems.
- Conduct resource forecasting, design scalable infrastructure, and manage performance during surge conditions.
- Partner with developers to ensure safe, reliable rollout of new features with automated testing and rollback mechanisms.
- Implement multi-region resilience strategies, chaos tests, and failover automation for business continuity.
- Utilize post-incident analytics to refine operational practices and drive data-driven improvements.
- Collaborate with product, design, ML, and DevOps teams to build intelligent workflows.
- Implement Infrastructure as Code (IaC) using tools like Terraform, CloudFormation, Azure DevOps, or Pulumi.
- Provide expert support for Cloud IaaS and PaaS services.
Required Technical Skills:
- Expertise in scripting languages such as Python, Go, Bash, or JavaScript for automation.
- Hands-on experience with cloud environments (AWS, Azure, GCP) and orchestration tools like Kubernetes and Terraform.
- Deep understanding of Linux systems, networking, and distributed architectures.
- Experience with observability solutions (e.g., Prometheus, Grafana, Datadog, CloudWatch, New Relic).
- Familiarity with incident management and alerting platforms (e.g., PagerDuty, xMatters).
- Proficiency in CI/CD frameworks (e.g., Jenkins, GitHub Actions, GitLab CI).
- Working knowledge of security, compliance, and performance optimization for highly available systems.
Qualifications:
- Minimum of 2 years of experience in Site Reliability Engineering.
- 15 years of full-time education.
Preferred Certifications:
- AWS Certified Solutions Architect Professional
- Microsoft Certified: Azure Solutions Architect Expert
- Google Professional Cloud Architect
- Certified Kubernetes Administrator (CKA)
- HashiCorp Certified: Terraform Associate
- Certified DevOps Engineer certifications (AWS, Azure, or Google)
This position is based at Accenture's Kolkata office.