Driffle
Driffle6h ago
Foundit

DevOps Engineer

Gurugram, Gurgaon / Gurugram, India
Full Time
Mid Level

Auto Apply to 50+ AI Matched DevOps Engineer Jobs

Use Auto Apply Agents to Bulk Apply jobs with ATS Optimised Resumes, find verified Insider Connections for jobs at Driffle

Full Job Description

About Driffle:

Driffle is a global digital goods marketplace specializing in digital gaming products, including games, gift cards, DLCs and more across 140+ countries. We are a team of gamers with the aim of making gaming accessible and affordable to everyone. Operating across multiple jurisdictions, Driffle facilitates high-volume cross-border transactions and works with global payment service providers to deliver seamless digital commerce experiences.

About the Role:

We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) with 2+ years of experience to join our dynamic team in Gurugram, India. As an SRE, you will play a crucial role in designing, implementing, and maintaining the infrastructure and systems that power our organization's technology platforms. Your primary focus will be ensuring the reliability, scalability, and performance of our applications and services.

Key Responsibilities:

  • System Design and Architecture: Collaborate with cross-functional teams to design and implement scalable, reliable, and efficient systems. Participate in system architecture discussions, provide recommendations, and drive improvements to meet business objectives.
  • System Monitoring and Performance: Develop and implement robust monitoring systems to proactively identify and resolve performance bottlenecks, service disruptions, and other issues affecting system reliability. Continuously monitor system performance metrics and optimize resource utilization.
  • Incident Response and Troubleshooting: Respond to and resolve production incidents in a timely manner, utilizing strong troubleshooting skills and collaborating with other teams. Conduct root cause analysis to prevent future incidents and implement corrective actions.
  • Automation and Tooling: Develop automation tools and scripts to streamline deployment, configuration, and monitoring processes. Implement and maintain CI/CD pipelines to ensure efficient and reliable software delivery.
  • Capacity Planning and Scalability: Work closely with development teams to forecast system capacity requirements and plan for scalability. Conduct performance testing and capacity analysis to ensure systems can handle increased loads and peak traffic.
  • Security and Compliance: Implement and maintain security measures and best practices to protect our infrastructure and data. Stay up to date with the latest security vulnerabilities and apply necessary patches and upgrades.
  • Collaboration and Documentation: Foster strong collaboration with cross-functional teams, including developers, operations, and QA. Document system configurations, processes, and procedures to facilitate knowledge sharing and ensure a smooth handover of responsibilities.

Key Requirements:

  • Bachelor's degree in computer science, Engineering, or a related field (or equivalent practical experience).
  • Strong experience in a Site Reliability Engineering role or a similar capacity, managing large-scale, highly available production systems.
  • Proficiency in programming and scripting languages (e.g., Python, Bash, Ruby).
  • Deep understanding of Linux/Unix systems and networking concepts.
  • Experience with cloud platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Docker, Kubernetes).
  • Familiarity with infrastructure-as-code tools (e.g., Terraform, Ansible) and configuration management tools (e.g., Chef, Puppet).
  • Knowledge of monitoring and logging tools (e.g., Prometheus, ELK stack) and incident management systems (e.g., PagerDuty).
  • Strong problem-solving and analytical skills, with the ability to quickly identify and resolve complex technical issues.
  • Excellent communication and collaboration skills, with the ability to work effectively in a team-oriented environment.
  • Previous experience in a product-based company or startup environment is preferred.

Join our team as an SRE in Gurugram and contribute to the stability and performance of our technology infrastructure. Help us ensure an exceptional user experience for our customers while driving continuous improvements in system reliability and scalability.

Company

Driffle

Driffle

Driffle is a leading global digital goods marketplace focused on gaming products. We offer a wide selection of games, gift cards, and DLCs to customers in over 140 countries. Our mission is to make ga...

Gurugram, Gurgaon / Gurugram, India
Posted on Foundit