Lotusflare
Lotusflare5h ago
Foundit

DevOps Site Reliability Engineer

Pune, India
Full Time
Mid Level

Auto Apply to 50+ AI Matched DevOps Site Reliability Engineer Jobs

Use Auto Apply Agents to Bulk Apply jobs with ATS Optimised Resumes, find verified Insider Connections for jobs at Lotusflare

Full Job Description

DevOps Site Reliability Engineer - Pune, India

LotusFlare is seeking a highly motivated DevOps Site Reliability Engineer to join our team in Pune, India. This role is crucial in guaranteeing the quality and reliability of LotusFlare's software solutions through proactive monitoring, swift incident response, and rigorous testing.

As a DevOps Support Engineer, you will share accountability and code ownership, participating in on-call rotations and incident management. Your efforts will directly contribute to writing code that seamlessly integrates into our applications and infrastructure, enhancing the reliability of services deployed to millions of users worldwide. You will be instrumental in testing code functionality, identifying flaws, and optimizing underperforming features to ensure customer satisfaction.

You will collaborate with experienced engineers on cutting-edge projects, working on systems that are in production and utilized by a global user base. While relevant industry experience (SRE, Systems Engineer, Software Engineer, DevOps Engineer, Network Engineer, Systems Administrator, Linux Administrator, Database Administrator, or similar) is valued, demonstrated abilities and a strong attitude are paramount. This is an opportunity to learn from top engineers and contribute to innovative solutions.

Key Responsibilities:

  • Monitor backend services and cloud-based infrastructure.
  • Provide support, troubleshoot, and investigate issues and incidents, assisting developers and the infrastructure team with system metrics analysis, logs, traffic, configurations, and deployment changes.
  • Support and enhance monitoring and alerting systems by searching, testing, and deploying new functionalities for existing tools.
  • Develop new features to automate troubleshooting and investigation processes.
  • Create new tools to improve the support process.
  • Draft reports and summarize findings after investigations and incidents.

Required Qualifications:

  • Minimum 1 year of work experience in similar responsibilities.
  • Strong knowledge and practical experience with Linux (Ubuntu) command-line/administration.
  • Understanding of network protocols and troubleshooting (TCP/IP, UDP).
  • Proficient scripting skills in Bash and Python.
  • Excellent critical thinking and problem-solving abilities.
  • Understanding of containerization technologies (Docker).
  • Experience troubleshooting API-driven services.
  • Experience with Kubernetes.
  • Experience with Git.
  • Familiarity with release management processes.
  • Professional written and verbal English communication skills.

Desirable Qualifications:

  • Experience with Prometheus, Grafana, Kibana (including query language).
  • Experience with Nginx/OpenResty.
  • Experience with telco protocols (Camel, Map, Diameter) is advantageous.
  • Software development/scripting skills.
  • Basic knowledge of Cassandra and PostgreSQL.
  • Experience with AWS cloud services (EC2, Redshift, S3, RDS, ELB/ALB, ElastiCache, Direct Connect, Route 53, Elastic IPs, etc.).
  • Experience with CI/CD tools like Jenkins.
  • Experience with Terraform.

Benefits:

  • Competitive salary package.
  • Paid lunch (in the office).
  • Private healthcare.
  • Yearly bonus.
  • Training and workshops.
  • Truly flexible working hours.
  • Opportunity to learn from and work with top-tier engineers.

Company

Lotusflare

Lotusflare

About LotusFlareLotusFlare is a Silicon Valley-based provider of cloud-native SaaS products dedicated to making affordable mobile communications accessible globally. Founded by a team instrumental in ...

Pune, India
Posted on Foundit