
Senior Site Reliability Engineer
Responsibilities
Qualifications & Requirements
Experience Level: Senior Level
Full Job Description
As a Senior Site Reliability Engineer based in Bengaluru/Bangalore, India, you will play a crucial role in supporting existing customer deployments on managed or private cloud environments and initiating new deployments on major cloud platforms like Azure, AWS, and GCP. Your primary responsibilities include ensuring the operational excellence, scalability, and security of WSO2 cloud services, alongside driving automation initiatives to enhance efficiency and reliability. This permanent role requires a strong technical background and a proactive approach to problem-solving.
Key Responsibilities:
Deployment Strategy and Implementation: Design and implement robust cloud deployments on Azure, AWS, GCP, and Kubernetes, aligning with specific stakeholder needs. Optimize cloud architectures for scalability and cost-efficiency, ensuring adherence to best practices in networking, security, and access control. Develop deep expertise in cloud infrastructure to build resilient solutions. Drive continuous improvements and cost-effective strategies for infrastructure adaptability and streamlined deployment.
Automation and CI/CD Excellence: Develop and manage automation scripts and Infrastructure as Code (IaC) using tools such as Terraform, Ansible, or CloudFormation. Implement and maintain CI/CD pipelines for efficient software delivery, testing, and deployment, with a focus on version control and configuration management.
Managed Cloud Operations: Ensure service availability through the configuration of comprehensive monitoring and alerting systems. Respond promptly to critical alerts and provide ongoing support and maintenance for existing deployments, monitoring performance, and resolving issues to maintain high availability. Implement performance optimization strategies and conduct thorough root cause analyses (RCAs) to prevent future incidents. Demonstrate strong ownership and deliver timely resolutions during critical incident scenarios.
Security and Monitoring: Establish robust monitoring and alerting systems for customer deployments, defining clear incident response thresholds. Conduct regular security assessments and stay updated on emerging threats to enhance cloud environment security.
Collaboration and Knowledge Dissemination: Foster collaborative relationships with product development, operations, and QA teams to improve workflows and product quality. Share expertise and best practices through documentation, training, and mentorship to elevate team capabilities.
Required Skills and Qualifications:
A Bachelor's degree in Computer Science, Engineering, or a related discipline, or equivalent practical experience. Minimum of 2 years of hands-on experience as a Site Reliability Engineer managing and optimizing large-scale production systems. Demonstrated strong collaboration and leadership abilities, with a track record of leading teams and driving cross-functional initiatives. Expertise in major cloud platforms including Azure, AWS, and GCP. Profound knowledge of Linux, virtualization, and containerization technologies such as Docker and Kubernetes. A solid understanding of networking, security principles, and compliance frameworks. Proficiency in IaC tools (Terraform, CloudFormation), configuration management (Puppet, Chef, Helm), and scripting languages (Python, Bash, PowerShell). Experience with CI/CD tools (Github Actions, Jenkins) and monitoring/logging solutions (Prometheus, ELK stack, Splunk). Excellent problem-solving, analytical, and troubleshooting skills, coupled with a customer-centric and proactive approach. Strong communication skills essential for effective teamwork.
Why Join WSO2?
WSO2 fosters a culture that values hard work and flexibility, offering a sensible vacation/leave plan. We provide comprehensive health insurance for you and your family, a competitive compensation package, and significant opportunities for professional growth and development.
Diversity and Inclusion:
At WSO2, diversity and inclusion are central to our business. We cultivate an environment that respects and values individual strengths, perspectives, and ideas, driving innovation and superior customer experiences. We are committed to fostering a diverse team regardless of race, ethnicity, religion, gender, age, national origin, disability, sexual orientation, or veteran or marital status, and we strictly prohibit any form of discrimination.
Company
WSO2
WSO2, founded in 2005, is a leading independent software vendor offering open-source API management, integration, and identity and access management (IAM) solutions. Serving thousands of enterprises g...