
Cloud Site Reliability Engineer
Responsibilities
Qualifications & Requirements
Experience Level: Mid Level
Full Job Description
Join NICE Public Safety in Pune, India, as a Cloud Site Reliability Engineer and play a crucial role in maintaining and enhancing our state-of-the-art Software as a Service (SaaS) platforms for the Public Safety & Justice market. This hands-on role is essential for ensuring our cloud platforms are observable, measurable, reliable, scalable, and maintainable. You will be part of a dedicated SRE team acting as the guardians of production, driving reliability improvements, and leading investigations into critical incidents, performance bottlenecks, and cost optimizations. Your contributions will involve automating low-value tasks, providing technical leadership to Cloud Operations and Support teams, and developing robust monitoring solutions using tools like Grafana and Azure Monitor. You will also be responsible for the installation and configuration of observability platforms, including Prometheus, OpenTelemetry, and developing Bicep modules for infrastructure deployment.
We are seeking candidates with a strong foundation in DevOps, SRE, or Cloud Engineering, demonstrating at least 2 years of experience in Site Reliability Engineering. Essential skills include excellent technical, analytical, and troubleshooting capabilities, in-depth knowledge of databases (MS-SQL, Elasticsearch) and data formats (YML, JSON, XML), and proficiency in programming or advanced scripting (C#, PowerShell). Experience with Infrastructure as Code (ARM, BICEP), version control (Git), and managing monitoring platforms (Azure Monitor, Prometheus, Grafana, Elasticsearch) is vital. You should have demonstrable experience supporting live cloud services, production experience with Kubernetes and containerization, and a solid understanding of Service Level Objectives (SLOs). While Azure experience is preferred, exposure to other commercial cloud providers is also valued. Familiarity with Azure DevOps pipelines (CI/CD) and test frameworks (NUnit, Jasmine, Selenium) is a plus. Exceptional communication skills, including active listening, effective questioning, methodical troubleshooting, and strong time management, are required.
This is a permanent, full-time individual contributor role reporting to a Manager. Successful candidates must be flexible with working hours, including occasional on-call duties. NICE offers a vibrant, collaborative, and creative work environment with ample opportunities for learning and career growth through our NiCE-FLEX hybrid model, allowing for 3 days of remote work and 2 days in the office each week.
Company
NICE
NICELtd. is a global market leader in AI, cloud, and digital solutions, empowering over 25,000 businesses, including 85 of the Fortune 100, to deliver exceptional customer experiences, combat financia...