
Site Reliability Engineer
Responsibilities
Qualifications & Requirements
Experience Level: Mid Level
Full Job Description
Site Reliability Engineer - Associate - Infrastructure Production Management & Reliability Engineering
Morgan Stanley is seeking a Site Reliability Engineer (Associate) to join our Infrastructure Production Management & Reliability Engineering team in Bengaluru, India. This role involves providing L3 level support during large-scale outages, conducting post-mortems, pre-mortems, and engaging in problem management with data-driven strategies and a code-first approach. You will be responsible for preparing and executing change management activities, automating processes, and creating tools as needed. Collaboration with partner enterprise technology teams and support for L2 operations and stakeholders are key aspects of this position. You will ensure system scalability, monitor application and infrastructure performance using Service Level Objectives (SLOs) and Service Level Indicators (SLIs), and participate in on-call rotations (weekday and weekend). The ideal candidate will have working knowledge of SAML, OIDC, OAuth, and Identity and Access Management (IAM), with at least 2 years of relevant experience. Proficiency in Linux system administration, at least one scripting language (PowerShell, Python, Bash/Shell), and familiarity with the Software Development Life Cycle (SDLC) and development tooling (GitHub, Jenkins, Visual Studio Code) are required. A strong interest in automation, downtime-less deployments, and using code to resolve operational issues is essential. Understanding of general enterprise infrastructure concepts (network, storage, web infrastructure, middleware) and enterprise security standards is necessary. Experience working within large enterprise architectures and foundational knowledge of authentication protocols (OpenID Connect, SAML, Kerberos, Radius) and multifactor authentication solutions (RSA SecurID, Cisco Duo Security, FIDO) are a plus. Familiarity with visualization and incident management tools such as Splunk, Grafana, ServiceNow, Jira, Bitbucket, PagerDuty, and PowerBI is beneficial. Experience with SaaS onboarding on Azure and troubleshooting Azure authentication issues is also desired. Excellent written and oral English communication skills are required for documentation, presentations, and interaction with colleagues and customers. We are looking for an independent problem-solver, highly motivated, and self-directed individual who is comfortable working in an operations and support team with end-user interaction and periodic on-call responsibilities. An advocate of SRE principles and good organizational skills are highly valued. Morgan Stanley is committed to diversity and inclusion, providing a supportive and inclusive environment where individuals can maximize their potential. Our values guide our daily decisions to do what's best for our clients, communities, and over 80,000 employees across 1,200 offices in 42 countries. We offer attractive and comprehensive employee benefits and perks, with ample opportunity for career growth for those demonstrating passion and grit.