
Senior Infrastructure Site Reliabil...
Responsibilities
Qualifications & Requirements
Experience Level: Senior Level
Full Job Description
We are seeking a Senior Site Reliability Engineer with a focus on Windows environments to join our team in Hyderabad, India. This role is crucial for the continuous improvement and support of our Windows server infrastructure, encompassing versions from Windows 2012 through Windows 2022.
Key responsibilities include managing the entire lifecycle of the server environment, from deployment to retirement. You will be instrumental in ensuring swift resolution of incidents and problems, adhering to Service Level Agreements (SLAs) and company policies, and maintaining the environment to high standards and industry best practices. A strong understanding of virtualization with VMware and HPE Hardware products is essential. Experience with automation tools such as Ansible or BladeLogic is also required.
As a Senior SRE, you will serve as a primary point of contact for technical issues, providing mentorship to the team and delivering on-site support for changes and incidents. The goal is to manage the server estate effectively, ensuring its supportability while driving continuous improvement and enhancing performance and security.
Responsibilities include:
- Developing and executing automation plans.
- Managing incident resolution to isolate problems and develop remediation strategies within the server environment.
- Supporting post-incident and problem management processes to ensure root causes are identified and remediated.
- Implementing permanent resolutions for incidents and problems across the entire environment.
- Providing expert technical input and leadership during complex system problem resolution.
- Mentoring third-party engineers and ensuring team performance.
- Maintaining the server environment to established standards.
- Identifying and remediating gaps in monitoring capabilities.
- Developing and implementing new support services, performance tuning recommendations, and workload adjustments to meet user requirements and ensure resource availability.
- Identifying opportunities for process improvements to reduce technical costs and enhance service delivery.
- Ensuring client queries are handled reliably and efficiently by the team, meeting client expectations.
- Demonstrating a clear understanding of cross-platform infrastructure and their interdependencies.
- Performing administrative and business-as-usual (BAU) tasks for technical and business teams.
- Remediating issues with HPE hardware, software, and firmware, and making system management software changes to improve performance and resolve problems.
- Managing patching and vulnerability remediation for the server environment.
- Reviewing, maintaining, and testing upgrades to vendor software used for server support.
- Adhering to ITIL standards and processes.
- Participating in an out-of-hours on-call rotation and working outside normal business hours as required.
Core Technical Skills:
- Proficiency with Windows (all versions).
- Experience in large shared environments including Server, Converged infrastructure, Network, and VMware.
- Expertise in VMware virtualization and VRealise Suite v*.x.
- Experience with scripting and automation technologies such as Bash, Python, Perl, Ansible, PowerShell, or VBScript.
- Familiarity with HPE Server Hardware (ProLiant range, Synergy/OneView).
Desirable Technical Skills:
- Experience with Rapid 7 vulnerability management tools.
- Knowledge of McAfee security products.
- Understanding of SAN Storage and Switches (HDS, Brocade).
- Exposure to Cloud Technologies (AWS, Azure, Oracle, Google).
- Experience with monitoring tools like Dynatrace.
Skills:
- Knowledge of Agile methodologies.
- Familiarity with ITIL processes and experience using ticket tracking software, particularly ServiceNow.
- Advanced technical skills across relevant domains.
- Excellent English verbal and written communication skills.
- Vendor and management interaction skills.
- Ability to implement technology and cost/process improvements.
- A proactive approach to identifying and implementing improvements.
- Strong written and verbal communication abilities.
- Capacity for innovation.
- Ability to work independently with minimal supervision.
- Flexibility in working hours, including on-call and out-of-hours support.
Qualifications:
- A minimum of 5 years of experience working in a large Windows environment.
Experian fosters a culture that celebrates individuality and diversity. Our people-centric approach is recognized globally, with accolades such as World's Best Workplaces™ 2024 (Fortune Top 25), Great Place To Work™ in 24 countries, and Glassdoor Best Places to Work 2024. We prioritize DEI, work/life balance, development, authenticity, collaboration, and wellness. Explore Experian Life on social media or our Careers Site to learn more.
Experian is an Equal Opportunity and Affirmative Action employer committed to diversity and inclusion. Our success is driven by our innovative and diverse workforce. We encourage everyone to bring their authentic selves to work, regardless of gender, ethnicity, religion, color, sexuality, physical ability, or age. If you require accommodation due to a disability or special need, please inform us at your earliest convenience.
Company
Experian
Experian is a global data and technology company that empowers opportunities for individuals and businesses worldwide. We specialize in redefining lending, preventing fraud, simplifying healthcare, de...