
Cloudangle It Solutions•2h ago
Naukri
CA0294
Noida
Entry Level
N/A
N/A
N/A
Full Job Description
Site Reliability Engineer I - Noida
Cloudangle It Solutions is seeking a motivated Site Reliability Engineer I to join our team in Noida. This role is crucial for ensuring the optimal performance, reliability, and efficiency of our systems and applications. You will work closely with development teams and experienced SREs to maintain and improve our service infrastructure.
Key Responsibilities
- Analyze system and application metrics to drive performance optimization and fault diagnosis.
- Develop and implement observability strategies using tools like New Relic to proactively identify and resolve issues before they impact users.
- Learn and apply principles of sustainable service and system operation, with a focus on reliability, efficiency, automation, debugging, and understanding underlying technologies.
- Execute assigned tasks and solve problems effectively within a team environment, guided by management and senior SREs.
- Collaborate with team members to prioritize high-impact tasks that deliver quality results to customers and stakeholders.
- Actively participate in core team processes including planning, on-call rotations, incident triage, and metrics reviews.
- Partner with development teams to enhance service quality and contribute to platform management and capacity planning.
- Perform other duties as assigned to support team objectives.
Qualifications
- Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
- A minimum of 1 year of experience in a Site Reliability Engineering (SRE), DevOps, or infrastructure engineering role.
Required Skills and Abilities
- Experience with cloud platforms, particularly AWS, including container orchestration (Kubernetes, EKS), infrastructure management, and monitoring best practices.
- Solid understanding of observability principles such as Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Error Budgets.
- Proficiency in at least one programming language: Python, Go, or Java.
- Hands-on experience with monitoring tools like New Relic, Datadog, or Prometheus.
- A strong problem-solving aptitude with a dedication to root cause analysis and continuous improvement.
- Excellent communication and collaboration skills, with the ability to work effectively in cross-functional teams.
- Demonstrated knowledge of CI/CD tools and scripting; experience with GitLab or ArgoCD is a plus.
- Familiarity with incident management frameworks like ITIL is desirable.
- AWS certifications are a bonus.
- Prior experience in regulated industries such as healthcare is preferred.
Company
Cloudangle It Solutions
Noida
Posted on Naukri