
DevOps Engineer
Full Job Description
We are actively seeking a dedicated DevOps Engineer to join our Fulfillment Optimization support team in Hyderabad/Secunderabad, Telangana, India. In this critical role, you will be the primary technical expert responsible for ensuring the continuous and smooth operation of our mission-critical, high-volume systems. You will conduct in-depth investigations into operational issues across a multitude of distributed systems, meticulously identify root causes, drive effective resolutions, and develop robust automation and tooling to proactively prevent recurring problems. This is a hands-on role that requires proactive engagement, not just passive monitoring. You will cultivate deep subject matter expertise within key components of our fulfillment stack, take the lead in incident response, create comprehensive runbooks, enhance operational processes, and provide mentorship to junior engineers. Collaboration with software development teams is essential to improve system supportability, availability, and overall performance. This team operates on a 12x7 on-call rotation basis.
What You Will Do
- Troubleshoot and resolve technical issues across distributed systems, often with minimal or no existing documentation.
- Act as the technical point of contact within your area of expertise for your team and collaborating engineering groups.
- Lead incident response efforts, including driving root cause analysis, authoring post-incident reviews, and implementing preventive mechanisms.
- Identify operational trends and potential problems before they impact customers, defining and implementing proactive monitoring and alerting strategies.
- Build automation, tooling, and scripts to significantly improve operational efficiency and reduce manual workload.
- Author, maintain, and review technical documentation, such as runbooks, standard operating procedures (SOPs), and troubleshooting guides.
- Mentor junior team members, assisting with their onboarding and contributing to hiring activities.
- Lead internal team projects, ensuring the delivery of defined goals and timelines.
- Collaborate across teams on tactical and strategic initiatives aimed at enhancing system reliability and customer experience.
- Utilize data analysis to identify and drive the development of new support mechanisms, processes, and tools.
Key Job Responsibilities
- Independently maintain and operate products and systems within your team's scope, including performing change management activities.
- Participate in the 12x7 on-call rotation, managing incidents through to resolution or appropriate escalation.
- Contribute to Correction of Errors (COEs) and support retrospectives.
- Influence issue prioritization, best practices, and operational standards within the team.
Basic Qualifications
- A minimum of 2 years of experience in software development or 2 years of technical support.
- Proficiency in scripting with modern programming languages.
- Demonstrated experience in troubleshooting and debugging technical systems.
Preferred Qualifications
- Knowledge of web services, distributed systems, and web application development.
- Experience working with REST web services, XML, and JSON.
- Familiarity with AWS Services such as EC2, Lambda, S3, DynamoDB, and SQS.
- Experience with infrastructure as code tools like CloudFormation, Chef, Puppet, Salt, or Ansible in production environments.
Amazon is committed to fostering an inclusive culture that empowers our employees to deliver exceptional results for our customers. If you require workplace accommodations due to a disability during the application or hiring process, including support for interviews or onboarding, please visit https://amazon.jobs/content/en/how-we-hire/accommodations. If your country/region is not listed, please contact your Recruiting Partner.
Company
Amazon Thunder
Amazon's Supply Chain Optimization Technologies (SCOT) organization is at the forefront of developing sophisticated systems that manage the end-to-end process of delivering millions of packages to cus...