
Site Reliability Engineer
Full Job Description
Flexera is seeking an enthusiastic Site Reliability Engineer (SRE) passionate about a career in DevOps. As a fast-growing, category-leading organization with ambitious goals and a positive, inclusive culture, we are looking for passionate professionals eager to grow their talents and achieve great things.
In this role, you will contribute to product design, diagnose issues, and develop automated scripts to resolve problems in our production systems. You will be driven to build fault-tolerant, scalable systems and automate operational toil, aligning with the DevOps movement's principles to enhance collaboration between development and operations.
We are looking for a candidate with extensive experience in SaaS/Cloud products utilizing a microservices architecture.
Responsibilities:
- Automate repetitive operational tasks to eliminate toil.
- Establish and enhance CI/CD pipelines.
- Create dashboards using Grafana/Prometheus to visualize key metrics for product services.
- Collaborate effectively with other teams.
- Investigate, debug, and resolve customer issues.
- Ensure the security and reliability of shared infrastructure within the Flexera cloud.
- Prioritize reliability as a fundamental aspect of our systems.
- Design, develop, and deploy new features for Flexera products and platforms based on SRE organizational goals.
- Engage with product owners and product engineering teams as needed.
- Participate in an on-call rotation to address alerts requiring engineering expertise.
- Conduct root cause analysis for incidents and design solutions to prevent recurrence.
Minimum Qualifications:
- Bachelor's degree in Computer Science, Information Technology, or a related field.
- 2+ years of exposure or hands-on experience with AWS or other cloud services (internships, training, or work experience).
Critical Skills / Competencies:
Required:
- Knowledge of Agile software delivery methodologies.
- Experience managing cloud-based services like AWS or Azure at scale.
- Experience with DevOps practices.
- Familiarity with Docker containers, Kubernetes, EKS, and ECS.
- Experience with Terraform and CloudFormation.
- Proficiency in Linux and its commands.
- Solid understanding of networking fundamentals.
- Proficiency with GitHub for collaboration and change management.
- Knowledge of AWS services including EC2, ECS, EKS, and S3.
- Experience with databases, preferably MySQL, Amazon RDS, and MongoDB.
Good to have:
- Understanding of RESTful APIs and web-based application concepts.
- Experience with a scripting language (Ruby, Java, Python, Perl, etc.).
- Knowledge of Go Lang.
- Knowledge of Helm.
#LI-PS1 #LI-Development #LI-Remote
Flexera is committed to fostering a diverse, equitable, and inclusive workforce. We encourage candidates requiring accommodations to please let us know by emailing.
Company
Flexera
Flexera is a pioneer in Hybrid ITAM and FinOps, offering award-winning, data-oriented SaaS solutions for technology value optimization (TVO). We empower IT, finance, procurement, and cloud teams to ac...