StratusGrid•4d ago
LinkedIn
Senior Cloud Optimization Engineer
United States
Senior Level
Full Job Description
About the Role
StratusGrid delivers a high-trust, high-competence customer experience focused on cloud optimization. In this role, you will productize our expertise into Stratusphere™ by manually assessing, designing, and executing cloud optimization work in real-world environments today, while simultaneously validating and improving the agent-driven workflows that will automate these tasks tomorrow.
Key Responsibilities
- Customer Outcomes & Optimization Delivery: Manually assess customer AWS/Azure (and GCP) environments to identify optimization opportunities. Quantify impact, propose solutions, execute approved changes safely, and deliver measurable savings.
- Stratusphere Output Review & Calibration: Validate cost-savings recommendations generated by our AI platform against real-world constraints. Provide structured feedback to improve agent accuracy, reliability, and safety over time.
- Build Trust & Navigate Stakeholders: Act as a solutions-driven technical partner who builds strong relationships, navigates organizational dynamics confidently, and translates complex technical choices into win-win business outcomes for both customers and the company.
- Big-Picture Decision Support: Connect daily optimization work to broader customer goals. Explain implications regarding cost, risk, reliability, performance, and operational overhead to guide stakeholders toward informed decisions.
- Agent Improvement Feedback Loop: Capture patterns in agent errors (e.g., missing context, risky sequencing). Propose rubric changes and training examples to measurably improve product quality.
Requirements
- Cloud Platform Expertise: Proven ability to operate production AWS/Azure environments (multi-account/subscription), with familiarity in GCP. Deep knowledge of Compute, Storage, Networking, IAM/RBAC, and Identity Guardrails.
- Hands-On Execution: Experience implementing real savings via rightsizing, autoscaling, lifecycle management, commitment utilization strategies, and IaC changes (Terraform preferred) with rigorous rollback plans.
- Automation Skills: Proficiency in Python/TypeScript/Go for scripting CLI operations, collecting metrics, and operationalizing remediation at scale.
- Observability & Architecture: Ability to use CloudWatch/Azure Monitor to validate performance impact post-change. Strong understanding of VPC/VNet constructs and how architecture affects cost/security.
- Communication & Ownership: Exceptional ability to translate technical topics for business stakeholders, navigate ambiguity, drive work to resolution without offloading effort, and maintain high standards of reliability.
Company
StratusGrid
StratusGrid is pioneering a multi-agent solution designed to solve complex cloud infrastructure challenges, delivering measurable outcomes in cost effectiveness, security, velocity, and operational re...
United States
Posted on LinkedIn