
Operations Engineer
Qualifications
Experience Level: Senior Level
- </b><li>Systems Operations and Engineering RHEL and SUSE and OLE and/or Wimdows/Hperv and/or AIX/HPUX/Soalris Hyper Converged Infrastructure and Private Cloud Vmware/ Hyper V/ KVM//Pacemaker / AHV /OpenStack / Scale Computing / Nvidia Omniverse Infrastructure as a Code & Scripting Terraform or Ansible - Shell Scripting and Python
- Pwer Shell
- Power CLi Public Cloud IaaS and Associated PaaS AWS / Azure/ GCP - S3
- LBs Cloud Watch
- Stack Driver
- Azure MonCloud Native and Containers PODMAN
- Docker Openshift and AKS and GKE and EKE Database Systems (RDBMS
- No SQl and Cloud DB) Orcle and MS SQL or MySQL and Mongo or Couch - Cloud DB (Redis or Aurora
- Sql online Midleware (WebServers
- Message Qs
Full Job Description
About the Role
We are seeking a visionary Operations Engineer to join our team. This role focuses on managing and supporting production systems and services, ensuring alignment with operational requirements and service level agreements. You will be instrumental in modernizing technology landscapes, driving innovation, and resolving complex technical challenges for our clients.
The ideal candidate will possess expertise in cloud infrastructure, specifically working with cognitive systems on-premise, private cloud environments, and public cloud stacks. This position involves cross-technology migrations, developing modernization solutions, and managing technology crises. A key aspect of this role is to enhance client technology environments through innovation.
Roles and Responsibilities:
- Design, develop, and maintain end-to-end cognitive Hyperconverged Infrastructure (HCI) and private cloud solutions, integrating intelligence into traditional web stacks.
- Develop and manage full-stack infrastructure applications, including backend services (APIs, microservices) and API gateways for frontend and backend services.
- Understand the impact of GPU-based computing and possess experience deploying High-Performance Computing (HPC) environments.
- Obtain and apply knowledge of AWS Outposts, Azure Stack, and Google Cloud VPC for certified implementation.
- Deploy, design, and maintain Tanzu and Red Hat OpenShift clusters on private cloud environments.
- Develop cloud-native backend services using Node.js, Python (FastAPI, Flask), or Java to connect AI models with application logic.
- Integrate AI/ML models (TensorFlow, PyTorch, scikit-learn) into production-ready APIs and microservices.
- Write efficient, maintainable code and manage the integration between front-end interfaces and backend infrastructure services.
- Collaborate with product, design, ML, and DevOps teams to build intelligent workflows and user experiences.
- Implement Infrastructure as Code (IaC) using tools like Terraform, CloudFormation, Azure DevOps, or Pulumi.
- Deploy and manage Platform-as-a-Service (PaaS) offerings.
- Design, implement, and maintain database solutions, including relational databases (e.g., MySQL, PostgreSQL, SQL Server) and NoSQL databases (e.g., MongoDB, DynamoDB).
- Collaborate with DevOps, security, and development teams to ensure seamless integration and delivery.
- Ensure platform observability through metrics, logging, and monitoring frameworks (e.g., Prometheus, ELK, CloudWatch).
- Manage containerization and orchestration using Docker and Kubernetes.
- Ensure compliance with security best practices and organizational policies.
- Continuously evaluate and implement new cloud technologies and tools to improve efficiency.
- Provide technical guidance and support to team members and stakeholders.
- Integrate and support AI-driven tools and frameworks, including Generative AI and Agentic AI technologies, within cloud infrastructure and applications.
Professional and Technical Skills:
- Systems Operations and Engineering: RHEL, SUSE, OLE, Windows/Hyper-V, AIX/HP-UX/Solaris, Hyperconverged Infrastructure (HCI), Private Cloud (VMware, Hyper-V, KVM, Pacemaker, AHV, OpenStack, Scale Computing, Nvidia Omniverse).
- Infrastructure as Code & Scripting: Terraform, Ansible, Shell Scripting, Python, PowerShell, PowerCLI.
- Public Cloud IaaS and Associated PaaS: AWS, Azure, GCP (S3, Blobs, VPCs, vNet, LBs, CloudWatch, Stack Driver, Azure Monitor).
- Cloud Native and Containers: Podman, Docker, OpenShift, AKS, GKE, EKS.
- Database Systems: RDBMS, NoSQL, Cloud DB (Oracle, MS SQL, MySQL, Mongo, Couchbase, Redis, Aurora, SQL Online).
- Middleware: Web Servers (IIS, Apache, JBoss, WebSphere/WebLogic), Message Queues (MQ Series), Managed File Transfer (MFTs), Job Schedulers (Control-M, Autosys, TWS).
- Observability & Environment Health: Observability tools (Nagios, SolarWinds, Netcool, Prometheus, ELK), Environment Health and Capacity Management, Tech Debt, FinOps.
- Enterprise AI: Agentic AI Frameworks (CrewAI, LangGraph, AutoGen), Responsible AI Concepts, AI Guardrails.
Additional Information:
This position requires 15 years of full-time education. While the role is advertised in Noida, this specific opening is based at our Bengaluru office.
Certifications (Highly Desired):
- VMware Certified Professional - VMware Cloud (VCP-VMC) 2022
- Red Hat Certified Engineer (RHCE)
- Nvidia Certified Engineer / Nvidia Certified Associate
- Microsoft Certified: Azure Solutions Architect Expert
- Google Professional Cloud Architect
- Certified Kubernetes Administrator (CKA)
- HashiCorp Certified: Terraform Associate
- Certified DevOps Engineer (AWS, Azure, or Google)