Cloud Platform Engineer
Qualifications & Requirements
Experience Level: Senior Level
Full Job Description
Join Capgemini as a Cloud Platform Engineer in Hyderabad, Telangana, India, and be at the forefront of cloud-native database infrastructure. In this role, you will architect and deploy managed database instances across multi-cloud environments (RDS, Azure DB, Cloud SQL) and self-managed clusters using Infrastructure as Code tools like Terraform and Ansible. Your responsibilities will include automating configuration management, security hardening, and patching, as well as developing internal tools and scripts in Python/Bash to empower production support teams. You will also build scripts for routine operational tasks such as backups and health checks.
We are looking for expertise in integrating advanced observability platforms (Dynatrace, CloudWatch) with AIOps tools to establish SLOs and train models for anomaly detection and proactive forecasting of database performance issues. You will design, deploy, and govern AI-powered agents (using Azure Copilot / AWS Bedrock) for autonomous self-healing and automated resource management. Implementing advanced monitoring for key database metrics (SLIs/SLOs) like latency, throughput, and error rates, and developing predictive ML models to forecast potential system outages are also key aspects of this role.
You will be instrumental in designing and implementing cross-region/multi-AZ replication, automated failover strategies, and disaster recovery plans. This includes executing backup strategies, performing restores, and conducting DR drills. Collaboration with application operations and production support teams to troubleshoot database and platform layer issues, leading incident response, and conducting root cause analysis (RCA) for database outages and performance degradations will be crucial. You'll leverage AI tools for real-time RCA and automate scaling strategies based on predicted load.
Furthermore, you will contribute to cost management by implementing policies for rightsizing instances and managing storage tiers. Proactive analysis and automation of query performance tuning and database configuration optimization are expected. Implementing robust secrets management solutions and ensuring database environments meet regulatory requirements (PCI, HIPAA, GDPR) through encryption, audit logging, and automated compliance checks are also part of the scope. You will define and enforce least-privilege access policies and manage security and compliance using AI agents.
Requirements:
- 8+ years of experience in Oracle / DB2 / MSSQL/Snowflake/PostgreSQL and MySQL administration, with a strong focus on AIOps integration.
- 5+ years of experience in public cloud operations (AWS, Azure, GCP).
- Deep, demonstrable expertise designing and operationalizing solutions leveraging AWS Bedrock/Agent Frameworks and Azure Copilot for DB Operations.
- Expertise in Infrastructure as Code (Terraform, CloudFormation), Ansible, and CI/CD pipelines, including supervising AI-generated infrastructure artifacts.
- Expertise integrating observability platforms into AI/ML platforms for predictive analysis and anomaly detection.
- Advanced (7+ Years) Hands-On experience on Informatica PowerCenter / PowerBI /Cognos /Sapiens /Alteryx/IDMC/ILM/SAS / BusinessObjects / Glue / SPSS /ODI is a plus.
- Advanced (7+ Years) Proficiency in scripting languages (Python, Bash).
Benefits:
Capgemini offers a competitive compensation and benefits package including a competitive salary with performance-based bonuses, comprehensive benefits, career development and training opportunities, flexible work arrangements, and a dynamic, inclusive work culture. Benefits may vary by employee level. These include Private Health Insurance, Pension Plan, Paid Time Off, and Training & Development.
Company
Capgemini
Capgemini is a premier global leader in business and technology transformation, powered by AI. We help organizations envision and realize their future through innovative AI, technology, and human expe...