Sirma Business Consulting India Private Limited

Backend Engineer – Generative AI (Infra)

Sirma Business Consulting India Private Limited

Bengaluru, Karnataka, India
entry_level
Glassdoor
full_time
4000-7000
Featured Job
Added 10/14/2025

Job Description

Backend Engineer – Generative AI (Infra)

We are seeking a talented Backend Engineer with a focus on AI-driven systems, particularly in developing and maintaining scalable infrastructures for Generative AI applications. This role requires a solid background in backend engineering, API design, and AI model integration, with experience in LLMs, diffusion models, and multimodal AI systems.

As a Backend Engineer, you will be responsible for building and managing backend systems that support AI services, ranging from model APIs to data retrieval pipelines, ensuring they are optimized for performance and security in production environments. Experience with enterprise-level systems, particularly in banking or financial services, is a significant advantage.

Key Responsibilities

Backend Architecture & Development

  • Design and develop scalable backend services to support AI and Generative AI workloads.
  • Build and optimize RESTful and GraphQL APIs for model inference, prompt processing, and data access.
  • Create modular backend components for high-performance, scalable AI operations.

Generative AI Pipeline Integration

  • Collaborate with AI engineers to integrate and deploy Generative AI models (LLMs, diffusion models, text-to-image, speech-to-text) in production.
  • Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, Weaviate, FAISS, or Milvus.
  • Optimize AI model serving and inference with frameworks like TensorFlow Serving, TorchServe, FastAPI, or custom endpoints.

API Management & Optimization

  • Develop secure APIs for AI-driven services, ensuring proper version control, authentication (OAuth2, JWT), and scalability.
  • Integrate third-party AI APIs (OpenAI, Anthropic, Google Gemini, Stability AI) into internal systems.
  • Monitor API usage, optimize for latency, and ensure high-throughput AI operations.

System & Environment Setup

  • Set up backend environments using Docker, Kubernetes, and CI/CD pipelines for automated deployments.
  • Manage scalable cloud environments (AWS, Azure, GCP) with load balancing and fault tolerance.
  • Support development environments and backend utilities for cross-functional AI teams.

Performance & Security

  • Optimize server performance for AI workloads through caching, queuing, and load balancing techniques.
  • Ensure compliance with data privacy and security regulations, especially in financial domains.
  • Implement observability tools like Prometheus, Grafana, and ELK Stack for performance monitoring.

Collaboration

  • Collaborate with AI/ML engineers, data scientists, frontend developers, and DevOps teams.
  • Contribute to architecture discussions, backend documentation, and process improvements.
  • Participate in agile ceremonies, code reviews, and design sessions.

Required Technical Skills

Languages:

  • Python (primary), Node.js, Java

Backend Frameworks:

  • FastAPI, Flask, Django, Express.js, Spring Boot

AI & Generative Tools:

  • Familiarity with LLMs (e.g., GPT, Claude, Gemini, LLaMA, Mistral)
  • Experience with diffusion models (e.g., Stable Diffusion, Midjourney APIs)
  • Knowledge of transformer models via Hugging Face or OpenAI APIs
  • Understanding of prompt engineering and model-serving best practices

Databases:

  • PostgreSQL, MongoDB, Redis, Elasticsearch

Vector Databases:

  • Pinecone, Weaviate, Milvus, FAISS (for RAG systems)

Infrastructure & DevOps:

  • Docker, Kubernetes, Jenkins, GitLab CI/CD, AWS Lambda, ECS, or GCP Cloud Run

Monitoring:

  • Prometheus, Grafana, ELK Stack, New Relic

API Technologies:

  • REST, GraphQL, gRPC

Preferred Qualifications

  • Experience with AI product backend systems or enterprise software, especially in banking.
  • Familiarity with AI pipeline orchestration tools (LangChain, LlamaIndex, Haystack).
  • Understanding of model lifecycle management, vector embeddings, and inference scaling.
  • Familiarity with data encryption, tokenization, and compliance standards (PCI DSS, ISO 27001).

Soft Skills

  • Strong problem-solving and debugging abilities.
  • Excellent communication skills for collaborating with cross-functional teams.
  • High attention to detail and performance optimization.
  • Curiosity and enthusiasm for emerging AI technologies and backend architecture.

Education

  • Bachelor's or Master's degree in Computer Science, IT, AI, or a related field.
  • Certifications in AI/ML, Cloud Architecture, or Backend Development are a plus.

Why Join Us

  • Work at the forefront of backend engineering and Generative AI innovation.
  • Build production-ready systems for large-scale, intelligent applications.
  • Collaborate with a team pushing the boundaries of AI applications and infrastructure optimization.

Work Schedule

Working Days: Monday to Saturday (Alternate Saturdays leave)

Timings: 9:00 AM to 6:00 PM

How to Apply

Ready to take the next step in your career? Click the "Apply Now" button to be redirected to the official application portal.

Company Info

Sirma Business Consulting India Private Limited logo

Sirma Business Consulting India Private Limited

Posted on Glassdoor

Bengaluru, Karnataka, India

Job Details

Date Added:10/14/2025
Job Type:
full_time
Platform:
Glassdoor
Backend Engineer – Generative AI (Infra) at Sirma Business Consulting India Private Limited | Bengaluru, Karnataka, India | Apply Now | MindMyJob | MindMyJob - AI Job Search Platform