Forward Deployed ML Engineer
Responsibilities
Qualifications & Requirements
Experience Level: Mid Level
Full Job Description
AION is seeking a Forward Deployed ML Engineer to join our team in Bengaluru, Karnataka, India. In this role, you will act as a hands-on AI engineer, similar to an AI startup CTO, with 3-5+ years of experience in building production-grade multimodal AI systems and LLM applications. You will work within small, agile teams to deliver critical customer projects, embedding directly at client sites. Your responsibilities will include architecting, building, and deploying intelligent agent solutions, translating ambiguous business requirements into impactful technical solutions, and managing the full AI deployment lifecycle from use case discovery to production optimization. You will be comfortable writing production code, presenting to C-level executives, and debugging complex AI systems in real-world environments. Experience with voice agents, video processing systems, conversational AI, RAG systems, and LLM orchestration frameworks is highly desirable. Exceptional communication, customer empathy, and a drive to build transformative AI solutions are essential.
Customer Engagement & Multimodal Agent Development
Engage directly with customers at their sites to conduct discovery workshops and technical assessments, identifying high-impact AI opportunities. Design and architect end-to-end multimodal agent systems (voice, video, text) leveraging AION's distributed GPU infrastructure. Build production-grade voice AI systems using STT, TTS APIs, and LLMs. Develop vision-enabled agents processing real-time video streams using computer vision pipelines. Implement multi-agent orchestration with frameworks like LangChain or LlamaIndex for tool use, memory management, and autonomous task completion. Rapidly prototype POCs, validate concepts, and iterate based on feedback. Optimize for sub-500ms latency, natural conversation flow, and real-time system responsiveness. Integrate agents into customer codebases via REST/GraphQL/WebSocket APIs and custom SDKs.
Serve as a trusted technical advisor to customers, shaping their AI strategy and guiding roadmap decisions.
Data Strategy & MLOps Infrastructure
Design data architectures with efficient processing pipelines and ingestion workflows for training and inference on AION's platform. Implement RAG systems with vector databases, optimizing embedding strategies, chunk sizes, and retrieval methods. Prepare and validate datasets for fine-tuning, evaluation, and synthetic data generation. Collaborate with MLEs, MLOps, and SREs for model deployment and productionization.
Observability, Evaluation & Production Operations
Implement LLM and agent observability and monitoring, tracking key metrics like token usage, latency, costs, and quality. Instrument applications to trace LLM calls, retrieval operations, agent actions, and data flows. Build evaluation frameworks with offline benchmarks and online monitoring to ensure system performance and identify drift.
Technical Skills & Experience
We encourage you to apply if you meet some of these requirements and are eager to learn the rest:
- 3-5+ years of hands-on experience building production AI/ML systems, with 1-2+ years deploying LLM applications.
- Multimodal AI expertise: practical experience with voice agents, vision systems, or conversational AI.
- Strong LLM foundations: hands-on with foundation models, fine-tuning, prompt engineering, and evaluation.
- Agent framework proficiency: production experience with LangChain, LlamaIndex, or similar.
- Voice AI platform experience: built real-time conversational systems with STT/TTS integration.
- Proficiency in Python (production-grade, async, type hints) and JavaScript/TypeScript (full-stack).
- RAG implementation experience: built retrieval-augmented generation systems with vector databases.
- MLOps & deployment: hands-on with Docker, Kubernetes, CI/CD, and IaC.
- Cloud platforms: experience with AWS, Azure, or GCP for ML workloads.
- Exceptional communication skills for technical and business stakeholders.
- Customer-facing experience (Solutions Architecture, TAM, Pre-Sales) is highly desirable.
- Computer vision experience (video processing, object detection, VLM) is a plus.
- Model fine-tuning experience (LoRA/QLoRA, SFT, RLHF) is a plus.
- Inference optimization experience (vLLM, TensorRT-LLM, Triton, quantization) is desirable.
- Observability tooling experience for LLM monitoring, tracing, and evaluation is a strong plus.
- Familiarity with WebRTC, real-time streaming, and low-latency media processing.
Why Join AION?
- Work directly with founders shaping technical and product strategy.
- Build infrastructure powering the future of AI compute globally.
- Significant ownership and impact with competitive equity.
- Competitive compensation, flexible work options, and wellness benefits.
Apply now by sharing your resume highlighting relevant projects and leadership experience, links to your work (GitHub, demos), and a brief note on why AION's mission excites you.
Company
AION
AION is pioneering a decentralized AI cloud platform focused on high-performance computing (HPC). Our mission is to democratize access to compute power and provide managed services, creating an end-to...