
Senior AI Engineer
Responsibilities
Qualifications & Requirements
Experience Level: Senior Level
Full Job Description
About the Role:
We are looking for a highly skilled Senior AI Engineer with profound expertise in agentic frameworks, Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) systems, MLOps/LLMOps, and end-to-end Generative AI (GenAI) application development. In this pivotal role, you will be instrumental in designing, developing, fine-tuning, deploying, and optimizing cutting-edge AI solutions for various enterprise use cases. These include AI Copilots, sophisticated summarization tools, advanced enterprise search capabilities, and intelligent tool orchestration.
Key Responsibilities:
- Develop and fine-tune LLMs (such as GPT-4, Claude, LLaMA, Mistral, Gemini) utilizing advanced techniques like instruction tuning, prompt engineering, chain-of-thought prompting, and model fine-tuning.
- Construct robust RAG Pipelines: Implement Retrieval-Augmented Generation solutions, effectively employing embeddings, strategic chunking, and various vector databases including FAISS, Pinecone, Weaviate, and Qdrant.
- Implement and Orchestrate Agents: Leverage powerful frameworks like MCP, OpenAI Agent SDK, LangChain, LlamaIndex, Haystack, and DSPy to build dynamic multi-agent systems and serverless GenAI applications.
- Deploy Models at Scale: Manage model deployments efficiently using platforms such as HuggingFace, Azure Web Apps, vLLM, and Ollama, with expertise in handling local models through GGUF, LoRA/QLoRA, PEFT, and quantization methods.
- Integrate APIs Seamlessly: Ensure smooth integration with APIs from leading providers like OpenAI, Anthropic, Cohere, and Azure, as well as other GenAI platforms.
- Ensure Security and Compliance: Implement stringent guardrails, perform PII redaction, guarantee secure deployments, and meticulously monitor model performance using advanced observability tools.
- Optimize and Monitor: Lead LLMOps initiatives, focusing on performance monitoring, cost optimization, and comprehensive model evaluation.
- Utilize AWS Services: Apply hands-on experience with AWS Bedrock, SageMaker, S3, Lambda, API Gateway, IAM, CloudWatch, and serverless computing to deploy and manage scalable AI solutions.
- Contribute to Diverse Use Cases: Develop innovative AI-driven solutions, including AI copilots, enterprise search engines, summarization tools, and intelligent function-calling systems.
- Foster Cross-functional Collaboration: Work closely with product, data, and DevOps teams to ensure the successful delivery of scalable and secure AI products.
Qualifications:
Required Skills and Experience:
- 3-5 years of experience in AI/ML roles, with a strong focus on LLM agent development, data science workflows, and system deployment.
- Proven experience in designing domain-specific AI systems and effectively integrating structured and unstructured data into AI models.
- Proficiency in designing scalable solutions using LangChain and various vector databases.
- Deep understanding of LLMs and foundational models, including GPT-4, Claude, Mistral, LLaMA, and Gemini.
- Strong expertise in Prompt Engineering, Chain-of-Thought reasoning, and various Fine-Tuning methods.
- Demonstrated experience building RAG pipelines and working with modern vector stores such as FAISS, Pinecone, Weaviate, and Qdrant.
- Hands-on proficiency with frameworks like LangChain, LlamaIndex, Haystack, and DSPy.
- Skilled in model deployment using HuggingFace, vLLM, Ollama, and adept at handling LoRA/QLoRA, PEFT, and GGUF models.
- Practical experience with AWS serverless services, including Lambda, S3, API Gateway, IAM, and CloudWatch.
- Strong coding proficiency in Python or comparable programming languages.
- Experience with MLOps/LLMOps practices for monitoring, evaluation, and cost management.
- Familiarity with security standards, including guardrails, PII protection, and secure API interactions.
- Track record of delivering impactful AI solutions such as AI Copilots, Summarization engines, or Enterprise GenAI applications.
Additional Information:
Preferred Skills:
- Experience in Business Process Outsourcing (BPO) or IT Outsourcing environments.
- Knowledge of workforce management tools and CRM integrations.
- Hands-on experience with a variety of AI technologies and their applications in data analytics.
- Familiarity with Agile/Scrum methodologies.
Soft Skills:
- Exceptional analytical and problem-solving capabilities.
- Excellent communication and stakeholder management skills.
- Ability to excel and thrive in a fast-paced, dynamic work environment.
Company
Quantanite
Quantanite is a customer experience (CX) solutions company dedicated to assisting fast-growing companies and leading global brands in their transformation and growth journeys. Through a collaborative ...