
AI/ML Engineer
Full Job Description
About the Role
We are seeking a talented AI/ML Engineer with specialized expertise in fine-tuning and deploying advanced Large Language Models (LLMs) and Vision-Language Models (VLMs). The ideal candidate will possess a strong background in model optimization techniques, such as LoRA, and be adept at implementing Retrieval-Augmented Generation (RAG) systems. You will be instrumental in developing and maintaining scalable AI pipelines that power a diverse range of applications, including sophisticated text summarization, precise object detection, and the creation of intelligent agents.
Key Responsibilities
- Fine-tune and optimize LLMs and VLMs, utilizing methods like LoRA or other low-rank approaches, to align with specific project requirements and deliver solutions for use cases such as intent recognition, text summarization, multi-turn dialogue, object detection, and image captioning.
- Design and implement robust Retrieval-Augmented Generation (RAG) systems.
- Build and manage the comprehensive toolchain required for fine-tuning and deploying LLMs/VLMs. This includes overseeing training clusters and ensuring efficient model inference across both server-side environments and embedded targets.
- Apply advanced prompt engineering and agent-based methodologies to conceptualize, evaluate, and iterate on AI solutions tailored to specific user scenarios.
- Demonstrate knowledge of Post-Training Quantization (PTQ) and Quantization-Aware Training (QAT) for optimized model deployment on edge devices.
Required Skills and Experience
- A solid foundation in Natural Language Processing (NLP) and Computer Vision (CV), with a deep understanding of mainstream AI models, their underlying principles, strengths, and typical applications, enabling the creation of effective technical solutions.
- Proficiency in deep-learning frameworks such as PyTorch, coupled with familiarity with the architecture and implementation of models including Transformer, BERT, LLaMA, LLaVA, and their related extensions.
- Hands-on experience in designing production-ready architectures for large-model applications, such as chatbots, RAG pipelines, and intelligent agents.
- Fluency in at least one programming language, with Python being essential.
- Experience with C++ is considered a valuable asset.
Preferred Qualifications
- Proven track record as a core contributor to a high-impact open-source project.
- Published research in leading academic journals or conferences.
- Demonstrated success with top rankings in well-known competitions.
- Awards received in programming or mathematical-modeling contests.
Education
Bachelor's or Master's degree in Computers & Technology.
Company
Mercedes Benz
Mercedes-Benz is a global automotive giant renowned for its luxury vehicles and pioneering spirit. With a rich history of innovation and a commitment to excellence, Mercedes-Benz is at the forefront o...