
Umanist•2h ago
Foundit
Data Scientist
Pune, India
Permanent
Mid Level
700000-900000
N/A
N/A
N/A
Responsibilities
Qualifications & Requirements
Experience Level: Mid Level
Full Job Description
We are seeking a skilled Data Scientist to join our AI-CoE team in Pune, India, with opportunities also available in Bangalore and Chennai. This role is ideal for individuals with a strong foundation in Machine Learning, Natural Language Processing (NLP), Generative AI, and Retrieval-Augmented Generation (RAG).
As a Data Scientist, you will be instrumental in developing and deploying advanced data-driven solutions, focusing on creating AI-enabled products that drive significant business value and product differentiation. You will work on cutting-edge projects, leveraging your expertise to innovate and contribute to our Telco use cases.
Key Responsibilities:
- Develop, test, and deploy machine learning models for diverse business and Telco applications.
- Execute data preprocessing, feature engineering, and rigorous ML/DL model evaluation.
- Optimize and fine-tune models to enhance performance and scalability.
- Apply your understanding of NLP concepts to projects involving entity recognition, text classification, and language modeling (e.g., GPT, Llama, Claude, Grok).
- Build and refine RAG models to improve information retrieval and response generation.
- Integrate RAG methodologies into existing applications to boost data accessibility and user experience.
- Collaborate effectively with cross-functional teams, including software engineers, product managers, and domain experts.
- Clearly articulate complex technical concepts to non-technical audiences.
- Maintain comprehensive documentation of processes, methodologies, and model development.
Required Skills:
- Solid grasp of probability and statistics.
- Proficiency in machine learning and deep learning techniques.
- Strong programming skills in Python and SQL, along with experience in frameworks and tools such as PyTorch, Sci-kit, NumPy, and Gen AI tools like LangChain/LlamaIndex.
- Familiarity with MLOps principles and experience with Big Data processing in both batch and streaming modes.
- Excellent problem-solving abilities and a proactive approach.
- Effective communication and teamwork skills.
- Ability to manage multiple projects and adhere to deadlines.
- Candidates should expect coding tests as part of the interview process.
Must-Haves:
- A minimum of 2 years of relevant hands-on experience as a Data Scientist.
- Proven experience in developing and deploying ML models using Python, PyTorch, and Scikit-learn.
- Practical knowledge of NLP tasks (e.g., entity recognition, text classification) and experience with Generative AI models like GPT or LLaMA.
- Experience utilizing tools like LangChain or LlamaIndex for building and optimizing RAG pipelines.
- Understanding of MLOps practices and the capacity to manage big data in batch and streaming environments.
- Excellent communication skills.
- Bachelor's or Master's degree in a relevant technical field.
- Availability to join within 15 days (notice period).
Company
Umanist
Pune, India
Posted on Foundit