
Data Scientist
Responsibilities
Qualifications & Requirements
Experience Level: Mid Level
Full Job Description
MavenMagnet AI, a dynamic AI-based data analytics firm in Mumbai, India, is seeking a creative Data Scientist specializing in Large Language Models (LLMs) to join our growing team. This is a crucial role for our success, contributing directly to the development of our cutting-edge AI products.
As a Data Scientist, you will be instrumental in the transfer learning and training of LLMs. The ideal candidate possesses a strong data science foundation and the ability to build and fine-tune in-house LLMs. You will have the opportunity to work with large-scale datasets, translating your insights into tangible products. Your contributions will be vital in understanding user behavior and identifying/recommending product opportunities, which are critical for scaling our expanding user base.
Responsibilities
- Analyze and modify complex datasets using advanced techniques including machine learning, statistical analysis, and Natural Language Processing (NLP). Identify modeling attributes and parameters, applying algorithms to create descriptive, predictive, or prescriptive models to meet analysis and project objectives.
- Investigate data characteristics, complexity, quality, and meaning. Utilize visualizations and summaries to define performance, identify trends, outliers, and key drivers.
- Prepare data for analysis, including cleansing, conditioning, transformation, handling missing fields, identifying new feature variables, and managing multi-variate data.
- Fine-tune LLMs for various NLP tasks such as topic modeling, multi-labeling, and emotion detection, as well as image detection.
Required Skills
- 1-3 years of industry experience in data science, with a focus on developing ML Models for NLP.
- 1-2 years of experience in fine-tuning and training LLMs.
- Proven experience building NLP models for Topic Modeling and Multi-labeling data science solutions.
- Proficiency in relevant programming languages, particularly Python.
- Experience with the Python Data Science ecosystem, including NumPy, SciPy, and Jupyter.
- Experience with AWS cloud, including developing and deploying applications in a cloud environment.
- Master's degree in Data Science.
- Prior experience working with startups in a fast-paced environment is considered an advantage.
- A link to a Github profile showcasing your work is required.
Preferred Requirements
- Master's degree in Data Science from a premier institution like IIT/NIT.
- Experience building and deploying deep neural networks to production.
- Experience with major cloud platforms such as AWS, Google Cloud, or Azure.
- Experience handling large-volume data processing systems.
- Familiarity with Agile software development methodologies.
- Solid understanding of database technologies like SQL, PL/SQL, and relational database schema design.
At MavenMagnet AI, we offer:
- A supportive team environment dedicated to your career growth and training.
- Competitive salary and equity, commensurate with experience.
- Best-in-class health insurance, including dental and vision coverage.
- The opportunity to work alongside world-class talent.
- A flexible vacation / paid time off policy.
- Encouragement for personal style expression.
We are committed to celebrating diversity and creating an inclusive environment for all our employees.
Company
MavenMagnet AI
MavenMagnet AI is a leading data analytics company based in Mumbai, India. We specialize in transforming insights generation through in-depth analysis and discovery, offering significant time and cost...