
Data Engineer
Responsibilities
Qualifications & Requirements
Experience Level: Senior Level
Full Job Description
EVERSANA seeks a Senior Associate, reporting to the Associate Director - Engineering, to play a vital role in designing, developing, and maintaining scalable data pipelines and infrastructure in Bengaluru, India. This position involves close collaboration with data scientists, analysts, and other stakeholders to ensure our data systems are efficient, reliable, and support complex analytical needs. The role combines hands-on technical work with strategic planning to meet the organization's evolving data requirements. Successful candidates will possess strong analytical skills, the ability to integrate data from diverse sources, proficiency in multiple programming languages, and an understanding of machine learning methodologies.
RESPONSIBILITIES:
- Design, construct, and manage scalable data pipelines for efficient data flow into machine learning models.
- Partner with data scientists to guarantee the availability of high-quality, pre-processed data for model training.
- Implement machine learning models into production environments utilizing tools like TensorFlow Serving, MLflow, or Seldon.
- Monitor and maintain the performance of production machine learning models, proactively addressing issues such as data or concept drift.
- Process large datasets, both batch and real-time, using frameworks such as Apache Spark, Apache Kafka, and AWS Glue.
- Implement and oversee data versioning and experiment tracking with tools like DVC and MLflow.
- Ensure data integrity and quality through rigorous validation and profiling techniques.
REQUIREMENTS:
- Proficiency in Python and SQL; familiarity with Scala or Java is advantageous.
- Substantial experience with Apache Spark, Apache Kafka, and other data processing frameworks.
- Experience deploying machine learning models using TensorFlow Serving, MLflow, or comparable tools.
- Familiarity with data lakes and data warehouses, including AWS S3, Google BigQuery, and Snowflake.
- Experience with cloud platforms (AWS, GCP, Azure) and containerization technologies such as Docker and Kubernetes.
- Understanding of machine learning workflows and experience collaborating with data scientists and ML engineers.
- Solid knowledge of ETL processes, batch and real-time data processing, and orchestration tools like Apache Airflow.
EDUCATIONAL QUALIFICATIONS:
- Bachelor’s degree in Engineering, Technology, or Computer Science.
- A minimum of 5 years of relevant industry experience, preferably in Healthcare, Pharmaceutical Consulting, or Enterprise-level data-analytical solutions.
OUR CULTURAL BELIEFS:
- Patient Minded: Prioritize the patient's best interests.
- Client Delight: Own and impact every client experience.
- Take Action: Empower self and others to act decisively.
- Grow Talent: Invest in personal and others' development.
- Win Together: Collaborate effectively to achieve results.
- Communication Matters: Foster transparent, thoughtful, and timely dialogue.
- Embrace Diversity: Cultivate an environment of awareness and respect.
- Always Innovate: Be bold and creative in all endeavors.
Company
EVERSANA
EVERSANA is a globally certified Great Place to Work organization dedicated to creating a healthier world. Our diverse team of over 7,000 professionals provides next-generation commercialization servi...