Memorial Sloan Kettering Cancer Center•6h ago
Career Pages
Data Engineer
United States
Full Time
Mid Level
93000-148000
Full Job Description
Role Overview
Join Memorial Sloan Kettering Cancer Center (MSK) and become a vital part of the Prostate Cancer Clinical Trials Consortium as a Data Engineer. In this role, you will design and maintain robust data storage and access infrastructure essential for clinical trials.
Key Responsibilities
- Data Infrastructure: Implement and maintain relational database structures within AWS S3 using advanced tools like DuckDB/DuckLake to ensure organized, analysis-ready datasets.
- ETL Development: Build sophisticated ETL pipelines that ingest data from Electronic Data Capture (EDC) systems, transforming raw inputs into versioned assets for scientific teams.
- Access Layers & Governance: Develop efficient database connectors and R utilities while maintaining strict governance standards, including naming conventions and documentation across the trial portfolio.
- Cross-Functional Collaboration: Partner with Clinical Operations to align upstream data flows from study sites with downstream analytic needs for researchers.
- Version Control & Automation: Utilize GitHub Enterprise for robust version control and contribute to CI/CD workflows that automate infrastructure maintenance where possible.
Must-Have Qualifications
- Bachelor's degree in Computer Science, Data Engineering, or a related field.
- 2–4 years of experience designing data pipelines, ETL processes, and database systems.
- Strong proficiency in SQL and relational database concepts.
- Familiarity with cloud storage (AWS S3) and Infrastructure-as-Code principles.
- Experience managing access permissions across platforms like SharePoint and Airtable.
Company
Memorial Sloan Kettering Cancer Center
Memorial Sloan Kettering Cancer Center (MSK) is a world-renowned leader in cancer research and clinical care, dedicated to its singular mission of ending cancer for life.The organization unites divers...
United States
Posted on Career Pages