About this role
Calico Life Sciences is an Alphabet-founded research company harnessing advanced technologies to understand aging biology and devise interventions for longer, healthier lives. Seeking a Senior Data Engineer as the founding member of the Drug Discovery Data Engineering group. Join a highly collaborative Engineering team focused on innovative technology labs and curiosity-driven discovery.
Act as a technical bridge between Medicinal Chemistry, Automation, Machine Learning, Assay Technology, and Protein Sciences groups. Drive projects from requirements-gathering to production deployment. Engineer high-performance data systems integrating with CDD Vault, Mosaic, Benchling, BigQuery, and internal AI platform.
Work in a vibrant environment with academic and industry partners on a drug-development pipeline. As the first hire, define data flows and build web applications for stakeholder review. Establish engineering culture for this key growth area amid complex scientific challenges.
Succeed as an enthusiastic team player who is detail-oriented, organized, and adept with complex data, software, and problems. Champion best practices in infrastructure-as-code, CI/CD, and containerization. Provide mentorship to junior engineers and onboard future hires.
Requirements
- BS/MS/PhD in Computer Science, Data Science, or a related technical field, or equivalent practical experience
- 5+ years of professional software or data engineering experience on the small molecule and antibody informatics side of pharmaceutical R&D
- Proficiency in applying laboratory informatics systems such as CDD Vault, Titian Mosaic, and Benchling to the drug discovery process
- Fluency in Python with a strong grasp of software and data engineering principles including testing, modularity, design patterns, data modeling
- Demonstrated experience developing and deploying cloud-based applications on Google Cloud Platform (GCP) preferred, AWS, or Azure
- Strong experience with modern web frameworks and infrastructure, specifically FastA
Responsibilities
- Collaborate with scientists in Assay Technology, Medicinal Chemistry, and Protein Sciences to gather requirements, architect solutions, and deploy production-grade software facilitating data movement and analysis
- Design and implement robust integrations between internal pipelines and third-party platforms, specifically CDD molecular database, Mosaic inventory systems, and Benchling ELN
- Define and optimize data flows across the organization, ensuring seamless data handover from Machine Learning to Protein Sciences to Assay Technologies to accelerate drug discovery feedback loop
- Develop data systems and internal web applications using React and Python that allow stakeholders to review, visualize, and communicate complex scientific data
- Serve as a senior technical voice within the larger Engineering team and provide mentorship to junior engineers across Calico
- Help onboard future hires into the Drug Discovery Data Engineering team
- Champion best practices for infrastructure-as-code, CI/CD, and containerization while setting standards for data engineering at Calico
Similar roles

Real-World Evidence Research Scientist
1w1 week agoNovocure
Haifa, IL · Full-time · ILS 350,000 – ILS 500,000

Statistical Programmer I
1w1 week agoQuanticate
Hyderābād, IN · Full-time · INR 500,000 – INR 800,000

Senior Biostatistician
1w1 week agoClinChoice
US · Full-time · $140,000 – $180,000

PhD-Level Bioinformatics Expert
1w1 week agoWeekday
US · Contract · $120,000 – $170,000
