
Large Language Models Intern
2w2 weeks agoGenentech
New York City, US · Internship · $50 – $50/hr
About this role
Prescient Design, part of Genentech’s Research and Early Development organization, advances drug discovery through cutting-edge machine learning. The Foundation Models team builds internal large language models enabling next-generation scientific and biomedical applications across the drug-discovery pipeline. Exceptional graduate student interns with strong ML research or engineering backgrounds drive independent exploration and solve complex technical problems collaboratively.
Interns contribute to research and development of internal LLMs for scientific discovery and therapeutic molecular design. They develop and evaluate advanced post-training techniques to enhance domain knowledge and strengthen reasoning capabilities for scientific and biomedical applications. Responsibilities include supporting large-scale model training on high-performance GPU clusters.
Collaboration occurs with cross-functional teams to design and implement applied LLM use cases. The internship is on-site in New York City, fostering work with leading experts in biotechnology and AI. Interns tackle high-visibility projects with full ownership in interdisciplinary settings.
This 12-week full-time paid internship offers a location-based stipend and paid holiday time off. Participants pursue impactful work in AI for drug discovery. The program emphasizes hands-on contributions to innovative foundation models.
Requirements
- Pursuing a PhD (enrolled student)
- Major in Computer Science, Data Science, Machine Learning, Statistics, or a related technical field
- Strong Python skills and experience with ML frameworks such as PyTorch
- Solid understanding of neural networks, representation learning, and modern supervised/unsupervised methods
- Excellent written and verbal communication, and ability to work effectively with interdisciplinary teams
- Hands-on experience with large language models, especially post-training workflows (e.g., supervised fine-tuning and reinforcement learning)
- Experience with GPU clusters or distributed training systems for efficient large-scale model training
Responsibilities
- Contribute to research and development of internal LLMs for scientific discovery and therapeutic molecular design
- Develop and evaluate advanced post-training techniques to enhance domain knowledge and strengthen reasoning capabilities for scientific and biomedical applications
- Support large-scale model training on high-performance GPU clusters
- Collaborate with cross-functional teams to design and implement applied LLM use cases
- Drive independent exploration of LLM applications in drug discovery
- Solve complex technical problems in collaborative settings
Benefits
- 12-week, full-time paid internship (40 hours per week)
- Location-based stipend to support internship expenses
- Ownership of impactful, high-visibility projects
- Collaboration with leading experts in biotechnology and AI
- Paid holiday time off benefits
Similar roles

Real-World Evidence Research Scientist
1w1 week agoNovocure
Haifa, IL · Full-time · ILS 350,000 – ILS 500,000

Statistical Programmer I
1w1 week agoQuanticate
Hyderābād, IN · Full-time · INR 500,000 – INR 800,000

Senior Biostatistician
1w1 week agoClinChoice
US · Full-time · $140,000 – $180,000

PhD-Level Bioinformatics Expert
1w1 week agoWeekday
US · Contract · $120,000 – $170,000