Logo of Huzzle

Internship - Product Management (Large Language Models Integration Into Data Pipelines)

Applications are closed

  • Internship
    Full-time
    Off-cycle Internship
  • Product
    Data
  • Waltham

Requirements

  • Knowledge of data engineering, machine learning, and data science
  • Proficiency in Python programming
  • Understanding of LLMs and Natural Language Processing (NLP)
  • Familiarity with scientific workflows (such as laboratory, research, clinical)
  • Familiarity with Langchain and Hugging Face is a plus
  • Strong collaboration and communication skills, with the ability to work effectively in cross-functional teams and communicate technical concepts to non-technical stakeholders
  • Excellent problem-solving abilities and analytical thinking skills

Responsibilities

  • Your role involves:
  • Defining use cases for LLMs in scientific data pipelining
  • Researching the state of the art in the use of LLMs in data pipelines (RAG, fine-tuning, etc.)
  • Identifying relevant Foundation Models
  • Impacting product direction by writing user stories to implement new product capabilities
  • Collaborating with UX experts to create product feature that delight our customers.

Powering Smarter Treatments and Healthier People

Technology
Industry
1001-5000
Employees

Mission & Purpose

Medidata is leading the digital transformation of clinical research, creating hope for millions of patients. Our unified platform combines AI powered insights with unrivaled patient-centric clinical trial solutions to help pharmaceutical, biotech, medical device and diagnostics companies, as well as academic researchers accelerate value, minimize risk, and optimize outcomes. Medidata is the first life sciences tech company to surpass 30,000 trials and 9 million participants. We’re powering smarter treatments and healthier people, with over 1.5 million registered users across 2,000+ customers and partners harnessing the world's most trusted clinical trials software for clinical development, commercial, and real-world data. We are a Dassault Systèmes company (Euronext Paris: FR0014003TT8, DSY.PA) headquartered in New York City with offices around the world. See why experience matters at www.medidata.com and follow us on Twitter (@Medidata) and Instagram (@medidata.solutions).