Logo of Huzzle

Machine Learning Scientist Intern

  • Internship
    Full-time
    Summer Internship
  • Data
    IT & Cybersecurity
  • Portland
  • Quick Apply

AI generated summary

  • You should have a strong foundation in machine learning, NLP, and data analysis, proficiency in Python, experience with ML pipelines, and a keen interest in healthcare.
  • You will design and implement ML models, analyze performance, troubleshoot issues, collaborate with teams, and stay updated on ML advancements in healthcare. Write clean, maintainable code and communicate results.

Requirements

  • Demonstrated knowledge of data science, machine learning, and modeling.
  • Ability to use well-understood techniques and existing patterns to build, analyze, deploy, and maintain models.
  • Effective in time and task management.
  • Able to develop productive working relationships with colleagues and business partners.
  • Strong interest in the healthcare industry.
  • Ability to read, understand, and apply the latest research to enhance our products where possible.
  • Strong mathematical foundation and theoretical grasp of the concepts underlying machine learning, optimization, etc.
  • Demonstrated understanding of how to structure simple machine learning pipelines (e.g., has prepared datasets, trained and tested models end-to-end).
  • Classic ML algorithms (e.g., linear and logistic regression, decision and boosted trees, SVM, collaborative filtering, ranking)
  • Approaches (e.g., supervised, semi-supervised, unsupervised, reinforcement learning, regression, classification, time series modeling, transfer learning)
  • Foundational ML concepts such as objective functions, regularization and overfitting
  • Data partitions (train/dev/test) and model development
  • Hyperparameter tuning and grid search
  • Evaluation concepts (metrics, feature importance, etc.)
  • Familiarity with standard python packages (scikit-learn, XGBoost, TensorFlow, PyTorch, etc.)
  • Familiarity with structure of machine learning pipelines
  • Experience with large language models (LLMs) and natural language processing (NLP) techniques.
  • Knowledge of techniques such as tokenization, part-of-speech tagging, named entity recognition, sentiment analysis, and machine translation.
  • Familiarity with pre-trained models and frameworks like BERT, GPT, and spaCy for NLP tasks.
  • Understanding of text preprocessing, feature extraction, and embedding methods.
  • Experience with NLP libraries and tools (NLTK, Hugging Face Transformers, etc.)
  • Capable of building and refining NLP models for tasks such as text classification, language generation, and information extraction.
  • Awareness of the ethical considerations and challenges in deploying NLP models.
  • Strong foundation in data analysis.
  • Research and experiment design.
  • Visualization with data.
  • Answering questions with data.
  • Strong python programming skills. Familiarity with standard data science packages.
  • Familiarity with standard software development best practices. Strong SQL skills a plus.
  • Understanding of standard algorithms and data structures (ex. search & sort) and their analysis.

Responsibilities

  • Researches, designs, develops, and implements data-driven models and algorithms using machine learning, deep learning, statistical, and other mathematical modeling techniques.
  • Trains and tests models and develops algorithms to solve business problems.
  • Adheres to standard best-practices and establishes principled experimental frameworks for developing data-driven models.
  • Develops models and performs experiments and analyses that are replicable by others.
  • Uses open-source packages when appropriate to facilitate model development.
  • Identifies, measures, analyzes, and visualizes drivers to explain model performance (e.g., feature importance, interpretability, bias and error analysis), both offline (in the development phase) and online (in production).
  • Uses appropriate metrics and quantified outcomes to drive model and algorithm improvements.
  • Analyzes, diagnoses, and resolves bugs in production machine learning models and systems.
  • Evaluates model/use case feasibility by quickly generating prototypes.
  • Takes models from prototype stage and improves performance as needed.
  • Writes clean, well-commented, tested, version-controlled, and maintainable python code.
  • Collaborates with team members and Cambia business partners.
  • Actively participates in group meetings and discussions.
  • Communicates effectively both orally and in writing with both technical and non-technical audiences.
  • Keeps current with the state of the art in machine learning and AI and its application to healthcare.
  • Keeps current with evolving commercial and open-source tools, techniques, and brings these practices to projects.
  • Over time develops familiarity and insight with various subdomains of healthcare data.

FAQs

What is the duration of the Machine Learning Scientist Intern position?

The internship is a 12-week, full-time position starting in May or June 2025.

Where can the internship be performed?

The internship can be performed remotely within Washington, Idaho, Oregon, and Utah.

What is the primary job purpose of the Machine Learning Scientist Intern?

The primary job purpose is to work with various stakeholders to design, develop, and implement data-driven solutions using expertise in machine learning, deep learning, optimization, and statistical modeling, particularly within the healthcare payer domain.

Will I receive mentorship during the internship?

Yes, as an intern, you will work under the mentorship of another Machine Learning Scientist on various parts of the AI/ML lifecycle.

What types of tasks can I expect to engage in during the internship?

You can expect to research, design, develop, and implement data-driven models, train and test models, collaborate with team members and business partners, and communicate results to both technical and non-technical audiences.

What are the minimum requirements for applicants?

The minimum requirements include demonstrated knowledge of data science, machine learning, and modeling, the ability to manage time and tasks effectively, and a strong interest in the healthcare industry.

What programming skills are necessary for this internship?

Strong Python programming skills and familiarity with standard data science packages are required. Strong SQL skills are a plus.

Is there a specific pay range for the internship?

Yes, the base pay range for this role is $30.00 per hour.

How does Cambia support professional development?

Cambia is committed to helping you succeed and grow your career by providing opportunities to work alongside diverse teams and offering generous benefits.

Are there options to work from home?

Yes, Cambia offers work-from-home options for most roles, but employees must have a wired internet connection that meets specified speed requirements.

What values does Cambia emphasize in its workplace culture?

Cambia emphasizes a caring and supportive culture built on trust and innovation, where compassion, empathy, and team spirit are integral to the work environment.

Is Cambia an equal opportunity employer?

Yes, Cambia is an Equal Opportunity and Affirmative Action employer dedicated to workforce diversity. All qualified applicants will receive consideration for employment without regard to various protected statuses.

What accommodations are available for applicants with disabilities?

If you need accommodation for any part of the application process due to a medical condition or disability, you can email CambiaCareers@cambiahealth.com for assistance.

We are dedicated to making the health care experience simpler, better and more affordable for people and their families.

Science & Healthcare
Industry
501-1000
Employees

Mission & Purpose

Cambia Health Solutions is a family of companies headquartered in Portland, Oregon, working to create a person-focused and economically sustainable health care system. Through our health plans, innovative technology and strategic investments, we are delivering consumer-centric solutions that: •Help people achieve better health and well-being •Make health care more available, affordable and accessible •Are easily understood by everyone We put people at the heart of everything we do. Our company reaches more than 70 million Americans nationwide, including more than two million people in the Pacific Northwest who are enrolled in our regional health plans.