Logo of Huzzle

Data Science Intern

  • Internship
    Full-time
    Summer Internship
  • Data
    Software Engineering
  • Chicago

AI generated summary

  • You should have a strong CS and ML background, NLP expertise with Hugging Face, Python skills, and experience in SQL/Athena, AWS, Docker, and CI/CD. Strong communication and data visualization skills are essential.
  • You will collaborate with senior data scientists on projects, manage the data science workflow, evaluate model performance, participate in agile ceremonies, and present your work to the team.

Requirements

  • We are looking for an intern with strong background in Computer Science and Machine Learning to join the Places team.
  • The candidate must have prior experience in developing and testing machine learning or statistical projects.
  • Also, the candidate must have a strong background in NLP ( Natural Language Processing), Hugging Face transformer models and prompt engineering.
  • Experience in web scraping, SQL/Athena and AWS is a plus.
  • In addition, the candidate must be able to work effectively in a multidisciplinary team environment focused on innovation and be able to learn and integrate complementary technologies outside their specific area of expertise.
  • Strong programming skills in Python and proficiency in data manipulation libraries (e.g., Pandas) and machine learning frameworks (e.g., scikit-learn).
  • Solid understanding of the PyTorch and Tensorflow ecosystem.
  • Utilize state-of-the-art transformer-based models such as BERT, GPT, or other variants to solve natural language processing (NLP) tasks.
  • Build and deploy machine learning models using Docker containers to ensure consistency and reproducibility across different environments.
  • Optimize deep learning models using Nvidia CUDA libraries to accelerate computation on GPU hardware.
  • Experience developing and testing machine learning or statistical projects, with strong background in supervised, unsupervised, semi-supervised and reinforcement learning algorithms and modelling.
  • Demonstrate expertise in prompt engineering methodologies to fine-tune and optimize language models for specific tasks and domains, ensuring improved performance and efficiency.
  • Excellent communication skills, both written and verbal.
  • Familiarity with web scraping tools and techniques to gather data from diverse online sources.
  • Strong SQL/Athena skills
  • Experience working with AWS + Experience working with Hadoop and other distributed
  • Strong data visualization skills
  • Experience in a software development environment and code management or versioning
  • Experience working with continuous integration and delivery (CI/CD) pipelines

Responsibilities

  • Work with Senior Data Scientists to develop proof of concepts, minimum viable products, and fully deployable solutions
  • Take ownership for the data science workflow including exploratory data analysis, model development, and potentially deployment
  • Participate in design discussions and code reviews of your work
  • Evaluate model performance and contribute to optimization efforts, ensuring that our NLP solutions meet high standards of accuracy and efficiency
  • Participate in agile scrum ceremonies like daily standup, sprint planning, sprint review and retrospective
  • Presenting and demonstrating your work to co-workers

FAQs

What is the duration of the internship?

This is a full-time paid internship with a commitment of 20 to 40 hours per week.

What is the pay rate for the internship?

The hourly rate is based on your level of education: $25/hr for undergraduate students, $30/hr for graduate students, and $35/hr for PhD students.

What qualifications are required for this internship?

Candidates should have a strong background in computer science, machine learning, and natural language processing (NLP), along with experience in developing and testing related projects.

What programming languages and frameworks should I be proficient in?

Strong programming skills in Python and proficiency in data manipulation libraries such as Pandas and machine learning frameworks like scikit-learn are required.

Are there any preferred qualifications for this internship?

Yes, preferred qualifications include familiarity with web scraping tools, strong SQL/Athena skills, experience working with AWS, Hadoop, and experience in a software development environment.

What type of team will I be working with?

You will be working in a multidisciplinary team environment focused on innovation with collaborative and supportive colleagues.

What responsibilities will I have during the internship?

Responsibilities include developing proof of concepts, conducting exploratory data analysis, participating in design discussions and code reviews, optimizing models, and presenting work to co-workers.

Is there a focus on utilizing specific models in this role?

Yes, the role emphasizes using state-of-the-art transformer-based models like BERT and GPT to solve NLP tasks, as well as employing prompt engineering methodologies.

What opportunities for professional development are offered?

Interns will face challenging problems to solve, learn new technologies, work on impactful projects, decide how to perform tasks, and receive feedback on their performance.

Does HERE Technologies offer health benefits to interns?

Yes, US-based HERE employees, including interns, have access to health (Medical/Dental/Vision) insurance and retirement savings plans.

The world's leading location platform company

Technology
Industry
5001-10,000
Employees

Mission & Purpose

As global mobility becomes increasingly connected, electrified and automated, HERE Technologies is leading the way to a safer, greener future. Our location platform is integrated into more than 180 million vehicles across the planet, using fresh and accurate data that we have been building for over 35 years – and continue to refresh daily. Our experience in mapmaking has made HERE one of the leading innovators in location technology and spatial intelligence. In our key markets, Automated and Connected Driving, Fleet Management and Supply Chain, we work with global brands, partners, developers, and customers so that together we can move the world forward. Armed with critical location data and technology tools, we’re developing solutions that solve the biggest challenges that face us today and help us plan for a better future. To discover more about the future of location and spatial intelligence visit here.com.