Logo of Huzzle

Internship

• Starts Jun 16

Data Engineering Intern - Graduate

Logo of Tubi

Tubi

20d ago

🚀 Off-cycle Internship

San Francisco

💻 Remote
⌛ Closed
Applications are closed

Off-cycle Internship

DataSan Francisco

Description

  • At Tubi, data plays a vital role in keeping viewers engaged and the business thriving. Every day, data engineering pipelines analyze the massive amount of data generated by millions of viewers, turning it into actionable insights. In addition to processing TBs a day of 1st party user activity data, we manage a petabyte scale data lake and data warehouses that several hundred consumers use daily. We have two openings on two different teams.

Requirements

  • Fluency (intermediate) in one major programming language (preferably Python, Scala, or Java) and SQL (any variant)
  • Familiar with big data technologies (e.g., Apache Spark, Kafka) is a plus
  • Strong communication skills and a desire to learn!
  • Program Eligibility Requirements:
  • Must be actively enrolled in an accredited college or university and pursuing an undergraduate or graduate degree during the length of the program
  • Current class standing of sophomore (second-year college student) or above
  • Strong academic record (minimum cumulative 3.0 GPA)
  • Committed and available to work for the entire length of the program

Education requirements

Currently Studying
Undergraduate

Area of Responsibilities

Data

Responsibilities

  • Core Data Engineering (1): In this role, you will join a team focused on Core Data Engineering, helping build and analyze business-critical datasets that fuel Tubi's success as a leading streaming platform.
  • Use SQL and SQL modeling to interact with and create massive sets of data
  • Use DBT and its semantic modeling concept to build production data models
  • Use Databricks as a data warehouse and computing platform
  • Use Python/Scala in notebooks to interact with and create large datasets
  • Streaming Analytics (1): In this role you will join a small and nimble team focused on Streaming Analytics that power our core and critical datasets for machine learning, helping improve the data quality that fuels Tubi's success as a leading streaming platform.
  • Use SQL to explore and analyze the data quality of our most critical datasets, working with different technical stakeholders across ML & data science
  • Work with engineers to implement a near-time data quality dashboard
  • Use Python/Scala in notebooks to transform and explore large datasets
  • Use tools like Airflow for workflow management and Terraform for cloud infrastructure automation

Details

Work type

Full time

Work mode

remote

Start date

Jun 16, 2024

Application deadline

Apr 19, 2024

Location

San Francisco