Logo of Huzzle

🚀 Internship

Data Engineer Intern (Supplyframe, Summer 2024)

Logo of Siemens

Siemens

26d ago

🚀 Summer Internship

Los Angeles

AI generated summary

  • The ideal candidate for this Data Engineer Intern position should be a university student pursuing a degree in a quantitative discipline, comfortable with scripting languages and web analytics, adept at working with open-source tools, proficient in Java/Scala for processing unstructured data, skilled in data analysis and translating technical data into actionable insights, and familiar with Hadoop/Spark.
  • The Data Engineer Intern at Siemens will be responsible for developing and automating data pipelines, implementing algorithms, improving data frameworks, managing data lake scaling, enhancing reporting platforms, maintaining data integrity, and identifying new technologies for rapid scaling and new capabilities.

Summer Internship

EngineeringLos Angeles

Description

  • Supplyframe, recently acquired by Siemens, is currently recruiting students to kick off our Summer 2024 Internship Program. 
  • We are looking for a talented and ambitious Data Engineer Intern. This individual will have the opportunity to expand and enhance our big data platform to deliver clean, structured data to both our internal and external customers. As we expand our capabilities in the areas of data mining, machine learning, and big data analysis, this position will be key to help deliver value to our end users. They will also have the unique opportunity to create new data products from our data lakes while working to enhance the cluster's stability and flexibility. 
  • Our goal is to empower our students to become the next generation of leaders at our company!

Why You’ll Love Interning Here:

  • Strong track record of providing an inclusive culture of belonging and empowerment in an entrepreneurial environment.
  • Gain real world experiences and become more marketable as you partner with the very best minds in our industry.
  • Flexible schedules and work-from-home opportunities; casual dress environment.


Requirements

  • Actively enrolled as a university student and in pursuit of a Bachelor's degree in Computer Science, Statistics, Mathematics or a related quantitative discipline.
  • Comfortable with Unix shell or other scripting languages.
  • Demonstrate clear understanding of web analytics and tracking.
  • Comfortable working with open-source tools and have the self-learning ability to get tools to work with limited instruction.
  • Understanding of software development best practices and revision control (git).
  • Experience with:
  • Java/Scala experience working with unstructured data and perform raw text processing
  • Data analysis and the ability to translate raw, technical data into actionable insight
  • Using Hadoop/Spark

Education requirements

Currently Studying

Area of Responsibilities

Engineering

Responsibilities

  • Develop and automate data pipelines using MapReduce/Spark to model large data sets
  • Performing algorithm development and implementation in production systems
  • Developing software in Java, Scala, or scripting programming language
  • Improve existing data frameworks within the data lake to handle anticipated growth and new objectives
  • Manage data lake scaling with regards to space allocation, job optimization, and data partitioning
  • Increase the capabilities of our reporting/analytics platforms to support business insight for internal and external users
  • Maintain data integrity by enhancing our ability to remove content generated by undesirable actors such as bots, scrapers, and pen testers
  • Identify and configure additional technologies to allow for rapid scaling and new capabilities

Details

Work type

Full time

Work mode

office

Location

Los Angeles