Logo of Huzzle

Student Researcher (Doubao (Seed) - Generative AI) - 2025 Summer/Fall/Winter (PhD)

image

ByteDance

12d ago

  • Internship
    Full-time
    Off-cycle Internship
  • Research & Development
    Software Engineering
  • San Jose

AI generated summary

  • You must be pursuing a PhD in a related field, possess multi-modal research experience, have publications in top-tier venues, and be proficient in Python and deep learning frameworks.
  • You will research and develop generative AI and multimodal machine learning technologies, focusing on video generation and enhancing foundation models for new AI-driven products.

Requirements

  • Minimum Qualifications
  • Currently pursuing a PhD in Software Development, Computer Science, Computer Engineering, or a related technical discipline.
  • Research experience in multi-modal understanding, vision and language, such as video captioning, VQA, Text-to-video retrieval, audio/music understanding and generation, and other related topics.
  • Publications in top-tier venues, such as CVPR, ECCV, ICCV, NeurIPS, ICLR, ICML, EMNLP, ACL, COLING, etc.
  • Highly competent in algorithms and programming; Strong coding skills in Python and popular deep learning frameworks.
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment.
  • Preferred Qualifications:
  • Work and collaborate well with team members.
  • Ability to work independently; Strong communication skills.

Responsibilities

  • Conduct cutting-edge research and development in foundation model and multimodal machine learning, especially in the areas of generative AI (e.g. image, video generation). The primary objective is to research cutting-edge video generation technology through innovation.
  • Develop the foundation model to enhance the strategic advantages for ByteDance products
  • Explore new downstream products with artificial intelligence technology at its core.

FAQs

What is the Student Researcher position focused on?

The Student Researcher position focuses on conducting research and development in foundation models and multimodal machine learning, particularly in generative AI, with an emphasis on video generation technology.

What are the qualifications required for this position?

Candidates must be pursuing a PhD in Software Development, Computer Science, Computer Engineering, or a related technical discipline, have research experience in multi-modal understanding and vision-language tasks, and possess coding skills in Python and deep learning frameworks.

How long can the Student Researcher position last?

The duration of the Student Researcher position is flexible and can accommodate Part-Time or Full-Time commitments, depending on the project's needs and the researcher's availability.

Is there a limit on the number of positions I can apply for?

Yes, candidates can apply to a maximum of two positions within TikTok and its affiliates globally.

Are publications in top-tier venues necessary for this role?

Yes, candidates are expected to have publications in recognized venues such as CVPR, ECCV, ICCV, NeurIPS, and others relevant to the field.

What is the compensation for this position?

The hourly rate range for this position is $60 to $75, depending on location and qualifications.

Does the company offer benefits for this position?

Yes, interns have day one access to health insurance, life insurance, wellbeing benefits, 10 paid holidays per year, and paid sick time.

Is prior authorization required for employment?

Yes, candidates must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment.

What is the team culture like at the Doubao Vision team?

The team consists of experienced research scientists and engineers who collaborate to advance research boundaries in foundation models while fostering innovation in AI technologies.

Are there opportunities for collaboration in this role?

Yes, candidates are expected to work well with team members and effectively communicate while collaborating on research projects.

What kind of support is available for candidates with disabilities or other needs?

ByteDance is committed to providing reasonable accommodations during the recruitment process for candidates with disabilities, pregnancy, sincerely held religious beliefs, or other protected reasons.

What is the core mission of ByteDance?

The core mission of ByteDance is to inspire creativity and enrich life by helping people authentically express themselves, discover, and connect through innovative products.

Technology
Industry
10,001+
Employees
2012
Founded Year

Mission & Purpose

ByteDance is a global incubator of platforms at the cutting edge of commerce, content, entertainment and enterprise services - over 2.5bn people interact with ByteDance products including TikTok. Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. We are committed to building a safe, healthy and positive online environment for all our users.