Logo of Huzzle

Researcher Scientist Intern - Doubao (Seed) - Foundation Model, Vision and Language - 2025 Summer (PhD)

image

ByteDance

2mo ago

  • Internship
    Full-time
    Summer Internship
  • Research & Development
    Data
  • San Jose

AI generated summary

  • You must be a PhD student in a related field, have research experience in multi-modal understanding, strong coding skills in Python, and publications in top-tier venues.
  • You will conduct research in computer vision and NLP, publish findings, build the brand, and develop product applications based on your research insights in multi-modality.

Requirements

  • Currently pursuing a PhD in Software Development, Computer Science, Computer Engineering, or a related technical discipline.
  • Able to commit to working for 12 weeks during Summer 2025.
  • Research experience in multi-modal understanding, vision and language, such as video captioning, VQA, Text-to-video retrieval, audio/music understanding and generation, and other related topics.
  • Publications in top-tier venues, such as CVPR, ECCV, ICCV, NeurIPS, ICLR, ICML, EMNLP, ACL, COLING, etc.
  • Highly competent in algorithms and programming; Strong coding skills in Python and popular deep learning frameworks.
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment.
  • Preferred Qualifications:
  • Graduating December 2025 onwards with intent to return to degree-program after the completion of the internship.
  • Work and collaborate well with team members.
  • Ability to work independently; Strong communication skills.

Responsibilities

  • Conduct cutting-edge research and development in computer vision and natural language processing, especially in the areas of multi-modality, vision and language, etc.
  • Publish our latest research results, and help to build our brand in the research community.
  • Transfer our research results to product applications, and explore new product ideas with CV/NLP at its core.

Technology
Industry
10,001+
Employees
2012
Founded Year

Mission & Purpose

ByteDance is a global incubator of platforms at the cutting edge of commerce, content, entertainment and enterprise services - over 2.5bn people interact with ByteDance products including TikTok. Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. We are committed to building a safe, healthy and positive online environment for all our users.