Logo of Huzzle


Student Researcher in Foundation Model, Vision and Language - 2024 Start (PhD)

Logo of TikTok


1mo ago

💼 Graduate Job


AI generated summary

  • You must be pursuing a PhD in a technical field with research experience in vision and language. Strong coding skills and publications in top-tier venues are a must. Graduating in December 2024 or later with ability to collaborate and work independently.
  • You will conduct cutting-edge research in computer vision and natural language processing, publish research results, build brand, transfer to product applications, explore new product ideas with CV/NLP at its core.

Graduate Job

Software EngineeringSeattle


  • Our team's mission is to empower content understanding and creation using CV/NLP related technologies. We focus on cutting-edge R&D in areas like multi-modal understanding, vision and language, foundation models, audio/music understanding and generation with an emphasis on content creation. The team is a mix of experienced research scientists and research engineers, aiming to push the research boundaries in multi-modality and applying our research results to improve the experience of TikTok users.


  • Currently pursuing a PhD in Software Development, Computer Science, Computer Engineering, or a related technical discipline.
  • Research experience in multi-modal understanding, vision and language, such as video captioning, VQA, Text-to-video retrieval, audio/music understanding and generation, and other related topics.
  • Publications in top-tier venues, such as CVPR, ECCV, ICCV, NeurIPS, ICLR, ICML, EMNLP, ACL, COLING, etc.
  • Highly competent in algorithms and programming; Strong coding skills in Python and popular deep learning frameworks.
  • Preferred Qualifications:
  • Graduating December 2024 onwards with intent to return to degree-program after the completion of the internship.
  • Work and collaborate well with team members.
  • Ability to work independently; Strong communication skills.

Education requirements

Currently Studying

Area of Responsibilities

Software Engineering


  • Conduct cutting-edge research and development in computer vision and natural language processing, especially in the areas of multi-modality, vision and language, etc.
  • Publish our latest research results, and help to build our brand in the research community.
  • Transfer our research results to product applications, and explore new product ideas with CV/NLP at its core.


Work type

Full time

Work mode