Logo of Huzzle


AI/ML Vision & Language Research Intern

Logo of Sony


Jan 13

🚀 Off-cycle Internship

San Francisco +3

💻 Remote
⌛ Closed
Applications are closed

Off-cycle Internship

Software EngineeringSan Francisco, New York, Los Angeles, San Jose


  • Sony Corporation of America currently has an opening for AI/ML Vision & Language Research Intern in our R&D Center US Lab located in San Jose, CA for the 2024 Summer. In this role, you will have the opportunity to collaborate with a talented research team in multidisciplinary efforts to develop Sony’s future products and services. We’re inventing the technology and products that inspire customers all over the world, and soon you could be working to bring your ideas to life! This internship is a great opportunity to gain valuable experience in exciting entertainment industry applications as well as cutting-edge research experience in the field of advanced Computer Vision and Natura Language Processing, based on the AI and Machine Learning. If you’re up to the challenge, we’d love to see what you’ve got! – location is flexible


  • M.S., Ph.D. candidate, or Post-doc in Computer Science, Electrical Engineering, or a related field.
  • Research background in the area of AI/Machine Learning, Computer Vision, Natural Language Processing or related areas.
  • Strong familiarity with neural network modeling and analysis, formulate optimization flow using PyTorch, TensorFlow or equivalent deep learning framework.
  • Strong familiarity with Python and/or modern C++ for rapid algorithm prototyping.
  • Strong familiarity with camera geometry and geometric algebra
  • Excellent analytical and mathematical skills.
  • Knowledge and experiences in generative models
  • Knowledge and experiences in transformer models
  • Knowledge and experiences in large language models and fine tuning
  • Knowledge and experiences in vision & language multimodal models

Education requirements

Currently Studying

Area of Responsibilities

Software Engineering


  • Research and evaluate the performance of state-of-the-art computer vision and natural language processing with respect to latest deep learning algorithms.
  • Apply software engineering skills to prototype algorithms to identify challenges for our research.
  • Learn, collaborate, and network alongside world-class researchers and creators to investigate algorithms; present your findings and contribute to industry leading entertainment technology community.


Work type

Full time

Work mode



San Francisco, New York, Los Angeles, San Jose