Logo of Huzzle


Research Intern - Azure AI: Visual Document Intelligence

Logo of Microsoft


1mo ago

🚀 Off-cycle Internship


AI generated summary

  • You must be a PhD student in computer vision or NLP with 1 year of deep learning experience, proficiency in Python/C++, and published works in top conferences/journals. Willingness to relocate for internship.
  • You will collaborate with researchers, present findings, and contribute to exciting research and development projects in Azure AI: Visual Document Intelligence as a Research Intern at Microsoft.

Off-cycle Internship



  • The mission of the Cognitive Services Visual Document Intelligence team in the Azure AI organization is to make fundamental contributions to advancing the state-of-the-art in artificial intelligence (AI) technology related to document understanding. “Documents” stand at the intersection between natural language processing (NLP) and computer vision (CV) research. To fully understand a document, one needs to borrow from both language and visual (Layout) elements of the document. We explore both these research areas – and their synergy - to conduct research on forward-looking topics such as key-value extraction, few-shot named entity recognition (NER), hierarchical layout analysis, and many others. 
  • We are looking for passionate Research Interns to work on cutting edge research in Document Understanding. We are particularly interested in research interns with a background in AI, NLP, and/or CV, including topics like layout analysis, chart understanding, multi-page multi-document question answering, novel ways of leveraging large language models for document understanding and solving problems inherent to large language models (grounding, retrieval-based generation, etc.). Familiarity with modern large language models is a plus, but not required. 
  • In this position, you will conduct both fundamental and applied research under the supervision of Microsoft mentors. As a Research Intern, you will have the opportunity to prototype, demonstrate and publish your results. 


  • Required Qualifications:
  • Currently enrolled in a PhD program in computer vision, natural language processing, deep learning, machine learning, AI, or a related field.
  • At least one year of experience in natural language processing, deep learning, or multimodal research with hands-on deep learning experience.
  • Other Requirements:
  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications above, you’ll need to submit a minimum of two reference letters for this position. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter.
  • Preferred Qualifications:
  • Proficient algorithmic problem solving and software development skills (Python, C/C++, etc.).
  • Experience with open-source tools such as PyTorch, etc.
  • Publication(s) in top-tier conferences or journals in related fields (e.g., ACL, CVPR, ECCV, ICCV, EMNLP, NAACL, NIPS, ICML, ICLR, IJCV, PAMI, etc.).

Education requirements

Currently Studying

Area of Responsibilities



  • Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.


Work type

Full time

Work mode