Logo of Huzzle


Research Intern - Azure AI: Speech & Audio

Logo of Microsoft


2mo ago

🚀 Off-cycle Internship


AI generated summary

  • You must be a PhD student with 1+ years of speech & audio research experience, proficient in deep learning, AI, and have published work in related fields. Physically located at Microsoft during the internship. Need 2 reference letters. Good communication skills and experience with PyTorch, TensorFlow preferred.
  • You will collaborate with top researchers, present findings, and contribute to cutting-edge AI research in Azure Speech & Audio during a 12-week internship at Microsoft.

Off-cycle Internship

Data, EngineeringRedmond


  • Speech and audio AI technology is one of the key drivers for advancing natural user interfaces with natural spoken language. Our team is on a mission to develop the core speech technologies that empower millions of users to achieve more, such as the speech recognition platform that powers Microsoft 365 Copilot and Teams live captioning. Our team brings together talent in the areas of signal processing, speech recognition, speech translation, speaker recognition, and speech and audio generation to develop and deliver robust, natural and scalable speech experience across a rich set of scenarios and languages. 
  • We are seeking Research Interns to contribute to pioneering research in speech and audio. This includes areas such as end-to-end speech recognition and translation, advanced speaker recognition and diarization, as well as speech and audio generative models. As a research intern, you will have the opportunity to conduct both fundamental and applied research in collaboration with our researchers and scientists. 


  • Required Qualifications:
  • Currently enrolled in a PhD program in speech recognition, speech translation, separation, and enhancement, speaker recognition and diarization, speech and audio generation, audio processing, deep learning, machine learning, artificial intelligence (AI) or a related field.
  • At least one year of research experience in speech recognition, speech translation, separation, and enhancement, speaker recognition, speaker diarization, speech and audio generation, audio processing, deep learning, machine learning, AI or a related field.
  • Other Requirements:
  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications above, you’ll need to submit a minimum of two reference letters for this position. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter.
  • Preferred Qualifications:
  • Experience with open-source tools such as PyTorch, TensorFlow, etc.
  • Publication(s) in top-tier conferences or journals in related fields (e.g., ICASSP, Interspeech, ASRU, SLT, IEEE/ACM Transactions on Audio, Speech and Language Processing, Speech Communication, Computer Speech and Language, etc.).
  • Effective communication and writing skills.

Education requirements

Currently Studying

Area of Responsibilities



  • Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.


Work type

Full time

Work mode