Logo of Huzzle


Research Intern - Reinforcement Learning

Logo of Huawei


Aug 15

🚀 Off-cycle Internship


⌛ Closed
Applications are closed

Off-cycle Internship

Research & Development, Software EngineeringLondon


Huawei’s vision is a fully connected, intelligent world.

  • To achieve this, we work to inspire passion for basic research around the world. Our combined passion drives development across the global innovation value chain. Huawei has the largest Research and Development organization in the world with 96,000+ employees in research centres around the globe. In the UK, we already have design centres in Cambridge, London, Edinburgh, Ipswich and Bristol. We continue to explore and define new research directions and new services. We have expanded our collaborations with academic researchers; researched new network architectures, integration of communications and key enabling technologies; and developed the fundamental theories of these technologies. We invite you to join us on this exciting journey and drive your career forward.


  • Develop high-impact research output in the field of reinforcement learning, Bayesian optimisation, game theory, multi-agent learning, probabilistic modelling, and/or risk-averse learning
  • Write research-level code capable of testing novel ideas and new approaches
  • Effectively communicate research findings to the team and to the broader community through journal and conference publications
  • Aid in progressing research fields by open-sourcing code
  • PhD in Computer Science or a related field
  • Strong research background demonstrated through journal and conference submissions in any of the following: ICML, NeurIPS, AISTATS, AAAI, UAI, IJCAI, JMLR, Annals of Statistics, and Annals of Probability
  • Hands-on Experience in implementing reinforcement learning, Bayesian optimisation, probabilistic modelling, and/or multi-agent algorithms
  • Knowledge of Python, specifically PyTorch or TensorFlow in addition to OpenAI Gym, GPFlow, Pyro, among others
  • Ability to work in a diverse interdisciplinary team of researchers and engineers with different background

Education requirements


Area of Responsibilities

Research & Development
Software Engineering


  • Huawei Technologies Research and Development in London, UK is seeking exceptional candidates to pursue research in various aspects of reinforcement learning and Bayesian optimisation for autonomous decision making under uncertainty. The successful applicant is expected to develop novel contributions allowing the field to move forward in getting it a step closer towards real-world applications. Key research questions include but are not limited to scalable high-dimensional Bayesian optimisation, Gaussian processes, Bayesian neural networks, safe and robust reinforcement learning, multi-agent reinforcement learning, and model-based reinforcement learning. This is an exceptional opportunity for research while collaborating with a diverse team with backgrounds ranging from mathematics and optimisation to probability and game theory. These positions involve theoretical advances and aim to apply innovations in the real world, e.g., in self-driving scenarios, 5g networks, and many other chip design challenges. We aim to publish our work in top-tier conferences and journals, including but not limited to ICML, ICLR, NeurIPS, and JMLR. Our goal is to contribute to an open and transparent research environment by regularly open-sourcing code of our novel discoveries.


Work type

Full time

Work mode





  • 33 days annual leave entitlement per year (including UK public holidays)
  • Group Personal Pension
  • Life insurance
  • Private medical insurance
  • Medical expense claim scheme
  • Employee Assistance Program
  • Cycle to work scheme
  • Company sports club and social events
  • Corporate retail discounts
  • Flexible working
  • Additional time off for learning and development