Logo of Huzzle

Solutions Architect, Large Language Model Inference

image

NVIDIA

Aug 7

  • Job
    Full-time
    Senior Level
  • Munich
  • Quick Apply

AI generated summary

  • You need strong communication skills, an MS/PhD in a relevant field, 5+ years in software development, NLP expertise, knowledge of ML libraries, and the ability to multitask in a dynamic environment.
  • You will engage with customers to understand their needs, develop and showcase NLP/LLM solutions, optimize performance on GPU systems, and collaborate with various teams for effective implementations.

Requirements

  • What We Need to See:
  • Excellent verbal, written communication, and technical presentation skills in English
  • MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering fields
  • 5+ years work or research experience with Python/ C++ / other software development and Capable of working in a constantly evolving environment without losing focus.
  • A consistent record of academic and/or industry experience in fields related to machine learning, deep learning and/or data science.
  • Work experience and knowledge of modern NLP including good understanding of transformer architectures including prompt learning and adapter tuning techniques (e.g. IA3 or LORA). Understanding of model alignment approaches.
  • Understanding of key libraries used for NLP/LLM training (NeMo Framework, DeepSpeed etc.) and inference (e.g. TRT-LLM, Triton Inference Server, HF Optimum).
  • You are excited to work with multiple levels and teams across organisations (Engineering, Product, Sales and Marketing team)
  • Ability to multitask in a fast-paced environment and Driven with strong analytical and problem-solving skills.
  • Strong time-management and organisation skills for coordinating multiple initiatives, priorities and implementations of new technology and products into very sophisticated projects
  • You are a self-starter with demeanour for growth, passion for continuous learning and sharing findings across the team

Responsibilities

  • Work directly with key customers to understand their technology and provide the best solutions
  • Develop and demonstrate solutions based on NVIDIA’s and open source NLP and LLM technology
  • Perform in-depth analysis and optimisation to ensure the best performance on GPU based systems. This includes both training and inference NLP/LLM pipelines.
  • Partner with Engineering, Product and Sales teams to develop, plan best suitable solutions for customers. Enable development and growth of product features through customer feedback and proof-of-concept evaluations
  • Build industry expertise and become a contributor in integrating NVIDIA technology into Enterprise Computing architectures.
  • Work closely with customer's data science and IT teams

FAQs

What is the primary focus of the Solutions Architect role for Large Language Models at NVIDIA?

The primary focus of the Solutions Architect role is to work with neural Natural Language Processing (NLP), transformer architectures, and Large Language Model (LLM) workflows, particularly on inferencing technology such as model compression, model compilation, and model serving.

What types of customers will the Solutions Architect interact with?

The Solutions Architect will interact with a range of customers including developers, scientific researchers, data scientists, IT managers, and senior leaders to promote adoption of Large Language Models and streamline their deployment to production.

What qualifications are preferred for this position?

Preferred qualifications include an MS/PhD or equivalent experience in fields like Computer Science, Data Science, Electrical/Computer Engineering, Physics, or Mathematics, along with 5+ years of work or research experience in related areas.

What specific technologies and tools should a candidate be familiar with?

A candidate should be familiar with modern NLP techniques, transformer architectures, key libraries used for NLP/LLM training and inference (e.g., NeMo Framework, DeepSpeed, TRT-LLM, Triton Inference Server, HF Optimum), and DevOps technologies like Docker and Kubernetes.

What skills are required for effective communication in this role?

Excellent verbal, written communication, and technical presentation skills in English are required to effectively engage with customers and internal teams.

Is experience in Machine Learning and Deep Learning essential for this position?

Yes, a consistent record of academic and/or industry experience in Machine Learning, Deep Learning, and Data Science is essential for this position.

What personal attributes are important for a candidate applying for this role?

Important personal attributes include being self-motivated, passionate about continuous learning, capable of multitasking, possessing strong analytical and problem-solving skills, and having good time-management and organizational abilities.

Is prior experience in deploying NLP technology to production valued?

Yes, prior experience applying NLP technology and its deployment to production is valued and will help distinguish candidates in the application process.

What does NVIDIA value in its workplace culture?

NVIDIA values diversity and is committed to being an equal opportunity employer, ensuring a discrimination-free environment for all employees.

Will the Solutions Architect work with cross-functional teams?

Yes, the Solutions Architect will work closely with Engineering, Product, Sales, and Marketing teams to develop and implement solutions tailored to customer needs.

Manufacturing & Electronics
Industry
10,001+
Employees
1993
Founded Year

Mission & Purpose

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.