TRT-LLM Jobs

A large language model (LLM) specifically designed and optimized for use with NVIDIA's TensorRT, enhancing inference performance. It leverages GPU acceleration to deliver fast and efficient natural language processing capabilities.

Remote

Senior Solutions Architect - AI

NVIDIA

Toronto, Ontario, Canada

- Full Time

- Remote

Senior Software Test Development Engineer - NVLink Fusion

NVIDIA

Shanghai, Shanghai, China

- Full Time

Senior Software Test Development Engineer - Deep Learning

NVIDIA

Shanghai, Shanghai, China

- Full Time

11d

Senior Software Engineer, Deep Learning Inference Workflows

NVIDIA

Santa Clara, California, United States

- Full Time

Senior Solutions Architect, Gen AI

NVIDIA

Santa Clara, California, United States

+ 1 other location

- Full Time

- Remote

Deep Learning Solutions Architect – Large Scale Inference Optimization

NVIDIA

United Kingdom

+ 4 other locations

- Full Time

- Remote

Senior Deep Learning Software Engineer

NVIDIA

Santa Clara, California, United States

- Full Time

NVIDIA Solutions Architect Intern - AI/ML Specialist - 2025

NVIDIA

Hong Kong

- Full Time

Solutions Architect Intern, AI and ML - 2025

NVIDIA

Shanghai, Shanghai, China

+ 1 other location

- Full Time

Senior Applied AI Software Engineer, Distributed Inference Systems

NVIDIA

Boulder, Colorado, United States

+ 4 other locations

- Full Time

Searching within 3 million jobs, this might take a second!...