TRT-LLM Jobs
A large language model (LLM) specifically designed and optimized for use with NVIDIA's TensorRT, enhancing inference performance. It leverages GPU acceleration to deliver fast and efficient natural language processing capabilities.
A large language model (LLM) specifically designed and optimized for use with NVIDIA's TensorRT, enhancing inference performance. It leverages GPU acceleration to deliver fast and efficient natural language processing capabilities.

3d
Santa Clara, California, United States
- Full Time

3d
Germany
+ 4 other locations- Full Time
- Remote

4d
Shanghai, Shanghai, China
- Full Time

4d
Shanghai, Shanghai, China
- Full Time

9d
San Francisco, California, United States
- Full Time
Salary: $230,000 to $300,000 per year

10d
Munich, Bavaria, Germany
+ 1 other location- Full Time

12d
Beijing, Beijing, China
+ 1 other location- Full Time

4w
San Francisco, California, United States

1m
San Francisco, California, United States
- Full Time
Salary: $190,000 to $250,000 per year

1m
San Francisco, California, United States
- Full Time
Salary: $190,000 to $250,000 per year