TRT-LLM Jobs
A large language model (LLM) specifically designed and optimized for use with NVIDIA's TensorRT, enhancing inference performance. It leverages GPU acceleration to deliver fast and efficient natural language processing capabilities.
A large language model (LLM) specifically designed and optimized for use with NVIDIA's TensorRT, enhancing inference performance. It leverages GPU acceleration to deliver fast and efficient natural language processing capabilities.

2d
Belgium
+ 4 other locations- Full Time
- Remote

5d
Santa Clara, California, United States
- Full Time

5d
Santa Clara, California, United States
- Full Time

10d
San Mateo, California, United States
+ 1 other location
10d
Shanghai, Shanghai, China
- Full Time

10d
Shanghai, Shanghai, China
- Full Time

1m
Santa Clara, California, United States
- Full Time

1m
Germany
+ 4 other locations- Full Time
- Remote

1m
United States
- Full Time
- Remote

1m
Santa Clara, California, United States
- Full Time