TRT-LLM Jobs
A large language model (LLM) specifically designed and optimized for use with NVIDIA's TensorRT, enhancing inference performance. It leverages GPU acceleration to deliver fast and efficient natural language processing capabilities.
A large language model (LLM) specifically designed and optimized for use with NVIDIA's TensorRT, enhancing inference performance. It leverages GPU acceleration to deliver fast and efficient natural language processing capabilities.

8d
San Francisco, California, United States

2w
San Mateo, California, United States
+ 1 other location
2w
Shanghai, Shanghai, China
- Full Time

2w
Shanghai, Shanghai, China
- Full Time

1m
Germany
+ 4 other locations- Full Time
- Remote

1m
United States
- Full Time
- Remote

2m
Santa Clara, California, United States
- Full Time

2m
Santa Clara, California, United States
- Full Time

2m
Santa Clara, California, United States
- Full Time

2m
Santa Clara, California, United States
- Full Time