TRT-LLM Jobs
A large language model (LLM) specifically designed and optimized for use with NVIDIA's TensorRT, enhancing inference performance. It leverages GPU acceleration to deliver fast and efficient natural language processing capabilities.
A large language model (LLM) specifically designed and optimized for use with NVIDIA's TensorRT, enhancing inference performance. It leverages GPU acceleration to deliver fast and efficient natural language processing capabilities.
1d
Santa Clara, California, United States
- Full Time
1d
Germany
+ 4 other locations- Full Time
- Remote
2d
Santa Clara, California, United States
- Full Time
- Remote
2d
United States
- Full Time
- Remote
3w
Santa Clara, California, United States
- Full Time
3w
Santa Clara, California, United States
- Full Time
3w
Santa Clara, California, United States
- Full Time
4w
Santa Clara, California, United States
- Full Time
1m
Santa Clara, California, United States
- Full Time
1m
Santa Clara, California, United States
- Full Time