TRT-LLM Jobs
A large language model (LLM) specifically designed and optimized for use with NVIDIA's TensorRT, enhancing inference performance. It leverages GPU acceleration to deliver fast and efficient natural language processing capabilities.
A large language model (LLM) specifically designed and optimized for use with NVIDIA's TensorRT, enhancing inference performance. It leverages GPU acceleration to deliver fast and efficient natural language processing capabilities.
2d
Santa Clara, California, United States
- Full Time
6d
Santa Clara, California, United States
- Full Time
7d
Santa Clara, California, United States
+ 2 other locations- Full Time
10d
Warsaw, Masovian Voivodeship, Poland
- Full Time
2w
Santa Clara, California, United States
- Full Time
2w
Beijing, Beijing, China
- Full Time
1m
Santa Clara, California, United States
- Full Time
1m
Shanghai, Shanghai, China
- Full Time
1m
Santa Clara, California, United States
- Full Time
1m
United Kingdom
+ 4 other locations- Full Time
- Remote