vLLM Jobs
A library designed for accelerated AI model inference utilizing advanced techniques for efficiency. It enhances large language models' performance on modern hardware architectures.
A library designed for accelerated AI model inference utilizing advanced techniques for efficiency. It enhances large language models' performance on modern hardware architectures.
Today
Chennai, Tamil Nādu, India
1d
Chennai, Tamil Nādu, India
1d
Bangalore, Karnātaka, India
2d
Cupertino, California, United States
2d
China
- Full Time
2d
Toronto, Ontario, Canada
- Full Time
2d
Shanghai, Shanghai, China
- Full Time
2d
McLean, Virginia, United States
+ 2 other locations- Full Time
3d
Sunnyvale, California, United States
3d
Markham, Ontario, Canada
- Full Time