vLLM Jobs
A library designed for accelerated AI model inference utilizing advanced techniques for efficiency. It enhances large language models' performance on modern hardware architectures.
A library designed for accelerated AI model inference utilizing advanced techniques for efficiency. It enhances large language models' performance on modern hardware architectures.
2d
San Sebastián, Autonomous Community of the Basque Country, Spain
- Full Time
2d
Chennai, Tamil Nadu, India
- Full Time
2d
San Diego, California, United States
- Full Time
2d
Markham, Ontario, Canada
- Full Time
3d
Cupertino, California, United States
3d
- Remote
3d
London, England, United Kingdom
3d
Toronto, Ontario, Canada
- Full Time
- Remote
3d
County Waterford, Munster, Ireland
- Full Time
3d
Beijing, Beijing, China
- Full Time
- Remote