vLLM Jobs
A library designed for accelerated AI model inference utilizing advanced techniques for efficiency. It enhances large language models' performance on modern hardware architectures.
A library designed for accelerated AI model inference utilizing advanced techniques for efficiency. It enhances large language models' performance on modern hardware architectures.

Today
Shanghai, Shanghai, China
- Full Time

Today
Shanghai, Shanghai, China
- Full Time

Today
Shanghai, Shanghai, China
- Full Time

Today
India
- Full Time

1d
Palo Alto, California, United States

1d
Norrtälje kommun, Sweden
+ 1 other location- Full Time

1d
Bengaluru, Karnataka, India
- Full Time

1d
Bangalore, Karnataka, India
- Full Time

1d
Bochum, North Rhine-Westphalia, Germany
+ 1 other location- Full Time

1d
Madrid, Community of Madrid, Spain