RLHF Jobs
A methodology for aligning language models with human intents and preferences via reinforcement learning and human feedback. Enhances AI systems by refining their outputs based on user guidance and supervision.
A methodology for aligning language models with human intents and preferences via reinforcement learning and human feedback. Enhances AI systems by refining their outputs based on user guidance and supervision.
2d
Cupertino, California, United States
3d
India
- Other
4d
Pune, Mahārāshtra, India
- Full Time
4d
Palo Alto, California, United States
+ 1 other location- Full Time
4d
Pune, Mahārāshtra, India
- Full Time
5d
Paris, Île-de-France, France
- Full Time
6d
Sydney, New South Wales, Australia
- Contract
6d
Hyderābād, Telangāna, India
- Full Time
Salary: $25 to $40 per year
6d
McLean, Virginia, United States
+ 2 other locations- Full Time
6d
McLean, Virginia, United States
+ 2 other locations- Full Time