RLHF Jobs
A methodology for aligning language models with human intents and preferences via reinforcement learning and human feedback. Enhances AI systems by refining their outputs based on user guidance and supervision.
A methodology for aligning language models with human intents and preferences via reinforcement learning and human feedback. Enhances AI systems by refining their outputs based on user guidance and supervision.
1d
San Francisco, California, United States
1d
- Full Time
- Full Time
- Experienced
1d
Barcelona, Catalonia, Spain
+ 1 other location- Contractor
1d
San Francisco, California, United States
+ 4 other locations- Full Time
2d
Mountain View, California, United States
+ 1 other location2d
Washington, District of Columbia, United States
2d
San Francisco, California, United States
- Full Time Contract
2d
France
- Full Time
3d
Mountain View, California, United States
- Contractor
3d
Mountain View, California, United States
- Contractor