Compete in HackAPrompt 2.0, the world's largest AI Red-Teaming competition!
Last updated on November 12, 2024
Reinforcement Learning from Human Feedback is a method for fine tuning LLMs according to human preference data.