Comment on Father sues Google, claiming Gemini chatbot drove son into fatal delusion
calamitycastle@lemmy.world 2 weeks agoWhat is an rlhf data set?
Comment on Father sues Google, claiming Gemini chatbot drove son into fatal delusion
calamitycastle@lemmy.world 2 weeks agoWhat is an rlhf data set?
wonderingwanderer@sopuli.xyz 2 weeks ago
Reinforcement Learning from Human Feedback
It’s a method of fine-tuning and aligning LLMs which requires active human input