Comment on Father sues Google, claiming Gemini chatbot drove son into fatal delusion
calamitycastle@lemmy.world 2 months agoWhat is an rlhf data set?
Comment on Father sues Google, claiming Gemini chatbot drove son into fatal delusion
calamitycastle@lemmy.world 2 months agoWhat is an rlhf data set?
wonderingwanderer@sopuli.xyz 2 months ago
Reinforcement Learning from Human Feedback
It’s a method of fine-tuning and aligning LLMs which requires active human input