HiggsBroson

@HiggsBroson@lemmy.world

This is a remote user, information on this page may be incomplete. View at Source ↗

⁨Comment⁩ on ⁨Bill Gates feels Generative AI has plateaued, says GPT-5 will not be any better⁩ ⁨⁨1⁩ ⁨year⁩ ago⁩:
You can finetune LLMs using smaller datasets, or with RLHF (reinforcement learning from human feedback) wherein people can give ratings to responses and the model can be either “rewarded” or “penalized” based off of the ratings for a given output. This retrains the LLM to produce outputs that people prefer.