Comment on Microsoft Copilot falls Atari 2600 Video Chess

<- View Parent
webghost0101@sopuli.xyz ⁨5⁩ ⁨days⁩ ago

Tbf they don’t really claim that when you read the research, thats mostly media hype and ceo assholes spinning words.

Its good at lots specific tasks like rewriting emails and summarising gives text, short roleplay, boilerplate code. Some undiscovered uses.

Anthropic latest claims they would not hire their own ai because of how hard it failed at the test they give, They didnt do that expecting validation but to measure how far we are still off from ai doing meaningful full work.

source
Sort:hotnewtop