Yes i saw that benchmark and was honestly not surprised with the results. It seems that Anthropic really focused on those issues above and beyond what was done in other labs.
Yes i saw that benchmark and was honestly not surprised with the results. It seems that Anthropic really focused on those issues above and beyond what was done in other labs.
probably2high@lemmy.world 4 days ago
With its prior government contact, maybe anthropic was tuning it to ward against all the fucking dolts in decision-making roles.