Comment on Is Meta's Superintelligence Overhaul a Sign Its AI Goals Are Struggling?
nymnympseudonym@lemmy.world 2 weeks ago
This particular coding leaderboard matches my own personal experience. Llama4 is hitting ~15% ; Claude Opus4 ~70% (I haven’t used others personally)