Comment on Large Language Model Performance Doubles Every 7 Months
vrighter@discuss.tchncs.de 1 week agoin yes/no type questions, 50% success rate is the absolute worst one can do. Any worse and you’re just giving an inverted correct answer more than half the time