The article says “sometimes provide less-accurate and less-truthful responses to users who have lower English proficiency”. This is what I was commenting on.
What if you ask the exact same question with the same wording, but share beforehand that you don’t have a PhD or are from Iran? Because that’s what the study looked at.
tias@discuss.tchncs.de 7 hours ago
inconel@lemmy.ca 7 hours ago
Actually quote is below (emphasis mine):
For this research, the team tested how the three LLMs responded to questions from two datasets: TruthfulQA and SciQ. TruthfulQA is designed to measure a model’s truthfulness (by relying on common misconceptions and literal truths about the real world), while SciQ contains science exam questions testing factual accuracy. The researchers prepended short user biographies to each question, varying three traits: education level, English proficiency, and country of origin.
melfie@lemy.lol 6 hours ago
I wonder if the bios are written in the style of the less educated, non-native speaker. Ring described, because I can imagine that would have an effect.
fiat_lux@lemmy.world 5 hours ago
They were also somewhat offensive, being generated by llm - because the researchers somehow could not find real non-PhD examples to draw from (wtf?). Here’s one:
Ignoring the bad tropes that would be suited to a bad 1930s movie, and that I’ve never heard a Russian speak like that, or any ESL speaker for that matter… GPT-4 leaned on the word “fancy” for the “less educated” bios for some reason.
It definitely affected the style of the output, but here’s a question and answer response for the less educated Russian persona:
The cherry on top is that it was provided this line in the system prompt:
Which just raises further questions about the response to what was supposed a multiple choice selection task.
melfie@lemy.lol 4 hours ago
Wow, that’s really patronizing.
Passerby6497@lemmy.world 5 hours ago
They are, but the effect is absolutely fucking stupid and makes no goddamned sense outside of just being as asshole (were this a person, so the makers in this case are assholes
Bio:
Question:
Answer: