A simple web search is going to hit their massive distributed DB to return answers in subsecond time.
It’s going to hit an index, not the actual data, it’s going to return approximate and not accurate results. Tons of engineering been done around basic search precisely to get more data locality.
Read a blog post at some time (please don’t ask me where) talking about Bing vs. Google when Bing started to use ChatGPT and it basically boiled down to “Google has the tech to do it, they don’t roll it out because they don’t want to eat the electricity bill this is MS spending money to get market share”. The cost difference in providing search vs. having ChatGPT answer a question was something like 10x. It might not be that way forever what with beating models down to work in trinary and stuff, though (that’s not just massive quantisation but also much easier maths, convolutions don’t need much maths when all you deal with is -1, 0, 1 IIRC you can throw out the multiplication unit and work with nothing but shifts and adds)
sheogorath@lemmy.world 9 months ago
Based on this article, it seems that on average an LLM query costs about 10x when compared to a search engine query.