ChatGPT 2 was literally an Excel spreadsheet.
I guesstimate that it’s effectively a supermassive autocomplete algo that uses some TOTP-like factor to help it produce “unique” output every time.
And they’re running into issues due to increasingly ingesting AI-generated data.
Get your popcorn out! 🍿
kromem@lemmy.world 18 hours ago
It very much isn’t and that’s extremely technically wrong on many, many levels.
Yet still one of the higher up voted comments here.
Which says a lot.
elbarto777@lemmy.world 4 hours ago
I’ll be pedantic, but yeah. It’s all transistors all the way down, and transistors are pretty much chained if/then switches.
Blue_Morpho@lemmy.world 8 hours ago
Given that the weights in a model are transformed into a set of conditional if statements (GPU or CPU JMP machine code), he’s not technically wrong. (Of course, it’s more than just JMP and JMP represents the entire class of jump commands like JE and JZ. Something needs to act on the results of the TMULs.)
tmpod@lemmy.pt 5 hours ago
That is not really true. Yes, there are jump instructions being executed when you run interference on a model, but they are in no way related to the model itself. There’s no translation of weights to jumps in transformers and the underlying attention mechanisms.
I suggest reading …wikipedia.org/…/Transformer_(deep_learning_archi…
Blue_Morpho@lemmy.world 4 hours ago
The model is data. It needs to be operated on to get information out. That means lots of JMPs.
If someone said viewing a gif is just a bunch of if-else’s, that’s also true. That the data in the gif isn’t itself a bunch of if-else’s isn’t relevant.
Executing LLM’S is particularly JMP heavy. It’s why you need massive fast ram because caching doesn’t help them.
Hotzilla@sopuli.xyz 7 hours ago
Calling these new LLM’s just if statements is quite a over simplification. These are technically something that has not existed before, they do enable use cases that previously were impossible to implement.
This is far from General Intelligence, but there are solutions now to few coding issues that were near impossible 5 years ago
5 years ago I would have laughed in your face if you came to suggest that can you code a code that summarizes this description that was inputed by user.
JcbAzPx@lemmy.world 4 hours ago
I think the point is that this is not the path to general intelligence. This is more like cheating on the Turing test.