Instead of the traditional open models (like llama, qwen, gemma…) that are only open weight, this model says that it has :
Fully open-source release of model weights, training hyperparameters, datasets, and code
Making it different from other big tech “open” models. Tough it exists other “fully open” models like GPT neo, and more
frezik@midwest.social 1 year ago
The source code on these models is almost too boring to care about. Training data and weights is what really matters.