Small LLMs are quite fast these days, even the more compute heavy multimodal ones. Same with small models explicitly used to filter diffusion output.
Small LLMs are quite fast these days, even the more compute heavy multimodal ones. Same with small models explicitly used to filter diffusion output.