Comment

Small LLMs are quite fast these days, even the more compute heavy multimodal ones. Same with small models explicitly used to filter diffusion output.

Sort:hotnew top