r/LocalLLaMA

are you ready for small Qwens?


title: are you ready for small Qwens?
author: u/jacek2023
contenttype: redditpost
publication: r/LocalLLaMA
published: 2026-02-28T11:02:10+00:00
sourceurl: https://www.reddit.com/r/LocalLLaMA/comments/1rgzul5/areyoureadyforsmallqwens/

word_count: 213

13-9=4

unsloth collection has been updated with 4 hidden items too ;)

Link: https://i.redd.it/bwc4xcf0w7mg1.png

Score: 408 | Comments: 171 | Subreddit: r/LocalLLaMA


Top Comments

u/FewPainter5588 (67 pts):
Probably a base model and the instruct trained model. So 2 sizes.

If I hazard to take a guess, the 9B model and a 4B model. So this way they have size equivalents of the Gemma 3 Models, 4B, 9B, 27B

u/Slow_Concentrate3831 (65 pts):
Please, a model between 14 and 20b for us, poor 16Gb vram users.

u/eidrag (119 pts):
^^qwen

u/ProfessionalSpend589 (16 pts):
Yep, ready!

I bought a new GPU with 32GB VRAM today and will hook it in a couple of hours.

I’ll finally be able to play with small dense models at decent speeds :)

u/dryadofelysium (11 pts):
I am horny for the 9B one.

u/meganoob1337 (10 pts):
hopefully they also provide a qwen3.5 based multimodal embedding model 🎉

u/hum_ma (11 pts):
It could be just quants for the 4 already released big models.

u/DrNavigat (9 pts):
Qwen is the replacement for the late Gemma.

u/LosEagle (13 pts):
I'm ready for any Qwens. They're all broken in ollama and there's like 10 open pull requests trying to make them work in the first place and I'm too stupid to compile llama.cpp.

u/chris_0611 (6 pts):
Only if they work as a draft model and provide speed-up for the bigger models.