r/LocalLLaMA

DeepSeek V4 will be released next week and will have image and video generation capabilities, according to the Financial Times


title: DeepSeek V4 will be released next week and will have image and video generation capabilities, according to the Financial Times
author: u/Nunki08
contenttype: redditpost
publication: r/LocalLLaMA
published: 2026-02-28T11:25:49+00:00
sourceurl: https://www.reddit.com/r/LocalLLaMA/comments/1rh095c/deepseekv4willbereleasednextweekand_will/

word_count: 226

Financial Times: DeepSeek to release long-awaited AI model in new challenge to US rivals (paywall): https://www.ft.com/content/e3366881-0622-40a7-9c34-a0d82e3d573e

Link: https://i.redd.it/kwyym79lz7mg1.jpeg

Score: 652 | Comments: 110 | Subreddit: r/LocalLLaMA


Top Comments

u/dampflokfreund (157 pts):
Generation!? Surely they mean video/image input, right?

It would be immensely cool to have an omni modal model that can do everything though, that would be real innovation.

u/FewPainter5588 (195 pts):
It's more likely they mean the model will be text-image to text.

u/nullmove (48 pts):
If you report next week every week, you will get it right at some point. I believe in you.

u/Kirigaya_Mitsuru (16 pts):
This Next Week really never ends...

u/RobertLigthart (15 pts):
everyones been saying V4 is coming for months now lol. but if it actually ships with native image gen and not just routing to a separate model... thats huge for open source. the closed labs have been gatekeeping multimodal generation for way too long

u/NoAfternoon4260 (52 pts):
It's been months everybody is saying that V4 is just around the corner.. imho they'll wait to digest the opus 4.6 moment

u/HeftyAeon (13 pts):
i'd just happy if it uses engram and we can offload a good part of the model to disk with no inference speed cost

u/pmttyji (17 pts):
Hope this release shakes the market like last time. Just expecting tiny price down of GPUs for short time at least.

u/yogthos (6 pts):
I'm hoping it's agentic coding capability will match claude.

u/bakawolf123 (7 pts):
Opus and GPT on life watch?
I mean GLM-5 is already strong enough competition, and the research prep for Deepseek4 was quite significant, some technical breakthrough is very possible which would put it at least uncomfortably close to current SOTA.
That would be a very stark contrast to Dario Amodei words just few month ago about scaling is still only thing you need - and some minor architecture tweaks here and there.

u/Technical-Earth-3254 (6 pts):
Let's see if it stays oss then.

u/Ok-Adhesiveness-4141 (11 pts):
Hope this release causes Nvidia,Anthropic & OpenAI stocks to crash.

u/lacerating_aura (8 pts):
This would be a really double edged sword situation. IF it is to be believed that their model will be an omni, itll be nearly impossible for community in general to make finetunes of it. Which is a BIG part of the image/video gen community. There are many reasons for fine tuning and LoRa creation and a Trillion plus model will make it practically impossible. Although because it will be trained on multimodal data, the general intelligence of the modal would probably be better. I really hope its a multimodal ingestion model for now and not a fully omni one.