网易首页 > 网易号 > 正文 申请入驻

DeepSeek Releases Open-Source Multimodal AI Model Janus-Pro, Surpassing DALL...

0
分享至

TMTPOST -- In the early hours of Tuesday, the AI community was abuzz as Hugging Face announced the release of DeepSeek's latest open-source multimodal AI model, Janus-Pro. Available in two configurations with 1 billion and 7 billion parameters, the model marks a significant leap in AI capabilities.

The Janus-Pro-7B model has outperformed OpenAI's DALL-E 3 and Stable Diffusion in benchmark tests such as GenEval and DPG-Bench, establishing its superiority in both image generation and understanding.

Janus-Pro integrates cutting-edge advancements in multimodal AI. The model's ability to process and understand images is powered by the innovative SigLIP-L architecture, while its image generation capabilities draw inspiration from LlamaGen. The model is offered in two sizes, with configurations at 1.5 billion and 7 billion parameters, catering to a range of computational needs.

This launch comes at a time when OpenAI's highly anticipated multimodal image-generation model, GPT-4o, remains unavailable to the public, adding to the excitement surrounding Janus-Pro's open-source debut.

DeepSeek has been at the forefront of multimodal generative AI research. The company launched its original Janus model in late 2024 as a unified framework for understanding and generating multimodal content. Built on DeepSeek-LLM-1.3b-base, Janus utilized a massive dataset of 500 billion text tokens for training. Its design decoupled visual encoding to optimize both understanding and generation tasks, employing advanced techniques like SigLIP-L for visual input and an innovative rectified flow for image generation.

This progress culminated in Janus-Pro, an enhanced self-regressive framework with significant architectural refinements. By decoupling visual encoding into independent pathways, Janus-Pro eliminates previous conflicts in understanding and generation tasks while maintaining a unified Transformer architecture. This modularity improves flexibility and task-specific performance.

Janus-Pro is built on DeepSeek-LLM-1.5b-base and DeepSeek-LLM-7b-base, trained using HAI-LLM, a high-performance distributed training framework on PyTorch. The training involved clusters of 16 to 32 nodes, each equipped with 8 Nvidia A100 GPUs, and required 7–14 days depending on the model size.

The complete Janus-Pro codebase is now available on GitHub: Janus GitHub Repository.

DeepSeek’s rapid advancements in multimodal AI may heighten competition with industry giants such as OpenAI, Meta, and Nvidia. However, the company has faced challenges, including recent large-scale cyberattacks on its online services. To mitigate these issues, DeepSeek has temporarily restricted new user registrations outside China, requiring international users to register using virtual numbers.

With Janus-Pro setting new standards for multimodal AI, the industry eagerly anticipates further developments, including potential advancements in text-to-image and text-to-video capabilities.

特别声明:以上内容(如有图片或视频亦包括在内)为自媒体平台“网易号”用户上传并发布,本平台仅提供信息存储服务。

Notice: The content above (including the pictures and videos if any) is uploaded and posted by a user of NetEase Hao, which is a social media platform and only provides information storage services.

相关推荐
热点推荐
孩子竟然是来报恩的?网友直呼:真是惊掉下巴!

孩子竟然是来报恩的?网友直呼:真是惊掉下巴!

特约前排观众
2026-04-03 12:05:48
财政压力的下半场:退休人员占比近四成,才是硬账

财政压力的下半场:退休人员占比近四成,才是硬账

超先声
2026-01-09 16:45:39
小孩子的嘴有多口无遮拦?网友:妈妈的脸瞬间红了!

小孩子的嘴有多口无遮拦?网友:妈妈的脸瞬间红了!

另子维爱读史
2026-04-02 18:18:16
教师大势已明朗:不出意外,2026年中国教师队伍,会迎来4大变化

教师大势已明朗:不出意外,2026年中国教师队伍,会迎来4大变化

小谈食刻美食
2026-04-02 08:46:43
一旦西巴布亚成功独立,印尼面临的不仅是领土缩水,而是国家解体

一旦西巴布亚成功独立,印尼面临的不仅是领土缩水,而是国家解体

鹤羽说个事
2026-04-02 22:12:25
央视直播突然换台!国乒男单告急?王楚钦、温瑞博双双遭遇硬仗

央视直播突然换台!国乒男单告急?王楚钦、温瑞博双双遭遇硬仗

小犙拍客在北漂
2026-04-03 11:23:21
无锡植物园悬赏10000元寻越狱卡皮巴拉,最新回应:属实,找到除奖金外还可获终身免费入园资格

无锡植物园悬赏10000元寻越狱卡皮巴拉,最新回应:属实,找到除奖金外还可获终身免费入园资格

大象新闻
2026-04-02 22:45:08
乌克兰无人机捅破能源神话,普里莫尔斯克40%储油能力直接报废

乌克兰无人机捅破能源神话,普里莫尔斯克40%储油能力直接报废

老马拉车莫少装
2026-04-03 12:10:22
很多人只看到大清丢失很多领土,但没有看到它打下的千万领土

很多人只看到大清丢失很多领土,但没有看到它打下的千万领土

秀心文雅
2026-03-31 09:17:19
存款要变天?若不出意外的话,下个月银行存款利息将迎来4大转变

存款要变天?若不出意外的话,下个月银行存款利息将迎来4大转变

混沌录
2026-04-02 19:50:10
只要聚餐有她,大家食欲直接大增!

只要聚餐有她,大家食欲直接大增!

飛娱日记
2026-03-05 08:22:06
重庆给张雪机车划了200亩地,但真正动起来的是整个摩托车产业链

重庆给张雪机车划了200亩地,但真正动起来的是整个摩托车产业链

蓝色海边
2026-04-03 08:43:49
女游击队员,陈幸同4-2迪亚兹进8强,腿都软了

女游击队员,陈幸同4-2迪亚兹进8强,腿都软了

真理是我亲戚
2026-04-03 12:39:47
科瓦奇回击朗尼克: 我们是国际化俱乐部,我们说英语

科瓦奇回击朗尼克: 我们是国际化俱乐部,我们说英语

懂球帝
2026-04-03 11:31:18
大罗前女友下海当模特,刚开号就被网友的奇葩要求轰炸了!

大罗前女友下海当模特,刚开号就被网友的奇葩要求轰炸了!

仰卧撑FTUer
2026-04-02 21:14:13
不可错过!4月3日中午11:00比赛!中央5套CCTV5、CCTV5+直播表

不可错过!4月3日中午11:00比赛!中央5套CCTV5、CCTV5+直播表

皮皮观天下
2026-04-03 08:23:21
宋宁峰发长文承认出轨:将无限期暂停所有演艺工作;出轨时女儿就在另一个房间睡觉

宋宁峰发长文承认出轨:将无限期暂停所有演艺工作;出轨时女儿就在另一个房间睡觉

大象新闻
2026-04-01 12:53:28
陈光标赠张雪劳斯莱斯骑虎难下,想私了热度太高,二手车商已盯上

陈光标赠张雪劳斯莱斯骑虎难下,想私了热度太高,二手车商已盯上

小怪吃美食
2026-04-03 04:56:08
整容脸千万别演年代剧!看冬去春来里章若楠和林允对比就全明白了

整容脸千万别演年代剧!看冬去春来里章若楠和林允对比就全明白了

TVB的四小花
2026-04-02 14:56:47
为什么只有革命卫队与美以干,而伊朗40万国防军沉默观战?

为什么只有革命卫队与美以干,而伊朗40万国防军沉默观战?

廖保平
2026-03-17 09:04:38
2026-04-03 13:35:00
钛媒体APP incentive-icons
钛媒体APP
独立财经科技媒体
131766文章数 862051关注度
往期回顾 全部

教育要闻

2026教育新规:7种情况不许责备孩子,否则伤大脑又伤自尊

头条要闻

牛弹琴:美国干了一件令人发指的事 全世界都无法接受

头条要闻

牛弹琴:美国干了一件令人发指的事 全世界都无法接受

体育要闻

冲击世界杯失败,80岁老帅一气之下病倒了

娱乐要闻

《浪姐7》最新人气TOP 曾沛慈断层第一

财经要闻

专家称长期摄入“飘香剂”存在健康隐患

科技要闻

5万辆库存车,给了特斯拉一记重拳

汽车要闻

你介意和远房亲戚长得很像吗?

态度原创

教育
健康
时尚
亲子
旅游

教育要闻

高考志愿不要比着分数填,给大家看个例子,如何把一把好牌打烂的

干细胞抗衰4大误区,90%的人都中招

为什么“这个颜色”成为今年顶流?这样穿好看又治愈

亲子要闻

清明假期,想更快的疗愈躺平孩子,一定要这样做!

旅游要闻

视窗|杭州西湖:又是桃红柳绿时

无障碍浏览 进入关怀版