网易首页 > 网易号 > 正文 申请入驻

Nvidia Accelerates Mistral AI Models, Help Close Gap with OpenAI

0
分享至

TMTPOST -- Nvidia Corp. and French artificial intelligence (AI) startup Mistral AI have achieved significant performance breakthroughs through their latest collaboration, delivering up to 10 times faster inference speeds for Mistral's new model family on Nvidia's GB200 NVL72 systems compared to the previous-generation H200 chips.


AI Generated Image

Mistral AI on Tuesday released its Mistral 3 family of open-weight models, optimized for Nvidia platforms from data centers to edge devices. The release includes Mistral Large 3, a 675 billion total parameter mixture-of-experts model with multilingual and multimodal capabilities, alongside nine smaller Ministral 3 variants designed for deployment on robots, drones and offline devices.

The partnership positions the two-year-old French company to better compete with leading AI labs including OpenAI and Google, particularly in enterprise deployments where customization and cost efficiency matter. Mistral has raised $2.7 billion at a $13.7 billion valuation, with Nvidia among its investors.

The collaboration delivers practical advantages for enterprise users. On the GB200 NVL72, Mistral Large 3 achieved over 5 million tokens per second per megawatt at 40 tokens per second per user, translating to lower per-token costs and improved energy efficiency for production AI systems.

GB200 Systems Drive Performance Gains

Mistral Large 3's architecture leverages Nvidia's hardware optimizations to unlock substantial efficiency improvements. The model's mixture-of-experts design activates only the most relevant parts for each task rather than engaging all 675 billion parameters, reducing computational waste while maintaining accuracy.

The performance leap stems from several technical advances. Nvidia's TensorRT-LLM Wide Expert Parallelism exploits the GB200 NVL72's coherent memory domain through NVLink fabric, enabling optimized expert distribution and load balancing. The system also employs NVFP4 low-precision inference and Dynamo disaggregated inference optimizations to deliver peak performance for large-scale training and deployment.

These optimizations work across Nvidia's inference frameworks including TensorRT-LLM, SGLang and vLLM. The models are available through leading open-source platforms and cloud service providers, with deployment expected soon as Nvidia NIM microservices.

Ministral 3 Targets Edge Deployment

The compact Ministral 3 suite brings AI capabilities to devices operating without network connectivity. Available in 3 billion, 8 billion and 14 billion parameter configurations, each size offers Base, Instruct and Reasoning variants to match specific use cases.

Performance on edge platforms demonstrates practical viability. The Ministral-3B variants achieve up to 385 tokens per second on Nvidia's RTX 5090 GPU. On Nvidia Jetson Thor, the models deliver 52 tokens per second for single concurrency, scaling to 273 tokens per second with eight concurrent requests.

Guillaume Lample, Mistral co-founder and chief scientist, emphasized the efficiency advantage: "The huge majority of enterprise use cases are things that can be tackled by small models, especially if you fine-tune them." All Ministral 3 variants support vision, handle 128,000 to 256,000 context windows, and run on single GPUs, reducing deployment costs and latency.

Commercial Push Intensifies Competition

The release comes as Mistral accelerates commercial activity following a 1.7 billion euro funding round in September that valued the company at 11.7 billion euros. Dutch chip equipment maker ASML contributed 1.3 billion euros, with Nvidia also participating.

Mistral has secured contracts worth hundreds of millions of dollars with corporate clients and announced a deal Monday with HSBC for financial analysis and translation tasks. The company is also expanding through acquisitions to compete with U.S. rivals establishing European operations, including Anthropic and OpenAI, which both opened European offices this year.

The startup's open-weight approach contrasts with closed-source competitors. While OpenAI and Anthropic maintain proprietary models accessible only through APIs, Mistral releases model weights publicly for download and customization. Lample argues this delivers superior results for specific enterprise deployments: "In many cases, you can actually match or even out-perform closed-source models" through fine-tuning.

特别声明:以上内容(如有图片或视频亦包括在内)为自媒体平台“网易号”用户上传并发布,本平台仅提供信息存储服务。

Notice: The content above (including the pictures and videos if any) is uploaded and posted by a user of NetEase Hao, which is a social media platform and only provides information storage services.

相关推荐
热点推荐
汪小菲将辞退月嫂,小杨阿姨仍未复工曝处境,老公跑滴滴还房贷!

汪小菲将辞退月嫂,小杨阿姨仍未复工曝处境,老公跑滴滴还房贷!

古希腊掌管月桂的神
2026-03-03 11:30:22
1.76亿独生子女,迎来一个坏消息,以后可能真的没亲戚了

1.76亿独生子女,迎来一个坏消息,以后可能真的没亲戚了

老特有话说
2026-03-01 21:57:03
伊朗外长:美以打完后,愿重启谈判

伊朗外长:美以打完后,愿重启谈判

观察者网
2026-03-01 08:39:35
“全部拆除”将至?2026住建部官宣:这两类房屋一律拆除

“全部拆除”将至?2026住建部官宣:这两类房屋一律拆除

慧眼看世界哈哈
2026-03-02 14:13:14
伊朗导弹越打越少,美以却开始慌了?第三天战况:谁先顶不住?

伊朗导弹越打越少,美以却开始慌了?第三天战况:谁先顶不住?

瞳眼天下
2026-03-03 10:47:04
伊朗发出警告:如果伊朗石油和天然气设施遭袭击,作为回应,该地区所有国家的油气设施都将被摧毁

伊朗发出警告:如果伊朗石油和天然气设施遭袭击,作为回应,该地区所有国家的油气设施都将被摧毁

大象新闻
2026-03-02 15:50:38
伊朗导弹炸翻比亚迪,史上最硬核广告爆了

伊朗导弹炸翻比亚迪,史上最硬核广告爆了

营销头版
2026-03-03 11:42:23
3月3日人民币对美元中间价调升148个基点

3月3日人民币对美元中间价调升148个基点

证券时报
2026-03-03 09:31:33
围绕霍尔木兹海峡,美伊还有什么撒手锏

围绕霍尔木兹海峡,美伊还有什么撒手锏

枢密院十号
2026-03-03 19:47:57
“大年初五回家”成最后留言!重庆男子春节前赴迪拜旅游,失联已超十天

“大年初五回家”成最后留言!重庆男子春节前赴迪拜旅游,失联已超十天

封面新闻
2026-03-03 21:31:07
以色列已经告诉世界:日本若敢拥有核武器,美国并不会第一个翻脸

以色列已经告诉世界:日本若敢拥有核武器,美国并不会第一个翻脸

八斗小先生
2025-12-26 09:33:27
见证历史,一场史诗级的绝杀!

见证历史,一场史诗级的绝杀!

君临财富
2026-03-02 09:44:11
畜生父亲虞天华被执行死刑,押赴刑场前高喊:这辈子值了!

畜生父亲虞天华被执行死刑,押赴刑场前高喊:这辈子值了!

纸鸢奇谭
2024-12-04 21:37:57
迪拜机场公司宣布:自3月2日傍晚起,迪拜国际机场和阿勒马克图姆国际机场将有限度地恢复航班起降

迪拜机场公司宣布:自3月2日傍晚起,迪拜国际机场和阿勒马克图姆国际机场将有限度地恢复航班起降

三湘都市报
2026-03-02 23:21:55
一新能源车高速上两次突然断电 转向、动力全部丢失!车主:不敢开了

一新能源车高速上两次突然断电 转向、动力全部丢失!车主:不敢开了

快科技
2026-03-03 17:21:04
哈梅内伊死后第三天,妻子追随而去:背后藏着伊朗最危险的信号

哈梅内伊死后第三天,妻子追随而去:背后藏着伊朗最危险的信号

爱竞彩的小周
2026-03-03 07:32:48
特朗普大厦发现可疑包裹

特朗普大厦发现可疑包裹

潇湘晨报
2026-03-03 11:25:11
哈梅内伊之死,它帮了美军?

哈梅内伊之死,它帮了美军?

环球时报国际
2026-03-03 10:37:33
《求是》暗示不再盲目追求增长数字

《求是》暗示不再盲目追求增长数字

凯利经济观察
2026-03-03 11:43:31
真的难!2026年B级车市场开启“大降价”,最大降幅52%,合资霸榜

真的难!2026年B级车市场开启“大降价”,最大降幅52%,合资霸榜

小怪吃美食
2026-03-03 03:58:15
2026-03-04 04:32:49
钛媒体APP incentive-icons
钛媒体APP
独立财经科技媒体
130185文章数 861867关注度
往期回顾 全部

科技要闻

拥抱AI的"牛马":边提效边自嘲"自费"上班

头条要闻

美国突发史无前例撤离令引外界担忧:终极空袭或来临

头条要闻

美国突发史无前例撤离令引外界担忧:终极空袭或来临

体育要闻

35轮后积分-7,他们遭遇史上最早的降级

娱乐要闻

谢娜霸气护夫:喊话薛之谦给张杰道歉

财经要闻

特朗普“不惜一切”!全球股债齐崩

汽车要闻

第一梯队辅助驾驶加持 iCAR V27定档3月13日上市

态度原创

时尚
本地
艺术
手机
军事航空

今年流行的“新老钱风”,优雅又时髦,太适合春天了!

本地新闻

食味印象|一口入魂!康乐烤肉串起千年丝路香

艺术要闻

柔滑裙装女神出场,惊艳程度超乎想象!

手机要闻

荣耀Magic V6下周见,开启折叠屏7000mAh时代

军事要闻

伊朗:击中美空军基地大楼

无障碍浏览 进入关怀版