网易首页 > 网易号 > 正文 申请入驻

ByteDance and OpenAI Race to Develop AI Agents, Nvidia Partner Predicts AI A...

0
分享至

(Image Source: Photo by Lin Zhijia, TMTPost AGI Editor)

AsianFin -- In the ever-evolving world of artificial intelligence, the race to build AI agents is heating up. Following OpenAI’s launch of its first AI-powered agent application, “Operator,” ByteDance on Sunday launched its next-generation automation model, UI-TARS, on GitHub.

With seven billion parameters, this AI agent integrates crucial components such as visual understanding, text processing, task planning, and memory management into one unified model.

UI-TARS can perform complex, cross-platform tasks, perceiving user interfaces, reasoning through action steps, and interacting with web interfaces in ways previously thought to be exclusive to human operators.

While still in its preview phase and undergoing constant updates, UI-TARS has already made its mark by demonstrating the ability to “automatically” publish tweets, as seen in the official promotional video. Although the system currently requires human assistance for certain steps, such as inputting text and clicking through options, its potential is unmistakable. The model is already available for macOS and Windows users.


The Operator Revolution

Only two days earlier, OpenAI introduced its first AI agent, “Operator.” Aimed at U.S. ChatGPT Pro users with a monthly subscription of $200, Operator is a digital assistant capable of simulating human operations on the web. It can perform tasks such as shopping, ordering food, and organizing papers by seamlessly integrating visual recognition and advanced reasoning models. By using a combination of GPT-4’s visual capabilities and reinforcement learning, the AI agent plans complex steps and takes actions with impressive accuracy.

The proliferation of AI agents in recent months has been nothing short of remarkable. Other notable players, including Zhipu AI and Genius by Verses, have joined the AI agent race. Zhipu’s AutoGLM and GLM-PC have garnered attention, while Genius—an AI agent that only needed two hours of training and a fraction of the data—has already surpassed human-level players in the classic Pong game.

Even Nvidia's CEO, Jensen Huang, weighed in at CES 2024, predicting that AI agents will be the next frontier of the robotics industry, with a potential value in the trillions of dollars. OpenAI’s CEO, Sam Altman, has also said that AI agents could become a significant force in 2025, heralding the beginning of a new era in AI applications. This suggests that 2025 could be a watershed year for AI agents, positioning them as a key area of technological growth.

A New Frontier in AI Development

AI agents are essentially intelligent entities that can autonomously perceive their environment, make decisions, and take action. Think of them as highly capable assistants that can understand tasks and help humans perform them more efficiently. For example, UI-TARS can act like a "smart assistant" that can navigate the web, recognize visual cues, plan the necessary steps, and execute complex actions—such as publishing content or making purchases—without human intervention.

The concept of AI agents began to take off after the success of ChatGPT in late 2022. Researchers at Stanford University and Google published a paper on “Generative Agents,” which described how virtual people in a simulated environment exhibited behaviors similar to humans when integrated with ChatGPT. This research sparked widespread interest in the idea of AI agents.

By 2024, AI agents hadbeen recognized as essential components in the development of Artificial General Intelligence (AGI). Stanford professor Andrew Ng has pointed out that AI agents will play a critical role in the progression toward AGI, describing them as systems that not only think but can also take action. OpenAI’s roadmap for AGI, which spans five stages, places AI agents at the third level, between reasoning AI and fully autonomous, innovative systems.

A recent report highlighted the exponential growth of the AI agent market in China. In 2023, the Chinese AI agent market was valued at 55.4 billion yuan, and it is projected to grow to 852 billion yuan by 2028, with an impressive compound annual growth rate of 72.7%. These projections underscore the immense potential of AI agents as an integral part of future industries.

AI Agents Across Industries

AI agents are rapidly gaining traction in various industries, from customer service to programming, content creation, and financial management. In content creation, for instance, AI agents can generate videos or even write scripts autonomously. This level of efficiency has led to broader adoption of AI assistants by creators, further cementing AI’s role as an indispensable tool in modern workflows.

Operator, for example, serves as a highly practical tool. It can perform everyday tasks such as making restaurant reservations, buying groceries, and even booking tickets for sports events. It employs a straightforward workflow in which it captures and analyzes screen content, adds the relevant information to its model context, and determines the next steps through reasoning. It then executes these steps using a virtual mouse and keyboard. The human user can intervene if necessary, particularly in situations involving sensitive information like payment details or addresses.

According to OpenAI, the Operator is designed to perform tasks independently for users, providing them with a smooth, automated experience. In a demonstration, the AI agent successfully completed various tasks with minimal input from the user. However, it pauses when handling sensitive tasks, such as payment, so users can take control when needed.

AI agents are also poised to make a major impact on enterprise operations. According to F5’s Mohan Veloo, AI applications will increasingly rely on APIs, and the growth of AI usage will lead to an explosion of these interfaces. By 2025, it’s expected that 77% of global enterprises will deploy generative AI tools to improve productivity, with over 84% of all applications incorporating AI inference capabilities by 2028.

AI agents can streamline processes, reduce human labor costs, and provide businesses with new opportunities for automation. However, as AI becomes more pervasive, some experts warn that AI’s democratization of knowledge may level the playing field, removing some of the competitive advantages previously held by leading firms.

For enterprises, the challenge lies in finding the most effective ways to integrate AI agents into their operations. As Zhang Xin from Volcano Engine noted, while AI models bring new productivity tools, they also introduce challenges related to managing the massive amounts of data generated by AI operations. Companies must focus on creating AI solutions that drive innovation while leveraging existing technologies.

The Future of AI Agents: From Adoption to Integration

In the coming years, the widespread adoption of AI agents will likely become a defining characteristic of business transformation. According to F5’s Veloo, the increasing fusion of AI technologies with IoT, edge computing, and cloud-native architecture is accelerating AI’s integration into business processes. This trend will drive enterprises to implement AI solutions that can seamlessly collaborate with human workers, boosting both productivity and efficiency.

In the second phase of AI’s revolution, AI agents like those from ByteDance, OpenAI, and other major players in the industry are pushing the boundaries of what’s possible. Whether it’s automating daily tasks or offering new solutions for business optimization, the future of AI agents looks incredibly promising.In 2025, AI agents are expected to become a significant part of the business landscape, offering a glimpse into the future of work. As the technology continues to evolve, it’s clear that AI agents will not just be tools—they will be invaluable partners in the journey toward a more intelligent and automated world.

特别声明:以上内容(如有图片或视频亦包括在内)为自媒体平台“网易号”用户上传并发布,本平台仅提供信息存储服务。

Notice: The content above (including the pictures and videos if any) is uploaded and posted by a user of NetEase Hao, which is a social media platform and only provides information storage services.

相关推荐
热点推荐
泡椒究竟吃错了什么药?一夜间损失8160万 为近10年NBA最重罚单

泡椒究竟吃错了什么药?一夜间损失8160万 为近10年NBA最重罚单

劲爆体坛
2026-02-01 11:42:03
杭州男子失恋游湖南,遇苗族婚宴随礼1000入席,离场却被伴娘拦下

杭州男子失恋游湖南,遇苗族婚宴随礼1000入席,离场却被伴娘拦下

兰姐说故事
2025-06-09 10:00:07
除了黄金,白银,接下来2026年起飞的将会是什么?

除了黄金,白银,接下来2026年起飞的将会是什么?

小白鸽财经
2026-02-01 07:05:03
利空来了!周末A股3大事件:证监会重磅发文!明天,周一又悬了!

利空来了!周末A股3大事件:证监会重磅发文!明天,周一又悬了!

云鹏叙事
2026-02-01 11:18:12
场均16+8+7!失误1.6次联盟顶级,美媒晒阿门数据,火箭因祸得福

场均16+8+7!失误1.6次联盟顶级,美媒晒阿门数据,火箭因祸得福

巴叔GO聊体育
2026-02-01 13:12:05
这是乔妹初识C罗时的样子,无敌青春美少女,难怪C罗一见钟情

这是乔妹初识C罗时的样子,无敌青春美少女,难怪C罗一见钟情

科学发掘
2026-01-31 06:35:19
后生可畏啊!一家长吐槽女儿为省下1800元,坐了17个小时大巴回家

后生可畏啊!一家长吐槽女儿为省下1800元,坐了17个小时大巴回家

火山诗话
2026-01-30 15:13:24
又让张召忠说中了?东拼西凑550亿建的2艘航母,如今彻底成为累赘

又让张召忠说中了?东拼西凑550亿建的2艘航母,如今彻底成为累赘

泠泠说史
2025-12-24 17:42:56
蓄发哥剪发在望?曼联取得英超3连胜,后两场对阵热刺西汉姆

蓄发哥剪发在望?曼联取得英超3连胜,后两场对阵热刺西汉姆

懂球帝
2026-02-02 00:39:38
四川成都一佳人好漂亮,身高168cm,体重47kg 美的让人移不开眼

四川成都一佳人好漂亮,身高168cm,体重47kg 美的让人移不开眼

东方不败然多多
2026-01-07 10:20:04
王石在深圳骑行,面相变得好难看,腿比田朴珺还细!简直不忍直视

王石在深圳骑行,面相变得好难看,腿比田朴珺还细!简直不忍直视

小娱乐悠悠
2026-01-31 10:26:37
中日激烈对峙后,64岁高市早苗患病,已送医诊治,日本共产党发难

中日激烈对峙后,64岁高市早苗患病,已送医诊治,日本共产党发难

影孖看世界
2026-02-01 19:23:52
加拿大华裔14岁少女吸毒致死!

加拿大华裔14岁少女吸毒致死!

达文西看世界
2026-02-01 17:51:18
随着四川女篮4分险胜山东!WCBA常规赛结束:东莞排第1,四川第2

随着四川女篮4分险胜山东!WCBA常规赛结束:东莞排第1,四川第2

足球评论qs
2026-02-01 21:33:20
癌症去世的人越来越多?医生反复叮嘱:宁可打打牌,也别做这5事

癌症去世的人越来越多?医生反复叮嘱:宁可打打牌,也别做这5事

医学原创故事会
2026-01-25 22:54:04
打破尘封88年纪录!阿卡一战创多项历史,大满贯数量已跻身前9

打破尘封88年纪录!阿卡一战创多项历史,大满贯数量已跻身前9

全景体育V
2026-02-01 20:23:18
爆大冷!埃梅里耻辱一战:多踢1人主场落败,争冠形势迅速恶化

爆大冷!埃梅里耻辱一战:多踢1人主场落败,争冠形势迅速恶化

足球狗说
2026-02-02 00:19:11
刘维伟宣!NBA总冠军中锋加盟CBA球队?

刘维伟宣!NBA总冠军中锋加盟CBA球队?

刺猬篮球
2026-02-01 17:06:23
当年由上海发起,全国效仿的垃圾分类,现在为什么没人再推行了?

当年由上海发起,全国效仿的垃圾分类,现在为什么没人再推行了?

我心纵横天地间
2026-02-01 13:31:24
鲫鱼立大功!医生研究发现:鲫鱼对这6种疾病有好处,可以常吃

鲫鱼立大功!医生研究发现:鲫鱼对这6种疾病有好处,可以常吃

岐黄传人孙大夫
2026-01-31 15:40:03
2026-02-02 02:39:00
钛媒体APP incentive-icons
钛媒体APP
独立财经科技媒体
129210文章数 861746关注度
往期回顾 全部

教育要闻

这么个排行榜,无厘头的很!

头条要闻

爱泼斯坦追逐女孩、安德鲁跪爬女子身上画面全公布

头条要闻

爱泼斯坦追逐女孩、安德鲁跪爬女子身上画面全公布

体育要闻

德约大度祝贺阿卡 幽默互动逗笑纳达尔

娱乐要闻

春晚第三次联排阵容曝光:全是实力派

财经要闻

黄仁勋台北"夜宴":汇聚近40位台企高管

科技要闻

10亿元宝红包突袭 复刻微信支付还是微视?

汽车要闻

岚图汽车1月交付10515辆 同比增长31%

态度原创

艺术
家居
亲子
房产
公开课

艺术要闻

上海“高技派”地标:华润中心竣工,LV总部入驻!

家居要闻

蓝调空舍 自由与个性

亲子要闻

兰姐带玥儿看北京新学校,玥儿一待俩小时,筱梅的话终于有人信了

房产要闻

藏不住的小城大事,海澄新城执掌自贸港风口,进阶兑现美好生活新篇

公开课

李玫瑾:为什么性格比能力更重要?

无障碍浏览 进入关怀版