网易首页 > 网易号 > 正文 申请入驻

ByteDance and OpenAI Race to Develop AI Agents, Nvidia Partner Predicts AI A...

0
分享至

(Image Source: Photo by Lin Zhijia, TMTPost AGI Editor)

AsianFin -- In the ever-evolving world of artificial intelligence, the race to build AI agents is heating up. Following OpenAI’s launch of its first AI-powered agent application, “Operator,” ByteDance on Sunday launched its next-generation automation model, UI-TARS, on GitHub.

With seven billion parameters, this AI agent integrates crucial components such as visual understanding, text processing, task planning, and memory management into one unified model.

UI-TARS can perform complex, cross-platform tasks, perceiving user interfaces, reasoning through action steps, and interacting with web interfaces in ways previously thought to be exclusive to human operators.

While still in its preview phase and undergoing constant updates, UI-TARS has already made its mark by demonstrating the ability to “automatically” publish tweets, as seen in the official promotional video. Although the system currently requires human assistance for certain steps, such as inputting text and clicking through options, its potential is unmistakable. The model is already available for macOS and Windows users.


The Operator Revolution

Only two days earlier, OpenAI introduced its first AI agent, “Operator.” Aimed at U.S. ChatGPT Pro users with a monthly subscription of $200, Operator is a digital assistant capable of simulating human operations on the web. It can perform tasks such as shopping, ordering food, and organizing papers by seamlessly integrating visual recognition and advanced reasoning models. By using a combination of GPT-4’s visual capabilities and reinforcement learning, the AI agent plans complex steps and takes actions with impressive accuracy.

The proliferation of AI agents in recent months has been nothing short of remarkable. Other notable players, including Zhipu AI and Genius by Verses, have joined the AI agent race. Zhipu’s AutoGLM and GLM-PC have garnered attention, while Genius—an AI agent that only needed two hours of training and a fraction of the data—has already surpassed human-level players in the classic Pong game.

Even Nvidia's CEO, Jensen Huang, weighed in at CES 2024, predicting that AI agents will be the next frontier of the robotics industry, with a potential value in the trillions of dollars. OpenAI’s CEO, Sam Altman, has also said that AI agents could become a significant force in 2025, heralding the beginning of a new era in AI applications. This suggests that 2025 could be a watershed year for AI agents, positioning them as a key area of technological growth.

A New Frontier in AI Development

AI agents are essentially intelligent entities that can autonomously perceive their environment, make decisions, and take action. Think of them as highly capable assistants that can understand tasks and help humans perform them more efficiently. For example, UI-TARS can act like a "smart assistant" that can navigate the web, recognize visual cues, plan the necessary steps, and execute complex actions—such as publishing content or making purchases—without human intervention.

The concept of AI agents began to take off after the success of ChatGPT in late 2022. Researchers at Stanford University and Google published a paper on “Generative Agents,” which described how virtual people in a simulated environment exhibited behaviors similar to humans when integrated with ChatGPT. This research sparked widespread interest in the idea of AI agents.

By 2024, AI agents hadbeen recognized as essential components in the development of Artificial General Intelligence (AGI). Stanford professor Andrew Ng has pointed out that AI agents will play a critical role in the progression toward AGI, describing them as systems that not only think but can also take action. OpenAI’s roadmap for AGI, which spans five stages, places AI agents at the third level, between reasoning AI and fully autonomous, innovative systems.

A recent report highlighted the exponential growth of the AI agent market in China. In 2023, the Chinese AI agent market was valued at 55.4 billion yuan, and it is projected to grow to 852 billion yuan by 2028, with an impressive compound annual growth rate of 72.7%. These projections underscore the immense potential of AI agents as an integral part of future industries.

AI Agents Across Industries

AI agents are rapidly gaining traction in various industries, from customer service to programming, content creation, and financial management. In content creation, for instance, AI agents can generate videos or even write scripts autonomously. This level of efficiency has led to broader adoption of AI assistants by creators, further cementing AI’s role as an indispensable tool in modern workflows.

Operator, for example, serves as a highly practical tool. It can perform everyday tasks such as making restaurant reservations, buying groceries, and even booking tickets for sports events. It employs a straightforward workflow in which it captures and analyzes screen content, adds the relevant information to its model context, and determines the next steps through reasoning. It then executes these steps using a virtual mouse and keyboard. The human user can intervene if necessary, particularly in situations involving sensitive information like payment details or addresses.

According to OpenAI, the Operator is designed to perform tasks independently for users, providing them with a smooth, automated experience. In a demonstration, the AI agent successfully completed various tasks with minimal input from the user. However, it pauses when handling sensitive tasks, such as payment, so users can take control when needed.

AI agents are also poised to make a major impact on enterprise operations. According to F5’s Mohan Veloo, AI applications will increasingly rely on APIs, and the growth of AI usage will lead to an explosion of these interfaces. By 2025, it’s expected that 77% of global enterprises will deploy generative AI tools to improve productivity, with over 84% of all applications incorporating AI inference capabilities by 2028.

AI agents can streamline processes, reduce human labor costs, and provide businesses with new opportunities for automation. However, as AI becomes more pervasive, some experts warn that AI’s democratization of knowledge may level the playing field, removing some of the competitive advantages previously held by leading firms.

For enterprises, the challenge lies in finding the most effective ways to integrate AI agents into their operations. As Zhang Xin from Volcano Engine noted, while AI models bring new productivity tools, they also introduce challenges related to managing the massive amounts of data generated by AI operations. Companies must focus on creating AI solutions that drive innovation while leveraging existing technologies.

The Future of AI Agents: From Adoption to Integration

In the coming years, the widespread adoption of AI agents will likely become a defining characteristic of business transformation. According to F5’s Veloo, the increasing fusion of AI technologies with IoT, edge computing, and cloud-native architecture is accelerating AI’s integration into business processes. This trend will drive enterprises to implement AI solutions that can seamlessly collaborate with human workers, boosting both productivity and efficiency.

In the second phase of AI’s revolution, AI agents like those from ByteDance, OpenAI, and other major players in the industry are pushing the boundaries of what’s possible. Whether it’s automating daily tasks or offering new solutions for business optimization, the future of AI agents looks incredibly promising.In 2025, AI agents are expected to become a significant part of the business landscape, offering a glimpse into the future of work. As the technology continues to evolve, it’s clear that AI agents will not just be tools—they will be invaluable partners in the journey toward a more intelligent and automated world.

特别声明:以上内容(如有图片或视频亦包括在内)为自媒体平台“网易号”用户上传并发布,本平台仅提供信息存储服务。

Notice: The content above (including the pictures and videos if any) is uploaded and posted by a user of NetEase Hao, which is a social media platform and only provides information storage services.

相关推荐
热点推荐
外媒:乌克兰军队中出现东大FN-16便携式防空导弹,真实来源成疑

外媒:乌克兰军队中出现东大FN-16便携式防空导弹,真实来源成疑

零度Military
2026-05-14 05:49:57
中纪委怒批:公务员也是人,正常生活不应问责处理

中纪委怒批:公务员也是人,正常生活不应问责处理

职场资深秘书
2026-05-14 09:45:16
这一幕让全世界震撼!中国军人在“空军一号”轰鸣声前岿然不动

这一幕让全世界震撼!中国军人在“空军一号”轰鸣声前岿然不动

澎湃新闻
2026-05-14 10:30:25
射程超过35000公里,俄军方:可经南极至美国境内目标的“世界上最强大导弹”试射成功

射程超过35000公里,俄军方:可经南极至美国境内目标的“世界上最强大导弹”试射成功

红星新闻
2026-05-13 13:21:19
公然拒挂国旗,订单全给日韩,长荣如今的结局早已注定

公然拒挂国旗,订单全给日韩,长荣如今的结局早已注定

潋滟晴方DAY
2026-05-11 06:31:37
黄仁勋赶飞机 藏着中美科技关系最真实的底色

黄仁勋赶飞机 藏着中美科技关系最真实的底色

看看新闻Knews
2026-05-13 23:00:02
专机落地!特朗普又舞起熟悉手势 乘专车前往酒店

专机落地!特朗普又舞起熟悉手势 乘专车前往酒店

看看新闻Knews
2026-05-13 23:14:07
深圳一楼盘3小时卖212套,购房者扬言来晚就没了,评论区早已清醒

深圳一楼盘3小时卖212套,购房者扬言来晚就没了,评论区早已清醒

谭谈社会
2026-05-14 04:44:59
随特朗普抵京:马斯克第四个下机 黄仁勋换上西装

随特朗普抵京:马斯克第四个下机 黄仁勋换上西装

看看新闻Knews
2026-05-14 01:34:05
一家长称儿子早恋被叫学校,想开宝马镇住对方家长,评论玩梗笑死

一家长称儿子早恋被叫学校,想开宝马镇住对方家长,评论玩梗笑死

观察鉴娱
2026-05-13 11:22:56
马斯克:空军一号上只有我和黄仁勋!网友:全球最有钱的和全球市值最高的才有机会坐

马斯克:空军一号上只有我和黄仁勋!网友:全球最有钱的和全球市值最高的才有机会坐

大白聊IT
2026-05-14 00:58:40
黄仁勋:这会是一次非常成功的会晤

黄仁勋:这会是一次非常成功的会晤

财闻
2026-05-14 11:52:57
“空军一号”轰鸣而过,解放军岿然不动,视频火爆外网

“空军一号”轰鸣而过,解放军岿然不动,视频火爆外网

极目新闻
2026-05-14 10:08:46
性,已成为职场流通的硬资源!

性,已成为职场流通的硬资源!

灯锦年
2026-05-14 00:10:06
形势有多严峻?坐标上海:80末90初程序员都开始失业,评论区炸了

形势有多严峻?坐标上海:80末90初程序员都开始失业,评论区炸了

慧翔百科
2026-05-14 09:00:11
卢比奥来了,那些所谓的专家又被狠狠打脸

卢比奥来了,那些所谓的专家又被狠狠打脸

壹家言
2026-05-14 10:51:40
停更3年,千万粉丝网红改名宣布回归,4小时涨粉240万

停更3年,千万粉丝网红改名宣布回归,4小时涨粉240万

天津生活通
2026-05-14 10:34:09
特朗普抵京第一天就签了400亿大单,但真正让白宫失眠的是这件事

特朗普抵京第一天就签了400亿大单,但真正让白宫失眠的是这件事

浪子的烟火人间
2026-05-14 08:44:32
摩洛哥幸福新娘事件升级!河南一男子刷到该视频,断然与对象退婚

摩洛哥幸福新娘事件升级!河南一男子刷到该视频,断然与对象退婚

火山詩话
2026-05-14 07:04:08
扎心!朋友孩子的班34人处于“零就业”状态,引热议

扎心!朋友孩子的班34人处于“零就业”状态,引热议

火山詩话
2026-05-13 15:02:14
2026-05-14 13:23:00
钛媒体APP incentive-icons
钛媒体APP
独立财经科技媒体
133537文章数 862156关注度
往期回顾 全部

教育要闻

独家!海淀高三二模排名出炉!清北线预估665分,14970人参与

头条要闻

兄妹救4名落水者后遭拉黑 被告知获救者身份不便公开

头条要闻

兄妹救4名落水者后遭拉黑 被告知获救者身份不便公开

体育要闻

登海报!哈登30+8+6创多项纪录 第8次赢天王山

娱乐要闻

肖战提名金海燕奖,这一步走得太稳

财经要闻

片仔癀依旧困在“片仔癀”

科技要闻

马斯克:只有我和黄仁勋坐上了"空军一号"

汽车要闻

C级纯电轿跑 吉利银河"TT"申报图来了

态度原创

游戏
本地
时尚
艺术
教育

卡牌生存射击游戏《掏枪干吧》公开

本地新闻

用苏绣的方式,打开江西婺源

T恤+低腰阔腿裤、衬衫+低腰半裙,今年夏天最时髦的搭配,谁穿谁好看!

艺术要闻

充满光感的花卉油画 | 亚历山大·沙巴德伊

教育要闻

武汉交通职业学院:勤勉破局!她从专科逆袭硕士!

无障碍浏览 进入关怀版