万物一体 Agents at Scale百度AI开发者大会
文章语言:
简
繁
EN
Share
Minutes
原文
会议摘要
Baidu Intelligent Cloud showcases the applications of AI agents in industries such as logistics, manufacturing, and finance, leveraging full-stack AI technology to support agent development, enhance enterprise efficiency, reduce costs, and create business value. Li Yanhong emphasized that the rise of intelligent agents marks the advent of the AI application phase. Baidu has launched several intelligent agent products to help enterprises undergo intelligent transformation, urging them to adapt to change, increase delegation, and reduce hierarchical layers in order to embrace the opportunities brought by AI.
会议速览
It was discussed how agents are poised to replace models as the focal point in the AI field, signaling a shift in AI development from the model-centric phase to the application-driven phase. The discussion underscored the capabilities of agents in task execution, autonomous learning, and environmental adaptation, as well as their role in catalyzing self-evolution among individuals, enterprises, and organizations.
The discussion highlighted three major transformations brought about by AI agents: first, a shift in the entry point—from information retrieval to task completion, with AI agents offering greater value than chatbots; second, a deepening of trust, as AI is entrusted with handling more personal matters; and third, a reconfiguration of organizational structures, with AI agents enhancing individual capabilities, promoting organizational flattening, and shifting the focus of management to “alignment.” These changes herald the advent of a new era of productivity, in which established norms and traditional divisions of labor are being challenged and must be proactively adapted to evolve.
The discussion highlighted the importance of shifting from token consumption to DAA (Daily Active Agents) as a metric in the AI era, emphasizing the pivotal role of agents in organizational evolution and individual productivity enhancement, as well as the need to build full-stack AI capabilities to support the explosive growth of agent-based applications.
The discussion explored the application value of large models in real-world business scenarios, highlighting the performance improvements and cost optimizations of ERNIE Bot 5.1. It underscores the importance of agent-native infrastructure and introduces Baidu Do-Mate as a unified, general-purpose agent solution designed to integrate capabilities across multiple scenarios, alleviate users’ decision‑making burden, and enable co‑evolution with customers.
Introducing Baidu’s newly launched general-purpose AI agent, Do‑Mate. It can handle multiple roles—customer service, sales, marketing, and more—through natural language interaction, enabling end-to‑end automation from idea to outcome. Do‑Mate significantly boosts productivity, supports scheduled task creation and knowledge accumulation, and continuously evolves to become a true collaborative partner for users.
Introducing Baidu DoMate’s technical framework, skill ecosystem, and continuous evolution capabilities, highlighting its advantages in executing complex tasks, as well as the newly launched mobile app that provides round-the-clock AI assistant services, while also sharing event perks and inviting users to download and experience it.
The dialogue showcases the application of smart education on campus, particularly how AI technology can spark students’ creativity and streamline the software development process. Students leverage AI tools to bring their ideas to life, such as designing mini-programs to solve real-world problems, demonstrating the transformation of educational models and the widespread adoption of AI technology, while breaking down age and skill barriers to enable the practical application of innovative thinking.
We introduce an app called “Miaoda,” which stands out for its ability to automatically generate code, enabling individual developers to rapidly build applications using only voice commands. Through concrete examples, it demonstrates how to build an AI-powered social app in just a few lines of code, and notes that Miaoda has already helped numerous individual developers achieve commercial success, with some even earning annual incomes in the tens of millions.
The Enterprise Edition of Miaoda has been launched, designed specifically for power developers and enterprise users, addressing three key pain points: multi‑user collaboration, end‑to‑end closed loops, and enterprise‑grade stability. Ziroom’s “Miaoda” app is integrated into its rental business, enabling rapid visualization of interior design and enhancing both customer experience and team creativity. In the AI era, code intelligence has become a core capability, driving business growth and boosting productivity.
The emergence of code agents has lowered the barriers to entry and reduced costs in software development, enabling the fulfillment of personalized needs. As a result, the software industry may be redefined, with enormous market potential.
Baidu has announced that its digital human platform, “Huibo Xing,” has been upgraded to “Baidu Yijing,” with the aim of delivering end-to‑end digital human services across all scenarios, including live streaming, video production, and real‑time interaction. The company particularly highlights its strengths in video generation, multilingual support, and commercial conversion. Launching an international edition to support Chinese companies in their global expansion and enhance the efficiency and quality of content creation for creators worldwide.
We introduced a smart content‑creation tool called “Yijing Overseas Edition,” which offers an end‑to‑end solution for turning ideas into finished videos. We highlighted its applications in livestream‑driven e‑commerce and brand‑advertising production, including case studies such as a collaboration with a fitness influencer to promote protein bars and a co‑created short ad film with Coca‑Cola. The platform supports multiple languages, spanning everything from product seeding to brand storytelling, and offers a digital‑human livestreaming solution. It aims to leverage technology to transcend linguistic and cognitive barriers, empowering brand communication.
Baidu Vamou 2.0 is aimed at business users. By strengthening dialogue deduction capability and interdisciplinary knowledge integration modeling, Baidu Vamou solves industrial pain points such as production scheduling, process optimization and logistics planning, improves decision-making efficiency and effectiveness, and helps enterprises achieve value growth.
The dialogue explores how agents, particularly “penalty-cat”‑style agents, can achieve efficient closed-loop systems through self-evolution and autonomous decision-making, thereby driving enterprise process reengineering and the accumulation of institutional knowledge. Strong enterprise needs self-evolution, including authorization, reduce hierarchy, improve talent density, task-oriented, in order to adapt to the AI era, the realization of continuous optimization and competitive advantage.
The AI factory director and AI store manager created by Baidu Intelligent Cloud have effectively improved the production and sales efficiency of Yiwu small commodity market. AI factory directors can uniformly dispatch all cameras, learn factory management requirements, and improve production management capabilities; AI store managers can guide on-site purchases, analyze sales data, and provide sales promotion suggestions based on inventory to realize global buying and selling. The application of these intelligent agents has helped Chinese products integrate more seamlessly into the global supply chain, showcasing the dynamism and resilience of China’s manufacturing sector.
The conversation delved into the intelligent transformation of China’s automotive industry, highlighting that since 2019, tech companies such as Baidu have accelerated the development of AI-driven autonomous driving through a full-stack AI strategy, helping several leading automakers achieve mass production of intelligent driver-assistance systems, including Changan Automobile and Horizon Robotics. By jointly building intelligent computing centers and optimizing computational efficiency, the initiative is accelerating the transition of autonomous driving technology from the laboratory to mass production, showcasing the Chinese automotive industry’s progress and collaboration in both vehicle intelligence and industrial‑level intelligence.
Horizon has unveiled high-performance autonomous-driving chips and a central computing architecture, and is collaborating with Baidu Intelligent Cloud to build a large-scale heterogeneous computing cluster, enabling the mass production of intelligent driving systems, accelerating the智能化 transformation of the automotive industry, and benefiting consumers worldwide.
It explores the wide-ranging applications of AI in the smart hardware sector, from smartphones to AI glasses, and underscores the significance of AI-enabled devices as a new super entry point. Through case studies, it demonstrates how an AI assistant can optimize user experience, such as in trip planning and schedule management. Meanwhile, it mentions Baidu Intelligent Cloud’s role in supporting over 1,000 AI hardware companies and enhancing product experiences, as well as its technological breakthroughs in the field of embodied intelligence—covering data collection, model-training acceleration, and more—that help enterprises build skewing capabilities for the Physical AI era.
It is shared that the key to physical intelligence entrepreneurship lies in the combination of hardware and software, emphasizing the building of a complete product chain from motor to model, and the core position of dual-arm operation in the operation scenario. Xinghai Tu Gao adheres to an integrated, intelligence-driven approach, developing robotic brains that adapt to diverse form factors and already serving top-tier developers worldwide, including Professor Fei-Fei Li’s research group.
The importance of pre-training and post-training for embodied intelligence models was discussed, emphasizing the synergistic cooperation of fast–slow dual systems to emulate the principles of the human brain. An innovative fast WEM model was proposed to achieve real-time control, thereby advancing the application of embodied intelligence in industrial automation and other fields. The open-source foundation model G0.5 and the EDP platform are accelerating model deployment and boosting development efficiency, earning widespread acclaim in the industry.
The conversation centers on the field of embodied intelligence, underscoring the critical role of real-world data in training foundational models. It introduces open-source datasets and a global initiative to build large-scale real‑world scene datasets, aiming to create a dataset totaling one million hours of data. The goal is to accelerate the transition of technology from the lab to practical applications. Looking ahead to 2026, when the industry will enter a phase of成果 validation, the discussion highlights that application and productivity optimization are paramount. The team is actively collaborating with industry partners to jointly advance the deployment of embodied intelligence.
Shared the progress of full-stack AI applications in the financial industry, with particular emphasis on the pivotal role of domestically produced computing power in the AI projects of China Merchants Bank and Shanghai Pudong Development Bank, thereby driving the intelligent upgrade of core scenarios such as risk control and marketing.
The discussion covered the stringent requirements of financial operations for AI services, as well as a case study on optimizing specialized models by building a domestically developed heterogeneous computing cluster. It showcased AI applications in vertical domains such as financial analysis, underscored the importance of integrating AI with finance, and highlighted its role in serving the real economy and driving industrial upgrading.
Looking back on the decade-long collaboration between Baidu Intelligent Cloud and State Grid, from enhancing customer service efficiency to co‑developing the Guangming Power Large Model and the widespread deployment of intelligent inspection technologies, the two partners have jointly propelled China’s transformation from a major energy player into a global energy powerhouse. In addition, Baidu Intelligent Cloud already serves more than 80% of China’s central state-owned enterprises, leveraging technological innovation to empower diverse industries, fostering a virtuous cycle of continuous evolution and driving sustained innovation in sectors such as industrial manufacturing and smart energy.
Baidu Intelligent Cloud announced that it will focus on the intelligent field and upgrade to AI cloud for large-scale intelligent body applications, providing full stack capabilities of AI info and agent info, including optimizing model services and controlling projects, so as to improve the efficiency of intelligent body task completion and reduce costs, and promote enterprises to convert each token into productivity.
In response to the complexity of agent tasks, we have comprehensively upgraded AI computing power, including Qwen inference optimization for agents, a multimodal training framework, and support for domestically produced computing hardware, significantly enhancing inference efficiency and training performance to deliver more efficient AI applications and services.
AI technology has simplified the complexities of the world. Baidu Intelligent Cloud, as a new‑era infrastructure, empowers industries and individuals, accelerates the development of the intelligent agent ecosystem, and strengthens the industrial system. Technological advancements have made inconvenience a thing of the past. AI helps preserve memories, overcome barriers, interpret history, and promote cultural dissemination, ensuring that technological equality benefits everyone and co-creating a better future.
要点回答
Q:Against the backdrop of the rapid advancement of large models, what new trends have garnered attention at this year’s AI developer conference?
A:This year’s conference has focused on a phenomenon: every year around the Spring Festival, a tech trend of unprecedented scale emerges. For example, in recent years the spotlight has shifted to ChatGPT, Sora, DeepMind, and others, while this year the Lobster Model has gained widespread attention thanks to its practical applications and agent‑based capabilities. This marks a transition in the agent‑intelligence space, moving from model‑centric reliance to an application‑driven paradigm.
Q:What does it mean when AI agents step out of their niche? How are AI agents reshaping public perceptions of AI?
A:The emergence of AI agents signifies that AI development has shifted from the model stage to the application stage; it will permeate all industries at an unprecedented pace, with the focus of competition shifting from intelligence to execution. Users place greater emphasis on whether an agent can accomplish practical tasks rather than its intrinsic level of intelligence. The agent can not only automatically write code and independently invoke tools, but also build new tools, which has led people to recognize that AI is not merely a chat tool, but is evolving into a digital employee and an agent. They can autonomously perform complex tasks and continuously enhance their capabilities through self-evolution.
Q:What efforts has Baidu made to build end-to-end capabilities?
A:Baidu has built a full-stack capability for the Nebula Agent, encompassing chips, models, intelligent agents, and optimization tools and capabilities. Among them, the development of large models follows an application-driven principle; for example, the ERNIE 5.1 model has performed exceptionally well in multiple evaluations and supports seamless integration of various AI capabilities, such as search, coding, and in-depth research, forming a unified portal called do mate to achieve efficient collaborative work.
Q:How can we understand and respond to the changes brought about by intelligent agents?
A:It is necessary to stay focused on the core value proposition of intelligent agents, filter out noise, build consensus, and accelerate action. AI agents will transform user interfaces, trust dynamics, and organizational structures. Enterprises must adapt to these changes by adopting flatter management hierarchies and enhancing the efficiency of information flow, while also tracking daily active agents (DAA) as a key metric for gauging platform vitality.
Q:How can Do Mate leverage natural language interaction to simultaneously handle the responsibilities of customer service, sales, and marketing?
A:Do Mate is capable of handling and executing the tasks of these three positions in a single engagement. First, it handles customer service tasks by autonomously accessing the inbox to process customer complaint emails, crafting emotionally resonant responses, and proposing compensation measures. Secondly, in sales‑related tasks, do mate can cleanse and standardize data from heterogeneous source spreadsheets, analyze sales performance, identify anomalous fluctuations, and generate in‑depth industry‑level sales reports. Ultimately, it provides recommendations on product assortment and promotional priorities for both e‑commerce channels and company‑operated stores. Finally, for marketing campaigns, do mate generates model-wearing renderings and high‑quality HTML‑formatted promotional articles to boost the launch of new products, while leveraging flash‑sale pages to drive sales.
Q:What key characteristics does do mate possess when performing complex tasks?
A:Key features that enable Do Mate to tackle complex tasks include: a cutting-edge technical framework that excels at constraint enforcement, guidance, validation, and feedback; a robust skill ecosystem and comprehensive security mechanisms; and the ability to evolve continuously, becoming smarter with use. In addition, do mate integrates Baidu’s high-quality AI capabilities accumulated over many years, such as AI search, Instant Match, and Famo Encyclopedia, and can ensure that the model stays on course and maintains consistent, undistorted actions when executing complex tasks lasting for hours or even days.
Q:What are the main features and advantages of the Miaoda app?
A:Miaoda App is the first professional‑grade app‑building platform. Users can create web pages, mini‑programs, and mobile apps through natural‑language dialogue, with the entire development process requiring just one person, as 90% of the code is automatically generated by Miaoda’s AI engine. Users can move their mouths at any time and place, and their ideas can fall to the ground immediately. The MiaoDa app also supports multi‑user collaboration, end‑to‑end closed‑loop development, and enterprise‑grade stability, helping users rapidly build high‑quality applications and already enabling numerous developers to unlock business value.
Q:In the AI era, what changes have occurred in designers’ work?
A:In the AI era, designers’ work has become more diverse; for example, Ziroom leverages AI tools like MiaoDan to build home‑interior‑design applications on demand without writing any code. By integrating with the home‑decor module, users can simply upload a base image and select a design style to quickly generate renderings, turning their vision into reality—what you imagine is what you see, and what you see is what you get. As AI becomes more deeply integrated, frontline employees in enterprises can directly leverage AI capabilities to turn their business ideas into tangible growth and productivity gains.
Q:Why has coding ability become crucial in the AI era, yet also seems to have lost its value?
A:Code proficiency is a foundational technical capability, playing a decisive role in advancing base model capabilities (such as natural language processing, multimodal generation, and creative writing) and in numerous AI-native applications. However, as AI advances—especially with the emergence of code agents like MiaoDa—people who don’t know how to code can also build applications, leading to the devaluation of code itself; some in the industry even believe that code will become free. This has disrupted the software industry, making one-time‑use or disposable software feasible, as production costs are extremely low, rendering even single‑use requirements worthwhile to develop.
Q:What impact does this change have on the software industry?
A:In the past, the software industry was characterized by two significant barriers—high entry thresholds and high costs—leaving a vast array of long-tail and personalized demands unmet. But now, code agents like MiaoDa are breaking down these two barriers, lowering the entry threshold for applications, reducing development costs, and making one-off software a viable option. As a result, the software industry is being redefined, and its market size could potentially expand by a factor of ten—presenting a tremendous opportunity.
Q:What role do digital human agents play in the AI era?
A:Digital human agents are the universal interaction interface of the AI era, endowed with expressive capabilities such as speech, facial expressions, and gestures; they can earn people’s trust, provide real-time feedback, and deliver emotional value. As digital human agents become increasingly sophisticated, their application scenarios continue to expand, extending from live‑commerce to areas such as advertising and marketing, content creation, and public services, ushering in a new phase of large‑scale deployment.
Q:How does Baidu’s Yijing differ from conventional video-generation models?
A:Baidu Yijing can generate high-quality video content lasting several minutes or even hours, while offering robust interactivity to meet users’ creative needs across a variety of scenarios, such as livestream‑driven e‑commerce, advertising and marketing, and content creation. It can generate any video centered on a human‑character subject with a single click, significantly boosting individual creators’ productivity and helping businesses seamlessly enter the global market by producing high‑quality, commercially valuable content.
Q:In the chemical materials industry, how does Baidu help companies optimize their process flows?
A:By integrating interdisciplinary knowledge and model‑building capabilities, Baidu Famo can deeply understand and decipher the underlying principles governing product formulations and process parameter settings, thereby addressing the limitations of traditional approaches that rely on seasoned process experts while lacking systematic knowledge transfer. It can leverage scientific computing models to provide chemical enterprises with optimized process solutions and, in collaboration with the Asia Bay National Laboratory, has refined rice breeding and drought‑stress modeling, significantly enhancing detection efficiency.
Q:In the actual production operations at Qingdao Port, what results has the “fa mou” technology achieved?
A:During the May Day holiday, on its live production system, Qingdao Port adopted Baidu’s Famo technology and received highly positive feedback from operations staff. The efficiency of the ATOS system improved by 10.21%, with projections indicating it can handle nearly one million additional TEUs annually, helping Chinese products reach global markets. Moreover, strategy optimization not only improves individual production metrics but also supports multi‑directional, multi‑scenario simulations, serving as an intelligent assistant that promptly adjusts plans to aid business experts in decision‑making and enabling the reusable deployment of AI assets.
Q:How does the Port of Qingdao leverage its automated terminal operating system (ATOS) to enhance port‑terminal operational efficiency?
A:As a world-leading automated container terminal, Qingdao Port’s intelligent control system, A‑TOS, faces the challenge of performing extreme‑case simulations in three-dimensional space when determining vessel stowage plans. By collaborating with Baidu’s strategic planning team, starting from data reconstruction, focusing on the positional details of each container, establishing real-world constraints, and screening candidate solutions, we leveraged AI for holistic decision-making and conducted 248 rounds of self-iterative optimization, successfully reducing the number of toppled containers, increasing the biphasic coupling rate, and improving container‑outage balance, thereby enhancing the port’s overall input‑output ratio.
Q:What makes the Punish-Cat agent unique compared to other agents?
A:The Punish-Cat agent is capable of self-evolution and autonomous decision-making; it can self‑verify and optimize task outcomes, establishing a near‑closed loop that requires minimal human intervention and continuously seeks better solutions. At the same time, the requirements for AI’s verifiability and closed-loop operation also drive process reengineering and the codification of experience into verifiable context, enabling decision-making agents to move beyond single-use cases and evolve into enterprise‑wide intelligence capable of understanding and implementing industry logic while continuously seeking optimal solutions.
Q:How should companies respond to the evolution of AI in order to maintain their competitiveness?
A:Enterprises must undergo self-evolution by delegating more authority and reducing micromanagement, accelerating alignment and flattening hierarchies, increasing talent density while moving away from labor‑intensive strategies, and integrating more tasks while minimizing rigid division of labor. In the AI era, corporate management must shift its paradigm, embrace innovation and creativity, and establish a systematic transformation geared toward the age of intelligent agents, in order to adapt to the disruptive impact of AI on how businesses operate.
Q:In Yiwu, consumers can experience the AI store manager powered by Baidu Intelligent Cloud. What functions does it offer?
A:The AI store manager can provide in-store guidance, review sales data, and, based on inventory levels, offer recommendations for product transfers and promotional strategies, while also issuing alerts for overdue orders. It comes with a suite of marketing tools to help businesses manage their operations.
Q:Can the AI store manager’s capabilities be extended and evolved? How is this front-store, back‑factory solution currently made available to customers and partners?
A:Yes, the AI store manager’s capabilities can continuously expand and evolve. For example, once logistics data is integrated, it can calculate shipping costs and delivery lead times, helping merchants optimize their operations and management. We have made this capability available to our customers and partners. A local partner has leveraged Baidu Smart Cloud’s Hold‑On technology and related intelligent agents to build a solution for Yiwu merchants.
Q:What are Baidu’s strategies and partnerships in the field of automotive intelligence?
A:Baidu leverages its full-stack AI capabilities to participate in the intelligent transformation of the automotive industry, such as supporting Geely in building its first proprietary cloud and assisting Horizon in constructing an ultra-large-scale cluster. Baidu collaborates with leading companies across sectors—including automakers, battery and chip manufacturers, and autonomous‑vehicle providers—to co‑develop, train, simulate, test, and mass‑produce intelligent driver‑assistance solutions. Last year alone, it successfully supported the delivery of over 20 million new vehicles equipped with Level 2 advanced driver‑assistance systems.
Q:How has Changan Automobile achieved its intelligent transformation through its partnership with Baidu?
A:Changan Automobile has leveraged the intelligent computing center it co-built with Baidu to establish core competitive advantages spanning from the smart cockpit to autonomous driving. At present, Changan Automobile has leveraged Baidu’s platform to significantly optimize the performance of its computing center, enabling efficient training and iteration of autonomous-driving models, and has become one of the companies approved by the Ministry of Industry and Information Technology to produce L3‑level autonomous vehicles.
Q:How does Horizon's product strategy upgrade and its cooperation with Baidu AI Cloud?
A:Horizon’s strategic upgrade includes the launch of its new product line, Xingkong, an integrated chip designed for full‑vehicle computing. It adopts a central computing architecture, seamlessly integrating intelligent driving and smart cockpit processing, and supports a revolutionary human–machine interaction experience—such as the introduction of an in‑vehicle agent OS. Horizon has established a deep strategic partnership with Baidu AI Cloud, jointly building a full‑process closed loop based on big data, creating an ultra‑large‑scale heterogeneous computing cluster, supporting the distributed collaborative work of thousands of engineers, and raising industry benchmarks in the autonomous driving field in terms of scale, quality, and iteration speed. At present, Horizon Robotics has empowered more than 40 automakers worldwide, bringing hundreds of vehicle models to market, and is collaborating with companies such as the Volkswagen Group and Toyota.
Q:What are the latest advancements of Horizon Robotics in the field of intelligent driver assistance systems?
A:Horizon has launched a high-performance autonomous driving chip with 560 TOPS of computing power, advancing intelligent driver-assistance technologies worldwide. Vehicles equipped with Horizon’s HSD have won favor among consumers thanks to their exceptional user experience, securing the top spot in China’s market with a share exceeding 30%.
Q:How can AI become a new super entry point on mobile phones and other hardware platforms, and what examples have been cited?
A:AI is becoming a new super entry point by integrating with more hardware devices, such as smartphones and AI glasses. Taking Honor as an example, Baidu Intelligent Cloud provides it with end‑cloud integrated solutions and large‑model capabilities such as the ERNIE model, thereby enhancing user experience. For example, Honor UU has amassed over 100 million users, offering travel guides, attraction commentary, and personalized tour‑guide services, while also recommending local cuisine and providing navigation through Baidu Maps. At present, all of the world’s top ten smartphone manufacturers are leveraging Baidu Intelligent Cloud’s agent‑info capabilities. As open‑cloud general‑purpose agents gain traction, phone makers are beginning to expose OS‑level skills; when paired with agent‑building capabilities, this enables them to deliver a natively integrated super assistant for their devices.
Q:How does AI help users plan their itineraries?
A:When a user requests travel from Guangzhou to Chengdu during the May Day holiday, the AI assistant will invoke its built-in schedule management skill to check for time conflicts and, based on the user’s preferences, select suitable flights and hotels. After the user confirms the flight and hotel, the AI assistant will generate a final itinerary based on their choices, then use calendar and reminder skills to add it to the calendar and set a departure alarm, thereby achieving personalized trip planning.
Q:How does Baidu Intelligent Cloud play a role in the AI hardware field?
A:Baidu Intelligent Cloud not only serves more than 1,000 AI hardware companies, providing services such as building core computing power models to enhance product experience, but also helps traditional hardware evolve into super-intelligent entities. Furthermore, in response to the challenges facing the embodied intelligence industry, Baidu Intelligent Cloud draws on its experience in the autonomous driving field, supporting large-scale data collection and annotation, accelerating model training and improving efficiency, reducing inference latency, and providing end-to-end capabilities that encompass computing platforms, toolchains, data services, voice interaction security, as well as training, evaluation, and feedback loop systems, thereby helping enterprises build competitive advantages in the era of physical AI.
Q:What are Xinghai Map Company’s practices and insights in the field of generative AI?
A:Mr. Gao, CEO of Xinghai Map, shared his company’s experience in the entrepreneurial journey of Jusheng Intelligence, emphasizing that it is a well-rounded, integrated hardware‑software venture with no weak links, encompassing multiple stages such as motors, models, data, and toolchains. Jusheng Intelligence differs from large language models and autonomous driving in that its value chain is longer and every link is indispensable, necessitating an integrated hardware‑software solution. Xinghai Map adheres to an integrated hardware-and-intelligence strategy, placing particular emphasis on operational excellence as its core competency, and has independently developed the RE platform to serve developers worldwide. The Starry Sky Map focuses on two co‑evolving, symbiotic technical models—WAM and VLA—aiming to endow robots with general intelligence capable of understanding the world and manipulating all objects. They propose that pre-training should enable robots to learn the fundamental principles governing their own bodies and their interactions with the physical world, while fine-tuning serves as the adaptation process tailored to specific tasks. The world’s fastest world model, fast WEM, released in March of this year, has reduced one-step latency to 190 milliseconds, thereby enhancing deployment efficiency. The foundational model training for the Star Map is conducted on Baidu Intelligent Cloud, leveraging its efficient AI infrastructure to enhance development efficiency, with plans to further optimize model performance. Meanwhile, Xinghai Map has open-sourced its foundational models G0 and G0.5 and launched the Jusheng Intelligence one-stop development platform EDP, supporting end-to-end workflows that encompass data, model, device, and personnel management.
Q:In the development of the financial industry, why are the professionalism, stability, and timeliness of AI services so important? How does Baidu Intelligent Cloud collaborate with the financial sector to drive technological innovation and business transformation?
A:Financial operations place high demands on the professionalism, stability, and timeliness of AI services; particularly in an environment emphasizing self-reliance and controllability, it is essential to ensure that chip computing power, models, and applications have undergone rigorous testing. For example, China Merchants Bank has deployed the Kunlun Core P800 at scale and has already launched hundreds of AI applications covering core scenarios such as risk control, marketing, R&D, and office operations. SPDB has also leveraged AI to enhance its capabilities in refined operations and risk management. Baidu Intelligent Cloud has established deep partnerships with several financial institutions, including China Merchants Bank and Shanghai Pudong Development Bank, to jointly build a domestically developed heterogeneous computing cluster. The collaboration has enabled the evaluation of open-source models and the adaptation of mainstream large-scale models, resulting in the launch of specialized financial‑industry‑specific models tailored for applications such as financial‑statement‑form recognition and other vertical research areas. For example, in applications at Shanghai Pudong Development Bank, AI technology has reduced the time required for financial analysis from hours or days to just minutes, significantly boosting efficiency in risk management, marketing, and other areas.
Q:How can Baidu Intelligent Cloud establish long-term partnerships with central state-owned enterprises such as State Grid and deploy AI technologies in real-world scenarios?
A:Baidu Intelligent Cloud has collaborated with State Grid for ten years, advancing from intelligent customer service to the development of a trillion-parameter Guangming Power large model, successfully integrating AI technology into the grid’s core operations. For example, in the field of equipment inspection, combining small and large models has significantly improved recognition accuracy and reduced inspection time from 2.5 hours to 45 minutes, covering hundreds of substations nationwide and ensuring reliable power supply for 1.1 billion people.
Q:In the face of industry diversification and evolving customer needs, how is Baidu Intelligent Cloud adjusting its strategies to meet these new demands?
A:Baidu Intelligent Cloud recognizes that customer demand for agent‑based applications has evolved, shifting from a focus on business elasticity and cost reduction with efficiency gains to an increasing need for highly active, high‑value, and scalable agent‑driven solutions. Therefore, Baidu Intelligent Cloud is focusing on building intelligent agents, comprehensively upgrading its product technologies, and developing flexible AI capabilities to address customers’ real-world business challenges.
Q:How does Baidu Intelligent Cloud’s full-stack AI capability meet enterprises’ needs for large-scale deployment, continuous evolution, and secure, controllable intelligent agents?
A:Baidu Intelligent Cloud has comprehensively upgraded its capabilities in the AI info and agent info sections. AI info is committed to delivering the highest performance-per-watt and most cost-effective AI computing power, optimizing model performance; agent info, through token factory, enhances the speed and efficiency of agents in handling ultra-long-context tasks, and achieves more efficient agent operation via harness engineering. Meanwhile, by establishing collaboratively optimized infrastructure, agents can better adapt to complex task requirements and enhance their modeling and engineering capabilities.
Q:What upgrades has Baidu Intelligent Cloud made to its underlying computing power to support the development of AI cloud?
A:Baidu Intelligent Cloud has achieved significant breakthroughs in underlying computing power, including its XunTui service optimized for intelligent agents, which delivers industry-leading memory reuse and heterogeneous scheduling technologies, as well as a multimodal training framework and reinforcement learning acceleration techniques, substantially boosting model training and inference efficiency. In addition, Baidu Intelligent Cloud has also launched the Kunlun Core P800 series, which deploys large-scale clusters on a large scale, and has enabled efficient and stable training and reasoning. It has also created high-performance and highly available AI computing infrastructure through optimized network architecture and data center design to promote the ecological development and technological innovation of intelligent bodies.

Baidu, Inc.
Follow





