Large Model Competition Heats Up: More Applications Emerge

In an ever-accelerating race towards advancing artificial intelligence, the landscape of AI large models has witnessed significant upheaval. Recently, Elon Musk, a prominent entrepreneur and figure in the tech world, teamed up with his xAI crew to unveil the latest iteration of their AI model, known as Grok 3, on February 18 in a live stream on X platform. Musk boldly proclaimed it as “the smartest AI on Earth.” In a feat of technological excellence, the xAI team reportedly scaled their data center by doubling their GPU count from 100,000 to 200,000, underlining their commitment to refining AI capabilities.

Earlier, in February, the Chinese startup DeepSeek introduced its revamped large model, DeepSeek-R1, stirring up the market with its remarkably low training and usage costs. The competitive pressure intensified as tech giants such as Google, xAI, OpenAI, and Anthropic all announced forthcoming models. A significant trend began to emerge; many of these entities signaled a shift towards open-source strategies, which stakeholders believe could substantially reduce application costs and unlock new opportunities within AI sectors.

The announcement of Grok 3 signifies a crucial milestone in the ongoing development of large models. Musk emphasized its enhanced functionality over its predecessor Grok 2, claiming that interacting with Grok 3 is a uniquely engaging experience. The genesis of xAI began in July 2023, with its first model, Grok 1, emerging onto the scene in November with an unprecedented 314 billion parameters—setting a record as the largest open-source language model at that moment. The release of Grok 2 in August 2024 drew comparisons with the latest iterations of ChatGPT, suggesting a new level of performance parity in the sector.

To construct Grok 3's formidable architecture, Musk and his team faced several technical challenges, particularly in establishing a robust computational cluster capable of handling immense processing demands, while navigating hurdles related to heat dissipation and power supply. The team successfully deployed the initial 100,000 GPUs over a concentrated duration of 122 days, before expanding the setup further in mere 92 days—signifying a remarkable feat in hardware orchestration.

During the live demonstration, xAI showcased Grok 3's abilities, indicating that it outperformed or matched competitors such as Gemini, DeepSeek, and ChatGPT in various tests. One highlight was Grok 3's capability to generate code, which animated a spacecraft's journey between Earth and Mars. The versatility of Grok 3 was further exemplified through the creation of a Tetris-like game, showcasing its multi-dimensional applications.

Additionally, xAI introduced an intelligent search engine named DeepSearch, leveraging Grok 3's inherent capabilities. The team announced that all functionalities of Grok 3 would gradually go live within a week, alongside plans to make Grok 2 open-source. This strategic movement into open-source development is poised to foster a multitude of advancements in the AI landscape, amplifying the proliferation of AI applications and encouraging a democratization of technology.

The competitive arena has grown increasingly intense, with Musk's xAI emerging as a formidable contender among other major players like DeepSeek, OpenAI, and Google. The recent release of DeepSeek-R1 proved to be a game-changer in the open-source inference model segment—delivering high-performance outputs at remarkably low costs. Training DeepSeek-V3 reportedly utilized 2,048 NVIDIA H800 GPUs over a two-month period, totaling merely $5.576 million, a fraction of the costs typically associated with other models like GPT-4o.

Following DeepSeek’s launch, Google quickly responded with its Gemini 2.0 model series, enhancing capabilities in encoding and reasoning, thus making it publicly available. Meanwhile, OpenAI shared its intentions to roll out its next-generation models, GPT-5 and GPT-4.5, asserting that GPT-5 would incorporate several foundational technologies, including the new o3 reasoning model. Reports also suggest that Anthropic has plans to unveil a hybrid large model, Claude 4, allowing user control over inference costs in the imminent future.

DeepSeek's innovative approach has propelled an open-source revolution in the large model sector, aiming to integrate them as essential utilities—akin to water, electricity, or gas—across various industries. On February 18, a collaboration was announced between Jietiao Star and Geely Automotive Group, resulting in the open-source release of two new multi-modal models—Step-Video-T2V for video generation and Step-Audio for auditory modeling. In a parallel move, Baidu proclaimed that its large model product, Wenxin Yiyan, would be offered entirely free of charge starting April 1, reaching users on both PC and mobile app platforms. In another significant update, OpenAI declared that its free edition of ChatGPT would now enable unrestricted conversations utilizing GPT-5 under standard intelligent settings.

Reflecting on the implications, a researcher from the China Academy of Information and Communications Technology emphasized that the maturation of AI technologies is fundamentally reshaping commercial models. The ascendance of models from entities like DeepSeek is projected to reconstruct the industry ecosystem fundamentally.

In a broader context, an initial review indicated that hundreds of companies and institutions are now tapping into the expansive capabilities of DeepSeek's AI models across varied sectors, including chip production, cloud services, financial technology, and automotive industries. Just recently, DeepSeek's technologies were integrated into WeChat—a platform boasting nearly 1.4 billion users. Additionally, Baidu Search revealed intentions to comprehensively integrate the latest functions of DeepSeek and Wenxin models into their search capabilities.

Industry analysts highlighted a shift in how AI model enterprises will derive revenue moving forward: rather than relying solely on the intrinsic value of their models, profitability will increasingly hinge on considerations surrounding ecological contributions, user engagement, data integrity, and the capacity to provide valuable services.

The impact of large models is burgeoning across multiple realms such as content creation, finance, telecommunication, and autonomous driving. Reportedly, the major state-owned telecommunications operators will soon integrate DeepSeek technology, capitalizing on their vast data reservoirs to enrich and refine model training processes. Meanwhile, telecom firms aim to leverage this technology to unlock novel AI-driven business opportunities and enhance the operational capabilities of their cloud services.

In the content creation domain, large models are proving indispensable for enterprises aiming to boost creativity and efficiency in generating written texts, images, and videos. Recently, the Reading-Wen Group has integrated the newly deployed DeepSeek-R1 into its author assistance tool. Similarly, the digital cultural entity Chinese Online announced the implementation of DeepSeek-R1 in its AI-driven online content creation workflows, yielding substantive boosts to creative efficiency.

On the frontiers of intelligent customer service, large models now facilitate smarter interactions, thereby elevating customer satisfaction rates. For instance, FAW Toyota Motor Sales has harnessed DeepSeek's model combined with Tencent Cloud’s knowledge engine to remarkably enhance service efficiency across various scenarios, including online support and outbound call systems.

In the finance sector, large models stand as powerful tools for risk assessment and investment decision-making capabilities, driving operational efficiency and bolstering risk management within financial institutions. Guojin Securities, for example, is determining a pathway to incorporate DeepSeek into areas like intelligence retrieval and industry analysis, with aspirations to broaden applications to include cutting-edge services in risk management and investment analysis.

As the dialogue of the future unfolds, Tsinghua University professor Liang Zheng pointed out the trajectory towards terminalization and lightweight innovations in AI, suggesting that advancements in multi-modal and reinforced learning technologies may herald the age of scaled deployment for service robots, autonomous vehicles, and drones. This evolution holds promise for rendering AI capabilities more accessible and integrated into everyday activities.

In conclusion, research briefs emphasize a sunnier outlook for niche large models within fields like office applications, retail, customer service, finance, marketing, and entertainment, while tech giants such as Baidu, Alibaba, and Tencent (collectively known as BAT) may benefit from a re-evaluation of their inherent value amidst the growing AI boom. Investment experts at Morgan Asset Management are keen on tracking industries driven by AI technology, including renewable energy, high-end manufacturing, and innovative healthcare solutions focused on new medications.

Related stories

Morning Brief FM-Radio | February 19, 2025

Baidu AI Chatbot: A Practical Guide to Using Ernie Bot for Work & Life

Key Trends Shaping Banking Wealth Management

Why is Baidu Struggling? 3 Core Reasons Behind Its Decline

Musk Unveils Grok3 Model

AI in Combat: Transforming Warfare and Defense Stock Investments